101
|
Barquist L, Boinett CJ, Cain AK. Approaches to querying bacterial genomes with transposon-insertion sequencing. RNA Biol 2013; 10:1161-9. [PMID: 23635712 PMCID: PMC3849164 DOI: 10.4161/rna.24765] [Citation(s) in RCA: 104] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
In this review, we discuss transposon-insertion sequencing, variously known in the literature as TraDIS, Tn-seq, INSeq, and HITS. By monitoring a large library of single transposon-insertion mutants with high-throughput sequencing, these methods can rapidly identify genomic regions that contribute to organismal fitness under any condition assayable in the laboratory with exquisite resolution. We discuss the various protocols that have been developed and methods for analysis. We provide an overview of studies that have examined the reproducibility and accuracy of these methods, as well as studies showing the advantages offered by the high resolution and dynamic range of high-throughput sequencing over previous methods. We review a number of applications in the literature, from predicting genes essential for in vitro growth to directly assaying requirements for survival under infective conditions in vivo. We also highlight recent progress in assaying non-coding regions of the genome in addition to known coding sequences, including the combining of RNA-seq with high-throughput transposon mutagenesis.
Collapse
Affiliation(s)
- Lars Barquist
- Wellcome Trust Sanger Institute; Hinxton, Cambridge, UK; EMBL-European Bioinformatics Institute; Hinxton, Cambridge, UK
| | | | | |
Collapse
|
102
|
Barquist L, Langridge GC, Turner DJ, Phan MD, Turner AK, Bateman A, Parkhill J, Wain J, Gardner PP. A comparison of dense transposon insertion libraries in the Salmonella serovars Typhi and Typhimurium. Nucleic Acids Res 2013; 41:4549-64. [PMID: 23470992 PMCID: PMC3632133 DOI: 10.1093/nar/gkt148] [Citation(s) in RCA: 83] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Salmonella Typhi and Typhimurium diverged only ∼50 000 years ago, yet have very different host ranges and pathogenicity. Despite the availability of multiple whole-genome sequences, the genetic differences that have driven these changes in phenotype are only beginning to be understood. In this study, we use transposon-directed insertion-site sequencing to probe differences in gene requirements for competitive growth in rich media between these two closely related serovars. We identify a conserved core of 281 genes that are required for growth in both serovars, 228 of which are essential in Escherichia coli. We are able to identify active prophage elements through the requirement for their repressors. We also find distinct differences in requirements for genes involved in cell surface structure biogenesis and iron utilization. Finally, we demonstrate that transposon-directed insertion-site sequencing is not only applicable to the protein-coding content of the cell but also has sufficient resolution to generate hypotheses regarding the functions of non-coding RNAs (ncRNAs) as well. We are able to assign probable functions to a number of cis-regulatory ncRNA elements, as well as to infer likely differences in trans-acting ncRNA regulatory networks.
Collapse
Affiliation(s)
- Lars Barquist
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK.
| | | | | | | | | | | | | | | | | |
Collapse
|
103
|
Beauregard A, Smith EA, Petrone BL, Singh N, Karch C, McDonough KA, Wade JT. Identification and characterization of small RNAs in Yersinia pestis. RNA Biol 2013; 10:397-405. [PMID: 23324607 PMCID: PMC3672283 DOI: 10.4161/rna.23590] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Yersinia pestis, the etiologic agent of plague, is closely related to Yersinia pseudotuberculosis evolutionarily but has a very different mode of infection. The RNA-binding regulatory protein, Hfq, mediates regulation by small RNAs (sRNAs) and is required for virulence of both Y. pestis and Y. pseudotuberculosis. Moreover, Hfq is required for growth of Y. pestis, but not Y. pseudotuberculosis, at 37°C. Together, these observations suggest that sRNAs play important roles in the virulence and survival of Y. pestis, and that regulation by sRNAs may account for some of the differences between Y. pestis and Y. pseudotuberculosis. We have used a deep sequencing approach to identify 31 sRNAs in Y. pestis. The majority of these sRNAs are not conserved outside the Yersiniae. Expression of the sRNAs was confirmed by Northern analysis and we developed deep sequencing approaches to map 5ʹ and 3ʹ ends of many sRNAs simultaneously. Expression of the majority of the sRNAs we identified is dependent upon Hfq. We also observed temperature-dependent effects on the expression of many sRNAs, and differences in expression patterns between Y. pestis and Y. pseudotuberculosis. Thus, our data suggest that regulation by sRNAs plays an important role in the lifestyle switch from flea to mammalian host, and that regulation by sRNAs may contribute to the phenotypic differences between Y. pestis and Y. pseudotuberculosis.
Collapse
Affiliation(s)
- Arthur Beauregard
- Wadsworth Center; New York State Department of Health; Albany, NY USA
| | | | | | | | | | | | | |
Collapse
|
104
|
Coornaert A, Chiaruttini C, Springer M, Guillier M. Post-transcriptional control of the Escherichia coli PhoQ-PhoP two-component system by multiple sRNAs involves a novel pairing region of GcvB. PLoS Genet 2013; 9:e1003156. [PMID: 23300478 PMCID: PMC3536696 DOI: 10.1371/journal.pgen.1003156] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Accepted: 10/25/2012] [Indexed: 01/02/2023] Open
Abstract
PhoQ/PhoP is a central two-component system involved in magnesium homeostasis, pathogenicity, cell envelope composition, and acid resistance in several bacterial species. The small RNA GcvB is identified here as a novel direct regulator of the synthesis of PhoQ/PhoP in Escherichia coli, and this control relies on a novel pairing region of GcvB. After MicA, this is the second Hfq-dependent small RNA that represses expression of the phoPQ operon. Both MicA and GcvB bind phoPQ mRNA in vivo and in vitro around the translation initiation region of phoP. Binding of either small RNA is sufficient to inhibit ribosome binding and induce mRNA degradation. Surprisingly, however, MicA and GcvB have different effects on the levels of the PhoP protein and therefore on the expression of the PhoP regulon. These results highlight the complex connections between small RNAs and transcriptional regulation networks in bacteria. Regulation of bacterial gene expression participates in the ability of these microorganisms to quickly adapt to their environment. This regulation can occur at every level of gene expression. For instance, two-component systems are involved in transcriptional control, while small RNAs usually act at the post-transcriptional level. In this study, the pleiotropic small RNA GcvB is identified as the second small RNA regulator of the central PhoQ/PhoP two-component system, which highlights the connections between the different types of regulation.
Collapse
Affiliation(s)
- Audrey Coornaert
- UPR9073 du CNRS, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-Chimique, Paris, France
| | - Claude Chiaruttini
- UPR9073 du CNRS, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-Chimique, Paris, France
| | - Mathias Springer
- UPR9073 du CNRS, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-Chimique, Paris, France
| | - Maude Guillier
- UPR9073 du CNRS, Université Paris Diderot, Sorbonne Paris Cité, Institut de Biologie Physico-Chimique, Paris, France
- * E-mail:
| |
Collapse
|
105
|
Haas BJ, Chin M, Nusbaum C, Birren BW, Livny J. How deep is deep enough for RNA-Seq profiling of bacterial transcriptomes? BMC Genomics 2012; 13:734. [PMID: 23270466 PMCID: PMC3543199 DOI: 10.1186/1471-2164-13-734] [Citation(s) in RCA: 169] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2012] [Accepted: 12/15/2012] [Indexed: 12/17/2022] Open
Abstract
Background High-throughput sequencing of cDNA libraries (RNA-Seq) has proven to be a highly effective approach for studying bacterial transcriptomes. A central challenge in designing RNA-Seq-based experiments is estimating a priori the number of reads per sample needed to detect and quantify thousands of individual transcripts with a large dynamic range of abundance. Results We have conducted a systematic examination of how changes in the number of RNA-Seq reads per sample influences both profiling of a single bacterial transcriptome and the comparison of gene expression among samples. Our findings suggest that the number of reads typically produced in a single lane of the Illumina HiSeq sequencer far exceeds the number needed to saturate the annotated transcriptomes of diverse bacteria growing in monoculture. Moreover, as sequencing depth increases, so too does the detection of cDNAs that likely correspond to spurious transcripts or genomic DNA contamination. Finally, even when dozens of barcoded individual cDNA libraries are sequenced in a single lane, the vast majority of transcripts in each sample can be detected and numerous genes differentially expressed between samples can be identified. Conclusions Our analysis provides a guide for the many researchers seeking to determine the appropriate sequencing depth for RNA-Seq-based studies of diverse bacterial species.
Collapse
Affiliation(s)
- Brian J Haas
- Genome Sequencing and Analysis Program, The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | | | | | | |
Collapse
|
106
|
Bobay LM, Rocha EPC, Touchon M. The adaptation of temperate bacteriophages to their host genomes. Mol Biol Evol 2012; 30:737-51. [PMID: 23243039 PMCID: PMC3603311 DOI: 10.1093/molbev/mss279] [Citation(s) in RCA: 129] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Rapid turnover of mobile elements drives the plasticity of bacterial genomes. Integrated bacteriophages (prophages) encode host-adaptive traits and represent a sizable fraction of bacterial chromosomes. We hypothesized that natural selection shapes prophage integration patterns relative to the host genome organization. We tested this idea by detecting and studying 500 prophages of 69 strains of Escherichia and Salmonella. Phage integrases often target not only conserved genes but also intergenic positions, suggesting purifying selection for integration sites. Furthermore, most integration hotspots are conserved between the two host genera. Integration sites seem also selected at the large chromosomal scale, as they are nonrandomly organized in terms of the origin-terminus axis and the macrodomain structure. The genes of lambdoid prophages are systematically co-oriented with the bacterial replication fork and display the host high frequency of polarized FtsK-orienting polar sequences motifs required for chromosome segregation. matS motifs are strongly avoided by prophages suggesting counter selection of motifs disrupting macrodomains. These results show how natural selection for seamless integration of prophages in the chromosome shapes the evolution of the bacterium and the phage. First, integration sites are highly conserved for many millions of years favoring lysogeny over the lytic cycle for temperate phages. Second, the global distribution of prophages is intimately associated with the chromosome structure and the patterns of gene expression. Third, the phage endures selection for DNA motifs that pertain exclusively to the biology of the prophage in the bacterial chromosome. Understanding prophage genetic adaptation sheds new lights on the coexistence of horizontal transfer and organized bacterial genomes.
Collapse
Affiliation(s)
- Louis-Marie Bobay
- Microbial Evolutionary Genomics Group, Institut Pasteur, Paris, France.
| | | | | |
Collapse
|
107
|
Johnson R, Noble W, Tartaglia GG, Buckley NJ. Neurodegeneration as an RNA disorder. Prog Neurobiol 2012; 99:293-315. [PMID: 23063563 PMCID: PMC7116994 DOI: 10.1016/j.pneurobio.2012.09.006] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2012] [Revised: 09/14/2012] [Accepted: 09/26/2012] [Indexed: 12/12/2022]
Abstract
Neurodegenerative diseases constitute one of the single most important public health challenges of the coming decades, and yet we presently have only a limited understanding of the underlying genetic, cellular and molecular causes. As a result, no effective disease-modifying therapies are currently available, and no method exists to allow detection at early disease stages, and as a result diagnoses are only made decades after disease pathogenesis, by which time the majority of physical damage has already occurred. Since the sequencing of the human genome, we have come to appreciate that the transcriptional output of the human genome is extremely rich in non-protein coding RNAs (ncRNAs). This heterogeneous class of transcripts is widely expressed in the nervous system, and is likely to play many crucial roles in the development and functioning of this organ. Most exciting, evidence has recently been presented that ncRNAs play central, but hitherto unappreciated roles in neurodegenerative processes. Here, we review the diverse available evidence demonstrating involvement of ncRNAs in neurodegenerative diseases, and discuss their possible implications in the development of therapies and biomarkers for these conditions.
Collapse
Key Words
- neurodegeneration
- neurodegenerative disease
- non-coding rna
- alzheimer's disease
- parkinson's disease
- huntington's disease
- trinucleotide repeat disorder
- bace1
- rest
- long non-coding rna
- microrna
- har1
- sox2ot
- mir-9
- mir-132
- mir-124
- ndds, neurodegenerative disorders
- ad, alzheimer's disease
- hd, huntington's disease
- pd, parkinson's disease
- als, amyotrophic lateral sclerosis
- app, amyloid precursor protein
- cftr, cystic fibrosis
- csf, cerebrospinal fluid
- sod1, superoxide dismutase 1
- tardbp, tar dna binding protein
- psen-1, presenilin 1
- psen-2, presenilin 1
- mapt, microtubule-associated protein tau
- snca, α-synuclein
- ups, ubiquitin-proteasome system
- aββ, -amyloid
- er, endoplasmic reticulum
- ber, base excision repair
- parp-1, poly-adp ribose polymerase-1
- lncrnas, long non-coding rnas
- mirnas, microrna
- ncrna, non-coding rnas
- ngs, next generation sequencing
- pcr, polymerase chain reaction
- sars, severe acute respiratory disorder
- sca, spinal cerebellar ataxia
- dm, myotonic dystrophy
- hdl2, huntington's disease-like 2
- tnds, trinucleotide repeat disorders
Collapse
Affiliation(s)
- Rory Johnson
- Centre for Genomic Regulation (CRG) and UPF, Dr. Aiguader, 88, 08003 Barcelona, Catalunya, Spain
| | - Wendy Noble
- Kings College London, Institute of Psychiatry, London, UK
| | - Gian Gaetano Tartaglia
- Centre for Genomic Regulation (CRG) and UPF, Dr. Aiguader, 88, 08003 Barcelona, Catalunya, Spain
| | | |
Collapse
|
108
|
Abstract
Bacterial small non-coding RNA (sRNA) plays an important role in post-transcriptional gene regulation. Although the number of annotated sRNA is steadily increasing, their functional characterization is still lagging behind. Various computational strategies for finding sRNA−mRNA interactions, and thus putative sRNA targets, were developed. Most of them suffer from a high false positive rate. Here, we present a qualitative model to simulate the effect of an sRNA on the translation initiation of a potential target. Information about the ribosome−mRNA interaction, sRNA−mRNA interaction and expression information from deep sequencing experiments is integrated to calculate the change in translation initiation complex formation, as a proxy for translational activity. This model can be used to post-evaluate predicted targets, hence condensing the list of potential targets. We show that our translation initiation model, under the influence of an sRNA, can successfully simulate thirteen out of fifteen tested sRNA−mRNA interactions in a qualitative manner. To show the gain in specificity, we applied our method to a target search for the Escherichia coli sRNA RyhB. Compared with simple target prediction without post-evaluation, we reduce the number of targets to less than one fourth potential targets, considerably reducing the burden of experimental validation.
Collapse
|
109
|
Li W, Ying X, Lu Q, Chen L. Predicting sRNAs and their targets in bacteria. GENOMICS PROTEOMICS & BIOINFORMATICS 2012. [PMID: 23200137 PMCID: PMC5054197 DOI: 10.1016/j.gpb.2012.09.004] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Bacterial small RNAs (sRNAs) are an emerging class of regulatory RNAs of about 40–500 nucleotides in length and, by binding to their target mRNAs or proteins, get involved in many biological processes such as sensing environmental changes and regulating gene expression. Thus, identification of bacterial sRNAs and their targets has become an important part of sRNA biology. Current strategies for discovery of sRNAs and their targets usually involve bioinformatics prediction followed by experimental validation, emphasizing a key role for bioinformatics prediction. Here, therefore, we provided an overview on prediction methods, focusing on the merits and limitations of each class of models. Finally, we will present our thinking on developing related bioinformatics models in future.
Collapse
Affiliation(s)
- Wuju Li
- Beijing Institute of Basic Medical Sciences, Beijing 100850, China.
| | | | | | | |
Collapse
|
110
|
Reddy JS, Kumar R, Watt JM, Lawrence ML, Burgess SC, Nanduri B. Transcriptome profile of a bovine respiratory disease pathogen: Mannheimia haemolytica PHL213. BMC Bioinformatics 2012; 13 Suppl 15:S4. [PMID: 23046475 PMCID: PMC3439734 DOI: 10.1186/1471-2105-13-s15-s4] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Background Computational methods for structural gene annotation have propelled gene discovery but face certain drawbacks with regards to prokaryotic genome annotation. Identification of transcriptional start sites, demarcating overlapping gene boundaries, and identifying regulatory elements such as small RNA are not accurate using these approaches. In this study, we re-visit the structural annotation of Mannheimia haemolytica PHL213, a bovine respiratory disease pathogen. M. haemolytica is one of the causative agents of bovine respiratory disease that results in about $3 billion annual losses to the cattle industry. We used RNA-Seq and analyzed the data using freely-available computational methods and resources. The aim was to identify previously unannotated regions of the genome using RNA-Seq based expression profile to complement the existing annotation of this pathogen. Results Using the Illumina Genome Analyzer, we generated 9,055,826 reads (average length ~76 bp) and aligned them to the reference genome using Bowtie. The transcribed regions were analyzed using SAMTOOLS and custom Perl scripts in conjunction with BLAST searches and available gene annotation information. The single nucleotide resolution map enabled the identification of 14 novel protein coding regions as well as 44 potential novel sRNA. The basal transcription profile revealed that 2,506 of the 2,837 annotated regions were expressed in vitro, at 95.25% coverage, representing all broad functional gene categories in the genome. The expression profile also helped identify 518 potential operon structures involving 1,086 co-expressed pairs. We also identified 11 proteins with mutated/alternate start codons. Conclusions The application of RNA-Seq based transcriptome profiling to structural gene annotation helped correct existing annotation errors and identify potential novel protein coding regions and sRNA. We used computational tools to predict regulatory elements such as promoters and terminators associated with the novel expressed regions for further characterization of these novel functional elements. Our study complements the existing structural annotation of Mannheimia haemolytica PHL213 based on experimental evidence. Given the role of sRNA in virulence gene regulation and stress response, potential novel sRNA described in this study can form the framework for future studies to determine the role of sRNA, if any, in M. haemolytica pathogenesis.
Collapse
Affiliation(s)
- Joseph S Reddy
- College of Veterinary Medicine, Mississippi State University, Mississippi State, MS 39762, USA
| | | | | | | | | | | |
Collapse
|
111
|
Abstract
Some unidentified RNA molecules, together with the nucleoid protein HU, were suggested to be involved in the nucleoid structure of Escherichia coli. HU is a conserved protein known for its role in binding to DNA and maintaining negative supercoils in the latter. HU also binds to a few RNAs, but the full spectrum of its binding targets in the cell is not known. To understand any interaction of HU with RNA in the nucleoid structure, we immunoprecipitated potential HU-RNA complexes from cells and examined bound RNAs by hybridization to whole-genome tiling arrays. We identified associations between HU and 10 new intragenic and intergenic noncoding RNAs (ncRNAs), 2 of which are homologous to the annotated bacterial interspersed mosaic elements (BIMEs) and boxC DNA repeat elements. We confirmed direct binding of HU to BIME RNA in vitro. We also studied the nucleoid shape of HU and two of the ncRNA mutants (nc1 and nc5) by transmission electron microscopy and showed that both HU and the two ncRNAs play a role in nucleoid morphology. We propose that at least two of the ncRNA species complex with HU and help the formation or maintenance of the architecture of the E. coli chromosome. We also observed binding of HU with rRNA and tRNA segments, a few small RNAs, and a distinct small set of mRNAs, although the significance, if any, of these associations is not known.
Collapse
|
112
|
A selective force favoring increased G+C content in bacterial genes. Proc Natl Acad Sci U S A 2012; 109:14504-7. [PMID: 22908296 DOI: 10.1073/pnas.1205683109] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Bacteria display considerable variation in their overall base compositions, which range from 13% to over 75% G+C. This variation in genomic base compositions has long been considered to be a strictly neutral character, due solely to differences in the mutational process; however, recent sequence comparisons indicate that mutational input alone cannot produce the observed base compositions, implying a role for natural selection. Because bacterial genomes have high gene content, forces that operate on the base composition of individual genes could help shape the overall genomic base composition. To explore this possibility, we tested whether genes that encode the same protein but vary only in their base compositions at synonymous sites have effects on bacterial fitness. Escherichia coli strains harboring G+C-rich versions of genes display higher growth rates, indicating that despite a pervasive mutational bias toward A+T, a selective force, independent of adaptive codon use, is driving genes toward higher G+C contents.
Collapse
|
113
|
Kim D, Hong JSJ, Qiu Y, Nagarajan H, Seo JH, Cho BK, Tsai SF, Palsson BØ. Comparative analysis of regulatory elements between Escherichia coli and Klebsiella pneumoniae by genome-wide transcription start site profiling. PLoS Genet 2012; 8:e1002867. [PMID: 22912590 PMCID: PMC3415461 DOI: 10.1371/journal.pgen.1002867] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2012] [Accepted: 06/14/2012] [Indexed: 01/08/2023] Open
Abstract
Genome-wide transcription start site (TSS) profiles of the enterobacteria Escherichia coli and Klebsiella pneumoniae were experimentally determined through modified 5′ RACE followed by deep sequencing of intact primary mRNA. This identified 3,746 and 3,143 TSSs for E. coli and K. pneumoniae, respectively. Experimentally determined TSSs were then used to define promoter regions and 5′ UTRs upstream of coding genes. Comparative analysis of these regulatory elements revealed the use of multiple TSSs, identical sequence motifs of promoter and Shine-Dalgarno sequence, reflecting conserved gene expression apparatuses between the two species. In both species, over 70% of primary transcripts were expressed from operons having orthologous genes during exponential growth. However, expressed orthologous genes in E. coli and K. pneumoniae showed a strikingly different organization of upstream regulatory regions with only 20% identical promoters with TSSs in both species. Over 40% of promoters had TSSs identified in only one species, despite conserved promoter sequences existing in the other species. 662 conserved promoters having TSSs in both species resulted in the same number of comparable 5′ UTR pairs, and that regulatory element was found to be the most variant region in sequence among promoter, 5′ UTR, and ORF. In K. pneumoniae, 48 sRNAs were predicted and 36 of them were expressed during exponential growth. Among them, 34 orthologous sRNAs between two species were analyzed in depth, and the analysis showed that many sRNAs of K. pneumoniae, including pleiotropic sRNAs such as rprA, arcZ, and sgrS, may work in the same way as in E. coli. These results reveal a new dimension of comparative genomics such that a comparison of two genomes needs to be comprehensive over all levels of genome organization. In order to investigate similarities and differences of closely related species, most of the comparative genomics studies focus on comparing the gene contents either shared or specific for each genome. However, it is also important to investigate the differences in non-coding regulatory elements because they influence the transcriptional and post-transcriptional processes. Thus, we performed a genome-wide profiling of transcription start sites (TSSs) in two species, E. coli K-12 MG1655 and K. pneumoniae MGH78578. Experimental identification of TSSs is important for precise definition of promoter regions and 5′ untranslated regions upstream of coding genes. Comparative analysis of these regulatory elements revealed the use of multiple TSSs, identical sequence motifs of promoter and Shine-Dalgarno sequence. However, we observed that the upstream regulatory regions of the majority of operons having orthologous genes were organized with different usage of promoters and TSSs, resulting in diverse and complex gene regulation. We also found that the 5′ UTR is the least conserved regulatory element in sequence between the two species. Moreover, 34 orthologous sRNAs between E. coli and K. pneumoniae were analyzed in depth. The analysis suggested many of K. pneumoniae sRNAs might regulate the target genes as in E. coli.
Collapse
Affiliation(s)
- Donghyuk Kim
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Jay Sung-Joong Hong
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Yu Qiu
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Harish Nagarajan
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Joo-Hyun Seo
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Byung-Kwan Cho
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Shih-Feng Tsai
- Division of Molecular and Genomic Medicine, National Health Research Institutes, Miaoli, Taiwan
| | - Bernhard Ø. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- * E-mail:
| |
Collapse
|
114
|
Abstract
Noncoding RNAs, including antisense RNAs (asRNAs) that originate from the complementary strand of protein-coding genes, are involved in the regulation of gene expression in all domains of life. Recent application of deep-sequencing technologies has revealed that the transcription of asRNAs occurs genome-wide in bacteria. Although the role of the vast majority of asRNAs remains unknown, it is often assumed that their presence implies important regulatory functions, similar to those of other noncoding RNAs. Alternatively, many antisense transcripts may be produced by chance transcription events from promoter-like sequences that result from the degenerate nature of bacterial transcription factor binding sites. To investigate the biological relevance of antisense transcripts, we compared genome-wide patterns of asRNA expression in closely related enteric bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, by performing strand-specific transcriptome sequencing. Although antisense transcripts are abundant in both species, less than 3% of asRNAs are expressed at high levels in both species, and only about 14% appear to be conserved among species. And unlike the promoters of protein-coding genes, asRNA promoters show no evidence of sequence conservation between, or even within, species. Our findings suggest that many or even most bacterial asRNAs are nonadaptive by-products of the cell’s transcription machinery. Application of high-throughput methods has revealed the expression throughout bacterial genomes of transcripts encoded on the strand complementary to protein-coding genes. Because transcription is costly, it is usually assumed that these transcripts, termed antisense RNAs (asRNAs), serve some function; however, the role of most asRNAs is unclear, raising questions about their relevance in cellular processes. Because natural selection conserves functional elements, comparisons between related species provide a method for assessing functionality genome-wide. Applying such an approach, we assayed all transcripts in two closely related bacteria, Escherichia coli and Salmonella enterica serovar Typhimurium, and demonstrate that, although the levels of genome-wide antisense transcription are similarly high in both bacteria, only a small fraction of asRNAs are shared across species. Moreover, the promoters associated with asRNAs show no evidence of sequence conservation between, or even within, species. These findings indicate that despite the genome-wide transcription of asRNAs, many of these transcripts are likely nonfunctional.
Collapse
|
115
|
Sobrero P, Valverde C. The bacterial protein Hfq: much more than a mere RNA-binding factor. Crit Rev Microbiol 2012; 38:276-99. [DOI: 10.3109/1040841x.2012.664540] [Citation(s) in RCA: 116] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
116
|
Abstract
Riboswitches are RNA sensors that regulate gene expression upon binding specific metabolites or ions. Bacterial riboswitches control gene expression primarily by promoting intrinsic transcription termination or by inhibiting translation initiation. We now report a third general mechanism of riboswitch action: governing the ability of the RNA-dependent helicase Rho to terminate transcription. We establish that Rho promotes transcription termination in the Mg(2+)-sensing mgtA riboswitch from Salmonella enterica serovar Typhimurium and the flavin mononucleotide-sensing ribB riboswitch from Escherichia coli when the corresponding riboswitch ligands are present. The Rho-specific inhibitor bicyclomycin enabled transcription of the coding regions at these two loci in bacteria experiencing repressing concentrations of the riboswitch ligands in vivo. A mutation in the mgtA leader that favors the "high Mg(2+)" conformation of the riboswitch promoted Rho-dependent transcription termination in vivo and in vitro and enhanced the ability of the RNA to stimulate Rho's ATPase activity in vitro. These effects were overcome by mutations in a C-rich region of the mRNA that is alternately folded at high and low Mg(2+), suggesting a role for this region in regulating the activity of Rho. Our results reveal a potentially widespread mode of gene regulation whereby riboswitches dictate whether a protein effector can interact with the transcription machinery to prematurely terminate transcription.
Collapse
|
117
|
|
118
|
Skippington E, Ragan MA. Evolutionary dynamics of small RNAs in 27 Escherichia coli and Shigella genomes. Genome Biol Evol 2012; 4:330-45. [PMID: 22223756 PMCID: PMC3318445 DOI: 10.1093/gbe/evs001] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Small RNAs (sRNAs) are widespread in bacteria and play critical roles in regulating physiological processes. They are best characterized in Escherichia coli K-12 MG1655, where 83 sRNAs constitute nearly 2% of the gene complement. Most sRNAs act by base pairing with a target mRNA, modulating its translation and/or stability; many of these RNAs share only limited complementarity to their mRNA target, and require the chaperone Hfq to facilitate base pairing. Little is known about the evolutionary dynamics of bacterial sRNAs. Here, we apply phylogenetic and network analyses to investigate the evolutionary processes and principles that govern sRNA gene distribution in 27 E. coli and Shigella genomes. We identify core (encoded in all 27 genomes) and variable sRNAs; more than two-thirds of the E. coli K-12 MG1655 sRNAs are core, whereas the others show patterns of presence and absence that are principally due to genetic loss, not duplication or lateral genetic transfer. We present evidence that variable sRNAs are less tightly integrated into cellular genetic regulatory networks than are the core sRNAs, and that Hfq facilitates posttranscriptional cross talk between the E. coli–Shigella core and variable genomes. Finally, we present evidence that more than 80% of genes targeted by Hfq-associated core sRNAs have been transferred within the E. coli–Shigella clade, and that most of these genes have been transferred intact. These results suggest that Hfq and sRNAs help integrate laterally acquired genes into established regulatory networks.
Collapse
|
119
|
Abstract
Small regulatory RNAs (sRNAs) are versatile regulators that have been shown to be involved in the gene regulation of a growing number of biological pathways in bacteria. While finding the targets of a given sRNA has been the focus of many studies, fewer methods have been described to uncover which, if any, sRNAs regulate a given gene. Here I present two genetic screens that are designed to search for sRNAs regulating a gene of interest. Before the screens are performed, a translational fusion is made between the gene of interest and lacZ, designed so that mostly post-transcriptional effects on the gene's expression can be analyzed. I describe here a simple and rapid way to obtain this fusion, even when the transcriptional start site is unknown, by combining PCR or 5'RACE with recombination in the chromosome of a special strain of Escherichia coli. The first genetic screen uses a genomic multicopy library to find regulator genes that, when overexpressed, affect the expression of the fusion. While this technique is a classical genetic screen, particular attention is paid to how it can be used to specifically find sRNAs. A second screen is described that takes advantage of a specific library of sRNAs of E. coli that provides an easier and more rapid way to look for sRNA regulation. The library is transformed into the fusion containing strain using a serial transformation protocol developed in microtiter plates. The transformants can then be directly assayed for effects on the beta-galactosidase activity of the fusion in liquid, providing a precise and rapid way to evaluate sRNA regulation. Use of one or both of these screens should help uncover new pathways of regulation by sRNAs.
Collapse
Affiliation(s)
- Pierre Mandin
- Laboratoire de Chimie Bactérienne, Aix-Marseille Université, Institut de Microbiologie de la Méditéranée, Marseille, Cedex 20, France.
| |
Collapse
|
120
|
Metatranscriptomic analysis of microbes in an Oceanfront deep-subsurface hot spring reveals novel small RNAs and type-specific tRNA degradation. Appl Environ Microbiol 2011; 78:1015-22. [PMID: 22156430 DOI: 10.1128/aem.06811-11] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Studies of small noncoding RNAs (sRNAs) have been conducted predominantly using culturable organisms, and the acquisition of further information about sRNAs from global environments containing uncultured organisms now is very important. In this study, hot spring water (57°C, pH 8.1) was collected directly from the underground environment at depths of 250 to 1,000 m in Yunohama, Japan, and small RNA sequences obtained from the environment were analyzed. A phylogenetic analysis of both archaeal and bacterial 16S rRNA gene sequences was conducted, and the results suggested the presence of unique species in the environment, corresponding to the Archaeal Richmond Mine Acidophilic Nanoorganisms (ARMAN) group and three new Betaproteobacteria. A metatranscriptomic analysis identified 64,194 (20,057 nonredundant) cDNA sequences. Of these cDNAs, 90% were either tRNAs, tRNA fragments, rRNAs, or rRNA fragments, whereas 2,181 reads (10%) were classified as previously uncharacterized putative candidate sRNAs. Among these, 15 were particularly abundant, 14 of which showed no sequence similarity to any known noncoding RNA, and at least six of which form very stable RNA secondary structures. The analysis of a large number of tRNA fragments suggested that unique relationships exist between the anticodons of the tRNAs and the sites of tRNA degradation. Previous bacterial tRNA degradation studies have been limited to specific organisms, such as Escherichia coli and Streptomyces coelicolor, and the current results suggest that specific tRNA decay occurs more frequently than previously expected.
Collapse
|
121
|
Keller TE, Mis SD, Jia KE, Wilke CO. Reduced mRNA secondary-structure stability near the start codon indicates functional genes in prokaryotes. Genome Biol Evol 2011; 4:80-8. [PMID: 22138151 PMCID: PMC3269970 DOI: 10.1093/gbe/evr129] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Several recent studies have found that selection acts on synonymous mutations at the beginning of genes to reduce mRNA secondary-structure stability, presumably to aid in translation initiation. This observation suggests that a metric of relative mRNA secondary-structure stability, Z(ΔG), could be used to test whether putative genes are likely to be functionally important. Using the Escherichia coli genome, we compared the mean Z(ΔG) of genes with known functions, genes with known orthologs, genes where function and orthology are unknown, and pseudogenes. Genes in the first two categories demonstrated similar levels of selection for reduced stability (increased Z(ΔG)), whereas for pseudogenes stability did not differ from our null expectation. Surprisingly, genes where function and orthology were unknown were also not different from the null expectation, suggesting that many of these open reading frames are not functionally important. We extended our analysis by constructing a Bayesian phylogenetic mixed model based on data from 145 prokaryotic genomes. As in E. coli, genes with no known function had consistently lower Z(ΔG), even though we expect that many of the currently unannotated genes will ultimately have their functional utility discovered. Our findings suggest that functional genes tend to evolve increased Z(ΔG), whereas nonfunctional ones do not. Therefore, Z(ΔG) may be a useful metric for identifying genes of potentially important function and could be used to target genes for further functional study.
Collapse
Affiliation(s)
- Thomas E Keller
- Section of Integrative Biology, The University of Texas at Austin, TX, USA
| | | | | | | |
Collapse
|
122
|
Pichon C, du Merle L, Caliot ME, Trieu-Cuot P, Le Bouguénec C. An in silico model for identification of small RNAs in whole bacterial genomes: characterization of antisense RNAs in pathogenic Escherichia coli and Streptococcus agalactiae strains. Nucleic Acids Res 2011; 40:2846-61. [PMID: 22139924 PMCID: PMC3326304 DOI: 10.1093/nar/gkr1141] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Characterization of small non-coding ribonucleic acids (sRNA) among the large volume of data generated by high-throughput RNA-seq or tiling microarray analyses remains a challenge. Thus, there is still a need for accurate in silico prediction methods to identify sRNAs within a given bacterial species. After years of effort, dedicated software were developed based on comparative genomic analyses or mathematical/statistical models. Although these genomic analyses enabled sRNAs in intergenic regions to be efficiently identified, they all failed to predict antisense sRNA genes (asRNA), i.e. RNA genes located on the DNA strand complementary to that which encodes the protein. The statistical models enabled any genomic region to be analyzed theorically but not efficiently. We present a new model for in silico identification of sRNA and asRNA candidates within an entire bacterial genome. This model was successfully used to analyze the Gram-negative Escherichia coli and Gram-positive Streptococcus agalactiae. In both bacteria, numerous asRNAs are transcribed from the complementary strand of genes located in pathogenicity islands, strongly suggesting that these asRNAs are regulators of the virulence expression. In particular, we characterized an asRNA that acted as an enhancer-like regulator of the type 1 fimbriae production involved in the virulence of extra-intestinal pathogenic E. coli.
Collapse
Affiliation(s)
- Christophe Pichon
- Institut Pasteur, Unité de Biologie des Bactéries Pathogènes à Gram Positif, 25-28 Rue du Docteur Roux, F-75724 Paris, France and CNRS, URA2172, F-75724 Paris, France
| | - Laurence du Merle
- Institut Pasteur, Unité de Biologie des Bactéries Pathogènes à Gram Positif, 25-28 Rue du Docteur Roux, F-75724 Paris, France and CNRS, URA2172, F-75724 Paris, France
| | - Marie Elise Caliot
- Institut Pasteur, Unité de Biologie des Bactéries Pathogènes à Gram Positif, 25-28 Rue du Docteur Roux, F-75724 Paris, France and CNRS, URA2172, F-75724 Paris, France
| | - Patrick Trieu-Cuot
- Institut Pasteur, Unité de Biologie des Bactéries Pathogènes à Gram Positif, 25-28 Rue du Docteur Roux, F-75724 Paris, France and CNRS, URA2172, F-75724 Paris, France
| | - Chantal Le Bouguénec
- Institut Pasteur, Unité de Biologie des Bactéries Pathogènes à Gram Positif, 25-28 Rue du Docteur Roux, F-75724 Paris, France and CNRS, URA2172, F-75724 Paris, France
- *To whom correspondence should be addressed. Tel: +33 1 40 61 32 80; Fax: +33 1 40 61 36 40;
| |
Collapse
|
123
|
Chang X, Li Y, Ping J, Xing XB, Sun H, Jia P, Wang C, Li YY, Li YX. EcoBrowser: a web-based tool for visualizing transcriptome data of Escherichia coli. BMC Res Notes 2011; 4:405. [PMID: 21992408 PMCID: PMC3203075 DOI: 10.1186/1756-0500-4-405] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2011] [Accepted: 10/13/2011] [Indexed: 11/23/2022] Open
Abstract
Background Escherichia coli has been extensively studied as a prokaryotic model organism whose whole genome was determined in 1997. However, it is difficult to identify all the gene products involved in diverse functions by using whole genome sequencesalone. The high-resolution transcriptome mapping using tiling arrays has proved effective to improve the annotation of transcript units and discover new transcripts of ncRNAs. While abundant tiling array data have been generated, the lack of appropriate visualization tools to accommodate and integrate multiple sources of data has emerged. Findings EcoBrowser is a web-based tool for visualizing genome annotations and transcriptome data of E. coli. Important tiling array data of E. coli from different experimental platforms are collected and processed for query. An AJAX based genome browser is embedded for visualization. Thus, genome annotations can be compared with transcript profiling and genome occupancy profiling from independent experiments, which will be helpful in discovering new transcripts including novel mRNAs and ncRNAs, generating a detailed description of the transcription unit architecture, further providing clues for investigation of prokaryotic transcriptional regulation that has proved to be far more complex than previously thought. Conclusions With the help of EcoBrowser, users can get a systemic view both from the vertical and parallel sides, as well as inspirations for the design of new experiments which will expand our understanding of the regulation mechanism.
Collapse
Affiliation(s)
- Xiao Chang
- Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 320 Yueyang Road, Shanghai 200031, China.
| | | | | | | | | | | | | | | | | |
Collapse
|
124
|
Shinhara A, Matsui M, Hiraoka K, Nomura W, Hirano R, Nakahigashi K, Tomita M, Mori H, Kanai A. Deep sequencing reveals as-yet-undiscovered small RNAs in Escherichia coli. BMC Genomics 2011; 12:428. [PMID: 21864382 PMCID: PMC3175480 DOI: 10.1186/1471-2164-12-428] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2011] [Accepted: 08/24/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In Escherichia coli, approximately 100 regulatory small RNAs (sRNAs) have been identified experimentally and many more have been predicted by various methods. To provide a comprehensive overview of sRNAs, we analysed the low-molecular-weight RNAs (< 200 nt) of E. coli with deep sequencing, because the regulatory RNAs in bacteria are usually 50-200 nt in length. RESULTS We discovered 229 novel candidate sRNAs (≥ 50 nt) with computational or experimental evidence of transcription initiation. Among them, the expression of seven intergenic sRNAs and three cis-antisense sRNAs was detected by northern blot analysis. Interestingly, five novel sRNAs are expressed from prophage regions and we note that these sRNAs have several specific characteristics. Furthermore, we conducted an evolutionary conservation analysis of the candidate sRNAs and summarised the data among closely related bacterial strains. CONCLUSIONS This comprehensive screen for E. coli sRNAs using a deep sequencing approach has shown that many as-yet-undiscovered sRNAs are potentially encoded in the E. coli genome. We constructed the Escherichia coli Small RNA Browser (ECSBrowser; http://rna.iab.keio.ac.jp/), which integrates the data for previously identified sRNAs and the novel sRNAs found in this study.
Collapse
Affiliation(s)
- Atsuko Shinhara
- Institute for Advanced Biosciences, Keio University, Tsuruoka 997-0017, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|