1
|
El Mouali Y, Tawk C, Huang KD, Amend L, Lesker TR, Ponath F, Vogel J, Strowig T. The RNA landscape of the human commensal Segatella copri reveals a small RNA essential for gut colonization. Cell Host Microbe 2024:S1931-3128(24)00352-4. [PMID: 39368472 DOI: 10.1016/j.chom.2024.09.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 07/19/2024] [Accepted: 09/11/2024] [Indexed: 10/07/2024]
Abstract
The bacterium Segatella copri is a prevalent member of the human gut microbiota associated with health and disease states. However, the intrinsic factors that determine its ability to colonize the gut effectively remain largely unknown. By extensive transcriptome mapping of S. copri and examining human-derived samples, we discover a small RNA, which we name Segatella RNA colonization factor (SrcF), and show that SrcF is essential for S. copri gut colonization in gnotobiotic mice. SrcF regulates genes involved in nutrient acquisition, and complex carbohydrates, particularly fructans, control its expression. Furthermore, SrcF expression is strongly influenced by human microbiome composition and by the breakdown of fructans by cohabitating commensals, suggesting that the breakdown of complex carbohydrates mediates interspecies signaling among commensals beyond its established function in generating energy. Together, this study highlights the contribution of a small RNA as a critical regulator in gut colonization.
Collapse
Affiliation(s)
- Youssef El Mouali
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany.
| | - Caroline Tawk
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany
| | - Kun D Huang
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany
| | - Lena Amend
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany
| | - Till Robin Lesker
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany
| | - Falk Ponath
- Helmholtz Institute for RNA-Based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany
| | - Jörg Vogel
- Helmholtz Institute for RNA-Based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany; Institute for Molecular Infection Biology (IMIB), University of Würzburg, Würzburg, Germany
| | - Till Strowig
- Department of Microbial Immune Regulation, Helmholtz Centre for Infection Research (HZI), Braunschweig, Germany; Centre for Individualized Infection Medicine, Hannover, Germany.
| |
Collapse
|
2
|
Digby B, Finn S, Ó Broin P. Computational approaches and challenges in the analysis of circRNA data. BMC Genomics 2024; 25:527. [PMID: 38807085 PMCID: PMC11134749 DOI: 10.1186/s12864-024-10420-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Accepted: 05/15/2024] [Indexed: 05/30/2024] Open
Abstract
Circular RNAs (circRNA) are a class of non-coding RNA, forming a single-stranded covalently closed loop structure generated via back-splicing. Advancements in sequencing methods and technologies in conjunction with algorithmic developments of bioinformatics tools have enabled researchers to characterise the origin and function of circRNAs, with practical applications as a biomarker of diseases becoming increasingly relevant. Computational methods developed for circRNA analysis are predicated on detecting the chimeric back-splice junction of circRNAs whilst mitigating false-positive sequencing artefacts. In this review, we discuss in detail the computational strategies developed for circRNA identification, highlighting a selection of tool strengths, weaknesses and assumptions. In addition to circRNA identification tools, we describe methods for characterising the role of circRNAs within the competing endogenous RNA (ceRNA) network, their interactions with RNA-binding proteins, and publicly available databases for rich circRNA annotation.
Collapse
Affiliation(s)
- Barry Digby
- School of Mathematical and Statistical Sciences, University of Galway, Galway, Ireland.
| | - Stephen Finn
- Discipline of Histopathology, School of Medicine, Trinity College Dublin and Cancer Molecular Diagnostic Laboratory, Dublin, Ireland
| | - Pilib Ó Broin
- School of Mathematical and Statistical Sciences, University of Galway, Galway, Ireland
| |
Collapse
|
3
|
Ponath F, Zhu Y, Vogel J. Transcriptome fine-mapping in Fusobacterium nucleatum reveals FoxJ, a new σ E-dependent small RNA with unusual mRNA activation activity. mBio 2024; 15:e0353623. [PMID: 38436569 PMCID: PMC11005410 DOI: 10.1128/mbio.03536-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 01/23/2024] [Indexed: 03/05/2024] Open
Abstract
The oral commensal Fusobacterium nucleatum can spread to extra-oral sites, where it is associated with diverse pathologies, including pre-term birth and cancer. Due to the evolutionary distance of F. nucleatum to other model bacteria, we lack a deeper understanding of the RNA regulatory networks that allow this bacterium to adapt to its various niches. As a first step in that direction, we recently showed that F. nucleatum harbors a global stress response governed by the extracytoplasmic function sigma factor, σE, which displays a striking functional conservation with Proteobacteria and includes a noncoding arm in the form of a regulatory small RNA (sRNA), FoxI. To search for putative additional σE-dependent sRNAs, we comprehensively mapped the 5' and 3' ends of transcripts in the model strain ATCC 23726. This enabled the discovery of FoxJ, a ~156-nucleotide sRNA previously misannotated as the 5' untranslated region (UTR) of ylmH. FoxJ is tightly controlled by σE and activated by the same stress conditions as is FoxI. Both sRNAs act as mRNA repressors of the abundant porin FomA, but FoxJ also regulates genes that are distinct from the target suite of FoxI. Moreover, FoxJ differs from other σE-dependent sRNAs in that it also positively regulates genes at the post-transcriptional level. We provide preliminary evidence for a new mode of sRNA-mediated mRNA activation, which involves the targeting of intra-operonic terminators. Overall, our study provides an important resource through the comprehensive annotation of 5' and 3' UTRs in F. nucleatum and expands our understanding of the σE response in this evolutionarily distant bacterium.IMPORTANCEThe oral microbe Fusobacterium nucleatum can colonize secondary sites, including cancer tissue, and likely deploys complex regulatory systems to adapt to these new environments. These systems are largely unknown, partly due to the phylogenetic distance of F. nucleatum to other model organisms. Previously, we identified a global stress response mediated by σE that displays functional conservation with the envelope stress response in Proteobacteria, comprising a coding and noncoding regulatory arm. Through global identification of transcriptional start and stop sites, we uncovered the small RNA (sRNA) FoxJ as a novel component of the noncoding arm of the σE response in F. nucleatum. Together with its companion sRNA FoxI, FoxJ post-transcriptionally modulates the synthesis of envelope proteins, revealing a conserved function for σE-dependent sRNAs between Fusobacteriota and Proteobacteria. Moreover, FoxJ activates the gene expression for several targets, which is a mode of regulation previously unseen in the noncoding arm of the σE response.
Collapse
Affiliation(s)
- Falk Ponath
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany
| | - Yan Zhu
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany
| | - Jörg Vogel
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany
- RNA Biology Group, Institute for Molecular Infection Biology (IMIB), University of Würzburg, Würzburg, Germany
| |
Collapse
|
4
|
Ryan D, Bornet E, Prezza G, Alampalli SV, Franco de Carvalho T, Felchle H, Ebbecke T, Hayward RJ, Deutschbauer AM, Barquist L, Westermann AJ. An expanded transcriptome atlas for Bacteroides thetaiotaomicron reveals a small RNA that modulates tetracycline sensitivity. Nat Microbiol 2024; 9:1130-1144. [PMID: 38528147 PMCID: PMC10994844 DOI: 10.1038/s41564-024-01642-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 02/07/2024] [Indexed: 03/27/2024]
Abstract
Plasticity in gene expression allows bacteria to adapt to diverse environments. This is particularly relevant in the dynamic niche of the human intestinal tract; however, transcriptional networks remain largely unknown for gut-resident bacteria. Here we apply differential RNA sequencing (RNA-seq) and conventional RNA-seq to the model gut bacterium Bacteroides thetaiotaomicron to map transcriptional units and profile their expression levels across 15 in vivo-relevant growth conditions. We infer stress- and carbon source-specific transcriptional regulons and expand the annotation of small RNAs (sRNAs). Integrating this expression atlas with published transposon mutant fitness data, we predict conditionally important sRNAs. These include MasB, which downregulates tetracycline tolerance. Using MS2 affinity purification and RNA-seq, we identify a putative MasB target and assess its role in the context of the MasB-associated phenotype. These data-publicly available through the Theta-Base web browser ( http://micromix.helmholtz-hiri.de/bacteroides/ )-constitute a valuable resource for the microbiome community.
Collapse
Affiliation(s)
- Daniel Ryan
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Elise Bornet
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Gianluca Prezza
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Shuba Varshini Alampalli
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Taís Franco de Carvalho
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Hannah Felchle
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
- Department of Radiation Oncology, Technical University of Munich, School of Medicine, Klinikum rechts der Isar, Munich, Germany
| | - Titus Ebbecke
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Regan J Hayward
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Adam M Deutschbauer
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Lars Barquist
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
- Faculty of Medicine, University of Würzburg, Würzburg, Germany
- Department of Biology, University of Toronto Mississauga, Mississauga, Ontario, Canada
| | - Alexander J Westermann
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany.
- Institute of Molecular Infection Biology, University of Würzburg, Würzburg, Germany.
- Department of Microbiology, Biocentre, University of Würzburg, Würzburg, Germany.
| |
Collapse
|
5
|
Lee H, Yu SH, Shim JE, Yong D. Use of a combined antibacterial synergy approach and the ANNOgesic tool to identify novel targets within the gene networks of multidrug-resistant Klebsiella pneumoniae. mSystems 2024; 9:e0087723. [PMID: 38349171 PMCID: PMC10949472 DOI: 10.1128/msystems.00877-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 01/13/2024] [Indexed: 03/20/2024] Open
Abstract
Since the 1980s, the development of new drug classes for the treatment of multidrug-resistant Klebsiella pneumoniae has become limited, highlighting the urgent need for novel antibiotics. To address this challenge, this study aimed to explore the synergistic interactions between chemical compounds and representative antibiotics, such as carbapenem and colistin. The primary objective of this study was not only to mitigate the adverse impact of multidrug-resistant K. pneumoniae on public health but also to establish a sustainable balance among humans, animals, and the environment. Phenotypical measurements were conducted using the broth microdilution technique to determine the drug sensitivity of bacterial strains. Additionally, a genotypical approach was employed, involving traditional RNA sequencing analysis to identify differentially expressed genes and the computational ANNOgesic tool to detect noncoding RNAs. This study revealed the existence of various pathways and regulatory RNA elements that form a functional network. These pathways, characterized by the expression of specific genes, contribute to the combined treatment effect and bacterial survival strategies. The connections between pathways are facilitated by regulatory RNA elements that respond to environmental changes. These findings suggest an adaptive response of bacteria to harsh environmental conditions.IMPORTANCENoncoding RNAs were identified as key players in post-transcriptional regulation. Moreover, this study predicted the presence of novel small regulatory RNAs that interact with target genes, as well as the involvement of riboswitches and RNA thermometers in conjunction with associated genes. These findings will contribute to the discovery of potential antimicrobial therapeutic candidates. Overall, this study offers valuable insights into the synergistic effects of chemical compounds and antibiotics, highlighting the role of regulatory RNA elements in bacterial response, and survival strategies. The identification of novel noncoding RNAs and their interactions with target genes, riboswitches, and RNA thermometers holds promise for the development of antimicrobial therapies.
Collapse
Affiliation(s)
- Hyunsook Lee
- Department of Laboratory Medicine and Research Institute of Bacterial Resistance, Yonsei University College of Medicine, Seoul, South Korea
- Brain Korea 21 PLUS Project for Medical Science, Yonsei University College of Medicine, Seoul, South Korea
| | - Sung-Huan Yu
- Institute of Precision Medicine, College of Medicine, National Sun Yat-sen University, Kaohsiung, Taiwan
- School of Medicine, College of Medicine, National Sun Yat-sen University, Kaohsiung, Taiwan
| | - Jung Eun Shim
- Bioinformatics Collaboration Unit, Yonsei Biomedical Research Institute, Yonsei University College of Medicine, Seoul, South Korea
| | - Dongeun Yong
- Department of Laboratory Medicine and Research Institute of Bacterial Resistance, Yonsei University College of Medicine, Seoul, South Korea
| |
Collapse
|
6
|
García-Tomsig NI, Guedes-García SK, Jiménez-Zurdo JI. A Workflow for the Functional Characterization of Noncoding RNAs in Legume Symbiotic Bacteria. Methods Mol Biol 2024; 2751:179-203. [PMID: 38265717 DOI: 10.1007/978-1-0716-3617-6_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]
Abstract
Computational comparative genomics and, later, high-throughput transcriptome profiling (RNAseq) have uncovered a plethora of small noncoding RNA species (sRNAs) with potential regulatory roles in bacteria. A large fraction of sRNAs are differentially regulated in response to different biotic and abiotic stimuli and have the ability to fine-tune posttranscriptional reprogramming of gene expression through protein-assisted antisense interactions with trans-encoded target mRNAs. However, this level of gene regulation is still understudied in most non-model bacteria. Here, we compile experimental methods to detect expression, determine 5'/3'-ends, assess transcriptional regulation, generate mutants, and validate candidate target mRNAs of trans-acting sRNAs (trans-sRNAs) identified in the nitrogen-fixing α-rhizobium Sinorhizobium meliloti. The workflow, molecular tools, and methods are suited to investigate the function of newly identified base-pairing trans-sRNAs in phylogenetically related α-rhizobia.
Collapse
Affiliation(s)
- Natalia I García-Tomsig
- Structure, Dynamics and Function of Rhizobacterial Genomes (RhizoRNA Lab), Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas (CSIC), Granada, Spain
| | - Sabina K Guedes-García
- Structure, Dynamics and Function of Rhizobacterial Genomes (RhizoRNA Lab), Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas (CSIC), Granada, Spain
| | - José I Jiménez-Zurdo
- Structure, Dynamics and Function of Rhizobacterial Genomes (RhizoRNA Lab), Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas (CSIC), Granada, Spain.
| |
Collapse
|
7
|
Ferrara S, Bertoni G. Genome-Scale Analysis of the Structure and Function of RNA Pathways and Networks in Pseudomonas aeruginosa. Methods Mol Biol 2024; 2721:183-195. [PMID: 37819523 DOI: 10.1007/978-1-0716-3473-8_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
In recent years, several genome-wide approaches based on RNA sequencing (RNA-seq) have been developed. These methods allow a comprehensive and dynamic view of the structure and function of the multi-layered RNA pathways and networks. Many of these approaches, including the promising one of single-cell transcriptome analysis, have been successfully applied to Pseudomonas aeruginosa. However, we are only at the beginning because only a few surrounding conditions have been considered. Here, we aim to illustrate the different types of approaches based on RNA-seq that will lead us in the future to a better understanding of the dynamics of RNA biology in P. aeruginosa.
Collapse
Affiliation(s)
- Silvia Ferrara
- Department of Biosciences, Università degli Studi di Milano, Milan, Milano, Italy
| | - Giovanni Bertoni
- Department of Biosciences, Università degli Studi di Milano, Milan, Milano, Italy.
| |
Collapse
|
8
|
Tai CH, Hinton D, Yu SH. Discovering Novel Bacterial Small RNA by RNA-seq Analysis Toolkit ANNOgesic. Methods Mol Biol 2024; 2741:35-69. [PMID: 38217648 DOI: 10.1007/978-1-0716-3565-0_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2024]
Abstract
ANNOgesic is an RNA-seq analysis pipeline that can detect sRNAs and many other genomic features in bacteria and archaea. In addition to listing sRNA candidates, ANNOgesic also generates various formats of data files for visual examination and downstream experimental design. Based on validations from previous studies, the sRNA predictions are accurate and reliable. In this chapter, we outline the sRNA detection algorithm, important parameters used, step-by-step execution, and data interpretation with a B. pertussis study as an example. Following those procedures, novel sRNA can be revealed by ANNOgesic.
Collapse
Affiliation(s)
- Chin-Hsien Tai
- Laboratory of Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Deborah Hinton
- Gene Expression and Regulation Section, Laboratory of Biochemistry and Genetics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Sung-Huan Yu
- Institute of Precision Medicine, National Sun Yat-sen University, Kaohsiung, Taiwan.
- School of Medicine, College of Medicine, National Sun Yat-sen University, Kaohsiung, Taiwan.
| |
Collapse
|
9
|
Waldburger L, Thompson MG, Weisberg AJ, Lee N, Chang JH, Keasling JD, Shih PM. Transcriptome architecture of the three main lineages of agrobacteria. mSystems 2023; 8:e0033323. [PMID: 37477440 PMCID: PMC10469942 DOI: 10.1128/msystems.00333-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 06/15/2023] [Indexed: 07/22/2023] Open
Abstract
Agrobacteria are a diverse, polyphyletic group of prokaryotes with multipartite genomes capable of transferring DNA into the genomes of host plants, making them an essential tool in plant biotechnology. Despite their utility in plant transformation, genome-wide transcriptional regulation is not well understood across the three main lineages of agrobacteria. Transcription start sites (TSSs) are a necessary component of gene expression and regulation. In this study, we used differential RNA-seq and a TSS identification algorithm optimized on manually annotated TSS, then validated with existing TSS to identify thousands of TSS with nucleotide resolution for representatives of each lineage. We extend upon the 356 TSSs previously reported in Agrobacterium fabrum C58 by identifying 1,916 TSSs. In addition, we completed genomes and phenotyping of Rhizobium rhizogenes C16/80 and Allorhizobium vitis T60/94, identifying 2,650 and 2,432 TSSs, respectively. Parameter optimization was crucial for an accurate, high-resolution view of genome and transcriptional dynamics, highlighting the importance of algorithm optimization in genome-wide TSS identification and genomics at large. The optimized algorithm reduced the number of TSSs identified internal and antisense to the coding sequence on average by 90.5% and 91.9%, respectively. Comparison of TSS conservation between orthologs of the three lineages revealed differences in cell cycle regulation of ctrA as well as divergence of transcriptional regulation of chemotaxis-related genes when grown in conditions that simulate the plant environment. These results provide a framework to elucidate the mechanistic basis and evolution of pathology across the three main lineages of agrobacteria. IMPORTANCE Transcription start sites (TSSs) are fundamental for understanding gene expression and regulation. Agrobacteria, a group of prokaryotes with the ability to transfer DNA into the genomes of host plants, are widely used in plant biotechnology. However, the genome-wide transcriptional regulation of agrobacteria is not well understood, especially in less-studied lineages. Differential RNA-seq and an optimized algorithm enabled identification of thousands of TSSs with nucleotide resolution for representatives of each lineage. The results of this study provide a framework for elucidating the mechanistic basis and evolution of pathology across the three main lineages of agrobacteria. The optimized algorithm also highlights the importance of parameter optimization in genome-wide TSS identification and genomics at large.
Collapse
Affiliation(s)
- Lucas Waldburger
- Department of Bioengineering, University of California, Berkeley, California, USA
- Joint BioEnergy Institute, Emeryville, California, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Mitchell G. Thompson
- Joint BioEnergy Institute, Emeryville, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Alexandra J. Weisberg
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Namil Lee
- Joint BioEnergy Institute, Emeryville, California, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California, USA
| | - Jeff H. Chang
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Jay D. Keasling
- Joint BioEnergy Institute, Emeryville, California, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California, USA
- Institute for Quantitative Biosciences, University of California, Berkeley, California, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
- Center for Synthetic Biochemistry, Institute for Synthetic Biology, Shenzhen Institutes for Advanced Technologies, Shenzhen, China
| | - Patrick M. Shih
- Joint BioEnergy Institute, Emeryville, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
| |
Collapse
|
10
|
Brenes-Álvarez M, Vioque A, Muro-Pastor AM. Nitrogen-regulated antisense transcription in the adaptation to nitrogen deficiency in Nostoc sp. PCC 7120. PNAS NEXUS 2023; 2:pgad187. [PMID: 37361547 PMCID: PMC10287535 DOI: 10.1093/pnasnexus/pgad187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 05/24/2023] [Accepted: 05/30/2023] [Indexed: 06/28/2023]
Abstract
Transcriptomic analyses using high-throughput methods have revealed abundant antisense transcription in bacteria. Antisense transcription is often due to the overlap of mRNAs with long 5' or 3' regions that extend beyond the coding sequence. In addition, antisense RNAs that do not contain any coding sequence are also observed. Nostoc sp. PCC 7120 is a filamentous cyanobacterium that, under nitrogen limitation, behaves as a multicellular organism with division of labor among two different cell types that depend on each other, the vegetative CO2-fixing cells and the nitrogen-fixing heterocysts. The differentiation of heterocysts depends on the global nitrogen regulator NtcA and requires the specific regulator HetR. To identify antisense RNAs potentially involved in heterocyst differentiation, we assembled the Nostoc transcriptome using RNA-seq analysis of cells subjected to nitrogen limitation (9 or 24 h after nitrogen removal) in combination with a genome-wide set of transcriptional start sites and a prediction of transcriptional terminators. Our analysis resulted in the definition of a transcriptional map that includes >4,000 transcripts, 65% of which contain regions in antisense orientation to other transcripts. In addition to overlapping mRNAs, we identified nitrogen-regulated noncoding antisense RNAs transcribed from NtcA- or HetR-dependent promoters. As an example of this last category, we further analyzed an antisense (as_gltA) of the gene-encoding citrate synthase and showed that transcription of as_gltA takes place specifically in heterocysts. Since the overexpression of as_gltA reduces citrate synthase activity, this antisense RNA could eventually contribute to the metabolic remodeling that occurs during the differentiation of vegetative cells into heterocysts.
Collapse
Affiliation(s)
| | - Agustín Vioque
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas and Universidad de Sevilla, Américo Vespucio 49, 41092 Sevilla, Spain
| | | |
Collapse
|
11
|
Geissler AS, Fehler AO, Poulsen LD, González-Tortuero E, Kallehauge TB, Alkan F, Anthon C, Seemann SE, Rasmussen MD, Breüner A, Hjort C, Vinther J, Gorodkin J. CRISPRi screen for enhancing heterologous α-amylase yield in Bacillus subtilis. J Ind Microbiol Biotechnol 2023; 50:kuac028. [PMID: 36564025 PMCID: PMC9936203 DOI: 10.1093/jimb/kuac028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 12/19/2022] [Indexed: 12/25/2022]
Abstract
Yield improvements in cell factories can potentially be obtained by fine-tuning the regulatory mechanisms for gene candidates. In pursuit of such candidates, we performed RNA-sequencing of two α-amylase producing Bacillus strains and predict hundreds of putative novel non-coding transcribed regions. Surprisingly, we found among hundreds of non-coding and structured RNA candidates that non-coding genomic regions are proportionally undergoing the highest changes in expression during fermentation. Since these classes of RNA are also understudied, we targeted the corresponding genomic regions with CRIPSRi knockdown to test for any potential impact on the yield. From differentially expression analysis, we selected 53 non-coding candidates. Although CRISPRi knockdowns target both the sense and the antisense strand, the CRISPRi experiment cannot link causes for yield changes to the sense or antisense disruption. Nevertheless, we observed on several instances with strong changes in enzyme yield. The knockdown targeting the genomic region for a putative antisense RNA of the 3' UTR of the skfA-skfH operon led to a 21% increase in yield. In contrast, the knockdown targeting the genomic regions of putative antisense RNAs of the cytochrome c oxidase subunit 1 (ctaD), the sigma factor sigH, and the uncharacterized gene yhfT decreased yields by 31 to 43%.
Collapse
Affiliation(s)
- Adrian Sven Geissler
- Center for non-coding RNA in Technology and Health, Department of
Veterinary and Animal Sciences, University of Copenhagen, 1870
Frederiksberg,Denmark
| | - Annaleigh Ohrt Fehler
- Section for Computational and RNA Biology, Department of Biology,
University of Copenhagen, 2200 Copenhagen,Denmark
| | - Line Dahl Poulsen
- Section for Computational and RNA Biology, Department of Biology,
University of Copenhagen, 2200 Copenhagen,Denmark
| | - Enrique González-Tortuero
- Center for non-coding RNA in Technology and Health, Department of
Veterinary and Animal Sciences, University of Copenhagen, 1870
Frederiksberg,Denmark
| | | | - Ferhat Alkan
- Center for non-coding RNA in Technology and Health, Department of
Veterinary and Animal Sciences, University of Copenhagen, 1870
Frederiksberg,Denmark
| | - Christian Anthon
- Center for non-coding RNA in Technology and Health, Department of
Veterinary and Animal Sciences, University of Copenhagen, 1870
Frederiksberg,Denmark
| | - Stefan Ernst Seemann
- Center for non-coding RNA in Technology and Health, Department of
Veterinary and Animal Sciences, University of Copenhagen, 1870
Frederiksberg,Denmark
| | | | | | | | - Jeppe Vinther
- Section for Computational and RNA Biology, Department of Biology,
University of Copenhagen, 2200 Copenhagen,Denmark
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, Department of
Veterinary and Animal Sciences, University of Copenhagen, 1870
Frederiksberg,Denmark
| |
Collapse
|
12
|
Ryan D, Bornet E, Prezza G, Alampalli SV, de Carvalho TF, Felchle H, Ebbecke T, Hayward R, Deutschbauer AM, Barquist L, Westermann AJ. An integrated transcriptomics-functional genomics approach reveals a small RNA that modulates Bacteroides thetaiotaomicron sensitivity to tetracyclines. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.16.528795. [PMID: 36824877 PMCID: PMC9949090 DOI: 10.1101/2023.02.16.528795] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Gene expression plasticity allows bacteria to adapt to diverse environments, tie their metabolism to available nutrients, and cope with stress. This is particularly relevant in a niche as dynamic and hostile as the human intestinal tract, yet transcriptional networks remain largely unknown in gut Bacteroides spp. Here, we map transcriptional units and profile their expression levels in Bacteroides thetaiotaomicron over a suite of 15 defined experimental conditions that are relevant in vivo , such as variation of temperature, pH, and oxygen tension, exposure to antibiotic stress, and growth on simple carbohydrates or on host mucin-derived glycans. Thereby, we infer stress- and carbon source-specific transcriptional regulons, including conditional expression of capsular polysaccharides and polysaccharide utilization loci, and expand the annotation of small regulatory RNAs (sRNAs) in this organism. Integrating this comprehensive expression atlas with transposon mutant fitness data, we identify conditionally important sRNAs. One example is MasB, whose inactivation led to increased bacterial tolerance of tetracyclines. Using MS2 affinity purification coupled with RNA sequencing, we predict targets of this sRNA and discuss their potential role in the context of the MasB-associated phenotype. Together, this transcriptomic compendium in combination with functional sRNA genomics-publicly available through a new iteration of the 'Theta-Base' web browser (www.helmholtz-hiri.de/en/datasets/bacteroides-v2)-constitutes a valuable resource for the microbiome and sRNA research communities alike.
Collapse
|
13
|
Alipanahi R, Safari L, Khanteymoori A. CRISPR genome editing using computational approaches: A survey. FRONTIERS IN BIOINFORMATICS 2023; 2:1001131. [PMID: 36710911 PMCID: PMC9875887 DOI: 10.3389/fbinf.2022.1001131] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 12/19/2022] [Indexed: 01/13/2023] Open
Abstract
Clustered regularly interspaced short palindromic repeats (CRISPR)-based gene editing has been widely used in various cell types and organisms. To make genome editing with Clustered regularly interspaced short palindromic repeats far more precise and practical, we must concentrate on the design of optimal gRNA and the selection of appropriate Cas enzymes. Numerous computational tools have been created in recent years to help researchers design the best gRNA for Clustered regularly interspaced short palindromic repeats researches. There are two approaches for designing an appropriate gRNA sequence (which targets our desired sites with high precision): experimental and predicting-based approaches. It is essential to reduce off-target sites when designing an optimal gRNA. Here we review both traditional and machine learning-based approaches for designing an appropriate gRNA sequence and predicting off-target sites. In this review, we summarize the key characteristics of all available tools (as far as possible) and compare them together. Machine learning-based tools and web servers are believed to become the most effective and reliable methods for predicting on-target and off-target activities of Clustered regularly interspaced short palindromic repeats in the future. However, these predictions are not so precise now and the performance of these algorithms -especially deep learning one's-depends on the amount of data used during training phase. So, as more features are discovered and incorporated into these models, predictions become more in line with experimental observations. We must concentrate on the creation of ideal gRNA and the choice of suitable Cas enzymes in order to make genome editing with Clustered regularly interspaced short palindromic repeats far more accurate and feasible.
Collapse
Affiliation(s)
| | - Leila Safari
- Department of Computer Engineering, University of Zanjan, Zanjan, Iran,*Correspondence: Leila Safari,
| | | |
Collapse
|
14
|
Wei G, Li S, Ye S, Wang Z, Zarringhalam K, He J, Wang W, Shao Z. High-Resolution Small RNAs Landscape Provides Insights into Alkane Adaptation in the Marine Alkane-Degrader Alcanivorax dieselolei B-5. Int J Mol Sci 2022; 23:ijms232415995. [PMID: 36555635 PMCID: PMC9788540 DOI: 10.3390/ijms232415995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/07/2022] [Accepted: 12/07/2022] [Indexed: 12/23/2022] Open
Abstract
Alkanes are widespread in the ocean, and Alcanivorax is one of the most ubiquitous alkane-degrading bacteria in the marine ecosystem. Small RNAs (sRNAs) are usually at the heart of regulatory pathways, but sRNA-mediated alkane metabolic adaptability still remains largely unknown due to the difficulties of identification. Here, differential RNA sequencing (dRNA-seq) modified with a size selection (~50-nt to 500-nt) strategy was used to generate high-resolution sRNAs profiling in the model species Alcanivorax dieselolei B-5 under alkane (n-hexadecane) and non-alkane (acetate) conditions. As a result, we identified 549 sRNA candidates at single-nucleotide resolution of 5'-ends, 63.4% of which are with transcription start sites (TSSs), and 36.6% of which are with processing sites (PSSs) at the 5'-ends. These sRNAs originate from almost any location in the genome, regardless of intragenic (65.8%), antisense (20.6%) and intergenic (6.2%) regions, and RNase E may function in the maturation of sRNAs. Most sRNAs locally distribute across the 15 reference genomes of Alcanivorax, and only 7.5% of sRNAs are broadly conserved in this genus. Expression responses to the alkane of several core conserved sRNAs, including 6S RNA, M1 RNA and tmRNA, indicate that they may participate in alkane metabolisms and result in more actively global transcription, RNA processing and stresses mitigation. Two novel CsrA-related sRNAs are identified, which may be involved in the translational activation of alkane metabolism-related genes by sequestering the global repressor CsrA. The relationships of sRNAs with the characterized genes of alkane sensing (ompS), chemotaxis (mcp, cheR, cheW2), transporting (ompT1, ompT2, ompT3) and hydroxylation (alkB1, alkB2, almA) were created based on the genome-wide predicted sRNA-mRNA interactions. Overall, the sRNA landscape lays the ground for uncovering cryptic regulations in critical marine bacterium, among which both the core and species-specific sRNAs are implicated in the alkane adaptive metabolisms.
Collapse
Affiliation(s)
- Guangshan Wei
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai 519082, China
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China
- State Key Laboratory Breeding Base of Marine Genetic Resources, Key Laboratory of Marine Genetic Resources of Fujian Province, Xiamen 361005, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai 519000, China
| | - Sujie Li
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China
- State Key Laboratory Breeding Base of Marine Genetic Resources, Key Laboratory of Marine Genetic Resources of Fujian Province, Xiamen 361005, China
| | - Sida Ye
- Department of Mathematics, University of Massachusetts Boston, Boston, MA 02125, USA
- Center for Personalized Cancer Therapy, University of Massachusetts Boston, Boston, MA 02125, USA
| | - Zining Wang
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China
- State Key Laboratory Breeding Base of Marine Genetic Resources, Key Laboratory of Marine Genetic Resources of Fujian Province, Xiamen 361005, China
| | - Kourosh Zarringhalam
- Department of Mathematics, University of Massachusetts Boston, Boston, MA 02125, USA
- Center for Personalized Cancer Therapy, University of Massachusetts Boston, Boston, MA 02125, USA
| | - Jianguo He
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai 519082, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai 519000, China
| | - Wanpeng Wang
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China
- State Key Laboratory Breeding Base of Marine Genetic Resources, Key Laboratory of Marine Genetic Resources of Fujian Province, Xiamen 361005, China
- Correspondence: (W.W.); (Z.S.)
| | - Zongze Shao
- School of Marine Sciences, Sun Yat-Sen University, Zhuhai 519082, China
- Key Laboratory of Marine Genetic Resources, Third Institute of Oceanography, Ministry of Natural Resources, Xiamen 361005, China
- State Key Laboratory Breeding Base of Marine Genetic Resources, Key Laboratory of Marine Genetic Resources of Fujian Province, Xiamen 361005, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai 519000, China
- Correspondence: (W.W.); (Z.S.)
| |
Collapse
|
15
|
Chihara K, Gerovac M, Hör J, Vogel J. Global profiling of the RNA and protein complexes of Escherichia coli by size exclusion chromatography followed by RNA sequencing and mass spectrometry (SEC-seq). RNA (NEW YORK, N.Y.) 2022; 29:rna.079439.122. [PMID: 36328526 PMCID: PMC9808575 DOI: 10.1261/rna.079439.122] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 10/27/2022] [Indexed: 06/16/2023]
Abstract
New methods for the global identification of RNA-protein interactions have led to greater recognition of the abundance and importance of RNA-binding proteins (RBPs) in bacteria. Here, we expand this tool kit by developing SEC-seq, a method based on a similar concept as the established Grad-seq approach. In Grad-seq, cellular RNA and protein complexes of a bacterium of interest are separated in a glycerol gradient, followed by high-throughput RNA-sequencing and mass spectrometry analyses of individual gradient fractions. New RNA-protein complexes are predicted based on the similarity of their elution profiles. In SEC-seq, we have replaced the glycerol gradient with separation by size exclusion chromatography, which shortens operation times and offers greater potential for automation. Applying SEC-seq to Escherichia coli, we find that the method provides a higher resolution than Grad-seq in the lower molecular weight range up to ~500 kDa. This is illustrated by the ability of SEC-seq to resolve two distinct, but similarly sized complexes of the global translational repressor CsrA with either of its antagonistic small RNAs, CsrB and CsrC. We also characterized changes in the SEC-seq profiles of the small RNA MicA upon deletion of its RNA chaperones Hfq and ProQ and investigated the redistribution of these two proteins upon RNase treatment. Overall, we demonstrate that SEC-seq is a tractable and reproducible method for the global profiling of bacterial RNA-protein complexes that offers the potential to discover yet-unrecognized associations between bacterial RNAs and proteins.
Collapse
Affiliation(s)
- Kotaro Chihara
- Helmholtz Institute for RNA-based Infection Research, Würzburg, Germany
| | | | - Jens Hör
- Weizmann Institute, Rehovot, Israel
| | | |
Collapse
|
16
|
Li S, Lam J, Souliotis L, Alam MT, Constantinidou C. Posttranscriptional Regulation in Response to Different Environmental Stresses in Campylobacter jejuni. Microbiol Spectr 2022; 10:e0020322. [PMID: 35678555 PMCID: PMC9241687 DOI: 10.1128/spectrum.00203-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/10/2022] [Indexed: 11/20/2022] Open
Abstract
The survival strategies that Campylobacter jejuni (C. jejuni) employ throughout its transmission and infection life cycles remain largely elusive. Specifically, there is a lack of understanding about the posttranscriptional regulation of stress adaptations resulting from small noncoding RNAs (sRNAs). Published C. jejuni sRNAs have been discovered in specific conditions but with limited insights into their biological activities. Many more sRNAs are yet to be discovered as they may be condition-dependent. Here, we have generated transcriptomic data from 21 host- and transmission-relevant conditions. The data uncovered transcription start sites, expression patterns and posttranscriptional regulation during various stress conditions. This data set helped predict a list of putative sRNAs. We further explored the sRNAs' biological functions by integrating differential gene expression analysis, coexpression analysis, and genome-wide sRNA target prediction. The results showed that the C. jejuni gene expression was influenced primarily by nutrient deprivation and food storage conditions. Further exploration revealed a putative sRNA (CjSA21) that targeted tlp1 to 4 under food processing conditions. tlp1 to 4 are transcripts that encode methyl-accepting chemotaxis proteins (MCPs), which are responsible for chemosensing. These results suggested CjSA21 inhibits chemotaxis and promotes survival under food processing conditions. This study presents the broader research community with a comprehensive data set and highlights a novel sRNA as a potential chemotaxis inhibitor. IMPORTANCE The foodborne pathogen C. jejuni is a significant challenge for the global health care system. It is crucial to investigate C. jejuni posttranscriptional regulation by small RNAs (sRNAs) in order to understand how it adapts to different stress conditions. However, limited data are available for investigating sRNA activity under stress. In this study, we generate gene expression data of C. jejuni under 21 stress conditions. Our data analysis indicates that one of the novel sRNAs mediates the adaptation to food processing conditions. Results from our work shed light on the posttranscriptional regulation of C. jejuni and identify an sRNA associated with food safety.
Collapse
Affiliation(s)
- Stephen Li
- Warwick Medical School, University of Warwick, Coventry, United Kingdom
| | - Jenna Lam
- Warwick Medical School, University of Warwick, Coventry, United Kingdom
| | | | - Mohammad Tauqeer Alam
- Department of Biology, College of Science, United Arab Emirates University, Al-Ain, United Arab Emirates
| | | |
Collapse
|
17
|
Goldmann O, Sauerwein T, Molinari G, Rohde M, Förstner KU, Medina E. Cytosolic Sensing of Intracellular Staphylococcus aureus by Mast Cells Elicits a Type I IFN Response That Enhances Cell-Autonomous Immunity. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2022; 208:1675-1685. [PMID: 35321877 DOI: 10.4049/jimmunol.2100622] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 01/20/2022] [Indexed: 06/14/2023]
Abstract
Strategically located at mucosal sites, mast cells are instrumental in sensing invading pathogens and modulating the quality of the ensuing immune responses depending on the nature of the infecting microbe. It is believed that mast cells produce type I IFN (IFN-I) in response to viruses, but not to bacterial infections, because of the incapacity of bacterial pathogens to internalize within mast cells, where signaling cascades leading to IFN-I production are generated. However, we have previously reported that, in contrast with other bacterial pathogens, Staphylococcus aureus can internalize into mast cells and therefore could trigger a unique response. In this study, we have investigated the molecular cross-talk between internalized S. aureus and the human mast cells HMC-1 using a dual RNA sequencing approach. We found that a proportion of internalized S. aureus underwent profound transcriptional reprogramming within HMC-1 cells to adapt to the nutrients and stress encountered in the intracellular environment and remained viable. HMC-1 cells, in turn, recognized intracellular S. aureus via cGMP-AMP synthase-STING-TANK-binding kinase 1 signaling pathway, leading to the production of IFN-I. Bacterial internalization and viability were crucial for IFN-I induction because inhibition of S. aureus internalization or infection with heat-killed bacteria completely prevented the production of IFN-I by HMC-1 cells. Feeding back in an autocrine manner in S. aureus-harboring HMC-1 cells and in a paracrine manner in noninfected neighboring HMC-1 cells, IFN-I promoted a cell-autonomous antimicrobial state by inducing the transcription of IFN-I-stimulated genes. This study provides unprecedented evidence of the capacity of mast cells to produce IFN-I in response to a bacterial pathogen.
Collapse
Affiliation(s)
- Oliver Goldmann
- Infection Immunology Research Group, Helmholtz Centre for Infection Research, 38124 Braunschweig, Germany
| | - Till Sauerwein
- Institute for Molecular Infection Biology, University of Würzburg, 97080 Würzburg, Germany
- ZB MED-Information Centre for Life Science, 50931 Cologne, Germany
| | - Gabriella Molinari
- Central Facility for Microscopy, Helmholtz Centre for Infection Research, 38124 Braunschweig, Germany; and
| | - Manfred Rohde
- Central Facility for Microscopy, Helmholtz Centre for Infection Research, 38124 Braunschweig, Germany; and
| | - Konrad U Förstner
- Institute for Molecular Infection Biology, University of Würzburg, 97080 Würzburg, Germany
- ZB MED-Information Centre for Life Science, 50931 Cologne, Germany
- TH Köln, University of Applied Sciences, Faculty of Information Science and Communication Studies, 50678 Cologne, Germany
| | - Eva Medina
- Infection Immunology Research Group, Helmholtz Centre for Infection Research, 38124 Braunschweig, Germany;
| |
Collapse
|
18
|
Weidenbach K, Gutt M, Cassidy L, Chibani C, Schmitz RA. Small Proteins in Archaea, a Mainly Unexplored World. J Bacteriol 2022; 204:e0031321. [PMID: 34543104 PMCID: PMC8765429 DOI: 10.1128/jb.00313-21] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
In recent years, increasing numbers of small proteins have moved into the focus of science. Small proteins have been identified and characterized in all three domains of life, but the majority remains functionally uncharacterized, lack secondary structure, and exhibit limited evolutionary conservation. While quite a few have already been described for bacteria and eukaryotic organisms, the amount of known and functionally analyzed archaeal small proteins is still very limited. In this review, we compile the current state of research, show strategies for systematic approaches for global identification of small archaeal proteins, and address selected functionally characterized examples. Besides, we document exemplarily for one archaeon the tool development and optimization to identify small proteins using genome-wide approaches.
Collapse
Affiliation(s)
- Katrin Weidenbach
- Institute for General Microbiology, Christian Albrechts University, Kiel, Germany
| | - Miriam Gutt
- Institute for General Microbiology, Christian Albrechts University, Kiel, Germany
| | - Liam Cassidy
- AG Proteomics & Bioanalytics, Institute for Experimental Medicine, Christian Albrechts University, Kiel, Germany
| | - Cynthia Chibani
- Institute for General Microbiology, Christian Albrechts University, Kiel, Germany
| | - Ruth A. Schmitz
- Institute for General Microbiology, Christian Albrechts University, Kiel, Germany
| |
Collapse
|
19
|
Stiens J, Arnvig KB, Kendall SL, Nobeli I. Challenges in defining the functional, non-coding, expressed genome of members of the Mycobacterium tuberculosis complex. Mol Microbiol 2021; 117:20-31. [PMID: 34894010 DOI: 10.1111/mmi.14862] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 12/08/2021] [Accepted: 12/09/2021] [Indexed: 12/14/2022]
Abstract
A definitive transcriptome atlas for the non-coding expressed elements of the members of the Mycobacterium tuberculosis complex (MTBC) does not exist. Incomplete lists of non-coding transcripts can be obtained for some of the reference genomes (e.g., M. tuberculosis H37Rv) but to what extent these transcripts have homologues in closely related species or even strains is not clear. This has implications for the analysis of transcriptomic data; non-coding parts of the transcriptome are often ignored in the absence of formal, reliable annotation. Here, we review the state of our knowledge of non-coding RNAs in pathogenic mycobacteria, emphasizing the disparities in the information included in commonly used databases. We then proceed to review ways of combining computational solutions for predicting the non-coding transcriptome with experiments that can help refine and confirm these predictions.
Collapse
Affiliation(s)
- Jennifer Stiens
- Institute of Structural and Molecular Biology, Biological Sciences, Birkbeck, University of London, London, UK
| | - Kristine B Arnvig
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK
| | - Sharon L Kendall
- Centre for Emerging, Endemic and Exotic Diseases, Pathobiology and Population Sciences, Royal Veterinary College, Hatfield, UK
| | - Irene Nobeli
- Institute of Structural and Molecular Biology, Biological Sciences, Birkbeck, University of London, London, UK
| |
Collapse
|
20
|
Chevez-Guardado R, Peña-Castillo L. Promotech: a general tool for bacterial promoter recognition. Genome Biol 2021; 22:318. [PMID: 34789306 PMCID: PMC8597233 DOI: 10.1186/s13059-021-02514-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 10/11/2021] [Indexed: 12/14/2022] Open
Abstract
Promoters are genomic regions where the transcription machinery binds to initiate the transcription of specific genes. Computational tools for identifying bacterial promoters have been around for decades. However, most of these tools were designed to recognize promoters in one or few bacterial species. Here, we present Promotech, a machine-learning-based method for promoter recognition in a wide range of bacterial species. We compare Promotech's performance with the performance of five other promoter prediction methods. Promotech outperforms these other programs in terms of area under the precision-recall curve (AUPRC) or precision at the same level of recall. Promotech is available at https://github.com/BioinformaticsLabAtMUN/PromoTech .
Collapse
Affiliation(s)
- Ruben Chevez-Guardado
- Department of Computer Science, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, Newfoundland, A1C 5S7, Canada
| | - Lourdes Peña-Castillo
- Department of Computer Science, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, Newfoundland, A1C 5S7, Canada. .,Department of Biology, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, Newfoundland, A1C 5S7, Canada.
| |
Collapse
|
21
|
Oogai Y, Nakata M. Small regulatory RNAs of oral streptococci and periodontal bacteria. JAPANESE DENTAL SCIENCE REVIEW 2021; 57:209-216. [PMID: 34745393 PMCID: PMC8551640 DOI: 10.1016/j.jdsr.2021.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 09/20/2021] [Accepted: 09/24/2021] [Indexed: 11/27/2022] Open
Abstract
Small regulatory RNAs (sRNAs) belong to a family of non-coding RNAs, and many of which regulate expression of genes via interaction with mRNA. The recent popularity of high-throughput next generation sequencers have presented abundant sRNA-related data, including sRNAs of several different oral bacterial species. Some sRNA candidates have been validated in terms of their expression and interaction with target mRNAs. Since the oral cavity is an environment constantly exposed to various stimuli, such as fluctuations in temperature and pH, and osmotic pressure, as well as changes in nutrient availability, oral bacteria require rapid control of gene expression for adaptation to such diverse conditions, while regulation via interactions of sRNAs with mRNA provides advantages for rapid adaptation. This review summarizes methods effective for identification and validation of sRNAs, as well as sRNAs identified to be associated with oral bacterial species, including cariogenic and periodontal pathogens, together with their confirmed and putative target genes.
Collapse
Affiliation(s)
- Yuichi Oogai
- Department of Oral Microbiology, Kagoshima University Graduate School of Medical and Dental Sciences, Kagoshima, 890-8544, Japan
| | - Masanobu Nakata
- Department of Oral Microbiology, Kagoshima University Graduate School of Medical and Dental Sciences, Kagoshima, 890-8544, Japan
| |
Collapse
|
22
|
Identification of BvgA-Dependent and BvgA-Independent Small RNAs (sRNAs) in Bordetella pertussis Using the Prokaryotic sRNA Prediction Toolkit ANNOgesic. Microbiol Spectr 2021; 9:e0004421. [PMID: 34550019 PMCID: PMC8557813 DOI: 10.1128/spectrum.00044-21] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Noncoding small RNAs (sRNAs) are crucial for the posttranscriptional regulation of gene expression in all organisms and are known to be involved in the regulation of bacterial virulence. In the human pathogen Bordetella pertussis, which causes whooping cough, virulence is controlled primarily by the master two-component system BvgA (response regulator)/BvgS (sensor kinase). In this system, BvgA is phosphorylated (Bvg+ mode) or nonphosphorylated (Bvg- mode), with global transcriptional differences between the two. B. pertussis also carries the bacterial sRNA chaperone Hfq, which has previously been shown to be required for virulence. Here, we conducted transcriptomic analyses to identify possible B. pertussis sRNAs and to determine their BvgAS dependence using transcriptome sequencing (RNA-seq) and the prokaryotic sRNA prediction program ANNOgesic. We identified 143 possible candidates (25 Bvg+ mode specific and 53 Bvg- mode specific), of which 90 were previously unreported. Northern blot analyses confirmed all of the 10 ANNOgesic candidates that we tested. Homology searches demonstrated that 9 of the confirmed sRNAs are highly conserved among B. pertussis, Bordetella parapertussis, and Bordetella bronchiseptica, with one that also has homologues in other species of the Alcaligenaceae family. Using coimmunoprecipitation with a B. pertussis FLAG-tagged Hfq, we demonstrated that 3 of the sRNAs interact directly with Hfq, which is the first identification of sRNA binding to B. pertussis Hfq. Our study demonstrates that ANNOgesic is a highly useful tool for the identification of sRNAs in this system and that its combination with molecular techniques is a successful way to identify various BvgAS-dependent and Hfq-binding sRNAs. IMPORTANCE Noncoding small RNAs (sRNAs) are crucial for posttranscriptional regulation of gene expression in all organisms and are known to be involved in the regulation of bacterial virulence. We have investigated the presence of sRNAs in the obligate human pathogen B. pertussis, using transcriptome sequencing (RNA-seq) and the recently developed prokaryotic sRNA search program ANNOgesic. This analysis has identified 143 sRNA candidates (90 previously unreported). We have classified their dependence on the B. pertussis two-component system required for virulence, namely, BvgAS, based on their expression in the presence/absence of the phosphorylated response regulator BvgA, confirmed several by Northern analyses, and demonstrated that 3 bind directly to B. pertussis Hfq, the RNA chaperone involved in mediating sRNA effects. Our study demonstrates the utility of combining RNA-seq, ANNOgesic, and molecular techniques to identify various BvgAS-dependent and Hfq-binding sRNAs, which may unveil the roles of sRNAs in pertussis pathogenesis.
Collapse
|
23
|
Hill V, Akarsu H, Barbarroja RS, Cippà VL, Kuhnert P, Heller M, Falquet L, Heller M, Stoffel MH, Labroussaa F, Jores J. Minimalistic mycoplasmas harbor different functional toxin-antitoxin systems. PLoS Genet 2021; 17:e1009365. [PMID: 34673769 PMCID: PMC8562856 DOI: 10.1371/journal.pgen.1009365] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 11/02/2021] [Accepted: 09/29/2021] [Indexed: 11/19/2022] Open
Abstract
Mycoplasmas are minute bacteria controlled by very small genomes ranging from 0.6 to 1.4 Mbp. They encompass several important medical and veterinary pathogens that are often associated with a wide range of chronic diseases. The long persistence of mycoplasma cells in their hosts can exacerbate the spread of antimicrobial resistance observed for many species. However, the nature of the virulence factors driving this phenomenon in mycoplasmas is still unclear. Toxin-antitoxin systems (TA systems) are genetic elements widespread in many bacteria that were historically associated with bacterial persistence. Their presence on mycoplasma genomes has never been carefully assessed, especially for pathogenic species. Here we investigated three candidate TA systems in M. mycoides subsp. capri encoding a (i) novel AAA-ATPase/subtilisin-like serine protease module, (ii) a putative AbiEii/AbiEi pair and (iii) a putative Fic/RelB pair. We sequence analyzed fourteen genomes of M. mycoides subsp. capri and confirmed the presence of at least one TA module in each of them. Interestingly, horizontal gene transfer signatures were also found in several genomic loci containing TA systems for several mycoplasma species. Transcriptomic and proteomic data confirmed differential expression profiles of these TA systems during mycoplasma growth in vitro. While the use of heterologous expression systems based on E. coli and B. subtilis showed clear limitations, the functionality and neutralization capacities of all three candidate TA systems were successfully confirmed using M. capricolum subsp. capricolum as a host. Additionally, M. capricolum subsp. capricolum was used to confirm the presence of functional TA system homologs in mycoplasmas of the Hominis and Pneumoniae phylogenetic groups. Finally, we showed that several of these M. mycoides subsp. capri toxins tested in this study, and particularly the subtilisin-like serine protease, could be used to establish a kill switch in mycoplasmas for industrial applications.
Collapse
Affiliation(s)
- Virginia Hill
- Institute of Veterinary Bacteriology, University of Bern, Bern, Switzerland
- Graduate School for Biomedical Science, University of Bern, Bern, Switzerland
| | - Hatice Akarsu
- Institute of Veterinary Bacteriology, University of Bern, Bern, Switzerland
| | | | - Valentina L. Cippà
- Institute of Veterinary Bacteriology, University of Bern, Bern, Switzerland
| | - Peter Kuhnert
- Institute of Veterinary Bacteriology, University of Bern, Bern, Switzerland
| | - Martin Heller
- Friedrich-Loeffler-Institute—Federal Research Institute for Animal Health, Jena, Germany
| | - Laurent Falquet
- Biochemistry Unit, University of Fribourg and Swiss Institute of Bioinformatics, Fribourg, Switzerland
| | - Manfred Heller
- Proteomics and Mass Spectrometry Core Facility, Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
| | - Michael H. Stoffel
- Division of Veterinary Anatomy, Department of Clinical Research and Veterinary Public Health, University of Bern, Bern, Switzerland
| | - Fabien Labroussaa
- Institute of Veterinary Bacteriology, University of Bern, Bern, Switzerland
| | - Joerg Jores
- Institute of Veterinary Bacteriology, University of Bern, Bern, Switzerland
- * E-mail:
| |
Collapse
|
24
|
Burning the Candle at Both Ends: Have Exoribonucleases Driven Divergence of Regulatory RNA Mechanisms in Bacteria? mBio 2021; 12:e0104121. [PMID: 34372700 PMCID: PMC8406224 DOI: 10.1128/mbio.01041-21] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Regulatory RNAs have emerged as ubiquitous gene regulators in all bacterial species studied to date. The combination of sequence-specific RNA interactions and malleable RNA structure has allowed regulatory RNA to adopt different mechanisms of gene regulation in a diversity of genetic backgrounds. In the model GammaproteobacteriaEscherichia coli and Salmonella, the regulatory RNA chaperone Hfq appears to play a global role in gene regulation, directly controlling ∼20 to 25% of the entire transcriptome. While the model FirmicutesBacillus subtilis and Staphylococcus aureus encode a Hfq homologue, its role has been significantly depreciated. These bacteria also have marked differences in RNA turnover. E. coli and Salmonella degrade RNA through internal endonucleolytic and 3′→5′ exonucleolytic cleavage that appears to allow transient accumulation of mRNA 3′ UTR cleavage fragments that contain stabilizing 3′ structures. In contrast, B. subtilis and S. aureus are able to exonucleolytically attack internally cleaved RNA from both the 5′ and 3′ ends, efficiently degrading mRNA 3′ UTR fragments. Here, we propose that the lack of 5′→3′ exoribonuclease activity in Gammaproteobacteria has allowed the accumulation of mRNA 3′ UTR ends as the “default” setting. This in turn may have provided a larger pool of unconstrained RNA sequences that has fueled the expansion of Hfq function and small RNA (sRNA) regulation in E. coli and Salmonella. Conversely, the exoribonuclease RNase J may be a significant barrier to the evolution of 3′ UTR sRNAs in B. subtilis and S. aureus that has limited the pool of RNA ligands available to Hfq and other sRNA chaperones, depreciating their function in these model Firmicutes.
Collapse
|
25
|
Ponath F, Tawk C, Zhu Y, Barquist L, Faber F, Vogel J. RNA landscape of the emerging cancer-associated microbe Fusobacterium nucleatum. Nat Microbiol 2021; 6:1007-1020. [PMID: 34239075 DOI: 10.1038/s41564-021-00927-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Accepted: 05/24/2021] [Indexed: 12/14/2022]
Abstract
Fusobacterium nucleatum, long known as a constituent of the oral microflora, has recently garnered renewed attention for its association with several different human cancers. The growing interest in this emerging cancer-associated bacterium contrasts with a paucity of knowledge about its basic gene expression features and physiological responses. As fusobacteria lack all established small RNA-associated proteins, post-transcriptional networks in these bacteria are also unknown. In the present study, using differential RNA-sequencing, we generate high-resolution global RNA maps for five clinically relevant fusobacterial strains-F. nucleatum subspecies nucleatum, animalis, polymorphum and vincentii, as well as F. periodonticum-for early, mid-exponential growth and early stationary phase. These data are made available in an online browser, and we use these to uncover fundamental aspects of fusobacterial gene expression architecture and a suite of non-coding RNAs. Developing a vector for functional analysis of fusobacterial genes, we discover a conserved fusobacterial oxygen-induced small RNA, FoxI, which serves as a post-transcriptional repressor of the major outer membrane porin FomA. Our findings provide a crucial step towards delineating the regulatory networks enabling F. nucleatum adaptation to different environments, which may elucidate how these bacteria colonize different compartments of the human body.
Collapse
Affiliation(s)
- Falk Ponath
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany
| | - Caroline Tawk
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany
| | - Yan Zhu
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany
| | - Lars Barquist
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany.,Faculty of Medicine, University of Würzburg, Würzburg, Germany
| | - Franziska Faber
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany
| | - Jörg Vogel
- Helmholtz Institute for RNA-based Infection Research, Helmholtz Centre for Infection Research, Würzburg, Germany. .,Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany. .,Faculty of Medicine, University of Würzburg, Würzburg, Germany.
| |
Collapse
|
26
|
Ibrahim AGAER, Vêncio RZN, Lorenzetti APR, Koide T. Halobacterium salinarum and Haloferax volcanii Comparative Transcriptomics Reveals Conserved Transcriptional Processing Sites. Genes (Basel) 2021; 12:genes12071018. [PMID: 34209065 PMCID: PMC8303175 DOI: 10.3390/genes12071018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/25/2021] [Accepted: 05/27/2021] [Indexed: 01/15/2023] Open
Abstract
Post-transcriptional processing of messenger RNA is an important regulatory strategy that allows relatively fast responses to changes in environmental conditions. In halophile systems biology, the protein perspective of this problem (i.e., ribonucleases which implement the cleavages) is generally more studied than the RNA perspective (i.e., processing sites). In the present in silico work, we mapped genome-wide transcriptional processing sites (TPS) in two halophilic model organisms, Halobacterium salinarum NRC-1 and Haloferax volcanii DS2. TPS were established by reanalysis of publicly available differential RNA-seq (dRNA-seq) data, searching for non-primary (monophosphorylated RNAs) enrichment. We found 2093 TPS in 43% of H. salinarum genes and 3515 TPS in 49% of H. volcanii chromosomal genes. Of the 244 conserved TPS sites found, the majority were located around start and stop codons of orthologous genes. Specific genes are highlighted when discussing antisense, ribosome and insertion sequence associated TPS. Examples include the cell division gene ftsZ2, whose differential processing signal along growth was detected and correlated with post-transcriptional regulation, and biogenesis of sense overlapping transcripts associated with IS200/IS605. We hereby present the comparative, transcriptomics-based processing site maps with a companion browsing interface.
Collapse
Affiliation(s)
- Amr Galal Abd El-Raheem Ibrahim
- Department of Computation and Mathematics, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil; (A.G.A.E.-R.I.); (R.Z.N.V.)
| | - Ricardo Z. N. Vêncio
- Department of Computation and Mathematics, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil; (A.G.A.E.-R.I.); (R.Z.N.V.)
| | - Alan P. R. Lorenzetti
- Department of Biochemistry and Immunology, Ribeirão Preto Medical School, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil;
| | - Tie Koide
- Department of Biochemistry and Immunology, Ribeirão Preto Medical School, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil;
- Correspondence: ; Tel.: +55-16-3315-3107
| |
Collapse
|
27
|
Chen L, Wang C, Sun H, Wang J, Liang Y, Wang Y, Wong G. The bioinformatics toolbox for circRNA discovery and analysis. Brief Bioinform 2021; 22:1706-1728. [PMID: 32103237 PMCID: PMC7986655 DOI: 10.1093/bib/bbaa001] [Citation(s) in RCA: 193] [Impact Index Per Article: 64.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 12/16/2019] [Accepted: 01/02/2020] [Indexed: 12/21/2022] Open
Abstract
Circular RNAs (circRNAs) are a unique class of RNA molecule identified more than 40 years ago which are produced by a covalent linkage via back-splicing of linear RNA. Recent advances in sequencing technologies and bioinformatics tools have led directly to an ever-expanding field of types and biological functions of circRNAs. In parallel with technological developments, practical applications of circRNAs have arisen including their utilization as biomarkers of human disease. Currently, circRNA-associated bioinformatics tools can support projects including circRNA annotation, circRNA identification and network analysis of competing endogenous RNA (ceRNA). In this review, we collected about 100 circRNA-associated bioinformatics tools and summarized their current attributes and capabilities. We also performed network analysis and text mining on circRNA tool publications in order to reveal trends in their ongoing development.
Collapse
Affiliation(s)
- Liang Chen
- Department of Computer Science, Key Laboratory of Intelligent Manufacturing Technology of Ministry of Education, Shantou University
| | | | - Huiyan Sun
- School of Artificial Intelligence, Jilin University
| | - Juexin Wang
- Department of Electrical Engineering and Computer Science and Bond Life Science Center, University of Missouri
| | - Yanchun Liang
- College of Computer Science and Technology, Jilin University
| | - Yan Wang
- College of Computer Science and Technology, Jilin University
| | - Garry Wong
- Faculty of Health Sciences, University of Macau
| |
Collapse
|
28
|
Geissler AS, Anthon C, Alkan F, González-Tortuero E, Poulsen LD, Kallehauge TB, Breüner A, Seemann SE, Vinther J, Gorodkin J. BSGatlas: a unified Bacillus subtilis genome and transcriptome annotation atlas with enhanced information access. Microb Genom 2021; 7:000524. [PMID: 33539279 PMCID: PMC8208703 DOI: 10.1099/mgen.0.000524] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Accepted: 01/11/2021] [Indexed: 12/26/2022] Open
Abstract
A large part of our current understanding of gene regulation in Gram-positive bacteria is based on Bacillus subtilis, as it is one of the most well studied bacterial model systems. The rapid growth in data concerning its molecular and genomic biology is distributed across multiple annotation resources. Consequently, the interpretation of data from further B. subtilis experiments becomes increasingly challenging in both low- and large-scale analyses. Additionally, B. subtilis annotation of structured RNA and non-coding RNA (ncRNA), as well as the operon structure, is still lagging behind the annotation of the coding sequences. To address these challenges, we created the B. subtilis genome atlas, BSGatlas, which integrates and unifies multiple existing annotation resources. Compared to any of the individual resources, the BSGatlas contains twice as many ncRNAs, while improving the positional annotation for 70 % of the ncRNAs. Furthermore, we combined known transcription start and termination sites with lists of known co-transcribed gene sets to create a comprehensive transcript map. The combination with transcription start/termination site annotations resulted in 717 new sets of co-transcribed genes and 5335 untranslated regions (UTRs). In comparison to existing resources, the number of 5' and 3' UTRs increased nearly fivefold, and the number of internal UTRs doubled. The transcript map is organized in 2266 operons, which provides transcriptional annotation for 92 % of all genes in the genome compared to the at most 82 % by previous resources. We predicted an off-target-aware genome-wide library of CRISPR-Cas9 guide RNAs, which we also linked to polycistronic operons. We provide the BSGatlas in multiple forms: as a website (https://rth.dk/resources/bsgatlas/), an annotation hub for display in the UCSC genome browser, supplementary tables and standardized GFF3 format, which can be used in large scale -omics studies. By complementing existing resources, the BSGatlas supports analyses of the B. subtilis genome and its molecular biology with respect to not only non-coding genes but also genome-wide transcriptional relationships of all genes.
Collapse
Affiliation(s)
- Adrian Sven Geissler
- Center for Non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, 1871 Frederiksberg, Denmark
| | - Christian Anthon
- Center for Non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, 1871 Frederiksberg, Denmark
| | - Ferhat Alkan
- Center for Non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, 1871 Frederiksberg, Denmark
- Division of Oncogenomics, Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands
| | - Enrique González-Tortuero
- Center for Non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, 1871 Frederiksberg, Denmark
- Present address: School of Science, Engineering and Environment, University of Salford, Salford, UK
| | - Line Dahl Poulsen
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, 1165 Copenhagen, Denmark
| | | | | | - Stefan Ernst Seemann
- Center for Non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, 1871 Frederiksberg, Denmark
| | - Jeppe Vinther
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, 1165 Copenhagen, Denmark
| | - Jan Gorodkin
- Center for Non-coding RNA in Technology and Health, Department of Veterinary and Animal Sciences, University of Copenhagen, 1871 Frederiksberg, Denmark
| |
Collapse
|
29
|
Michaux C, Hansen EE, Jenniches L, Gerovac M, Barquist L, Vogel J. Single-Nucleotide RNA Maps for the Two Major Nosocomial Pathogens Enterococcus faecalis and Enterococcus faecium. Front Cell Infect Microbiol 2020; 10:600325. [PMID: 33324581 PMCID: PMC7724050 DOI: 10.3389/fcimb.2020.600325] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2020] [Accepted: 10/19/2020] [Indexed: 12/19/2022] Open
Abstract
Enterococcus faecalis and faecium are two major representative clinical strains of the Enterococcus genus and are sadly notorious to be part of the top agents responsible for nosocomial infections. Despite their critical implication in worldwide public healthcare, essential and available resources such as deep transcriptome annotations remain poor, which also limits our understanding of post-transcriptional control small regulatory RNA (sRNA) functions in these bacteria. Here, using the dRNA-seq technique in combination with ANNOgesic analysis, we successfully mapped and annotated transcription start sites (TSS) of both E. faecalis V583 and E. faecium AUS0004 at single nucleotide resolution. Analyzing bacteria in late exponential phase, we capture ~40% (E. faecalis) and 43% (E. faecium) of the annotated protein-coding genes, determine 5′ and 3′ UTR (untranslated region) length, and detect instances of leaderless mRNAs. The transcriptome maps revealed sRNA candidates in both bacteria, some found in previous studies and new ones. Expression of candidate sRNAs is being confirmed under biologically relevant environmental conditions. This comprehensive global TSS mapping atlas provides a valuable resource for RNA biology and gene expression analysis in the Enterococci. It can be accessed online at www.helmholtz-hiri.de/en/datasets/enterococcus through an instance of the genomic viewer JBrowse.
Collapse
Affiliation(s)
- Charlotte Michaux
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany
| | - Elisabeth E Hansen
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany
| | - Laura Jenniches
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Center for Infection Research (HZI), Würzburg, Germany
| | - Milan Gerovac
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany
| | - Lars Barquist
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Center for Infection Research (HZI), Würzburg, Germany.,Faculty of Medicine, University of Würzburg, Würzburg, Germany
| | - Jörg Vogel
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany.,Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Center for Infection Research (HZI), Würzburg, Germany
| |
Collapse
|
30
|
Hör J, Di Giorgio S, Gerovac M, Venturini E, Förstner KU, Vogel J. Grad-seq shines light on unrecognized RNA and protein complexes in the model bacterium Escherichia coli. Nucleic Acids Res 2020; 48:9301-9319. [PMID: 32813020 PMCID: PMC7498339 DOI: 10.1093/nar/gkaa676] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 07/29/2020] [Accepted: 08/14/2020] [Indexed: 12/21/2022] Open
Abstract
Stable protein complexes, including those formed with RNA, are major building blocks of every living cell. Escherichia coli has been the leading bacterial organism with respect to global protein-protein networks. Yet, there has been no global census of RNA/protein complexes in this model species of microbiology. Here, we performed Grad-seq to establish an RNA/protein complexome, reconstructing sedimentation profiles in a glycerol gradient for ∼85% of all E. coli transcripts and ∼49% of the proteins. These include the majority of small noncoding RNAs (sRNAs) detectable in this bacterium as well as the general sRNA-binding proteins, CsrA, Hfq and ProQ. In presenting use cases for utilization of these RNA and protein maps, we show that a stable association of RyeG with 30S ribosomes gives this seemingly noncoding RNA of prophage origin away as an mRNA of a toxic small protein. Similarly, we show that the broadly conserved uncharacterized protein YggL is a 50S subunit factor in assembled 70S ribosomes. Overall, this study crucially extends our knowledge about the cellular interactome of the primary model bacterium E. coli through providing global RNA/protein complexome information and should facilitate functional discovery in this and related species.
Collapse
Affiliation(s)
- Jens Hör
- Institute of Molecular Infection Biology, University of Würzburg, D-97080 Würzburg, Germany
| | - Silvia Di Giorgio
- Institute of Molecular Infection Biology, University of Würzburg, D-97080 Würzburg, Germany.,ZB MED - Information Centre for Life Sciences, D-50931 Cologne, Germany
| | - Milan Gerovac
- Institute of Molecular Infection Biology, University of Würzburg, D-97080 Würzburg, Germany
| | - Elisa Venturini
- Institute of Molecular Infection Biology, University of Würzburg, D-97080 Würzburg, Germany
| | - Konrad U Förstner
- ZB MED - Information Centre for Life Sciences, D-50931 Cologne, Germany.,TH Köln, Faculty of Information Science and Communication Studies, D-50678 Cologne, Germany
| | - Jörg Vogel
- Institute of Molecular Infection Biology, University of Würzburg, D-97080 Würzburg, Germany.,Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), D-97080 Würzburg, Germany
| |
Collapse
|
31
|
Wicke L, Ponath F, Coppens L, Gerovac M, Lavigne R, Vogel J. Introducing differential RNA-seq mapping to track the early infection phase for Pseudomonas phage ɸKZ. RNA Biol 2020; 18:1099-1110. [PMID: 33103565 PMCID: PMC8244752 DOI: 10.1080/15476286.2020.1827785] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
As part of the ongoing renaissance of phage biology, more phage genomes are becoming available through DNA sequencing. However, our understanding of the transcriptome architecture that allows these genomes to be expressed during host infection is generally poor. Transcription start sites (TSSs) and operons have been mapped for very few phages, and an annotated global RNA map of a phage – alone or together with its infected host – is not available at all. Here, we applied differential RNA-seq (dRNA-seq) to study the early, host takeover phase of infection by assessing the transcriptome structure of Pseudomonas aeruginosa jumbo phage ɸKZ, a model phage for viral genetics and structural research. This map substantially expands the number of early expressed viral genes, defining TSSs that are active ten minutes after ɸKZ infection. Simultaneously, we record gene expression changes in the host transcriptome during this critical metabolism conversion. In addition to previously reported upregulation of genes associated with amino acid metabolism, we observe strong activation of genes with functions in biofilm formation (cdrAB) and iron storage (bfrB), as well as an activation of the antitoxin ParD. Conversely, ɸKZ infection rapidly down-regulates complexes IV and V of oxidative phosphorylation (atpCDGHF and cyoABCDE). Taken together, our data provide new insights into the transcriptional organization and infection process of the giant bacteriophage ɸKZ and adds a framework for the genome-wide transcriptomic analysis of phage–host interactions.
Collapse
Affiliation(s)
- Laura Wicke
- Institute for Molecular Infection Biology (IMIB), Medical Faculty, University of Würzburg, Würzburg, Germany.,Department of Biosystems, Laboratory of Gene Technology, KU Leuven, Leuven, Belgium
| | - Falk Ponath
- Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany
| | - Lucas Coppens
- Department of Biosystems, Laboratory of Gene Technology, KU Leuven, Leuven, Belgium
| | - Milan Gerovac
- Institute for Molecular Infection Biology (IMIB), Medical Faculty, University of Würzburg, Würzburg, Germany
| | - Rob Lavigne
- Department of Biosystems, Laboratory of Gene Technology, KU Leuven, Leuven, Belgium
| | - Jörg Vogel
- Institute for Molecular Infection Biology (IMIB), Medical Faculty, University of Würzburg, Würzburg, Germany.,Helmholtz Institute for RNA-based Infection Research (HIRI), Helmholtz Centre for Infection Research (HZI), Würzburg, Germany
| |
Collapse
|
32
|
A high-resolution transcriptome map identifies small RNA regulation of metabolism in the gut microbe Bacteroides thetaiotaomicron. Nat Commun 2020; 11:3557. [PMID: 32678091 PMCID: PMC7366714 DOI: 10.1038/s41467-020-17348-5] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 06/23/2020] [Indexed: 12/15/2022] Open
Abstract
Bacteria of the genus Bacteroides are common members of the human intestinal microbiota and important degraders of polysaccharides in the gut. Among them, the species Bacteroides thetaiotaomicron has emerged as the model organism for functional microbiota research. Here, we use differential RNA sequencing (dRNA-seq) to generate a single-nucleotide resolution transcriptome map of B. thetaiotaomicron grown under defined laboratory conditions. An online browser, called ‘Theta-Base’ (www.helmholtz-hiri.de/en/datasets/bacteroides), is launched to interrogate the obtained gene expression data and annotations of ~4500 transcription start sites, untranslated regions, operon structures, and 269 noncoding RNA elements. Among the latter is GibS, a conserved, 145 nt-long small RNA that is highly expressed in the presence of N-acetyl-D-glucosamine as sole carbon source. We use computational predictions and experimental data to determine the secondary structure of GibS and identify its target genes. Our results indicate that sensing of N-acetyl-D-glucosamine induces GibS expression, which in turn modifies the transcript levels of metabolic enzymes. Bacteroides thetaiotaomicron is a human gut microbe and an emergent model organism. Here, Ryan et al. generate single-nucleotide resolution RNA-seq data for this bacterium and map transcription start sites and noncoding RNAs, one of which modulates expression of metabolic enzymes.
Collapse
|
33
|
Dual RNA-seq of Orientia tsutsugamushi informs on host-pathogen interactions for this neglected intracellular human pathogen. Nat Commun 2020; 11:3363. [PMID: 32620750 PMCID: PMC7335160 DOI: 10.1038/s41467-020-17094-8] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Accepted: 06/11/2020] [Indexed: 12/12/2022] Open
Abstract
Studying emerging or neglected pathogens is often challenging due to insufficient information and absence of genetic tools. Dual RNA-seq provides insights into host-pathogen interactions, and is particularly informative for intracellular organisms. Here we apply dual RNA-seq to Orientia tsutsugamushi (Ot), an obligate intracellular bacterium that causes the vector-borne human disease scrub typhus. Half the Ot genome is composed of repetitive DNA, and there is minimal collinearity in gene order between strains. Integrating RNA-seq, comparative genomics, proteomics, and machine learning to study the transcriptional architecture of Ot, we find evidence for wide-spread post-transcriptional antisense regulation. Comparing the host response to two clinical isolates, we identify distinct immune response networks for each strain, leading to predictions of relative virulence that are validated in a mouse infection model. Thus, dual RNA-seq can provide insight into the biology and host-pathogen interactions of a poorly characterized and genetically intractable organism such as Ot.
Collapse
|
34
|
Adams PP, Storz G. Prevalence of small base-pairing RNAs derived from diverse genomic loci. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2020; 1863:194524. [PMID: 32147527 DOI: 10.1016/j.bbagrm.2020.194524] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Revised: 03/03/2020] [Accepted: 03/03/2020] [Indexed: 12/21/2022]
Abstract
Small RNAs (sRNAs) that act by base-pairing have been shown to play important roles in fine-tuning the levels and translation of their target transcripts across a variety of model and pathogenic organisms. Work from many different groups in a wide range of bacterial species has provided evidence for the importance and complexity of sRNA regulatory networks, which allow bacteria to quickly respond to changes in their environment. However, despite the expansive literature, much remains to be learned about all aspects of sRNA-mediated regulation, particularly in bacteria beyond the well-characterized Escherichia coli and Salmonella enterica species. Here we discuss what is known, and what remains to be learned, about the identification of regulatory base-pairing RNAs produced from diverse genomic loci including how their expression is regulated. This article is part of a Special Issue entitled: RNA and gene control in bacteria edited by Dr. M. Guillier and F. Repoila.
Collapse
Affiliation(s)
- Philip P Adams
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD 20892-5430, USA; Postdoctoral Research Associate Program, National Institute of General Medical Sciences, National Institutes of Health, Bethesda, MD 20892-6200, USA.
| | - Gisela Storz
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD 20892-5430, USA
| |
Collapse
|
35
|
Ozuna A, Liberto D, Joyce RM, Arnvig KB, Nobeli I. baerhunter: an R package for the discovery and analysis of expressed non-coding regions in bacterial RNA-seq data. Bioinformatics 2020; 36:966-969. [PMID: 31418770 DOI: 10.1093/bioinformatics/btz643] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Revised: 07/29/2019] [Accepted: 08/13/2019] [Indexed: 12/12/2022] Open
Abstract
SUMMARY Standard bioinformatics pipelines for the analysis of bacterial transcriptomic data commonly ignore non-coding but functional elements e.g. small RNAs, long antisense RNAs or untranslated regions (UTRs) of mRNA transcripts. The root of this problem is the use of incomplete genome annotation files. Here, we present baerhunter, a coverage-based method implemented in R, that automates the discovery of expressed non-coding RNAs and UTRs from RNA-seq reads mapped to a reference genome. The core algorithm is part of a pipeline that facilitates downstream analysis of both coding and non-coding features. The method is simple, easy to extend and customize and, in limited tests with simulated and real data, compares favourably against the currently most popular alternative. AVAILABILITY AND IMPLEMENTATION The baerhunter R package is available from: https://github.com/irilenia/baerhunter. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- A Ozuna
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| | - D Liberto
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| | - R M Joyce
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| | - K B Arnvig
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, WC1E 6BT, UK
| | - I Nobeli
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| |
Collapse
|
36
|
Conditional Hfq Association with Small Noncoding RNAs in Pseudomonas aeruginosa Revealed through Comparative UV Cross-Linking Immunoprecipitation Followed by High-Throughput Sequencing. mSystems 2019; 4:4/6/e00590-19. [PMID: 31796567 PMCID: PMC6890931 DOI: 10.1128/msystems.00590-19] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
The Gram-negative bacterium P. aeruginosa is ubiquitously distributed in diverse environments and can cause severe biofilm-related infections in at-risk individuals. Although the presence of a large number of putative sRNAs and widely conserved RNA chaperones in this bacterium implies the importance of posttranscriptional regulatory networks for environmental fluctuations, limited information is available regarding the global role of RNA chaperones such as Hfq in the P. aeruginosa transcriptome, especially under different environmental conditions. Here, we characterize Hfq-dependent differences in gene expression and biological processes in two physiological states: the planktonic and biofilm forms. A combinatorial comparative CLIP-seq and total RNA-seq approach uncovered condition-dependent association of RNAs with Hfq in vivo and expands the potential direct regulatory targets of Hfq in the P. aeruginosa transcriptome. Bacterial small noncoding RNAs (sRNAs) play posttranscriptional regulatory roles in cellular responses to changing environmental cues and in adaptation to harsh conditions. Generally, the RNA-binding protein Hfq helps sRNAs associate with target mRNAs to modulate their translation and to modify global RNA pools depending on physiological state. Here, a combination of in vivo UV cross-linking immunoprecipitation followed by high-throughput sequencing (CLIP-seq) and total RNA-seq showed that Hfq interacts with different regions of the Pseudomonas aeruginosa transcriptome under planktonic versus biofilm conditions. In the present approach, P. aeruginosa Hfq preferentially interacted with repeats of the AAN triplet motif at mRNA 5′ untranslated regions (UTRs) and sRNAs and U-rich sequences at rho-independent terminators. Further transcriptome analysis suggested that the association of sRNAs with Hfq is primarily a function of their expression levels, strongly supporting the notion that the pool of Hfq-associated RNAs is equilibrated by RNA concentration-driven cycling on and off Hfq. Overall, our combinatorial CLIP-seq and total RNA-seq approach highlights conditional sRNA associations with Hfq as a novel aspect of posttranscriptional regulation in P. aeruginosa. IMPORTANCE The Gram-negative bacterium P. aeruginosa is ubiquitously distributed in diverse environments and can cause severe biofilm-related infections in at-risk individuals. Although the presence of a large number of putative sRNAs and widely conserved RNA chaperones in this bacterium implies the importance of posttranscriptional regulatory networks for environmental fluctuations, limited information is available regarding the global role of RNA chaperones such as Hfq in the P. aeruginosa transcriptome, especially under different environmental conditions. Here, we characterize Hfq-dependent differences in gene expression and biological processes in two physiological states: the planktonic and biofilm forms. A combinatorial comparative CLIP-seq and total RNA-seq approach uncovered condition-dependent association of RNAs with Hfq in vivo and expands the potential direct regulatory targets of Hfq in the P. aeruginosa transcriptome.
Collapse
|
37
|
Leonard S, Meyer S, Lacour S, Nasser W, Hommais F, Reverchon S. APERO: a genome-wide approach for identifying bacterial small RNAs from RNA-Seq data. Nucleic Acids Res 2019; 47:e88. [PMID: 31147705 PMCID: PMC6735904 DOI: 10.1093/nar/gkz485] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Revised: 05/06/2019] [Accepted: 05/20/2019] [Indexed: 12/02/2022] Open
Abstract
Small non-coding RNAs (sRNAs) regulate numerous cellular processes in all domains of life. Several approaches have been developed to identify them from RNA-seq data, which are efficient for eukaryotic sRNAs but remain inaccurate for the longer and highly structured bacterial sRNAs. We present APERO, a new algorithm to detect small transcripts from paired-end bacterial RNA-seq data. In contrast to previous approaches that start from the read coverage distribution, APERO analyzes boundaries of individual sequenced fragments to infer the 5′ and 3′ ends of all transcripts. Since sRNAs are about the same size as individual fragments (50–350 nucleotides), this algorithm provides a significantly higher accuracy and robustness, e.g., with respect to spontaneous internal breaking sites. To demonstrate this improvement, we develop a comparative assessment on datasets from Escherichia coli and Salmonella enterica, based on experimentally validated sRNAs. We also identify the small transcript repertoire of Dickeya dadantii including putative intergenic RNAs, 5′ UTR or 3′ UTR-derived RNA products and antisense RNAs. Comparisons to annotations as well as RACE-PCR experimental data confirm the precision of the detected transcripts. Altogether, APERO outperforms all existing methods in terms of sRNA detection and boundary precision, which is crucial for comprehensive genome annotations. It is freely available as an open source R package on https://github.com/Simon-Leonard/APERO
Collapse
Affiliation(s)
- Simon Leonard
- Université de Lyon, INSA-Lyon, Université Claude Bernard Lyon1, CNRS UMR5240, Laboratoire de Microbiologie, Adaptation, Pathogénie, 11 avenue Jean Capelle, F-69621 Villeurbanne, France
| | - Sam Meyer
- Université de Lyon, INSA-Lyon, Université Claude Bernard Lyon1, CNRS UMR5240, Laboratoire de Microbiologie, Adaptation, Pathogénie, 11 avenue Jean Capelle, F-69621 Villeurbanne, France
| | - Stephan Lacour
- Univ. Grenoble Alpes, CNRS, Inria, LiPhy (UMR5588), 38000 Grenoble, France
| | - William Nasser
- Université de Lyon, INSA-Lyon, Université Claude Bernard Lyon1, CNRS UMR5240, Laboratoire de Microbiologie, Adaptation, Pathogénie, 11 avenue Jean Capelle, F-69621 Villeurbanne, France
| | - Florence Hommais
- Université de Lyon, INSA-Lyon, Université Claude Bernard Lyon1, CNRS UMR5240, Laboratoire de Microbiologie, Adaptation, Pathogénie, 11 avenue Jean Capelle, F-69621 Villeurbanne, France
| | - Sylvie Reverchon
- Université de Lyon, INSA-Lyon, Université Claude Bernard Lyon1, CNRS UMR5240, Laboratoire de Microbiologie, Adaptation, Pathogénie, 11 avenue Jean Capelle, F-69621 Villeurbanne, France
| |
Collapse
|
38
|
Grünberger F, Reichelt R, Bunk B, Spröer C, Overmann J, Rachel R, Grohmann D, Hausner W. Next Generation DNA-Seq and Differential RNA-Seq Allow Re-annotation of the Pyrococcus furiosus DSM 3638 Genome and Provide Insights Into Archaeal Antisense Transcription. Front Microbiol 2019; 10:1603. [PMID: 31354685 PMCID: PMC6640164 DOI: 10.3389/fmicb.2019.01603] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 06/26/2019] [Indexed: 01/07/2023] Open
Abstract
Pyrococcus furiosus DSM 3638 is a model organism for hyperthermophilic archaea with an optimal growth temperature near 100°C. The genome was sequenced about 18 years ago. However, some publications suggest that in contrast to other Pyrococcus species, the genome of P. furiosus DSM 3638 is prone to genomic rearrangements. Therefore, we re-sequenced the genome using third generation sequencing techniques. The new de novo assembled genome is 1,889,914 bp in size and exhibits high sequence identity to the published sequence. However, two major deviations were detected: (1) The genome is 18,342 bp smaller than the NCBI reference genome due to a recently described deletion. (2) The region between PF0349 and PF0388 is inverted most likely due an assembly problem for the original sequence. In addition, numerous minor variations, ranging from single nucleotide exchanges, deletions or insertions were identified. The total number of insertion sequence (IS) elements is also reduced from 30 to 24 in the new sequence. Re-sequencing of a 2-year-old “lab culture” using Nanopore sequencing confirmed the overall stability of the P. furiosus DSM 3638 genome even under normal lab conditions without taking any special care. To improve genome annotation, the updated DNA sequence was combined with an RNA sequencing approach. Here, RNAs from eight different growth conditions were pooled to increase the number of detected transcripts. Furthermore, a differential RNA-Seq approach was employed for the identification of transcription start sites (TSSs). In total, 2515 TSSs were detected and classified into 834 primary (pTSS), 797 antisense (aTSS), 739 internal and 145 secondary TSSs. Our analysis of the upstream regions revealed a well conserved archaeal promoter structure. Interrogation of the distances between pTSSs and aTSSs revealed a significant number of antisense transcripts, which are a result of bidirectional transcription from the same TATA box. This mechanism of antisense transcript production could be further confirmed by in vitro transcription experiments. We assume that bidirectional transcription gives rise to non-functional antisense RNAs and that this is a widespread phenomenon in archaea due to the architecture of the TATA element and the symmetric structure of the TATA-binding protein.
Collapse
Affiliation(s)
- Felix Grünberger
- Institute of Microbiology and Archaea Center, University of Regensburg, Regensburg, Germany
| | - Robert Reichelt
- Institute of Microbiology and Archaea Center, University of Regensburg, Regensburg, Germany
| | - Boyke Bunk
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany
| | - Cathrin Spröer
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany
| | - Jörg Overmann
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany.,Institute of Microbiology, Technical University of Braunschweig, Braunschweig, Germany
| | - Reinhard Rachel
- Institute of Microbiology and Archaea Center, University of Regensburg, Regensburg, Germany
| | - Dina Grohmann
- Institute of Microbiology and Archaea Center, University of Regensburg, Regensburg, Germany
| | - Winfried Hausner
- Institute of Microbiology and Archaea Center, University of Regensburg, Regensburg, Germany
| |
Collapse
|
39
|
Characterization of the transcriptome of Haloferax volcanii, grown under four different conditions, with mixed RNA-Seq. PLoS One 2019; 14:e0215986. [PMID: 31039177 PMCID: PMC6490895 DOI: 10.1371/journal.pone.0215986] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 04/11/2019] [Indexed: 12/21/2022] Open
Abstract
Haloferax volcanii is a well-established model species for haloarchaea. Small scale RNomics and bioinformatics predictions were used to identify small non-coding RNAs (sRNAs), and deletion mutants revealed that sRNAs have important regulatory functions. A recent dRNA-Seq study was used to characterize the primary transcriptome. Unexpectedly, it was revealed that, under optimal conditions, H. volcanii contains more non-coding sRNAs than protein-encoding mRNAs. However, the dRNA-Seq approach did not contain any length information. Therefore, a mixed RNA-Seq approach was used to determine transcript length and to identify additional transcripts, which are not present under optimal conditions. In total, 50 million paired end reads of 150 nt length were obtained. 1861 protein-coding RNAs (cdRNAs) were detected, which encoded 3092 proteins. This nearly doubled the coverage of cdRNAs, compared to the previous dRNA-Seq study. About 2/3 of the cdRNAs were monocistronic, and 1/3 covered more than one gene. In addition, 1635 non-coding sRNAs were identified. The highest fraction of non-coding RNAs were cis antisense RNAs (asRNAs). Analysis of the length distribution revealed that sRNAs have a median length of about 150 nt. Based on the RNA-Seq and dRNA-Seq results, genes were chosen to exemplify characteristics of the H. volcanii transcriptome by Northern blot analyses, e.g. 1) the transcript patterns of gene clusters can be straightforward, but also very complex, 2) many transcripts differ in expression level under the four analyzed conditions, 3) some genes are transcribed into RNA isoforms of different length, which can be differentially regulated, 4) transcripts with very long 5'-UTRs and with very long 3'-UTRs exist, and 5) about 30% of all cdRNAs have overlapping 3'-ends, which indicates, together with the asRNAs, that H. volcanii makes ample use of sense-antisense interactions. Taken together, this RNA-Seq study, together with a previous dRNA-Seq study, enabled an unprecedented view on the H. volcanii transcriptome.
Collapse
|
40
|
Yu SH, Vogel J, Förstner KU. ANNOgesic: a Swiss army knife for the RNA-seq based annotation of bacterial/archaeal genomes. Gigascience 2018; 7:5087959. [PMID: 30169674 PMCID: PMC6123526 DOI: 10.1093/gigascience/giy096] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Accepted: 08/23/2018] [Indexed: 11/13/2022] Open
Abstract
To understand the gene regulation of an organism of interest, a comprehensive genome annotation is essential. While some features, such as coding sequences, can be computationally predicted with high accuracy based purely on the genomic sequence, others, such as promoter elements or noncoding RNAs, are harder to detect. RNA sequencing (RNA-seq) has proven to be an efficient method to identify these genomic features and to improve genome annotations. However, processing and integrating RNA-seq data in order to generate high-resolution annotations is challenging, time consuming, and requires numerous steps. We have constructed a powerful and modular tool called ANNOgesic that provides the required analyses and simplifies RNA-seq-based bacterial and archaeal genome annotation. It can integrate data from conventional RNA-seq and differential RNA-seq and predicts and annotates numerous features, including small noncoding RNAs, with high precision. The software is available under an open source license (ISCL) at https://pypi.org/project/ANNOgesic/.
Collapse
Affiliation(s)
- Sung-Huan Yu
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany
| | - Jörg Vogel
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,Helmholtz Institute for RNA-based Infection Research (HIRI), Josef-Schneider-Straße 2, 97080 Würzburg Germany
| | - Konrad U Förstner
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,ZB MED - Information Center for Life Sciences, Informationservices, Gleueler Straße 60, 50931 Cologne (Köln), Germany.,Technical University of Cologne, Faculty for Information and Communication Sciences, Claudiusstraße 1, 50678 Cologne (Köln), Germany
| |
Collapse
|