1
|
Vill AC, Rice EJ, De Vlaminck I, Danko CG, Brito IL. Precision run-on sequencing (PRO-seq) for microbiome transcriptomics. Nat Microbiol 2024; 9:241-250. [PMID: 38172625 PMCID: PMC11059318 DOI: 10.1038/s41564-023-01558-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 11/14/2023] [Indexed: 01/05/2024]
Abstract
Bacteria respond to environmental stimuli through precise regulation of transcription initiation and elongation. Bulk RNA sequencing primarily characterizes mature transcripts, so to identify actively transcribed loci we need to capture RNA polymerase (RNAP) complexed with nascent RNA. However, such capture methods have only previously been applied to culturable, genetically tractable organisms such as E. coli and B. subtilis. Here we apply precision run-on sequencing (PRO-seq) to profile nascent transcription in cultured E. coli and diverse uncultured bacteria. We demonstrate that PRO-seq can characterize the transcription of small, structured, or post-transcriptionally modified RNAs, which are often absent from bulk RNA-seq libraries. Applying PRO-seq to the human microbiome highlights taxon-specific RNAP pause motifs and pause-site distributions across non-coding RNA loci that reflect structure-coincident pausing. We also uncover concurrent transcription and cleavage of CRISPR guide RNAs and transfer RNAs. We demonstrate the utility of PRO-seq for exploring transcriptional dynamics in diverse microbial communities.
Collapse
Affiliation(s)
- Albert C Vill
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Edward J Rice
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Iwijn De Vlaminck
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA
| | - Charles G Danko
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, USA
| | - Ilana L Brito
- Meinig School of Biomedical Engineering, Cornell University, Ithaca, NY, USA.
| |
Collapse
|
2
|
Gomes-Filho JV, Breuer R, Morales-Filloy HG, Pozhydaieva N, Borst A, Paczia N, Soppa J, Höfer K, Jäschke A, Randau L. Identification of NAD-RNA species and ADPR-RNA decapping in Archaea. Nat Commun 2023; 14:7597. [PMID: 37989750 PMCID: PMC10663502 DOI: 10.1038/s41467-023-43377-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 11/07/2023] [Indexed: 11/23/2023] Open
Abstract
NAD is a coenzyme central to metabolism that also serves as a 5'-terminal cap for bacterial and eukaryotic transcripts. Thermal degradation of NAD can generate nicotinamide and ADP-ribose (ADPR). Here, we use LC-MS/MS and NAD captureSeq to detect and identify NAD-RNAs in the thermophilic model archaeon Sulfolobus acidocaldarius and in the halophilic mesophile Haloferax volcanii. None of the four Nudix proteins of S. acidocaldarius catalyze NAD-RNA decapping in vitro, but one of the proteins (Saci_NudT5) promotes ADPR-RNA decapping. NAD-RNAs are converted into ADPR-RNAs, which we detect in S. acidocaldarius total RNA. Deletion of the gene encoding the 5'-3' exonuclease Saci-aCPSF2 leads to a 4.5-fold increase in NAD-RNA levels. We propose that the incorporation of NAD into RNA acts as a degradation marker for Saci-aCPSF2. In contrast, ADPR-RNA is processed by Saci_NudT5 into 5'-p-RNAs, providing another layer of regulation for RNA turnover in archaeal cells.
Collapse
Affiliation(s)
| | - Ruth Breuer
- Faculty of Biology, Philipps-Universität Marburg, Marburg, Germany
| | | | | | - Andreas Borst
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt am Main, Germany
| | - Nicole Paczia
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
| | - Jörg Soppa
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt am Main, Germany
| | - Katharina Höfer
- Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
- SYNMIKRO, Center for Synthetic Microbiology, Marburg, Germany
| | - Andres Jäschke
- Institute of Pharmacy and Molecular Biotechnology (IPMB), Heidelberg University, Heidelberg, Germany
| | - Lennart Randau
- Faculty of Biology, Philipps-Universität Marburg, Marburg, Germany.
- SYNMIKRO, Center for Synthetic Microbiology, Marburg, Germany.
| |
Collapse
|
3
|
Waldburger L, Thompson MG, Weisberg AJ, Lee N, Chang JH, Keasling JD, Shih PM. Transcriptome architecture of the three main lineages of agrobacteria. mSystems 2023; 8:e0033323. [PMID: 37477440 PMCID: PMC10469942 DOI: 10.1128/msystems.00333-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 06/15/2023] [Indexed: 07/22/2023] Open
Abstract
Agrobacteria are a diverse, polyphyletic group of prokaryotes with multipartite genomes capable of transferring DNA into the genomes of host plants, making them an essential tool in plant biotechnology. Despite their utility in plant transformation, genome-wide transcriptional regulation is not well understood across the three main lineages of agrobacteria. Transcription start sites (TSSs) are a necessary component of gene expression and regulation. In this study, we used differential RNA-seq and a TSS identification algorithm optimized on manually annotated TSS, then validated with existing TSS to identify thousands of TSS with nucleotide resolution for representatives of each lineage. We extend upon the 356 TSSs previously reported in Agrobacterium fabrum C58 by identifying 1,916 TSSs. In addition, we completed genomes and phenotyping of Rhizobium rhizogenes C16/80 and Allorhizobium vitis T60/94, identifying 2,650 and 2,432 TSSs, respectively. Parameter optimization was crucial for an accurate, high-resolution view of genome and transcriptional dynamics, highlighting the importance of algorithm optimization in genome-wide TSS identification and genomics at large. The optimized algorithm reduced the number of TSSs identified internal and antisense to the coding sequence on average by 90.5% and 91.9%, respectively. Comparison of TSS conservation between orthologs of the three lineages revealed differences in cell cycle regulation of ctrA as well as divergence of transcriptional regulation of chemotaxis-related genes when grown in conditions that simulate the plant environment. These results provide a framework to elucidate the mechanistic basis and evolution of pathology across the three main lineages of agrobacteria. IMPORTANCE Transcription start sites (TSSs) are fundamental for understanding gene expression and regulation. Agrobacteria, a group of prokaryotes with the ability to transfer DNA into the genomes of host plants, are widely used in plant biotechnology. However, the genome-wide transcriptional regulation of agrobacteria is not well understood, especially in less-studied lineages. Differential RNA-seq and an optimized algorithm enabled identification of thousands of TSSs with nucleotide resolution for representatives of each lineage. The results of this study provide a framework for elucidating the mechanistic basis and evolution of pathology across the three main lineages of agrobacteria. The optimized algorithm also highlights the importance of parameter optimization in genome-wide TSS identification and genomics at large.
Collapse
Affiliation(s)
- Lucas Waldburger
- Department of Bioengineering, University of California, Berkeley, California, USA
- Joint BioEnergy Institute, Emeryville, California, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Mitchell G. Thompson
- Joint BioEnergy Institute, Emeryville, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Alexandra J. Weisberg
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Namil Lee
- Joint BioEnergy Institute, Emeryville, California, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California, USA
| | - Jeff H. Chang
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, Oregon, USA
| | - Jay D. Keasling
- Joint BioEnergy Institute, Emeryville, California, USA
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California, USA
- Institute for Quantitative Biosciences, University of California, Berkeley, California, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
- Center for Synthetic Biochemistry, Institute for Synthetic Biology, Shenzhen Institutes for Advanced Technologies, Shenzhen, China
| | - Patrick M. Shih
- Joint BioEnergy Institute, Emeryville, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, California, USA
| |
Collapse
|
4
|
Cheah HL, Ahmed SA, Tang TH. Transcription start site mapping and small RNA profiling of Leptospira biflexa serovar Patoc. World J Microbiol Biotechnol 2023; 39:104. [PMID: 36808011 DOI: 10.1007/s11274-023-03540-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 02/03/2023] [Indexed: 02/23/2023]
Abstract
Leptospirosis is an emerging zoonotic disease caused by bacterial species of the genus Leptospira. However, the regulatory mechanisms and pathways underlying the adaptation of pathogenic and non-pathogenic Leptospira spp. in different environmental conditions remain elusive. Leptospira biflexa is a non-pathogenic species of Leptospira that lives exclusively in a natural environment. It is an ideal model not only for exploring molecular mechanisms underlying the environmental survival of Leptospira species but also for identifying virulence factors unique to Leptospira's pathogenic species. In this study, we aim to establish the transcription start site (TSS) landscape and the small RNA (sRNA) profile of L. biflexa serovar Patoc grown to exponential and stationary phases via differential RNA-seq (dRNA-seq) and small RNA-seq (sRNA-seq) analyses, respectively. Our dRNA-seq analysis uncovered a total of 2726 TSSs, which are also used to identify other elements, e.g., promoter and untranslated regions (UTRs). Besides, our sRNA-seq analysis revealed a total of 603 sRNA candidates, comprising 16 promoter-associated sRNAs, 184 5'UTR-derived sRNAs, 230 true intergenic sRNAs, 136 5'UTR-antisense sRNAs, and 130 open reading frame (ORF)-antisense sRNAs. In summary, these findings reflect the transcriptional complexity of L. biflexa serovar Patoc under different growth conditions and help to facilitate our understanding of regulatory networks in L. biflexa. To the best of our knowledge, this is the first study reporting the TSS landscape of L. biflexa. The TSS and sRNA landscapes of L. biflexa can also be compared with its pathogenic counterparts, e.g., L. borgpetersenii and L. interrogans, to identify features contributing to their environmental survival and virulence.
Collapse
Affiliation(s)
- Hong-Leong Cheah
- Advanced Medical & Dental Institute (AMDI), Universiti Sains Malaysia, Bertam, 13200, Kepala Batas, Penang, Malaysia
| | - Siti Aminah Ahmed
- Advanced Medical & Dental Institute (AMDI), Universiti Sains Malaysia, Bertam, 13200, Kepala Batas, Penang, Malaysia
| | - Thean-Hock Tang
- Advanced Medical & Dental Institute (AMDI), Universiti Sains Malaysia, Bertam, 13200, Kepala Batas, Penang, Malaysia.
| |
Collapse
|
5
|
Genome-Wide Transcription Start Sites Mapping in Methylorubrum Grown with Dichloromethane and Methanol. Microorganisms 2022; 10:microorganisms10071301. [PMID: 35889020 PMCID: PMC9320726 DOI: 10.3390/microorganisms10071301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 06/17/2022] [Accepted: 06/22/2022] [Indexed: 02/04/2023] Open
Abstract
Dichloromethane (DCM, methylene chloride) is a toxic halogenated volatile organic compound massively used for industrial applications, and consequently often detected in the environment as a major pollutant. DCM biotransformation suggests a sustainable decontamination strategy of polluted sites. Among methylotrophic bacteria able to use DCM as a sole source of carbon and energy for growth, Methylorubrum extorquens DM4 is a longstanding reference strain. Here, the primary 5′-ends of transcripts were obtained using a differential RNA-seq (dRNA-seq) approach to provide the first transcription start site (TSS) genome-wide landscape of a methylotroph using DCM or methanol. In total, 7231 putative TSSs were annotated and classified with respect to their localization to coding sequences (CDSs). TSSs on the opposite strand of CDS (antisense TSS) account for 31% of all identified TSSs. One-third of the detected TSSs were located at a distance to the start codon inferior to 250 nt (average of 84 nt) with 7% of leaderless mRNA. Taken together, the global TSS map for bacterial growth using DCM or methanol will facilitate future studies in which transcriptional regulation is crucial, and efficient DCM removal at polluted sites is limited by regulatory processes.
Collapse
|
6
|
Webb IUC, Xu J, Sánchez-Cañizares C, Karunakaran R, Ramachandran VK, Rutten PJ, East AK, Huang WE, Watmough NJ, Poole PS. Regulation and Characterization of Mutants of fixABCX in Rhizobium leguminosarum. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2021; 34:1167-1180. [PMID: 34110256 DOI: 10.1094/mpmi-02-21-0037-r] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Symbiosis between Rhizobium leguminosarum and Pisum sativum requires tight control of redox balance in order to maintain respiration under the microaerobic conditions required for nitrogenase while still producing the eight electrons and sixteen molecules of ATP needed for nitrogen fixation. FixABCX, a cluster of electron transfer flavoproteins essential for nitrogen fixation, is encoded on the Sym plasmid (pRL10), immediately upstream of nifA, which encodes the general transcriptional regulator of nitrogen fixation. There is a symbiotically regulated NifA-dependent promoter upstream of fixA (PnifA1), as well as an additional basal constitutive promoter driving background expression of nifA (PnifA2). These were confirmed by 5'-end mapping of transcription start sites using differential RNA-seq. Complementation of polar fixAB and fixX mutants (Fix- strains) confirmed expression of nifA from PnifA1 in symbiosis. Electron microscopy combined with single-cell Raman microspectroscopy characterization of fixAB mutants revealed previously unknown heterogeneity in bacteroid morphology within a single nodule. Two morphotypes of mutant fixAB bacteroids were observed. One was larger than wild-type bacteroids and contained high levels of polyhydroxy-3-butyrate, a complex energy/reductant storage product. A second bacteroid phenotype was morphologically and compositionally different and resembled wild-type infection thread cells. From these two characteristic fixAB mutant bacteroid morphotypes, inferences can be drawn on the metabolism of wild-type nitrogen-fixing bacteroids.[Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY 4.0 International license.
Collapse
Affiliation(s)
- Isabel U C Webb
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, U.K
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, U.K
| | - Jiabao Xu
- Department of Engineering, University of Oxford, Parks Road, Oxford OX1 3PJ, U.K
| | | | - Ramakrishnan Karunakaran
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, U.K
| | - Vinoy K Ramachandran
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, U.K
| | - Paul J Rutten
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, U.K
| | - Alison K East
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, U.K
| | - Wei E Huang
- Department of Engineering, University of Oxford, Parks Road, Oxford OX1 3PJ, U.K
| | - Nicholas J Watmough
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, Norfolk NR4 7TJ, U.K
| | - Philip S Poole
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, U.K
- Department of Molecular Microbiology, John Innes Centre, Norwich Research Park, Norwich NR4 7UH, U.K
| |
Collapse
|
7
|
Ibrahim AGAER, Vêncio RZN, Lorenzetti APR, Koide T. Halobacterium salinarum and Haloferax volcanii Comparative Transcriptomics Reveals Conserved Transcriptional Processing Sites. Genes (Basel) 2021; 12:genes12071018. [PMID: 34209065 PMCID: PMC8303175 DOI: 10.3390/genes12071018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/25/2021] [Accepted: 05/27/2021] [Indexed: 01/15/2023] Open
Abstract
Post-transcriptional processing of messenger RNA is an important regulatory strategy that allows relatively fast responses to changes in environmental conditions. In halophile systems biology, the protein perspective of this problem (i.e., ribonucleases which implement the cleavages) is generally more studied than the RNA perspective (i.e., processing sites). In the present in silico work, we mapped genome-wide transcriptional processing sites (TPS) in two halophilic model organisms, Halobacterium salinarum NRC-1 and Haloferax volcanii DS2. TPS were established by reanalysis of publicly available differential RNA-seq (dRNA-seq) data, searching for non-primary (monophosphorylated RNAs) enrichment. We found 2093 TPS in 43% of H. salinarum genes and 3515 TPS in 49% of H. volcanii chromosomal genes. Of the 244 conserved TPS sites found, the majority were located around start and stop codons of orthologous genes. Specific genes are highlighted when discussing antisense, ribosome and insertion sequence associated TPS. Examples include the cell division gene ftsZ2, whose differential processing signal along growth was detected and correlated with post-transcriptional regulation, and biogenesis of sense overlapping transcripts associated with IS200/IS605. We hereby present the comparative, transcriptomics-based processing site maps with a companion browsing interface.
Collapse
Affiliation(s)
- Amr Galal Abd El-Raheem Ibrahim
- Department of Computation and Mathematics, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil; (A.G.A.E.-R.I.); (R.Z.N.V.)
| | - Ricardo Z. N. Vêncio
- Department of Computation and Mathematics, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil; (A.G.A.E.-R.I.); (R.Z.N.V.)
| | - Alan P. R. Lorenzetti
- Department of Biochemistry and Immunology, Ribeirão Preto Medical School, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil;
| | - Tie Koide
- Department of Biochemistry and Immunology, Ribeirão Preto Medical School, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil;
- Correspondence: ; Tel.: +55-16-3315-3107
| |
Collapse
|
8
|
Narra HP, Sahni A, Alsing J, Schroeder CLC, Golovko G, Nia AM, Fofanov Y, Khanipov K, Sahni SK. Comparative transcriptomic analysis of Rickettsia conorii during in vitro infection of human and tick host cells. BMC Genomics 2020; 21:665. [PMID: 32977742 PMCID: PMC7519539 DOI: 10.1186/s12864-020-07077-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 09/17/2020] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Pathogenic Rickettsia species belonging to the spotted fever group are arthropod-borne, obligate intracellular bacteria which exhibit preferential tropism for host microvascular endothelium in the mammalian hosts, resulting in disease manifestations attributed primarily to endothelial damage or dysfunction. Although rickettsiae are known to undergo evolution through genomic reduction, the mechanisms by which these pathogens regulate their transcriptome to ensure survival in tick vectors and maintenance by transovarial/transstadial transmission, in contrast to their ability to cause debilitating infections in human hosts remain unknown. In this study, we compare the expression profiles of rickettsial sRNAome/transcriptome and determine the transcriptional start sites (TSSs) of R. conorii transcripts during in vitro infection of human and tick host cells. RESULTS We performed deep sequencing on total RNA from Amblyomma americanum AAE2 cells and human microvascular endothelial cells (HMECs) infected with R. conorii. Strand-specific RNA sequencing of R. conorii transcripts revealed the expression 32 small RNAs (Rc_sR's), which were preferentially expressed above the limit of detection during tick cell infection, and confirmed the expression of Rc_sR61, sR71, and sR74 by quantitative RT-PCR. Intriguingly, a total of 305 and 132 R. conorii coding genes were differentially upregulated (> 2-fold) in AAE2 cells and HMECs, respectively. Further, enrichment for primary transcripts by treatment with Terminator 5'-Phosphate-dependent Exonuclease resulted in the identification of 3903 and 2555 transcription start sites (TSSs), including 214 and 181 primary TSSs in R. conorii during the infection to tick and human host cells, respectively. Seventy-five coding genes exhibited different TSSs depending on the host environment. Finally, we also observed differential expression of 6S RNA during host-pathogen and vector-pathogen interactions in vitro, implicating an important role for this noncoding RNA in the regulation of rickettsial transcriptome depending on the supportive host niche. CONCLUSIONS In sum, the findings of this study authenticate the presence of novel Rc_sR's in R. conorii, reveal the first evidence for differential expression of coding transcripts and utilization of alternate transcriptional start sites depending on the host niche, and implicate a role for 6S RNA in the regulation of coding transcriptome during tripartite host-pathogen-vector interactions.
Collapse
Affiliation(s)
- Hema P Narra
- Department of Pathology, University of Texas Medical Branch, Galveston, TX, 77555, USA.
| | - Abha Sahni
- Department of Pathology, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - Jessica Alsing
- Department of Pathology, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - Casey L C Schroeder
- Department of Pathology, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - George Golovko
- Department of Pharmacology and Toxicology, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - Anna M Nia
- Biochemistry and Molecular Biology, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - Yuriy Fofanov
- Department of Pharmacology and Toxicology, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - Kamil Khanipov
- Department of Pharmacology and Toxicology, University of Texas Medical Branch, Galveston, TX, 77555, USA
| | - Sanjeev K Sahni
- Department of Pathology, University of Texas Medical Branch, Galveston, TX, 77555, USA.
| |
Collapse
|
9
|
Cervantes-Rivera R, Puhar A. Whole-genome Identification of Transcriptional Start Sites by Differential RNA-seq in Bacteria. Bio Protoc 2020; 10:e3757. [PMID: 33659416 PMCID: PMC7842792 DOI: 10.21769/bioprotoc.3757] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 07/25/2020] [Accepted: 07/23/2020] [Indexed: 11/02/2022] Open
Abstract
Gene transcription in bacteria often starts some nucleotides upstream of the start codon. Identifying the specific Transcriptional Start Site (TSS) is essential for genetic manipulation, as in many cases upstream of the start codon there are sequence elements that are involved in gene expression regulation. Taken into account the classical gene structure, we are able to identify two kinds of transcriptional start site: primary and secondary. A primary transcriptional start site is located some nucleotides upstream of the translational start site, while a secondary transcriptional start site is located within the gene encoding sequence. Here, we present a step by step protocol for genome-wide transcriptional start sites determination by differential RNA-sequencing (dRNA-seq) using the enteric pathogen Shigella flexneri serotype 5a strain M90T as model. However, this method can be employed in any other bacterial species of choice. In the first steps, total RNA is purified from bacterial cultures using the hot phenol method. Ribosomal RNA (rRNA) is specifically depleted via hybridization probes using a commercial kit. A 5'-monophosphate-dependent exonuclease (TEX)-treated RNA library enriched in primary transcripts is then prepared for comparison with a library that has not undergone TEX-treatment, followed by ligation of an RNA linker adaptor of known sequence allowing the determination of TSS with single nucleotide precision. Finally, the RNA is processed for Illumina sequencing library preparation and sequenced as purchased service. TSS are identified by in-house bioinformatic analysis. Our protocol is cost-effective as it minimizes the use of commercial kits and employs freely available software.
Collapse
Affiliation(s)
- Ramón Cervantes-Rivera
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), Sweden
- Umeå Centre for Microbial Research (UCMR), Umeå University, 90 187 Umeå, Sweden
- Department of Molecular Biology, Umeå University, 90 187 Umeå, Sweden
| | - Andrea Puhar
- The Laboratory for Molecular Infection Medicine Sweden (MIMS), Sweden
- Umeå Centre for Microbial Research (UCMR), Umeå University, 90 187 Umeå, Sweden
- Department of Molecular Biology, Umeå University, 90 187 Umeå, Sweden
| |
Collapse
|
10
|
Soutourina O, Dubois T, Monot M, Shelyakin PV, Saujet L, Boudry P, Gelfand MS, Dupuy B, Martin-Verstraete I. Genome-Wide Transcription Start Site Mapping and Promoter Assignments to a Sigma Factor in the Human Enteropathogen Clostridioides difficile. Front Microbiol 2020; 11:1939. [PMID: 32903654 PMCID: PMC7438776 DOI: 10.3389/fmicb.2020.01939] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 07/23/2020] [Indexed: 12/12/2022] Open
Abstract
The emerging human enteropathogen Clostridioides difficile is the main cause of diarrhea associated with antibiotherapy. Regulatory pathways underlying the adaptive responses remain understudied and the global view of C. difficile promoter structure is still missing. In the genome of C. difficile 630, 22 genes encoding sigma factors are present suggesting a complex pattern of transcription in this bacterium. We present here the first transcriptional map of the C. difficile genome resulting from the identification of transcriptional start sites (TSS), promoter motifs and operon structures. By 5′-end RNA-seq approach, we mapped more than 1000 TSS upstream of genes. In addition to these primary TSS, this analysis revealed complex structure of transcriptional units such as alternative and internal promoters, potential RNA processing events and 5′ untranslated regions. By following an in silico iterative strategy that used as an input previously published consensus sequences and transcriptomic analysis, we identified candidate promoters upstream of most of protein-coding and non-coding RNAs genes. This strategy also led to refine consensus sequences of promoters recognized by major sigma factors of C. difficile. Detailed analysis focuses on the transcription in the pathogenicity locus and regulatory genes, as well as regulons of transition phase and sporulation sigma factors as important components of C. difficile regulatory network governing toxin gene expression and spore formation. Among the still uncharacterized regulons of the major sigma factors of C. difficile, we defined the SigL regulon by combining transcriptome and in silico analyses. We showed that the SigL regulon is largely involved in amino-acid degradation, a metabolism crucial for C. difficile gut colonization. Finally, we combined our TSS mapping, in silico identification of promoters and RNA-seq data to improve gene annotation and to suggest operon organization in C. difficile. These data will considerably improve our knowledge of global regulatory circuits controlling gene expression in C. difficile and will serve as a useful rich resource for scientific community both for the detailed analysis of specific genes and systems biology studies.
Collapse
Affiliation(s)
- Olga Soutourina
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France.,Institut Universitaire de France, Paris, France.,Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), Gif-sur-Yvette, France
| | - Thomas Dubois
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Marc Monot
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | | | - Laure Saujet
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Pierre Boudry
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems, Moscow, Russia.,Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Bruno Dupuy
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France
| | - Isabelle Martin-Verstraete
- Laboratoire Pathogenèses des Bactéries Anaérobies, Institut Pasteur, UMR CNRS 2001, Université de Paris, Paris, France.,Institut Universitaire de France, Paris, France
| |
Collapse
|
11
|
de la Fuente L, Arzalluz-Luque Á, Tardáguila M, Del Risco H, Martí C, Tarazona S, Salguero P, Scott R, Lerma A, Alastrue-Agudo A, Bonilla P, Newman JRB, Kosugi S, McIntyre LM, Moreno-Manzano V, Conesa A. tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing. Genome Biol 2020; 21:119. [PMID: 32423416 PMCID: PMC7236505 DOI: 10.1186/s13059-020-02028-w] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 04/23/2020] [Indexed: 12/26/2022] Open
Abstract
Recent advances in long-read sequencing solve inaccuracies in alternative transcript identification of full-length transcripts in short-read RNA-Seq data, which encourages the development of methods for isoform-centered functional analysis. Here, we present tappAS, the first framework to enable a comprehensive Functional Iso-Transcriptomics (FIT) analysis, which is effective at revealing the functional impact of context-specific post-transcriptional regulation. tappAS uses isoform-resolved annotation of coding and non-coding functional domains, motifs, and sites, in combination with novel analysis methods to interrogate different aspects of the functional readout of transcript variants and isoform regulation. tappAS software and documentation are available at https://app.tappas.org.
Collapse
Affiliation(s)
- Lorena de la Fuente
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
- Present Address: Bioinformatics Unit, IIS Fundación Jiménez Díaz, Madrid, Spain
| | - Ángeles Arzalluz-Luque
- Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
| | - Manuel Tardáguila
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Héctor Del Risco
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Cristina Martí
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Sonia Tarazona
- Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
| | - Pedro Salguero
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Raymond Scott
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Alberto Lerma
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Ana Alastrue-Agudo
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Pablo Bonilla
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Jeremy R B Newman
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Pathology, University of Florida, Gainesville, FL, USA
| | - Shunichi Kosugi
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Laboratory for Statistical and Translational Genetics, Center for Integrative Medical Sciences, RIKEN, Wako, Japan
| | - Lauren M McIntyre
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, USA
| | | | - Ana Conesa
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA.
- Genetics Institute, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
12
|
Ozuna A, Liberto D, Joyce RM, Arnvig KB, Nobeli I. baerhunter: an R package for the discovery and analysis of expressed non-coding regions in bacterial RNA-seq data. Bioinformatics 2020; 36:966-969. [PMID: 31418770 DOI: 10.1093/bioinformatics/btz643] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Revised: 07/29/2019] [Accepted: 08/13/2019] [Indexed: 12/12/2022] Open
Abstract
SUMMARY Standard bioinformatics pipelines for the analysis of bacterial transcriptomic data commonly ignore non-coding but functional elements e.g. small RNAs, long antisense RNAs or untranslated regions (UTRs) of mRNA transcripts. The root of this problem is the use of incomplete genome annotation files. Here, we present baerhunter, a coverage-based method implemented in R, that automates the discovery of expressed non-coding RNAs and UTRs from RNA-seq reads mapped to a reference genome. The core algorithm is part of a pipeline that facilitates downstream analysis of both coding and non-coding features. The method is simple, easy to extend and customize and, in limited tests with simulated and real data, compares favourably against the currently most popular alternative. AVAILABILITY AND IMPLEMENTATION The baerhunter R package is available from: https://github.com/irilenia/baerhunter. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- A Ozuna
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| | - D Liberto
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| | - R M Joyce
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| | - K B Arnvig
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, WC1E 6BT, UK
| | - I Nobeli
- Department of Biological Sciences, Institute of Structural and Molecular Biology, London, WC1E 7HX, UK
| |
Collapse
|
13
|
de Souza Pinto Lemgruber R, Valgepea K, Gonzalez Garcia RA, de Bakker C, Palfreyman RW, Tappel R, Köpke M, Simpson SD, Nielsen LK, Marcellin E. A TetR-Family Protein (CAETHG_0459) Activates Transcription From a New Promoter Motif Associated With Essential Genes for Autotrophic Growth in Acetogens. Front Microbiol 2019; 10:2549. [PMID: 31803150 PMCID: PMC6873888 DOI: 10.3389/fmicb.2019.02549] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Accepted: 10/22/2019] [Indexed: 01/08/2023] Open
Abstract
Acetogens can fix carbon (CO or CO2) into acetyl-CoA via the Wood-Ljungdahl pathway (WLP) that also makes them attractive cell factories for the production of fuels and chemicals from waste feedstocks. Although most biochemical details of the WLP are well understood and systems-level characterization of acetogen metabolism has recently improved, key transcriptional features such as promoter motifs and transcriptional regulators are still unknown in acetogens. Here, we use differential RNA-sequencing to identify a previously undescribed promoter motif associated with essential genes for autotrophic growth of the model-acetogen Clostridium autoethanogenum. RNA polymerase was shown to bind to the new promoter motif using a DNA-binding protein assay and proteomics enabled the discovery of four candidates to potentially function directly in control of transcription of the WLP and other key genes of C1 fixation metabolism. Next, in vivo experiments showed that a TetR-family transcriptional regulator (CAETHG_0459) and the housekeeping sigma factor (σA) activate expression of a reporter protein (GFP) in-frame with the new promoter motif from a fusion vector in Escherichia coli. Lastly, a protein-protein interaction assay with the RNA polymerase (RNAP) shows that CAETHG_0459 directly binds to the RNAP. Together, the data presented here advance the fundamental understanding of transcriptional regulation of C1 fixation in acetogens and provide a strategy for improving the performance of gas-fermenting bacteria by genetic engineering.
Collapse
Affiliation(s)
| | - Kaspar Valgepea
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD, Australia
- ERA Chair in Gas Fermentation Technologies, Institute of Technology, University of Tartu, Tartu, Estonia
| | | | - Christopher de Bakker
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD, Australia
| | - Robin William Palfreyman
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD, Australia
- Queensland Node of Metabolomics Australia, The University of Queensland, Brisbane, QLD, Australia
| | | | | | | | - Lars Keld Nielsen
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD, Australia
| | - Esteban Marcellin
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD, Australia
- Queensland Node of Metabolomics Australia, The University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
14
|
Comparative Transcriptomic Profiling of Yersinia enterocolitica O:3 and O:8 Reveals Major Expression Differences of Fitness- and Virulence-Relevant Genes Indicating Ecological Separation. mSystems 2019; 4:mSystems00239-18. [PMID: 31020044 PMCID: PMC6478967 DOI: 10.1128/msystems.00239-18] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Accepted: 02/27/2019] [Indexed: 01/16/2023] Open
Abstract
Yersinia enterocolitica is a major diarrheal pathogen and is associated with a large range of gut-associated diseases. Members of this species have evolved into different phylogroups with genotypic variations. We performed the first characterization of the Y. enterocolitica transcriptional landscape and tracked the consequences of the genomic variations between two different pathogenic phylogroups by comparing their RNA repertoire, promoter usage, and expression profiles under four different virulence-relevant conditions. Our analysis revealed major differences in the transcriptional outputs of the closely related strains, pointing to an ecological separation in which one is more adapted to an environmental lifestyle and the other to a mostly mammal-associated lifestyle. Moreover, a variety of pathoadaptive alterations, including alterations in acid resistance genes, colonization factors, and toxins, were identified which affect virulence and host specificity. This illustrates that comparative transcriptomics is an excellent approach to discover differences in the functional output from closely related genomes affecting niche adaptation and virulence, which cannot be directly inferred from DNA sequences. Yersinia enterocolitica is a zoonotic pathogen and an important cause of bacterial gastrointestinal infections in humans. Large-scale population genomic analyses revealed genetic and phenotypic diversity of this bacterial species, but little is known about the differences in the transcriptome organization, small RNA (sRNA) repertoire, and transcriptional output. Here, we present the first comparative high-resolution transcriptome analysis of Y. enterocolitica strains representing highly pathogenic phylogroup 2 (serotype O:8) and moderately pathogenic phylogroup 3 (serotype O:3) grown under four infection-relevant conditions. Our transcriptome sequencing (RNA-seq) approach revealed 1,299 and 1,076 transcriptional start sites and identified strain-specific sRNAs that could contribute to differential regulation among the phylogroups. Comparative transcriptomics further uncovered major gene expression differences, in particular, in the temperature-responsive regulon. Multiple virulence-relevant genes are differentially regulated between the two strains, supporting an ecological separation of phylogroups with certain niche-adapted properties. Strong upregulation of the ystA enterotoxin gene in combination with constitutive high expression of cell invasion factor InvA further showed that the toxicity of recent outbreak O:3 strains has increased. Overall, our report provides new insights into the specific transcriptome organization of phylogroups 2 and 3 and reveals gene expression differences contributing to the substantial phenotypic differences that exist between the lineages. IMPORTANCEYersinia enterocolitica is a major diarrheal pathogen and is associated with a large range of gut-associated diseases. Members of this species have evolved into different phylogroups with genotypic variations. We performed the first characterization of the Y. enterocolitica transcriptional landscape and tracked the consequences of the genomic variations between two different pathogenic phylogroups by comparing their RNA repertoire, promoter usage, and expression profiles under four different virulence-relevant conditions. Our analysis revealed major differences in the transcriptional outputs of the closely related strains, pointing to an ecological separation in which one is more adapted to an environmental lifestyle and the other to a mostly mammal-associated lifestyle. Moreover, a variety of pathoadaptive alterations, including alterations in acid resistance genes, colonization factors, and toxins, were identified which affect virulence and host specificity. This illustrates that comparative transcriptomics is an excellent approach to discover differences in the functional output from closely related genomes affecting niche adaptation and virulence, which cannot be directly inferred from DNA sequences.
Collapse
|
15
|
The Primary Antisense Transcriptome of Halobacterium salinarum NRC-1. Genes (Basel) 2019; 10:genes10040280. [PMID: 30959844 PMCID: PMC6523106 DOI: 10.3390/genes10040280] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 04/01/2019] [Accepted: 04/01/2019] [Indexed: 12/17/2022] Open
Abstract
Antisense RNAs (asRNAs) are present in diverse organisms and play important roles in gene regulation. In this work, we mapped the primary antisense transcriptome in the halophilic archaeon Halobacterium salinarum NRC-1. By reanalyzing publicly available data, we mapped antisense transcription start sites (aTSSs) and inferred the probable 3′ ends of these transcripts. We analyzed the resulting asRNAs according to the size, location, function of genes on the opposite strand, expression levels and conservation. We show that at least 21% of the genes contain asRNAs in H. salinarum. Most of these asRNAs are expressed at low levels. They are located antisense to genes related to distinctive characteristics of H. salinarum, such as bacteriorhodopsin, gas vesicles, transposases and other important biological processes such as translation. We provide evidence to support asRNAs in type II toxin–antitoxin systems in archaea. We also analyzed public Ribosome profiling (Ribo-seq) data and found that ~10% of the asRNAs are ribosome-associated non-coding RNAs (rancRNAs), with asRNAs from transposases overrepresented. Using a comparative transcriptomics approach, we found that ~19% of the asRNAs annotated in H. salinarum belong to genes with an ortholog in Haloferax volcanii, in which an aTSS could be identified with positional equivalence. This shows that most asRNAs are not conserved between these halophilic archaea.
Collapse
|
16
|
Kusmierek M, Heroven AK, Beckstette M, Nuss AM, Dersch P. Discovering Yersinia-Host Interactions by Tissue Dual RNA-Seq. Methods Mol Biol 2019; 2010:99-116. [PMID: 31177434 DOI: 10.1007/978-1-4939-9541-7_8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
A detailed knowledge about virulence-relevant genes, as well as where and when they are expressed during the course of an infection is required to obtain a comprehensive understanding of the complex host-pathogen interactions. The development of unbiased probe-independent RNA sequencing (RNA-seq) approaches has dramatically changed transcriptomics. It allows simultaneous monitoring of genome-wide, infection-linked transcriptional alterations of the host tissue and colonizing pathogens. Here, we provide a detailed protocol for the preparation and analysis of lymphatic tissue infected with the mainly extracellularly growing pathogen Yersinia pseudotuberculosis. This method can be used as a powerful tool for the discovery of Yersinia-induced host responses, colonization and persistence strategies of the pathogen, and underlying regulatory processes. Furthermore, we describe computational methods with which we analyzed obtained datasets.
Collapse
Affiliation(s)
- Maria Kusmierek
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Ann Kathrin Heroven
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Michael Beckstette
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Aaron M Nuss
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Petra Dersch
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany.
- Institute of Infectiology, University of Münster, Münster, Germany.
| |
Collapse
|
17
|
Garanina IA, Fisunov GY, Govorun VM. BAC-BROWSER: The Tool for Visualization and Analysis of Prokaryotic Genomes. Front Microbiol 2018; 9:2827. [PMID: 30519231 PMCID: PMC6258810 DOI: 10.3389/fmicb.2018.02827] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 11/05/2018] [Indexed: 11/13/2022] Open
Abstract
Prokaryotes are actively studied objects in the scope of genomic regulation. Microbiologists need special tools for complex analysis of data to study and identification of regulatory mechanism in bacteria and archaea. We developed a tool BAC-BROWSER, specifically for visualization and analysis of small prokaryotic genomes. BAC-BROWSER provides tools for different types of analysis to study a wide set of regulatory mechanisms of prokaryotes: -transcriptional regulation by transcription factors (TFs), analysis of TFs, their targets, and binding sites.-other regulatory motifs, promoters, terminators and ribosome binding sites-transcriptional regulation by variation of operon structure, alternative starts or ends of transcription.-non-coding RNAs, antisense RNAs-RNA secondary structure, riboswitches-GC content, GC skew, codon usage BAC-browser incorporated free programs accelerating the verification of obtained results: primer design and oligocalculator, vector visualization, the tool for synthetic gene construction. The program is designed for Windows operating system and freely available for download in http://smdb.rcpcm.org/tools/index.html.
Collapse
Affiliation(s)
- Irina A Garanina
- Federal Research and Clinical Centre of Physical-Chemical Medicine, Moscow, Russia.,Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Gleb Y Fisunov
- Federal Research and Clinical Centre of Physical-Chemical Medicine, Moscow, Russia
| | - Vadim M Govorun
- Federal Research and Clinical Centre of Physical-Chemical Medicine, Moscow, Russia.,Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| |
Collapse
|
18
|
Ten-Caten F, Vêncio RZN, Lorenzetti APR, Zaramela LS, Santana AC, Koide T. Internal RNAs overlapping coding sequences can drive the production of alternative proteins in archaea. RNA Biol 2018; 15:1119-1132. [PMID: 30175688 PMCID: PMC6161675 DOI: 10.1080/15476286.2018.1509661] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Prokaryotic genomes show a high level of information compaction often with different molecules transcribed from the same locus. Although antisense RNAs have been relatively well studied, RNAs in the same strand, internal RNAs (intraRNAs), are still poorly understood. The question of how common is the translation of overlapping reading frames remains open. We address this question in the model archaeon Halobacterium salinarum. In the present work we used differential RNA-seq (dRNA-seq) in H. salinarum NRC-1 to locate intraRNA signals in subsets of internal transcription start sites (iTSS) and establish the open reading frames associated to them (intraORFs). Using C-terminally flagged proteins, we experimentally observed isoforms accurately predicted by intraRNA translation for kef1, acs3 and orc4 genes. We also recovered from the literature and mass spectrometry databases several instances of protein isoforms consistent with intraRNA translation such as the gas vesicle protein gene gvpC1. We found evidence for intraRNAs in horizontally transferred genes such as the chaperone dnaK and the aerobic respiration related cydA in both H. salinarum and Escherichia coli. Also, intraRNA translation evidence in H. salinarum, E. coli and yeast of a universal elongation factor (aEF-2, fusA and eEF-2) suggests that this is an ancient phenomenon present in all domains of life.
Collapse
Affiliation(s)
- Felipe Ten-Caten
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Ricardo Z N Vêncio
- b Department of Computation and Mathematics, Faculdade de Filosofia , Ciências e Letras de Ribeirão Preto, University of São Paulo , Ribeirão Preto , Brazil
| | - Alan Péricles R Lorenzetti
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Livia Soares Zaramela
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Ana Carolina Santana
- c Department of Cell and Molecular Biology and Pathogenic Bioagents , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Tie Koide
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| |
Collapse
|
19
|
Yu SH, Vogel J, Förstner KU. ANNOgesic: a Swiss army knife for the RNA-seq based annotation of bacterial/archaeal genomes. Gigascience 2018; 7:5087959. [PMID: 30169674 PMCID: PMC6123526 DOI: 10.1093/gigascience/giy096] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Accepted: 08/23/2018] [Indexed: 11/13/2022] Open
Abstract
To understand the gene regulation of an organism of interest, a comprehensive genome annotation is essential. While some features, such as coding sequences, can be computationally predicted with high accuracy based purely on the genomic sequence, others, such as promoter elements or noncoding RNAs, are harder to detect. RNA sequencing (RNA-seq) has proven to be an efficient method to identify these genomic features and to improve genome annotations. However, processing and integrating RNA-seq data in order to generate high-resolution annotations is challenging, time consuming, and requires numerous steps. We have constructed a powerful and modular tool called ANNOgesic that provides the required analyses and simplifies RNA-seq-based bacterial and archaeal genome annotation. It can integrate data from conventional RNA-seq and differential RNA-seq and predicts and annotates numerous features, including small noncoding RNAs, with high precision. The software is available under an open source license (ISCL) at https://pypi.org/project/ANNOgesic/.
Collapse
Affiliation(s)
- Sung-Huan Yu
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany
| | - Jörg Vogel
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,Helmholtz Institute for RNA-based Infection Research (HIRI), Josef-Schneider-Straße 2, 97080 Würzburg Germany
| | - Konrad U Förstner
- Institute of Molecular Infection Biology (IMIB), University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,ZB MED - Information Center for Life Sciences, Informationservices, Gleueler Straße 60, 50931 Cologne (Köln), Germany.,Technical University of Cologne, Faculty for Information and Communication Sciences, Claudiusstraße 1, 50678 Cologne (Köln), Germany
| |
Collapse
|
20
|
Moldován N, Tombácz D, Szűcs A, Csabai Z, Balázs Z, Kis E, Molnár J, Boldogkői Z. Third-generation Sequencing Reveals Extensive Polycistronism and Transcriptional Overlapping in a Baculovirus. Sci Rep 2018; 8:8604. [PMID: 29872099 PMCID: PMC5988703 DOI: 10.1038/s41598-018-26955-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Accepted: 05/22/2018] [Indexed: 12/11/2022] Open
Abstract
The Autographa californica multiple nucleopolyhedrovirus (AcMNPV) is an insect-pathogen baculovirus. In this study, we applied the Oxford Nanopore Technologies platform for the analysis of the polyadenylated fraction of the viral transcriptome using both cDNA and direct RNA sequencing methods. We identified and annotated altogether 132 novel transcripts and transcript isoforms, including 4 coding and 4 non-coding RNA molecules, 47 length variants, 5 splice isoforms, as well as 23 polycistronic and 49 complex transcripts. All of the identified novel protein-coding genes were 5'-truncated forms of longer host genes. In this work, we demonstrated that in the case of transcript start site isoforms, the promoters and the initiator sequence of the longer and shorter variants belong to the same kinetic class. Long-read sequencing also revealed a complex meshwork of transcriptional overlaps, the function of which needs to be clarified. Additionally, we developed bioinformatics methods to improve the transcript annotation and to eliminate the non-specific transcription reads generated by template switching and false priming.
Collapse
Affiliation(s)
- Norbert Moldován
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Attila Szűcs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Zsolt Balázs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Emese Kis
- Solvo Biotechnology, Szeged, 6720, Hungary
| | | | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary.
| |
Collapse
|
21
|
Amman F, D'Halluin A, Antoine R, Huot L, Bibova I, Keidel K, Slupek S, Bouquet P, Coutte L, Caboche S, Locht C, Vecerek B, Hot D. Primary transcriptome analysis reveals importance of IS elements for the shaping of the transcriptional landscape of Bordetella pertussis. RNA Biol 2018; 15:967-975. [PMID: 29683387 PMCID: PMC6161684 DOI: 10.1080/15476286.2018.1462655] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Accepted: 04/03/2018] [Indexed: 12/25/2022] Open
Abstract
Bordetella pertussis is the causative agent of whooping cough, a respiratory disease still considered as a major public health threat and for which recent re-emergence has been observed. Constant reshuffling of Bordetella pertussis genome organization was observed during evolution. These rearrangements are essentially mediated by Insertion Sequences (IS), a mobile genetic elements present in more than 230 copies in the genome, which are supposed to be one of the driving forces enabling the pathogen to escape from vaccine-induced immunity. Here we use high-throughput sequencing approaches (RNA-seq and differential RNA-seq), to decipher Bordetella pertussis transcriptome characteristics and to evaluate the impact of IS elements on transcriptome architecture. Transcriptional organization was determined by identification of transcription start sites and revealed also a large variety of non-coding RNAs including sRNAs, leaderless mRNAs or long 3' and 5'UTR including seven riboswitches. Unusual topological organizations, such as overlapping 5'- or 3'-extremities between oppositely orientated mRNA were also unveiled. The pivotal role of IS elements in the transcriptome architecture and their effect on the transcription of neighboring genes was examined. This effect is mediated by the introduction of IS harbored promoters or by emergence of hybrid promoters. This study revealed that in addition to their impact on genome rearrangements, most of the IS also impact on the expression of their flanking genes. Furthermore, the transcripts produced by IS are strain-specific due to the strain to strain variation in IS copy number and genomic context.
Collapse
Affiliation(s)
- Fabian Amman
- University of Vienna, Theoretical Biochemistry Group, Institute for Theoretical Chemistry, Vienna, Austria
| | - Alexandre D'Halluin
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Rudy Antoine
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Ludovic Huot
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Ilona Bibova
- Institute of Microbiology of the ASCR; Laboratory of post-transcriptional control of gene expression, Prague, Czech Republic
| | - Kristina Keidel
- Institute of Microbiology of the ASCR; Laboratory of post-transcriptional control of gene expression, Prague, Czech Republic
| | - Stéphanie Slupek
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Peggy Bouquet
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Loïc Coutte
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Ségolène Caboche
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Camille Locht
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| | - Branislav Vecerek
- Institute of Microbiology of the ASCR; Laboratory of post-transcriptional control of gene expression, Prague, Czech Republic
| | - David Hot
- Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, U1019 - UMR8204 - CIIL - Center for Infection and Immunity of Lille, Lille, France
| |
Collapse
|
22
|
Le Scornet A, Redder P. Post-transcriptional control of virulence gene expression in Staphylococcus aureus. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2018; 1862:734-741. [PMID: 29705591 DOI: 10.1016/j.bbagrm.2018.04.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Revised: 04/25/2018] [Accepted: 04/25/2018] [Indexed: 12/12/2022]
Abstract
Opportunistic pathogens have to be ready to change life-style whenever the occasion arises, and therefore need to keep tight control over the expression of their virulence factors. Doubly so for commensal bacteria, such as Staphylococcus aureus, which should avoid harming their hosts when they are in a state of peaceful co-existence. S. aureus carries very few sigma factors to help define the transcriptional programs, but instead uses a plethora of small RNA molecules and RNA-RNA interactions to regulate gene expression post-transcriptionally. The endoribonucleases RNase III and RNase Y contribute to this regulatory diversity, and provide a link to RNA-decay and intra-cellular spatiotemporal control of expression. In this review we describe some of these post-transcriptional mechanisms as well as some of the novel transcriptomic approaches that have been used to find and to study them.
Collapse
Affiliation(s)
- Alexandre Le Scornet
- LMGM, Centre de Biologie Integrative, Paul Sabatier University, 118, Route de Narbonne, 31062 Toulouse, France
| | - Peter Redder
- LMGM, Centre de Biologie Integrative, Paul Sabatier University, 118, Route de Narbonne, 31062 Toulouse, France.
| |
Collapse
|
23
|
Balázs Z, Tombácz D, Szűcs A, Csabai Z, Megyeri K, Petrov AN, Snyder M, Boldogkői Z. Long-Read Sequencing of Human Cytomegalovirus Transcriptome Reveals RNA Isoforms Carrying Distinct Coding Potentials. Sci Rep 2017; 7:15989. [PMID: 29167532 PMCID: PMC5700075 DOI: 10.1038/s41598-017-16262-z] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 11/07/2017] [Indexed: 12/22/2022] Open
Abstract
The human cytomegalovirus (HCMV) is a ubiquitous, human pathogenic herpesvirus. The complete viral genome is transcriptionally active during infection; however, a large part of its transcriptome has yet to be annotated. In this work, we applied the amplified isoform sequencing technique from Pacific Biosciences to characterize the lytic transcriptome of HCMV strain Towne varS. We developed a pipeline for transcript annotation using long-read sequencing data. We identified 248 transcriptional start sites, 116 transcriptional termination sites and 80 splicing events. Using this information, we have annotated 291 previously undescribed or only partially annotated transcript isoforms, including eight novel antisense transcripts and their isoforms, as well as a novel transcript (RS2) in the short repeat region, partially antisense to RS1. Similarly to other organisms, we discovered a high transcriptional diversity in HCMV, with many transcripts only slightly differing from one another. Comparing our transcriptome profiling results to an earlier ribosome footprint analysis, we have concluded that the majority of the transcripts contain multiple translationally active ORFs, and also that most isoforms contain unique combinations of ORFs. Based on these results, we propose that one important function of this transcriptional diversity may be to provide a regulatory mechanism at the level of translation.
Collapse
Affiliation(s)
- Zsolt Balázs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Dóra Tombácz
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary.,Department of Genetics, School of Medicine, Stanford University, Stanford, California, 94305, USA
| | - Attila Szűcs
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Zsolt Csabai
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Klára Megyeri
- Department of Medical Microbiology and Immunobiology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary
| | - Alexey N Petrov
- Department of Structural Biology, School of Medicine, Stanford University, Stanford, California, 94305, USA.,Department of Biological Sciences, College of Sciences and Mathematics, Auburn University, Auburn, Alabama, 36849, USA
| | - Michael Snyder
- Department of Genetics, School of Medicine, Stanford University, Stanford, California, 94305, USA
| | - Zsolt Boldogkői
- Department of Medical Biology, Faculty of Medicine, University of Szeged, Szeged, 6720, Hungary.
| |
Collapse
|
24
|
Lott SC, Wolfien M, Riege K, Bagnacani A, Wolkenhauer O, Hoffmann S, Hess WR. Customized workflow development and data modularization concepts for RNA-Sequencing and metatranscriptome experiments. J Biotechnol 2017; 261:85-96. [DOI: 10.1016/j.jbiotec.2017.06.1203] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Revised: 06/22/2017] [Accepted: 06/26/2017] [Indexed: 12/14/2022]
|
25
|
Nitrogen cost minimization is promoted by structural changes in the transcriptome of N-deprived Prochlorococcus cells. ISME JOURNAL 2017; 11:2267-2278. [PMID: 28585937 PMCID: PMC5607370 DOI: 10.1038/ismej.2017.88] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Revised: 04/20/2017] [Accepted: 04/28/2017] [Indexed: 01/17/2023]
Abstract
Prochlorococcus is a globally abundant marine cyanobacterium with many adaptations that reduce cellular nutrient requirements, facilitating growth in its nutrient-poor environment. One such genomic adaptation is the preferential utilization of amino acids containing fewer N-atoms, which minimizes cellular nitrogen requirements. We predicted that transcriptional regulation might further reduce cellular N budgets during transient N limitation. To explore this, we compared transcription start sites (TSSs) in Prochlorococcus MED4 under N-deprived and N-replete conditions. Of 64 genes with primary and internal TSSs in both conditions, N-deprived cells initiated transcription downstream of primary TSSs more frequently than N-replete cells. Additionally, 117 genes with only an internal TSS demonstrated increased internal transcription under N-deprivation. These shortened transcripts encode predicted proteins with an average of 21% less N content compared to full-length transcripts. We hypothesized that low translation rates, which afford greater control over protein abundances, would be beneficial to relatively slow-growing organisms like Prochlorococcus. Consistent with this idea, we found that Prochlorococcus exhibits greater usage of glycine–glycine motifs, which causes translational pausing, when compared to faster growing microbes. Our findings indicate that structural changes occur within the Prochlorococcus MED4 transcriptome during N-deprivation, potentially altering the size and structure of proteins expressed under nutrient limitation.
Collapse
|
26
|
Promworn Y, Kaewprommal P, Shaw PJ, Intarapanich A, Tongsima S, Piriyapongsa J. ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data. PLoS One 2017; 12:e0178483. [PMID: 28542466 PMCID: PMC5444824 DOI: 10.1371/journal.pone.0178483] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Accepted: 05/12/2017] [Indexed: 11/25/2022] Open
Abstract
Background Biochemical methods are available for enriching 5′ ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5′ ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. Results We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5′ ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5′ ends than TSSAR. In general, the transcript 5′ ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. Conclusion ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5′ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER).
Collapse
Affiliation(s)
- Yuttachon Promworn
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Pavita Kaewprommal
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Philip J. Shaw
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Apichart Intarapanich
- National Electronics and Computer Technology Center (NECTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Sissades Tongsima
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Jittima Piriyapongsa
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
- * E-mail:
| |
Collapse
|
27
|
James K, Cockell SJ, Zenkin N. Deep sequencing approaches for the analysis of prokaryotic transcriptional boundaries and dynamics. Methods 2017; 120:76-84. [PMID: 28434904 DOI: 10.1016/j.ymeth.2017.04.016] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Revised: 04/13/2017] [Accepted: 04/18/2017] [Indexed: 01/13/2023] Open
Abstract
The identification of the protein-coding regions of a genome is straightforward due to the universality of start and stop codons. However, the boundaries of the transcribed regions, conditional operon structures, non-coding RNAs and the dynamics of transcription, such as pausing of elongation, are non-trivial to identify, even in the comparatively simple genomes of prokaryotes. Traditional methods for the study of these areas, such as tiling arrays, are noisy, labour-intensive and lack the resolution required for densely-packed bacterial genomes. Recently, deep sequencing has become increasingly popular for the study of the transcriptome due to its lower costs, higher accuracy and single nucleotide resolution. These methods have revolutionised our understanding of prokaryotic transcriptional dynamics. Here, we review the deep sequencing and data analysis techniques that are available for the study of transcription in prokaryotes, and discuss the bioinformatic considerations of these analyses.
Collapse
Affiliation(s)
- Katherine James
- Centre for Bacterial Cell Biology, Institute for Cell and Molecular Bioscience, Newcastle University, Baddiley-Clark Building, Richardson Road, Newcastle Upon Tyne NE2 4AX, UK.
| | - Simon J Cockell
- Bioinformatics Support Unit, Newcastle University, William Leech Building, Framlington Place, Newcastle Upon Tyne NE2 4HH, UK
| | - Nikolay Zenkin
- Centre for Bacterial Cell Biology, Institute for Cell and Molecular Bioscience, Newcastle University, Baddiley-Clark Building, Richardson Road, Newcastle Upon Tyne NE2 4AX, UK
| |
Collapse
|
28
|
Hilker R, Stadermann KB, Schwengers O, Anisiforov E, Jaenicke S, Weisshaar B, Zimmermann T, Goesmann A. ReadXplorer 2-detailed read mapping analysis and visualization from one single source. Bioinformatics 2016; 32:3702-3708. [PMID: 27540267 PMCID: PMC5167064 DOI: 10.1093/bioinformatics/btw541] [Citation(s) in RCA: 69] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2016] [Revised: 08/02/2016] [Accepted: 08/15/2016] [Indexed: 01/29/2023] Open
Abstract
MOTIVATION The vast amount of already available and currently generated read mapping data requires comprehensive visualization, and should benefit from bioinformatics tools offering a wide spectrum of analysis functionality from just one source. Appropriate handling of multiple mapped reads during mapping analyses remains an issue that demands improvement. RESULTS The capabilities of the read mapping analysis and visualization tool ReadXplorer were vastly enhanced. Here, we present an even finer granulated read mapping classification, improving the level of detail for analyses and visualizations. The spectrum of automatic analysis functions has been broadened to include genome rearrangement detection as well as correlation analysis between two mapping data sets. Existing functions were refined and enhanced, namely the computation of differentially expressed genes, the read count and normalization analysis and the transcription start site detection. Additionally, ReadXplorer 2 features a highly improved support for large eukaryotic data sets and a command line version, enabling its integration into workflows. Finally, the new version is now able to display any kind of tabular results from other bioinformatics tools. AVAILABILITY AND IMPLEMENTATION http://www.readxplorer.org CONTACT: readxplorer@computational.bio.uni-giessen.deSupplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rolf Hilker
- Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, Giessen 35392, Germany
| | - Kai Bernd Stadermann
- Faculty of Biology, Chair of Genome Research, Bielefeld University, Bielefeld 33615, Germany
| | - Oliver Schwengers
- Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, Giessen 35392, Germany
| | - Evgeny Anisiforov
- Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, Giessen 35392, Germany
| | - Sebastian Jaenicke
- Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, Giessen 35392, Germany
| | - Bernd Weisshaar
- Faculty of Biology, Chair of Genome Research, Bielefeld University, Bielefeld 33615, Germany
| | - Tobias Zimmermann
- Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, Giessen 35392, Germany
| | - Alexander Goesmann
- Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, Giessen 35392, Germany
| |
Collapse
|
29
|
Characterization of a putative NsrR homologue in Streptomyces venezuelae reveals a new member of the Rrf2 superfamily. Sci Rep 2016; 6:31597. [PMID: 27605472 PMCID: PMC5015018 DOI: 10.1038/srep31597] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 07/25/2016] [Indexed: 01/06/2023] Open
Abstract
Members of the Rrf2 superfamily of transcription factors are widespread in bacteria but their functions are largely unexplored. The few that have been characterized in detail sense nitric oxide (NsrR), iron limitation (RirA), cysteine availability (CymR) and the iron sulfur (Fe-S) cluster status of the cell (IscR). In this study we combined ChIP- and dRNA-seq with in vitro biochemistry to characterize a putative NsrR homologue in Streptomyces venezuelae. ChIP-seq analysis revealed that rather than regulating the nitrosative stress response like Streptomyces coelicolor NsrR, Sven6563 binds to a conserved motif at a different, much larger set of genes with a diverse range of functions, including a number of regulators, genes required for glutamine synthesis, NADH/NAD(P)H metabolism, as well as general DNA/RNA and amino acid/protein turn over. Our biochemical experiments further show that Sven6563 has a [2Fe-2S] cluster and that the switch between oxidized and reduced cluster controls its DNA binding activity in vitro. To our knowledge, both the sensing domain and the putative target genes are novel for an Rrf2 protein, suggesting Sven6563 represents a new member of the Rrf2 superfamily. Given the redox sensitivity of its Fe-S cluster we have tentatively named the protein RsrR for Redox sensitive response Regulator.
Collapse
|
30
|
Cohen O, Doron S, Wurtzel O, Dar D, Edelheit S, Karunker I, Mick E, Sorek R. Comparative transcriptomics across the prokaryotic tree of life. Nucleic Acids Res 2016; 44:W46-53. [PMID: 27154273 PMCID: PMC4987935 DOI: 10.1093/nar/gkw394] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2016] [Accepted: 04/28/2016] [Indexed: 12/23/2022] Open
Abstract
Whole-transcriptome sequencing studies from recent years revealed an unexpected complexity in transcriptomes of bacteria and archaea, including abundant non-coding RNAs, cis-antisense transcription and regulatory untranslated regions (UTRs). Understanding the functional relevance of the plethora of non-coding RNAs in a given organism is challenging, especially since some of these RNAs were attributed to ‘transcriptional noise’. To allow the search for conserved transcriptomic elements we produced comparative transcriptome maps for multiple species across the microbial tree of life. These transcriptome maps are detailed in annotations, comparable by gene families, and BLAST-searchable by user provided sequences. Our transcriptome collection includes 18 model organisms spanning 10 phyla/subphyla of bacteria and archaea that were sequenced using standardized RNA-seq methods. The utility of the comparative approach, as implemented in our web server, is demonstrated by highlighting genes with exceptionally long 5′UTRs across species, which correspond to many known riboswitches and further suggest novel putative regulatory elements. Our study provides a standardized reference transcriptome to major clinically and environmentally important microbial phyla. The viewer is available at http://exploration.weizmann.ac.il/TCOL, setting a framework for comparative studies of the microbial non-coding genome.
Collapse
Affiliation(s)
- Ofir Cohen
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Shany Doron
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Omri Wurtzel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Daniel Dar
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Sarit Edelheit
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Iris Karunker
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Eran Mick
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Rotem Sorek
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
31
|
Sass AM, Van Acker H, Förstner KU, Van Nieuwerburgh F, Deforce D, Vogel J, Coenye T. Genome-wide transcription start site profiling in biofilm-grown Burkholderia cenocepacia J2315. BMC Genomics 2015; 16:775. [PMID: 26462475 PMCID: PMC4603805 DOI: 10.1186/s12864-015-1993-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2015] [Accepted: 10/06/2015] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Burkholderia cenocepacia is a soil-dwelling Gram-negative Betaproteobacterium with an important role as opportunistic pathogen in humans. Infections with B. cenocepacia are very difficult to treat due to their high intrinsic resistance to most antibiotics. Biofilm formation further adds to their antibiotic resistance. B. cenocepacia harbours a large, multi-replicon genome with a high GC-content, the reference genome of strain J2315 includes 7374 annotated genes. This study aims to annotate transcription start sites and identify novel transcripts on a whole genome scale. METHODS RNA extracted from B. cenocepacia J2315 biofilms was analysed by differential RNA-sequencing and the resulting dataset compared to data derived from conventional, global RNA-sequencing. Transcription start sites were annotated and further analysed according to their position relative to annotated genes. RESULTS Four thousand ten transcription start sites were mapped over the whole B. cenocepacia genome and the primary transcription start site of 2089 genes expressed in B. cenocepacia biofilms were defined. For 64 genes a start codon alternative to the annotated one was proposed. Substantial antisense transcription for 105 genes and two novel protein coding sequences were identified. The distribution of internal transcription start sites can be used to identify genomic islands in B. cenocepacia. A potassium pump strongly induced only under biofilm conditions was found and 15 non-coding small RNAs highly expressed in biofilms were discovered. CONCLUSIONS Mapping transcription start sites across the B. cenocepacia genome added relevant information to the J2315 annotation. Genes and novel regulatory RNAs putatively involved in B. cenocepacia biofilm formation were identified. These findings will help in understanding regulation of B. cenocepacia biofilm formation.
Collapse
Affiliation(s)
- Andrea M Sass
- Laboratory of Pharmaceutical Microbiology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium.
| | - Heleen Van Acker
- Laboratory of Pharmaceutical Microbiology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium.
| | - Konrad U Förstner
- Core Unit Systems Medicine, University of Würzburg, Würzburg, Germany.
| | | | - Dieter Deforce
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium.
| | - Jörg Vogel
- Institute for Molecular Infection Biology, University of Würzburg, Würzburg, Germany.
| | - Tom Coenye
- Laboratory of Pharmaceutical Microbiology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium.
| |
Collapse
|
32
|
Stazic D, Voß B. The complexity of bacterial transcriptomes. J Biotechnol 2015; 232:69-78. [PMID: 26450562 DOI: 10.1016/j.jbiotec.2015.09.041] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Revised: 09/07/2015] [Accepted: 09/29/2015] [Indexed: 01/09/2023]
Abstract
For eukaryotes there seems to be no doubt that differences on the trancriptomic level substantially contribute to the process of species diversification, whereas for bacteria this is thought to be less important. Recent years saw a significant increase in full transcriptome studies for bacteria, which provided deep insight into the architecture of bacterial transcriptomes. Most notably, it became evident that, in contrast to previous scientific consensus, bacterial transcriptomes are quite complex. There exist a large number of cis-antisense RNAs, non-coding RNAs, overlapping transcripts and RNA elements that regulate transcription, such as riboswitches. Furthermore, processing and degradation of RNA has gained interest, because it has a significant impact on the composition of the transcriptome. In this review, we summarize recent findings and put them into a broader context with respect to the complexity of bacterial transcriptomes and its putative biological meanings.
Collapse
Affiliation(s)
- D Stazic
- University of Freiburg, Faculty of Biology, Computational Transcriptomics, Schänzlestr. 1, 79104 Freiburg, Germany.
| | - B Voß
- University of Freiburg, Faculty of Biology, Computational Transcriptomics, Schänzlestr. 1, 79104 Freiburg, Germany.
| |
Collapse
|
33
|
Bischler T, Tan HS, Nieselt K, Sharma CM. Differential RNA-seq (dRNA-seq) for annotation of transcriptional start sites and small RNAs in Helicobacter pylori. Methods 2015; 86:89-101. [PMID: 26091613 DOI: 10.1016/j.ymeth.2015.06.012] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Revised: 06/07/2015] [Accepted: 06/09/2015] [Indexed: 12/29/2022] Open
Abstract
The global mapping of transcription boundaries is a key step in the elucidation of the full complement of transcriptional features of an organism. It facilitates the annotation of operons and untranslated regions as well as novel transcripts, including cis- and trans-encoded small RNAs (sRNAs). So called RNA sequencing (RNA-seq) based on deep sequencing of cDNAs has greatly facilitated transcript mapping with single nucleotide resolution. However, conventional RNA-seq approaches typically cannot distinguish between primary and processed transcripts. Here we describe the recently developed differential RNA-seq (dRNA-seq) approach, which facilitates the annotation of transcriptional start sites (TSS) based on deep sequencing of two differentially treated cDNA library pairs, with one library being enriched for primary transcripts. Using the human pathogen Helicobacter pylori as a model organism, we describe the application of dRNA-seq together with an automated TSS annotation approach for generation of a genome-wide TSS map in bacteria. Besides a description of transcriptome and regulatory features that can be identified by this approach, we discuss the impact of different library preparation protocols and sequencing platforms as well as manual and automated TSS annotation. Moreover, we have set up an easily accessible online browser for visualization of the H. pylori transcriptome data from this and our previous H. pylori dRNA-seq study.
Collapse
Affiliation(s)
- Thorsten Bischler
- Research Center for Infectious Diseases (ZINF), University of Würzburg, Josef-Schneider-Str. 2/Bau D15, 97080 Würzburg, Germany
| | - Hock Siew Tan
- Research Center for Infectious Diseases (ZINF), University of Würzburg, Josef-Schneider-Str. 2/Bau D15, 97080 Würzburg, Germany
| | - Kay Nieselt
- Integrative Transcriptomics, ZBIT (Center for Bioinformatics Tübingen), University of Tübingen, Sand 14, D-72076 Tübingen, Germany
| | - Cynthia M Sharma
- Research Center for Infectious Diseases (ZINF), University of Würzburg, Josef-Schneider-Str. 2/Bau D15, 97080 Würzburg, Germany.
| |
Collapse
|
34
|
Nuss AM, Heroven AK, Waldmann B, Reinkensmeier J, Jarek M, Beckstette M, Dersch P. Transcriptomic profiling of Yersinia pseudotuberculosis reveals reprogramming of the Crp regulon by temperature and uncovers Crp as a master regulator of small RNAs. PLoS Genet 2015; 11:e1005087. [PMID: 25816203 PMCID: PMC4376681 DOI: 10.1371/journal.pgen.1005087] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 02/20/2015] [Indexed: 12/20/2022] Open
Abstract
One hallmark of pathogenic yersiniae is their ability to rapidly adjust their life-style and pathogenesis upon host entry. In order to capture the range, magnitude and complexity of the underlying gene control mechanisms we used comparative RNA-seq-based transcriptomic profiling of the enteric pathogen Y. pseudotuberculosis under environmental and infection-relevant conditions. We identified 1151 individual transcription start sites, multiple riboswitch-like RNA elements, and a global set of antisense RNAs and previously unrecognized trans-acting RNAs. Taking advantage of these data, we revealed a temperature-induced and growth phase-dependent reprogramming of a large set of catabolic/energy production genes and uncovered the existence of a thermo-regulated ‘acetate switch’, which appear to prime the bacteria for growth in the digestive tract. To elucidate the regulatory architecture linking nutritional status to virulence we also refined the CRP regulon. We identified a massive remodelling of the CRP-controlled network in response to temperature and discovered CRP as a transcriptional master regulator of numerous conserved and newly identified non-coding RNAs which participate in this process. This finding highlights a novel level of complexity of the regulatory network in which the concerted action of transcriptional regulators and multiple non-coding RNAs under control of CRP adjusts the control of Yersinia fitness and virulence to the requirements of their environmental and virulent life-styles. Many bacterial pathogens cycle between environmental sources and mammalian hosts. Adaptation to the different natural habitats and host niches is achieved through complex regulatory networks which adjust synthesis of the large repertoire of crucial virulence factors and fitness determinants. To uncover underlying control circuits, we determined the first in-depth single-nucleotide resolution transcriptome of Yersinia. This revealed important novel genetic information, such as global locations of transcriptional start sites, non-coding RNAs, potential riboswitches and provided a set of virulence-relevant expression profiles, which constitute a valuable tool for the research community. The analysis further uncovered a temperature-induced global reprogramming of central metabolic functions, likely to support intestinal colonization of the pathogen. This is accompanied by a major reorganization of the CRP regulon, which involves a multitude of regulatory RNAs. The primary consequence is a fine-tuned, coordinated control of metabolism and virulence through a plethora of environmentally controlled regulatory RNAs allowing rapid adaptation and high flexibility during life-style changes.
Collapse
Affiliation(s)
- Aaron M. Nuss
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Ann Kathrin Heroven
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Barbara Waldmann
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Jan Reinkensmeier
- Faculty of Technology and Center for Biotechnology (CeBiTec), Bielefeld University, Germany
| | - Michael Jarek
- Department of Genome Analytics, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Michael Beckstette
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Petra Dersch
- Department of Molecular Infection Biology, Helmholtz Centre for Infection Research, Braunschweig, Germany
- * E-mail:
| |
Collapse
|
35
|
Wolfinger MT, Fallmann J, Eggenhofer F, Amman F. ViennaNGS: A toolbox for building efficient next- generation sequencing analysis pipelines. F1000Res 2015; 4:50. [PMID: 26236465 PMCID: PMC4513691 DOI: 10.12688/f1000research.6157.2] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/10/2015] [Indexed: 11/26/2022] Open
Abstract
Recent achievements in next-generation sequencing (NGS) technologies lead to a high demand for reuseable software components to easily compile customized analysis workflows for big genomics data. We present ViennaNGS, an integrated collection of Perl modules focused on building efficient pipelines for NGS data processing. It comes with functionality for extracting and converting features from common NGS file formats, computation and evaluation of read mapping statistics, as well as normalization of RNA abundance. Moreover, ViennaNGS provides software components for identification and characterization of splice junctions from RNA-seq data, parsing and condensing sequence motif data, automated construction of Assembly and Track Hubs for the UCSC genome browser, as well as wrapper routines for a set of commonly used NGS command line tools.
Collapse
Affiliation(s)
- Michael T Wolfinger
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090, Vienna, Austria ; Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Dr. Bohr-Gasse 9, A-1030 Vienna, Austria ; Department of Biochemistry and Molecular Cell Biology, Max F. Perutz Laboratories, University of Vienna, Dr. Bohr-Gasse 9, A-1030 Vienna, Austria
| | - Jörg Fallmann
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090, Vienna, Austria
| | - Florian Eggenhofer
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090, Vienna, Austria
| | - Fabian Amman
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090, Vienna, Austria ; Department of Chromosome Biology, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Dr. Bohr-Gasse 9, A-1030 Vienna, Austria
| |
Collapse
|
36
|
Global transcriptional start site mapping using differential RNA sequencing reveals novel antisense RNAs in Escherichia coli. J Bacteriol 2014; 197:18-28. [PMID: 25266388 DOI: 10.1128/jb.02096-14] [Citation(s) in RCA: 218] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
While the model organism Escherichia coli has been the subject of intense study for decades, the full complement of its RNAs is only now being examined. Here we describe a survey of the E. coli transcriptome carried out using a differential RNA sequencing (dRNA-seq) approach, which can distinguish between primary and processed transcripts, and an automated prediction algorithm for transcriptional start sites (TSS). With the criterion of expression under at least one of three growth conditions examined, we predicted 14,868 TSS candidates, including 5,574 internal to annotated genes (iTSS) and 5,495 TSS corresponding to potential antisense RNAs (asRNAs). We examined expression of 14 candidate asRNAs by Northern analysis using RNA from wild-type E. coli and from strains defective for RNases III and E, two RNases reported to be involved in asRNA processing. Interestingly, nine asRNAs detected as distinct bands by Northern analysis were differentially affected by the rnc and rne mutations. We also compared our asRNA candidates with previously published asRNA annotations from RNA-seq data and discuss the challenges associated with these cross-comparisons. Our global transcriptional start site map represents a valuable resource for identification of transcription start sites, promoters, and novel transcripts in E. coli and is easily accessible, together with the cDNA coverage plots, in an online genome browser.
Collapse
|
37
|
Zaramela LS, Vêncio RZN, ten-Caten F, Baliga NS, Koide T. Transcription start site associated RNAs (TSSaRNAs) are ubiquitous in all domains of life. PLoS One 2014; 9:e107680. [PMID: 25238539 PMCID: PMC4169567 DOI: 10.1371/journal.pone.0107680] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2014] [Accepted: 08/18/2014] [Indexed: 01/06/2023] Open
Abstract
A plethora of non-coding RNAs has been discovered using high-resolution transcriptomics tools, indicating that transcriptional and post-transcriptional regulation is much more complex than previously appreciated. Small RNAs associated with transcription start sites of annotated coding regions (TSSaRNAs) are pervasive in both eukaryotes and bacteria. Here, we provide evidence for existence of TSSaRNAs in several archaeal transcriptomes including: Halobacterium salinarum, Pyrococcus furiosus, Methanococcus maripaludis, and Sulfolobus solfataricus. We validated TSSaRNAs from the model archaeon Halobacterium salinarum NRC-1 by deep sequencing two independent small-RNA enriched (RNA-seq) and a primary-transcript enriched (dRNA-seq) strand-specific libraries. We identified 652 transcripts, of which 179 were shown to be primary transcripts (∼7% of the annotated genome). Distinct growth-associated expression patterns between TSSaRNAs and their cognate genes were observed, indicating a possible role in environmental responses that may result from RNA polymerase with varying pausing rhythms. This work shows that TSSaRNAs are ubiquitous across all domains of life.
Collapse
Affiliation(s)
- Livia S. Zaramela
- Department Biochemistry and Immunology, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - Ricardo Z. N. Vêncio
- Department of Computing and Mathematics, Faculdade de Filosofia Ciências e Letras de Ribeirão Preto, University of São Paulo, Ribeirão Preto, Brazil
| | - Felipe ten-Caten
- Department Biochemistry and Immunology, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - Nitin S. Baliga
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Tie Koide
- Department Biochemistry and Immunology, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
- * E-mail:
| |
Collapse
|
38
|
The primary transcriptome of the marine diazotroph Trichodesmium erythraeum IMS101. Sci Rep 2014; 4:6187. [PMID: 25155278 PMCID: PMC4143802 DOI: 10.1038/srep06187] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 08/04/2014] [Indexed: 01/03/2023] Open
Abstract
Blooms of the dinitrogen-fixing marine cyanobacterium Trichodesmium considerably contribute to new nitrogen inputs into tropical oceans. Intriguingly, only 60% of the Trichodesmium erythraeum IMS101 genome sequence codes for protein, compared with ~85% in other sequenced cyanobacterial genomes. The extensive non-coding genome fraction suggests space for an unusually high number of unidentified, potentially regulatory non-protein-coding RNAs (ncRNAs). To identify the transcribed fraction of the genome, here we present a genome-wide map of transcriptional start sites (TSS) at single nucleotide resolution, revealing the activity of 6,080 promoters. We demonstrate that T. erythraeum has the highest number of actively splicing group II introns and the highest percentage of TSS yielding ncRNAs of any bacterium examined to date. We identified a highly transcribed retroelement that serves as template repeat for the targeted mutation of at least 12 different genes by mutagenic homing. Our findings explain the non-coding portion of the T. erythraeum genome by the transcription of an unusually high number of non-coding transcripts in addition to the known high incidence of transposable elements. We conclude that riboregulation and RNA maturation-dependent processes constitute a major part of the Trichodesmium regulatory apparatus.
Collapse
|
39
|
Sharma CM, Vogel J. Differential RNA-seq: the approach behind and the biological insight gained. Curr Opin Microbiol 2014; 19:97-105. [PMID: 25024085 DOI: 10.1016/j.mib.2014.06.010] [Citation(s) in RCA: 142] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Revised: 06/15/2014] [Accepted: 06/19/2014] [Indexed: 01/14/2023]
Abstract
RNA-sequencing has revolutionized the quantitative and qualitative analysis of transcriptomes in both prokaryotes and eukaryotes. It provides a generic approach for gene expression profiling, annotation of transcript boundaries and operons, as well as identifying novel transcripts including small noncoding RNA molecules and antisense RNAs. We recently developed a differential RNA-seq (dRNA-seq) method which in addition to the above, yields information as to whether a given RNA is a primary or processed transcript. Originally applied to describe the primary transcriptome of the gastric pathogen Helicobacter pylori, dRNA-seq has since provided global maps of transcriptional start sites in diverse species, informed new biology in the CRISPR-Cas9 system, advanced to a tool for comparative transcriptomics, and inspired simultaneous RNA-seq of pathogen and host.
Collapse
Affiliation(s)
- Cynthia M Sharma
- University of Würzburg, Institute for Molecular Infection Biology & Research Center for Infectious Diseases, Josef-Schneider-Straße 2/D15, D-97080 Würzburg, Germany.
| | - Jörg Vogel
- University of Würzburg, Institute for Molecular Infection Biology & Research Center for Infectious Diseases, Josef-Schneider-Straße 2/D15, D-97080 Würzburg, Germany.
| |
Collapse
|
40
|
Wright PR, Georg J, Mann M, Sorescu DA, Richter AS, Lott S, Kleinkauf R, Hess WR, Backofen R. CopraRNA and IntaRNA: predicting small RNA targets, networks and interaction domains. Nucleic Acids Res 2014; 42:W119-23. [PMID: 24838564 PMCID: PMC4086077 DOI: 10.1093/nar/gku359] [Citation(s) in RCA: 249] [Impact Index Per Article: 24.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
CopraRNA (Comparative prediction algorithm for small RNA targets) is the most recent asset to the Freiburg RNA Tools webserver. It incorporates and extends the functionality of the existing tool IntaRNA (Interacting RNAs) in order to predict targets, interaction domains and consequently the regulatory networks of bacterial small RNA molecules. The CopraRNA prediction results are accompanied by extensive postprocessing methods such as functional enrichment analysis and visualization of interacting regions. Here, we introduce the functionality of the CopraRNA and IntaRNA webservers and give detailed explanations on their postprocessing functionalities. Both tools are freely accessible at http://rna.informatik.uni-freiburg.de.
Collapse
Affiliation(s)
- Patrick R Wright
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, D-79110 Freiburg, Germany Genetics and Experimental Bioinformatics, Faculty of Biology, Schänzlestr. 1, D-79104 Freiburg, Germany
| | - Jens Georg
- Genetics and Experimental Bioinformatics, Faculty of Biology, Schänzlestr. 1, D-79104 Freiburg, Germany
| | - Martin Mann
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, D-79110 Freiburg, Germany
| | - Dragos A Sorescu
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, D-79110 Freiburg, Germany
| | - Andreas S Richter
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, D-79110 Freiburg, Germany Max Planck Institute of Immunobiology and Epigenetics, Stübeweg 51, D-79108 Freiburg, Germany
| | - Steffen Lott
- Genetics and Experimental Bioinformatics, Faculty of Biology, Schänzlestr. 1, D-79104 Freiburg, Germany
| | - Robert Kleinkauf
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, D-79110 Freiburg, Germany
| | - Wolfgang R Hess
- Genetics and Experimental Bioinformatics, Faculty of Biology, Schänzlestr. 1, D-79104 Freiburg, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, D-79110 Freiburg, Germany BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, Albert-Ludwigs-University Freiburg, Germany Center for non-coding RNA in Technology and Health, University of Copenhagen, Gronnegardsvej 3, DK-1870 Frederiksberg C, Denmark ZBSA Centre for Biological Systems Analysis, Albert-Ludwigs-University Freiburg, Habsburgerstr. 49, D-79104 Freiburg, Germany
| |
Collapse
|
41
|
Backofen R, Amman F, Costa F, Findeiß S, Richter AS, Stadler PF. Bioinformatics of prokaryotic RNAs. RNA Biol 2014; 11:470-83. [PMID: 24755880 PMCID: PMC4152356 DOI: 10.4161/rna.28647] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Revised: 03/17/2014] [Accepted: 03/25/2014] [Indexed: 02/02/2023] Open
Abstract
The genome of most prokaryotes gives rise to surprisingly complex transcriptomes, comprising not only protein-coding mRNAs, often organized as operons, but also harbors dozens or even hundreds of highly structured small regulatory RNAs and unexpectedly large levels of anti-sense transcripts. Comprehensive surveys of prokaryotic transcriptomes and the need to characterize also their non-coding components is heavily dependent on computational methods and workflows, many of which have been developed or at least adapted specifically for the use with bacterial and archaeal data. This review provides an overview on the state-of-the-art of RNA bioinformatics focusing on applications to prokaryotes.
Collapse
Affiliation(s)
- Rolf Backofen
- Bioinformatics Group; Department of Computer Science; University of Freiburg; Georges-Köhler-Allee 106; D-79110 Freiburg, Germany
- Center for non-coding RNA in Technology and Health; University of Copenhagen; Grønnegårdsvej 3; DK-1870 Frederiksberg C, Denmark
| | - Fabian Amman
- Institute for Theoretical Chemistry; University of Vienna; Währingerstraße 17; A-1090 Wien, Austria
- Bioinformatics Group; Department of Computer Science, and Interdisciplinary Center for Bioinformatics; University of Leipzig; Härtelstraße 16-18; D-04107 Leipzig, Germany
| | - Fabrizio Costa
- Bioinformatics Group; Department of Computer Science; University of Freiburg; Georges-Köhler-Allee 106; D-79110 Freiburg, Germany
| | - Sven Findeiß
- Institute for Theoretical Chemistry; University of Vienna; Währingerstraße 17; A-1090 Wien, Austria
- Bioinformatics and Computational Biology Research Group; University of Vienna; Währingerstraße 29; A-1090 Wien, Austria
| | - Andreas S Richter
- Bioinformatics Group; Department of Computer Science; University of Freiburg; Georges-Köhler-Allee 106; D-79110 Freiburg, Germany
- Max Planck Institute of Immunobiology and Epigenetics; Stübeweg 51; D-79108 Freiburg, Germany
| | - Peter F Stadler
- Center for non-coding RNA in Technology and Health; University of Copenhagen; Grønnegårdsvej 3; DK-1870 Frederiksberg C, Denmark
- Institute for Theoretical Chemistry; University of Vienna; Währingerstraße 17; A-1090 Wien, Austria
- Bioinformatics Group; Department of Computer Science, and Interdisciplinary Center for Bioinformatics; University of Leipzig; Härtelstraße 16-18; D-04107 Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences; Inselstraße 22; D-04103 Leipzig, Germany
- Fraunhofer Institute for Cell Therapy and Immunology – IZI; Perlickstraße 1; D-04103 Leipzig, Germany
- Santa Fe Institute; Santa Fe, NM USA
| |
Collapse
|