1
|
Ibrahim AGAER, Vêncio RZN, Lorenzetti APR, Koide T. Halobacterium salinarum and Haloferax volcanii Comparative Transcriptomics Reveals Conserved Transcriptional Processing Sites. Genes (Basel) 2021; 12:genes12071018. [PMID: 34209065 PMCID: PMC8303175 DOI: 10.3390/genes12071018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/25/2021] [Accepted: 05/27/2021] [Indexed: 01/15/2023] Open
Abstract
Post-transcriptional processing of messenger RNA is an important regulatory strategy that allows relatively fast responses to changes in environmental conditions. In halophile systems biology, the protein perspective of this problem (i.e., ribonucleases which implement the cleavages) is generally more studied than the RNA perspective (i.e., processing sites). In the present in silico work, we mapped genome-wide transcriptional processing sites (TPS) in two halophilic model organisms, Halobacterium salinarum NRC-1 and Haloferax volcanii DS2. TPS were established by reanalysis of publicly available differential RNA-seq (dRNA-seq) data, searching for non-primary (monophosphorylated RNAs) enrichment. We found 2093 TPS in 43% of H. salinarum genes and 3515 TPS in 49% of H. volcanii chromosomal genes. Of the 244 conserved TPS sites found, the majority were located around start and stop codons of orthologous genes. Specific genes are highlighted when discussing antisense, ribosome and insertion sequence associated TPS. Examples include the cell division gene ftsZ2, whose differential processing signal along growth was detected and correlated with post-transcriptional regulation, and biogenesis of sense overlapping transcripts associated with IS200/IS605. We hereby present the comparative, transcriptomics-based processing site maps with a companion browsing interface.
Collapse
Affiliation(s)
- Amr Galal Abd El-Raheem Ibrahim
- Department of Computation and Mathematics, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil; (A.G.A.E.-R.I.); (R.Z.N.V.)
| | - Ricardo Z. N. Vêncio
- Department of Computation and Mathematics, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil; (A.G.A.E.-R.I.); (R.Z.N.V.)
| | - Alan P. R. Lorenzetti
- Department of Biochemistry and Immunology, Ribeirão Preto Medical School, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil;
| | - Tie Koide
- Department of Biochemistry and Immunology, Ribeirão Preto Medical School, Universidade de São Paulo, Ribeirão Preto 14040-900, Brazil;
- Correspondence: ; Tel.: +55-16-3315-3107
| |
Collapse
|
2
|
Kim S, Kim Y, Suh DH, Lee CH, Yoo SM, Lee SY, Yoon SH. Heat-responsive and time-resolved transcriptome and metabolome analyses of Escherichia coli uncover thermo-tolerant mechanisms. Sci Rep 2020; 10:17715. [PMID: 33077799 PMCID: PMC7572479 DOI: 10.1038/s41598-020-74606-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 10/05/2020] [Indexed: 11/27/2022] Open
Abstract
Current understanding of heat shock response has been complicated by the fact that heat stress is inevitably accompanied by changes in specific growth rates and growth stages. In this study, a chemostat culture was successfully performed to avoid the physico-chemical and biological changes that accompany heatshock, which provided a unique opportunity to investigate the full range of cellular responses to thermal stress, ranging from temporary adjustment to phenotypic adaptation at multi-omics levels. Heat-responsive and time-resolved changes in the transcriptome and metabolome of a widely used E. coli strain BL21(DE3) were explored in which the temperature was upshifted from 37 to 42 °C. Omics profiles were categorized into early (2 and 10 min), middle (0.5, 1, and 2 h), and late (4, 8, and 40 h) stages of heat stress, each of which reflected the initiation, adaptation, and phenotypic plasticity steps of the stress response. The continued heat stress modulated global gene expression by controlling the expression levels of sigma factors in different time frames, including unexpected downregulation of the second heatshock sigma factor gene (rpoE) upon the heat stress. Trehalose, cadaverine, and enterobactin showed increased production to deal with the heat-induced oxidative stress. Genes highly expressed at the late stage were experimentally validated to provide thermotolerance. Intriguingly, a cryptic capsular gene cluster showed considerably high expression level only at the late stage, and its expression was essential for cell growth at high temperature. Granule-forming and elongated cells were observed at the late stage, which was morphological plasticity occurred as a result of acclimation to the continued heat stress. Whole process of thermal adaptation along with the genetic and metabolic changes at fine temporal resolution will contribute to far-reaching comprehension of the heat shock response. Further, the identified thermotolerant genes will be useful to rationally engineer thermotolerant microorganisms.
Collapse
Affiliation(s)
- Sinyeon Kim
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea
| | - Youngshin Kim
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea
| | - Dong Ho Suh
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea
| | - Choong Hwan Lee
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea
| | - Seung Min Yoo
- School of Integrative Engineering, Chung-Ang University, Seoul, 06974, Republic of Korea
| | - Sang Yup Lee
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Plus Program), BioProcess Engineering Research Center, Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Daejeon, 34141, Republic of Korea
| | - Sung Ho Yoon
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea.
| |
Collapse
|
3
|
The Primary Antisense Transcriptome of Halobacterium salinarum NRC-1. Genes (Basel) 2019; 10:genes10040280. [PMID: 30959844 PMCID: PMC6523106 DOI: 10.3390/genes10040280] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 04/01/2019] [Accepted: 04/01/2019] [Indexed: 12/17/2022] Open
Abstract
Antisense RNAs (asRNAs) are present in diverse organisms and play important roles in gene regulation. In this work, we mapped the primary antisense transcriptome in the halophilic archaeon Halobacterium salinarum NRC-1. By reanalyzing publicly available data, we mapped antisense transcription start sites (aTSSs) and inferred the probable 3′ ends of these transcripts. We analyzed the resulting asRNAs according to the size, location, function of genes on the opposite strand, expression levels and conservation. We show that at least 21% of the genes contain asRNAs in H. salinarum. Most of these asRNAs are expressed at low levels. They are located antisense to genes related to distinctive characteristics of H. salinarum, such as bacteriorhodopsin, gas vesicles, transposases and other important biological processes such as translation. We provide evidence to support asRNAs in type II toxin–antitoxin systems in archaea. We also analyzed public Ribosome profiling (Ribo-seq) data and found that ~10% of the asRNAs are ribosome-associated non-coding RNAs (rancRNAs), with asRNAs from transposases overrepresented. Using a comparative transcriptomics approach, we found that ~19% of the asRNAs annotated in H. salinarum belong to genes with an ortholog in Haloferax volcanii, in which an aTSS could be identified with positional equivalence. This shows that most asRNAs are not conserved between these halophilic archaea.
Collapse
|
4
|
Ten-Caten F, Vêncio RZN, Lorenzetti APR, Zaramela LS, Santana AC, Koide T. Internal RNAs overlapping coding sequences can drive the production of alternative proteins in archaea. RNA Biol 2018; 15:1119-1132. [PMID: 30175688 PMCID: PMC6161675 DOI: 10.1080/15476286.2018.1509661] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Prokaryotic genomes show a high level of information compaction often with different molecules transcribed from the same locus. Although antisense RNAs have been relatively well studied, RNAs in the same strand, internal RNAs (intraRNAs), are still poorly understood. The question of how common is the translation of overlapping reading frames remains open. We address this question in the model archaeon Halobacterium salinarum. In the present work we used differential RNA-seq (dRNA-seq) in H. salinarum NRC-1 to locate intraRNA signals in subsets of internal transcription start sites (iTSS) and establish the open reading frames associated to them (intraORFs). Using C-terminally flagged proteins, we experimentally observed isoforms accurately predicted by intraRNA translation for kef1, acs3 and orc4 genes. We also recovered from the literature and mass spectrometry databases several instances of protein isoforms consistent with intraRNA translation such as the gas vesicle protein gene gvpC1. We found evidence for intraRNAs in horizontally transferred genes such as the chaperone dnaK and the aerobic respiration related cydA in both H. salinarum and Escherichia coli. Also, intraRNA translation evidence in H. salinarum, E. coli and yeast of a universal elongation factor (aEF-2, fusA and eEF-2) suggests that this is an ancient phenomenon present in all domains of life.
Collapse
Affiliation(s)
- Felipe Ten-Caten
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Ricardo Z N Vêncio
- b Department of Computation and Mathematics, Faculdade de Filosofia , Ciências e Letras de Ribeirão Preto, University of São Paulo , Ribeirão Preto , Brazil
| | - Alan Péricles R Lorenzetti
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Livia Soares Zaramela
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Ana Carolina Santana
- c Department of Cell and Molecular Biology and Pathogenic Bioagents , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| | - Tie Koide
- a Department of Biochemistry and Immunology , Ribeirão Preto Medical School, University of São Paulo , Ribeirão Preto , Brazil
| |
Collapse
|
5
|
Kim S, Jeong H, Kim EY, Kim JF, Lee SY, Yoon SH. Genomic and transcriptomic landscape of Escherichia coli BL21(DE3). Nucleic Acids Res 2017; 45:5285-5293. [PMID: 28379538 PMCID: PMC5435950 DOI: 10.1093/nar/gkx228] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 03/26/2017] [Indexed: 11/23/2022] Open
Abstract
Escherichia coli BL21(DE3) has long served as a model organism for scientific research, as well as a workhorse for biotechnology. Here we present the most current genome annotation of E. coli BL21(DE3) based on the transcriptome structure of the strain that was determined for the first time. The genome was annotated using multiple automated pipelines and compared to the current genome annotation of the closely related strain, E. coli K-12. High-resolution tiling array data of E. coli BL21(DE3) from several different stages of cell growth in rich and minimal media were analyzed to characterize the transcriptome structure and to provide supporting evidence for open reading frames. This new integrated analysis of the genomic and transcriptomic structure of E. coli BL21(DE3) has led to the correction of translation initiation sites for 88 coding DNA sequences and provided updated information for most genes. Additionally, 37 putative genes and 66 putative non-coding RNAs were also identified. The panoramic landscape of the genome and transcriptome of E. coli BL21(DE3) revealed here will allow us to better understand the fundamental biology of the strain and also advance biotechnological applications in industry.
Collapse
Affiliation(s)
- Sinyeon Kim
- Department of Bioscience and Biotechnology, Konkuk University, Seoul 05029, Republic of Korea
| | - Haeyoung Jeong
- Infectious Disease Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon 34141, Republic of Korea
| | - Eun-Youn Kim
- School of Basic Sciences, Hanbat National University, Daejeon 34158, Republic of Korea
| | - Jihyun F Kim
- Department of Systems Biology and Division of Life Sciences, Yonsei University, Seoul 03722, Republic of Korea
| | - Sang Yup Lee
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Plus Program), BioProcess Engineering Research Center, Center for Systems and Synthetic Biotechnology, and Institute for the BioCentury, KAIST, Daejeon 34141, Republic of Korea
| | - Sung Ho Yoon
- Department of Bioscience and Biotechnology, Konkuk University, Seoul 05029, Republic of Korea
| |
Collapse
|
6
|
Lin G, Chai J, Yuan S, Mai C, Cai L, Murphy RW, Zhou W, Luo J. VennPainter: A Tool for the Comparison and Identification of Candidate Genes Based on Venn Diagrams. PLoS One 2016; 11:e0154315. [PMID: 27120465 PMCID: PMC4847855 DOI: 10.1371/journal.pone.0154315] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Accepted: 04/12/2016] [Indexed: 12/21/2022] Open
Abstract
VennPainter is a program for depicting unique and shared sets of genes lists and generating Venn diagrams, by using the Qt C++ framework. The software produces Classic Venn, Edwards’ Venn and Nested Venn diagrams and allows for eight sets in a graph mode and 31 sets in data processing mode only. In comparison, previous programs produce Classic Venn and Edwards’ Venn diagrams and allow for a maximum of six sets. The software incorporates user-friendly features and works in Windows, Linux and Mac OS. Its graphical interface does not require a user to have programing skills. Users can modify diagram content for up to eight datasets because of the Scalable Vector Graphics output. VennPainter can provide output results in vertical, horizontal and matrix formats, which facilitates sharing datasets as required for further identification of candidate genes. Users can obtain gene lists from shared sets by clicking the numbers on the diagram. Thus, VennPainter is an easy-to-use, highly efficient, cross-platform and powerful program that provides a more comprehensive tool for identifying candidate genes and visualizing the relationships among genes or gene families in comparative analysis.
Collapse
Affiliation(s)
- Guoliang Lin
- Key Laboratory for Animal Genetic Diversity and Evolution of High Education in Yunnan Province, School of Life Sciences, Yunnan University, Kunming, 650091, China
- State Key Laboratory for Conservation and Utilization of Bio-resource, Yunnan University, Kunming, 650091, Yunnan, China
| | - Jing Chai
- Key Laboratory for Animal Genetic Diversity and Evolution of High Education in Yunnan Province, School of Life Sciences, Yunnan University, Kunming, 650091, China
- State Key Laboratory for Conservation and Utilization of Bio-resource, Yunnan University, Kunming, 650091, Yunnan, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, The Chinese Academy of Sciences, Kunming, 650223, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650000, China
| | - Shuo Yuan
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Chao Mai
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Li Cai
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
- School of Computer and Science, Fudan University, Shanghai, 200433, China
| | - Robert W. Murphy
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, The Chinese Academy of Sciences, Kunming, 650223, Yunnan, China
- Centre for Biodiversity and Conservation Biology, Royal Ontario Museum, Toronto, M5S 2C6, Canada
| | - Wei Zhou
- School of Software, Yunnan University, Kunming, 650091, Yunnan, China
- * E-mail: (WZ); (JL)
| | - Jing Luo
- Key Laboratory for Animal Genetic Diversity and Evolution of High Education in Yunnan Province, School of Life Sciences, Yunnan University, Kunming, 650091, China
- State Key Laboratory for Conservation and Utilization of Bio-resource, Yunnan University, Kunming, 650091, Yunnan, China
- * E-mail: (WZ); (JL)
| |
Collapse
|
7
|
Gomes-Filho JV, Zaramela LS, Italiani VCDS, Baliga NS, Vêncio RZN, Koide T. Sense overlapping transcripts in IS1341-type transposase genes are functional non-coding RNAs in archaea. RNA Biol 2016; 12:490-500. [PMID: 25806405 PMCID: PMC4615843 DOI: 10.1080/15476286.2015.1019998] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The existence of sense overlapping transcripts that share regulatory and coding information in the same genomic sequence shows an additional level of prokaryotic gene expression complexity. Here we report the discovery of ncRNAs associated with IS1341-type transposase (tnpB) genes, at the 3'-end of such elements, with examples in archaea and bacteria. Focusing on the model haloarchaeon Halobacterium salinarum NRC-1, we show the existence of sense overlapping transcripts (sotRNAs) for all its IS1341-type transposases. Publicly available transcriptome compendium show condition-dependent differential regulation between sotRNAs and their cognate genes. These sotRNAs allowed us to find a UUCA tetraloop motif that is present in other archaea (ncRNA family HgcC) and in a H. salinarum intergenic ncRNA derived from a palindrome associated transposable elements (PATE). Overexpression of one sotRNA and the PATE-derived RNA harboring the tetraloop motif improved H. salinarum growth, indicating that these ncRNAs are functional.
Collapse
Affiliation(s)
- José Vicente Gomes-Filho
- a Department of Biochemistry and Immunology ; Ribeirão Preto Medical School ; University of São Paulo ; Ribeirão Preto , Brazil
| | | | | | | | | | | |
Collapse
|
8
|
Rudrappa D, Yao AI, White D, Pavlik BJ, Singh R, Facciotti MT, Blum P. Identification of an archaeal mercury regulon by chromatin immunoprecipitation. MICROBIOLOGY-SGM 2015; 161:2423-33. [PMID: 26408318 DOI: 10.1099/mic.0.000189] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Mercury is a heavy metal and toxic to all forms of life. Metal exposure can invoke a response to improve survival. In archaea, several components of a mercury response system have been identified, but it is not known whether metal transport is a member of this system. To identify such missing components, a peptide-tagged MerR transcription factor was used to localize enriched chromosome regions by chromosome immunoprecipitation combined with DNA sequence analysis. Such regions could serve as secondary regulatory binding sites to control the expression of additional genes associated with mercury detoxification. Among the 31 highly enriched loci, a subset of five was pursued as potential candidates based on their current annotations. Quantitative reverse transcription-PCR analysis of these regions with and without mercury treatment in WT and mutant strains lacking merR indicated significant regulatory responses under these conditions. Of these, a Family 5 extracellular solute-binding protein and the MarR transcription factor shown previously to control responses to oxidation were most strongly affected. Inactivation of the solute-binding protein by gene disruption increased the resistance of mutant cells to mercury challenge. Inductively coupled plasma-MS analysis of the mutant cell line following metal challenge indicated there was less intracellular mercury compared with the isogenic WT strain. Together, these regulated genes comprise new members of the archaeal MerR regulon and reveal a cascade of transcriptional control not previously demonstrated in this model organism.
Collapse
Affiliation(s)
- Deepak Rudrappa
- 1 School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Andrew I Yao
- 3 Department of Biomedical Engineering and Genome Center, University of California-Davis, Davis, California, USA
| | - Derrick White
- 1 School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Benjamin J Pavlik
- 2 Department of Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Raghuveer Singh
- 1 School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Marc T Facciotti
- 3 Department of Biomedical Engineering and Genome Center, University of California-Davis, Davis, California, USA
| | - Paul Blum
- 1 School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| |
Collapse
|
9
|
Persinoti GF, de Aguiar Peres NT, Jacob TR, Rossi A, Vêncio RZ, Martinez-Rossi NM. RNA-sequencing analysis of Trichophyton rubrum transcriptome in response to sublethal doses of acriflavine. BMC Genomics 2014; 15 Suppl 7:S1. [PMID: 25573029 PMCID: PMC4243288 DOI: 10.1186/1471-2164-15-s7-s1] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND The dermatophyte Trichophyton rubrum is an anthropophilic filamentous fungus that infects keratinized tissues and is the most common etiologic agent isolated in human dermatophytoses. The clinical treatment of these infections is challenging because only few antifungal drugs are commercially available. To understand the mode of action of cytotoxic drugs against fungi, we evaluated the time-dependent effects of acriflavine on T. rubrum transcriptome using high-throughput RNA-sequencing (RNA-seq) technology. RESULTS RNA-seq analysis generated approximately 200 million short reads that were mapped to the Broad Institute's Dermatophyte Comparative Database before differential gene expression analysis was performed. By employing a stringent cut-off threshold of -1.5 and 1.5 log₂-fold changes in gene expression, a subset of 490 unique genes were found to be modulated in T. rubrum in response to acriflavine exposure. Among the selected genes, 69 genes were modulated at all exposure time points. Functional categorization indicated the putative involvement of these genes in various cellular processes such as oxidation-reduction reaction, transmembrane transport, and metal ion binding. Interestingly, genes putatively involved in the pathogenicity of dermatophytoses were down-regulated suggesting that this drug interferes with the virulence of T. rubrum. Moreover, we identified 159 novel putative transcripts in intergenic regions and two transcripts in intron regions of T. rubrum genome. CONCLUSION The results provide insights into the molecular events underlying the stress responses of T. rubrum to acriflavine, revealing that this drug interfered with important molecular events involved in the establishment and maintenance of fungal infection in the host. In addition, the identification of novel transcripts will further enable the improvement of gene annotation and open reading frame prediction of T. rubrum and other dermatophyte genomes.
Collapse
|
10
|
Zaramela LS, Vêncio RZN, ten-Caten F, Baliga NS, Koide T. Transcription start site associated RNAs (TSSaRNAs) are ubiquitous in all domains of life. PLoS One 2014; 9:e107680. [PMID: 25238539 PMCID: PMC4169567 DOI: 10.1371/journal.pone.0107680] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2014] [Accepted: 08/18/2014] [Indexed: 01/06/2023] Open
Abstract
A plethora of non-coding RNAs has been discovered using high-resolution transcriptomics tools, indicating that transcriptional and post-transcriptional regulation is much more complex than previously appreciated. Small RNAs associated with transcription start sites of annotated coding regions (TSSaRNAs) are pervasive in both eukaryotes and bacteria. Here, we provide evidence for existence of TSSaRNAs in several archaeal transcriptomes including: Halobacterium salinarum, Pyrococcus furiosus, Methanococcus maripaludis, and Sulfolobus solfataricus. We validated TSSaRNAs from the model archaeon Halobacterium salinarum NRC-1 by deep sequencing two independent small-RNA enriched (RNA-seq) and a primary-transcript enriched (dRNA-seq) strand-specific libraries. We identified 652 transcripts, of which 179 were shown to be primary transcripts (∼7% of the annotated genome). Distinct growth-associated expression patterns between TSSaRNAs and their cognate genes were observed, indicating a possible role in environmental responses that may result from RNA polymerase with varying pausing rhythms. This work shows that TSSaRNAs are ubiquitous across all domains of life.
Collapse
Affiliation(s)
- Livia S. Zaramela
- Department Biochemistry and Immunology, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - Ricardo Z. N. Vêncio
- Department of Computing and Mathematics, Faculdade de Filosofia Ciências e Letras de Ribeirão Preto, University of São Paulo, Ribeirão Preto, Brazil
| | - Felipe ten-Caten
- Department Biochemistry and Immunology, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
| | - Nitin S. Baliga
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Tie Koide
- Department Biochemistry and Immunology, Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, Brazil
- * E-mail:
| |
Collapse
|
11
|
Mangan ME, Williams JM, Kuhn RM, Lathe WC. The UCSC Genome Browser: What Every Molecular Biologist Should Know. ACTA ACUST UNITED AC 2014; 107:19.9.1-19.9.36. [PMID: 24984850 DOI: 10.1002/0471142727.mb1909s107] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Electronic data resources can enable molecular biologists to quickly get information from around the world that a decade ago would have been buried in papers scattered throughout the library. The ability to access, query, and display these data makes benchwork much more efficient and drives new discoveries. Increasingly, mastery of software resources and corresponding data repositories is required to fully explore the volume of data generated in biomedical and agricultural research, because only small amounts of data are actually found in traditional publications. The UCSC Genome Browser provides a wealth of data and tools that advance understanding of genomic context for many species, enable detailed analysis of data, and provide the ability to interrogate regions of interest across disparate data sets from a wide variety of sources. Researchers can also supplement the standard display with their own data to query and share this with others. Effective use of these resources has become crucial to biological research today, and this unit describes some practical applications of the UCSC Genome Browser.
Collapse
|
12
|
Mader M, Simon R, Kurtz S. FISH Oracle 2: a web server for integrative visualization of genomic data in cancer research. J Clin Bioinforma 2014; 4:5. [PMID: 24684958 PMCID: PMC4230720 DOI: 10.1186/2043-9113-4-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2013] [Accepted: 03/26/2014] [Indexed: 01/21/2023] Open
Abstract
Background A comprehensive view on all relevant genomic data is instrumental for understanding the complex patterns of molecular alterations typically found in cancer cells. One of the most effective ways to rapidly obtain an overview of genomic alterations in large amounts of genomic data is the integrative visualization of genomic events. Results We developed FISH Oracle 2, a web server for the interactive visualization of different kinds of downstream processed genomics data typically available in cancer research. A powerful search interface and a fast visualization engine provide a highly interactive visualization for such data. High quality image export enables the life scientist to easily communicate their results. A comprehensive data administration allows to keep track of the available data sets. We applied FISH Oracle 2 to published data and found evidence that, in colorectal cancer cells, the gene TTC28 may be inactivated in two different ways, a fact that has not been published before. Conclusions The interactive nature of FISH Oracle 2 and the possibility to store, select and visualize large amounts of downstream processed data support life scientists in generating hypotheses. The export of high quality images supports explanatory data visualization, simplifying the communication of new biological findings. A FISH Oracle 2 demo server and the software is available at
http://www.zbh.uni-hamburg.de/fishoracle.
Collapse
Affiliation(s)
| | | | - Stefan Kurtz
- Center for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany.
| |
Collapse
|
13
|
Pitassi C, Gonçalves AA, Moreno Júnior VDA. [Factors affecting the adoption of ICT tools in experiments with bioinformatics in biopharmaceutical organizations: a case study in the Brazilian Cancer Institute]. CIENCIA & SAUDE COLETIVA 2014; 19:257-68. [PMID: 24473622 DOI: 10.1590/1413-81232014191.2007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2012] [Accepted: 10/16/2012] [Indexed: 11/22/2022] Open
Abstract
The scope of this article is to identify and analyze the factors that influence the adoption of ICT tools in experiments with bioinformatics at the Brazilian Cancer Institute (INCA). It involves a descriptive and exploratory qualitative field study. Evidence was collected mainly based on in-depth interviews with the management team at the Research Center and the IT Division. The answers were analyzed using the categorical content method. The categories were selected from the scientific literature and consolidated in the Technology-Organization-Environment (TOE) framework created for this study. The model proposed made it possible to demonstrate how the factors selected impacted INCA´s adoption of bioinformatics systems and tools, contributing to the investigation of two critical areas for the development of the health industry in Brazil, namely technological innovation and bioinformatics. Based on the evidence collected, a research question was posed: to what extent can the alignment of the factors related to the adoption of ICT tools in experiments with bioinformatics increase the innovation capacity of a Brazilian biopharmaceutical organization?
Collapse
|
14
|
Seitzer P, Huynh TA, Facciotti MT. JContextExplorer: a tree-based approach to facilitate cross-species genomic context comparison. BMC Bioinformatics 2013; 14:18. [PMID: 23324080 PMCID: PMC3560190 DOI: 10.1186/1471-2105-14-18] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2012] [Accepted: 12/21/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Cross-species comparisons of gene neighborhoods (also called genomic contexts) in microbes may provide insight into determining functionally related or co-regulated sets of genes, suggest annotations of previously un-annotated genes, and help to identify horizontal gene transfer events across microbial species. Existing tools to investigate genomic contexts, however, lack features for dynamically comparing and exploring genomic regions from multiple species. As DNA sequencing technologies improve and the number of whole sequenced microbial genomes increases, a user-friendly genome context comparison platform designed for use by a broad range of users promises to satisfy a growing need in the biological community. RESULTS Here we present JContextExplorer: a tool that organizes genomic contexts into branching diagrams. We implement several alternative context-comparison and tree rendering algorithms, and allow for easy transitioning between different clustering algorithms. To facilitate genomic context analysis, our tool implements GUI features, such as text search filtering, point-and-click interrogation of individual contexts, and genomic visualization via a multi-genome browser. We demonstrate a use case of our tool by attempting to resolve annotation ambiguities between two highly homologous yet functionally distinct genes in a set of 22 alpha and gamma proteobacteria. CONCLUSIONS JContextExplorer should enable a broad range of users to analyze and explore genomic contexts. The program has been tested on Windows, Mac, and Linux operating systems, and is implemented both as an executable JAR file and java WebStart. Program executables, source code, and documentation is available at http://www.bme.ucdavis.edu/facciotti/resources_data/software/.
Collapse
Affiliation(s)
- Phillip Seitzer
- Department of Biomedical Engineering, One Shields Ave, University of California, Davis, CA 95616, USA
| | | | | |
Collapse
|
15
|
Zhang X, Greenlee MHW, Serb JM. EnRICH: Extraction and Ranking using Integration and Criteria Heuristics. BMC SYSTEMS BIOLOGY 2013; 7:4. [PMID: 23320748 PMCID: PMC3564850 DOI: 10.1186/1752-0509-7-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2012] [Accepted: 01/07/2013] [Indexed: 11/10/2022]
Abstract
Background High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources. Results We developed the java application, EnRICH (Extraction and Ranking using Integration and Criteria Heuristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state. Conclusions We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets.
Collapse
Affiliation(s)
- Xia Zhang
- Department of Biomedical Sciences, 2008 Veterinary Medicine, Iowa State University, Ames, IA 50010, USA
| | | | | |
Collapse
|
16
|
Abstract
Understanding biological complexity demands a combination of high-throughput data and interdisciplinary skills. One way to bring to bear the necessary combination of data types and expertise is by encapsulating domain knowledge in software and composing that software to create a customized data analysis environment. To this end, simple flexible strategies are needed for interconnecting heterogeneous software tools and enabling data exchange between them. Drawing on our own work and that of others, we present several strategies for interoperability and their consequences, in particular, a set of simple data structures—list, matrix, network, table and tuple—that have proven sufficient to achieve a high degree of interoperability. We provide a few guidelines for the development of future software that will function as part of an interoperable community of software tools for biological data analysis and visualization.
Collapse
|
17
|
Sharma K, Gillum N, Boyd JL, Schmid A. The RosR transcription factor is required for gene expression dynamics in response to extreme oxidative stress in a hypersaline-adapted archaeon. BMC Genomics 2012; 13:351. [PMID: 22846541 PMCID: PMC3443676 DOI: 10.1186/1471-2164-13-351] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2012] [Accepted: 07/05/2012] [Indexed: 11/10/2022] Open
Abstract
Background Previous work has shown that the hypersaline-adapted archaeon, Halobacterium salinarum NRC-1, is highly resistant to oxidative stress caused by exposure to hydrogen peroxide, UV, and gamma radiation. Dynamic alteration of the gene regulatory network (GRN) has been implicated in such resistance. However, the molecular functions of transcription regulatory proteins involved in this response remain unknown. Results Here we have reanalyzed several existing GRN and systems biology datasets for H. salinarum to identify and characterize a novel winged helix-turn-helix transcription factor, VNG0258H, as a regulator required for reactive oxygen species resistance in this organism. This protein appears to be unique to the haloarchaea at the primary sequence level. High throughput quantitative growth assays in a deletion mutant strain implicate VNG0258H in extreme oxidative stress resistance. According to time course gene expression analyses, this transcription factor is required for the appropriate dynamic response of nearly 300 genes to reactive oxygen species damage from paraquat and hydrogen peroxide. These genes are predicted to function in repair of oxidative damage to proteins and DNA. In vivo DNA binding assays demonstrate that VNG0258H binds DNA to mediate gene regulation. Conclusions Together these results suggest that VNG0258H is a novel archaeal transcription factor that regulates gene expression to enable adaptation to the extremely oxidative, hypersaline niche of H. salinarum. We have therefore renamed VNG0258H as RosR, for reactive oxygen species regulator.
Collapse
Affiliation(s)
- Kriti Sharma
- Center for Systems Biology, Institute for Genome Sciences and Policy, Durham, NC 27710, USA
| | | | | | | |
Collapse
|
18
|
Wilbanks EG, Larsen DJ, Neches RY, Yao AI, Wu CY, Kjolby RAS, Facciotti MT. A workflow for genome-wide mapping of archaeal transcription factors with ChIP-seq. Nucleic Acids Res 2012; 40:e74. [PMID: 22323522 PMCID: PMC3378898 DOI: 10.1093/nar/gks063] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Deciphering the structure of gene regulatory networks across the tree of life remains one of the major challenges in postgenomic biology. We present a novel ChIP-seq workflow for the archaea using the model organism Halobacterium salinarum sp. NRC-1 and demonstrate its application for mapping the genome-wide binding sites of natively expressed transcription factors. This end-to-end pipeline is the first protocol for ChIP-seq in archaea, with methods and tools for each stage from gene tagging to data analysis and biological discovery. Genome-wide binding sites for transcription factors with many binding sites (TfbD) are identified with sensitivity, while retaining specificity in the identification the smaller regulons (bacteriorhodopsin-activator protein). Chromosomal tagging of target proteins with a compact epitope facilitates a standardized and cost-effective workflow that is compatible with high-throughput immunoprecipitation of natively expressed transcription factors. The Pique package, an open-source bioinformatics method, is presented for identification of binding events. Relative to ChIP-Chip and qPCR, this workflow offers a robust catalog of protein–DNA binding events with improved spatial resolution and significantly decreased cost. While this study focuses on the application of ChIP-seq in H. salinarum sp. NRC-1, our workflow can also be adapted for use in other archaea and bacteria with basic genetic tools.
Collapse
Affiliation(s)
- Elizabeth G Wilbanks
- University of California Davis, Department of Biomedical Engineering and Genome Center, One Shields Avenue, Davis, CA 95616, USA.
| | | | | | | | | | | | | |
Collapse
|
19
|
Functional genomic and advanced genetic studies reveal novel insights into the metabolism, regulation, and biology of Haloferax volcanii. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2011; 2011:602408. [PMID: 22190865 PMCID: PMC3235422 DOI: 10.1155/2011/602408] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2011] [Revised: 07/04/2011] [Accepted: 09/06/2011] [Indexed: 11/18/2022]
Abstract
The genome sequence of Haloferax volcanii is available and several comparative genomic in silico studies were performed that yielded novel insight for example into protein export, RNA modifications, small non-coding RNAs, and ubiquitin-like Small Archaeal Modifier Proteins. The full range of functional genomic methods has been established and results from transcriptomic, proteomic and metabolomic studies are discussed. Notably, Hfx. volcanii is together with Halobacterium salinarum the only prokaryotic species for which a translatome analysis has been performed. The results revealed that the fraction of translationally-regulated genes in haloarchaea is as high as in eukaryotes. A highly efficient genetic system has been established that enables the application of libraries as well as the parallel generation of genomic deletion mutants. Facile mutant generation is complemented by the possibility to culture Hfx. volcanii in microtiter plates, allowing the phenotyping of mutant collections. Genetic approaches are currently used to study diverse biological questions–from replication to posttranslational modification—and selected results are discussed. Taken together, the wealth of functional genomic and genetic tools make Hfx. volcanii a bona fide archaeal model species, which has enabled the generation of important results in recent years and will most likely generate further breakthroughs in the future.
Collapse
|
20
|
Turkarslan S, Reiss DJ, Gibbins G, Su WL, Pan M, Bare JC, Plaisier CL, Baliga NS. Niche adaptation by expansion and reprogramming of general transcription factors. Mol Syst Biol 2011; 7:554. [PMID: 22108796 PMCID: PMC3261711 DOI: 10.1038/msb.2011.87] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2011] [Accepted: 10/25/2011] [Indexed: 02/01/2023] Open
Abstract
Numerous lineage-specific expansions of the transcription factor B (TFB) family in archaea suggests an important role for expanded TFBs in encoding environment-specific gene regulatory programs. Given the characteristics of hypersaline lakes, the unusually large numbers of TFBs in halophilic archaea further suggests that they might be especially important in rapid adaptation to the challenges of a dynamically changing environment. Motivated by these observations, we have investigated the implications of TFB expansions by correlating sequence variations, regulation, and physical interactions of all seven TFBs in Halobacterium salinarum NRC-1 to their fitness landscapes, functional hierarchies, and genetic interactions across 2488 experiments covering combinatorial variations in salt, pH, temperature, and Cu stress. This systems analysis has revealed an elegant scheme in which completely novel fitness landscapes are generated by gene conversion events that introduce subtle changes to the regulation or physical interactions of duplicated TFBs. Based on these insights, we have introduced a synthetically redesigned TFB and altered the regulation of existing TFBs to illustrate how archaea can rapidly generate novel phenotypes by simply reprogramming their TFB regulatory network.
Collapse
Affiliation(s)
| | - David J Reiss
- Baliga Lab, Institute for Systems Biology, Seattle, WA, USA
| | | | - Wan Lin Su
- Baliga Lab, Institute for Systems Biology, Seattle, WA, USA
| | - Min Pan
- Baliga Lab, Institute for Systems Biology, Seattle, WA, USA
| | | | | | - Nitin S Baliga
- Baliga Lab, Institute for Systems Biology, Seattle, WA, USA
- Department of Microbiology, University of Washington, Seattle, WA, USA
- Department of Biology, Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| |
Collapse
|
21
|
Battke F, Symons S, Herbig A, Nieselt K. GaggleBridge: collaborative data analysis. ACTA ACUST UNITED AC 2011; 27:2612-3. [PMID: 21775306 DOI: 10.1093/bioinformatics/btr429] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Tools aiding in collaborative data analysis are becoming ever more important as researchers work together over long distances. We present an extension to the Gaggle framework, which has been widely adopted as a tool to enable data exchange between different analysis programs on one computer. RESULTS Our program, GaggleBridge, transparently extends this functionality to allow data exchange between Gaggle users at different geographic locations using network communication. GaggleBridge can automatically set up SSH tunnels to traverse firewalls while adding some security features to the Gaggle communication. AVAILABILITY GaggleBridge is available as open-source software implemented in the Java language at http://it.inf.uni-tuebingen.de/gb. CONTACT florian.battke@uni-tuebingen.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Florian Battke
- Integrative Transcriptomics, Center for Bioinformatics, University of Tuebingen, 72076 Tuebingen, Germany.
| | | | | | | |
Collapse
|
22
|
Yoon SH, Reiss DJ, Bare JC, Tenenbaum D, Pan M, Slagel J, Moritz RL, Lim S, Hackett M, Menon AL, Adams MWW, Barnebey A, Yannone SM, Leigh JA, Baliga NS. Parallel evolution of transcriptome architecture during genome reorganization. Genome Res 2011; 21:1892-904. [PMID: 21750103 DOI: 10.1101/gr.122218.111] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Assembly of genes into operons is generally viewed as an important process during the continual adaptation of microbes to changing environmental challenges. However, the genome reorganization events that drive this process are also the roots of instability for existing operons. We have determined that there exists a statistically significant trend that correlates the proportion of genes encoded in operons in archaea to their phylogenetic lineage. We have further characterized how microbes deal with operon instability by mapping and comparing transcriptome architectures of four phylogenetically diverse extremophiles that span the range of operon stabilities observed across archaeal lineages: a photoheterotrophic halophile (Halobacterium salinarum NRC-1), a hydrogenotrophic methanogen (Methanococcus maripaludis S2), an acidophilic and aerobic thermophile (Sulfolobus solfataricus P2), and an anaerobic hyperthermophile (Pyrococcus furiosus DSM 3638). We demonstrate how the evolution of transcriptional elements (promoters and terminators) generates new operons, restores the coordinated regulation of translocated, inverted, and newly acquired genes, and introduces completely novel regulation for even some of the most conserved operonic genes such as those encoding subunits of the ribosome. The inverse correlation (r=-0.92) between the proportion of operons with such internally located transcriptional elements and the fraction of conserved operons in each of the four archaea reveals an unprecedented view into varying stages of operon evolution. Importantly, our integrated analysis has revealed that organisms adapted to higher growth temperatures have lower tolerance for genome reorganization events that disrupt operon structures.
Collapse
Affiliation(s)
- Sung Ho Yoon
- Institute for Systems Biology, Seattle, Washington 98109, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
|