101
|
Blaise M, Becker HD, Lapointe J, Cambillau C, Giegé R, Kern D. Glu-Q-tRNAAsp synthetase coded by the yadB gene, a new paralog of aminoacyl-tRNA synthetase that glutamylates tRNAAsp anticodon. Biochimie 2005; 87:847-61. [PMID: 16164993 DOI: 10.1016/j.biochi.2005.03.007] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2004] [Revised: 02/18/2005] [Accepted: 03/17/2005] [Indexed: 10/25/2022]
Abstract
Analysis of the completed genome sequences revealed presence in various bacteria of an open reading frame (ORF) encoding a polypeptide chain presenting important similarities with the catalytic domain of glutamyl-tRNA synthetases but deprived of the C-terminal anticodon-binding domain. This paralog of glutamyl-tRNA synthetases, the YadB protein, activates glutamate in the absence of tRNA and transfers the activated glutamate not on tRNA(Glu) but instead on tRNA(Asp). It has been shown that tRNA(Asp) is able to accept two amino acids: aspartate charged by aspartyl-tRNA synthetase and glutamate charged by YadB. The functional properties of YadB contrast with those of the canonical glutamyl-tRNA synthetases, which activate Glu only in presence of the cognate tRNA before aminoacylation of the 3'-end of tRNA. Biochemical approaches and mass spectrometry investigations revealed that YadB transfers the activated glutamate on the cyclopenthene-diol ring of the modified nucleoside queuosine posttranscriptionally inserted at the wobble position of the anticodon-loop to form glutamyl-queuosine. Unstability of the ester bond between the glutamate residue and the cyclopenthene-diol (half-life 7.5 min) explains why until now this modification escaped detection. Among Escherichia coli tRNAs containing queuosine in the wobble position, only tRNA(Asp) is substrate of YadB. Sequence comparison reveals a structural mimicry between the anticodon-stem and loop of tRNA(Asp) and the amino acid acceptor-stem of tRNA(Glu). YadB, renamed glutamyl-Q-tRNA(Asp) synthetase, constitutes the first enzyme structurally related to aminoacyl-tRNA synthetases which catalyzes a hypermodification in tRNA, and whose function seems to be conserved among prokaryotes. The discovery of glutamyl-Q-tRNA(Asp) synthetase breaks down the current paradigm according to which the catalytic domain of aminoacyl-tRNA synthetases recognizes the amino acid acceptor-stem of tRNA and aminoacylates the 3'-terminal ribose. The evolutionary significance of the existence of an aminoacyl-tRNA synthetase paralog dedicated to the hypermodification of a tRNA anticodon will be discussed.
Collapse
Affiliation(s)
- Mickael Blaise
- Département Machineries Traductionnelles, UPR 9002 du CNRS, Institut de Biologie Moléculaire et Cellulaire, 15, rue René Descartes, 67084 Strasbourg cedex, France
| | | | | | | | | | | |
Collapse
|
102
|
Axmann IM, Kensche P, Vogel J, Kohl S, Herzel H, Hess WR. Identification of cyanobacterial non-coding RNAs by comparative genome analysis. Genome Biol 2005; 6:R73. [PMID: 16168080 PMCID: PMC1242208 DOI: 10.1186/gb-2005-6-9-r73] [Citation(s) in RCA: 117] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2005] [Revised: 06/01/2005] [Accepted: 07/20/2005] [Indexed: 01/09/2023] Open
Abstract
The first genome-wide and systematic screen for non-coding RNAs (ncRNAs) in cyanobacteria. Several ncRNAs were computationally predicted and their presence was biochemically verified. These ncRNAs may have regulatory functions, and each shows a distinct phylogenetic distribution. Background Whole genome sequencing of marine cyanobacteria has revealed an unprecedented degree of genomic variation and streamlining. With a size of 1.66 megabase-pairs, Prochlorococcus sp. MED4 has the most compact of these genomes and it is enigmatic how the few identified regulatory proteins efficiently sustain the lifestyle of an ecologically successful marine microorganism. Small non-coding RNAs (ncRNAs) control a plethora of processes in eukaryotes as well as in bacteria; however, systematic searches for ncRNAs are still lacking for most eubacterial phyla outside the enterobacteria. Results Based on a computational prediction we show the presence of several ncRNAs (cyanobacterial functional RNA or Yfr) in several different cyanobacteria of the Prochlorococcus-Synechococcus lineage. Some ncRNA genes are present only in two or three of the four strains investigated, whereas the RNAs Yfr2 through Yfr5 are structurally highly related and are encoded by a rapidly evolving gene family as their genes exist in different copy numbers and at different sites in the four investigated genomes. One ncRNA, Yfr7, is present in at least seven other cyanobacteria. In addition, control elements for several ribosomal operons were predicted as well as riboswitches for thiamine pyrophosphate and cobalamin. Conclusion This is the first genome-wide and systematic screen for ncRNAs in cyanobacteria. Several ncRNAs were both computationally predicted and their presence was biochemically verified. These RNAs may have regulatory functions and each shows a distinct phylogenetic distribution. Our approach can be applied to any group of microorganisms for which more than one total genome sequence is available for comparative analysis.
Collapse
Affiliation(s)
- Ilka M Axmann
- Humboldt-University, Department of Biology/Genetics, Chausseestrasse, D-Berlin, Germany
| | - Philip Kensche
- Humboldt-University, Department of Biology/Genetics, Chausseestrasse, D-Berlin, Germany
- Humboldt-University, Institute for Theoretical Biology, Invalidenstrasse, Berlin, Germany
| | - Jörg Vogel
- Max Planck Institute for Infection Biology, Schumannstrasse, Berlin, Germany
| | - Stefan Kohl
- Humboldt-University, Department of Biology/Genetics, Chausseestrasse, D-Berlin, Germany
| | - Hanspeter Herzel
- Humboldt-University, Institute for Theoretical Biology, Invalidenstrasse, Berlin, Germany
| | - Wolfgang R Hess
- Humboldt-University, Department of Biology/Genetics, Chausseestrasse, D-Berlin, Germany
- University Freiburg, Institute of Biology II/Experimental Bioinformatics, Schänzlestrasse, Freiburg, Germany
| |
Collapse
|
103
|
Vanderpool CK, Gottesman S. Involvement of a novel transcriptional activator and small RNA in post-transcriptional regulation of the glucose phosphoenolpyruvate phosphotransferase system. Mol Microbiol 2005; 54:1076-89. [PMID: 15522088 DOI: 10.1111/j.1365-2958.2004.04348.x] [Citation(s) in RCA: 272] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
RyaA is a small non-coding RNA in Escherichia coli that was identified by its ability to bind tightly to the RNA chaperone Hfq. This study reports the role of RyaA in mediating the cellular response to glucose-specific phosphoenolypyruvate phosphotransferase system (PTS)-dependent phosphosugar stress. Aiba and co-workers have shown that a block in the metabolism of glucose 6-phosphate causes transient growth inhibition and post-transcriptional regulation of ptsG, encoding the glucose-specific PTS transporter. We found that RyaA synthesis was induced by a non-metabolizable glucose phosphate analogue and was necessary for relief of the toxicity of glucose phosphate stress. Expression of RyaA was sufficient to cause a rapid loss of ptsG mRNA, probably reflecting degradation of the message mediated by RyaA:ptsG pairing. The ryaA gene was renamed sgrS, for sugar transport-related sRNA. Expression of sgrS is regulated by a novel transcriptional activator, SgrR (formerly YabN), which has a putative DNA-binding domain and a solute-binding domain similar to those found in certain transport proteins. Our results suggest that under conditions of glucose phosphate accumulation, SgrR activates SgrS synthesis, causing degradation of ptsG mRNA. Decreased ptsG mRNA results in decreased production of glucose transport machinery, thus limiting further accumulation of glucose phosphate.
Collapse
Affiliation(s)
- Carin K Vanderpool
- Laboratory of Molecular Biology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | |
Collapse
|
104
|
Livny J, Fogel MA, Davis BM, Waldor MK. sRNAPredict: an integrative computational approach to identify sRNAs in bacterial genomes. Nucleic Acids Res 2005; 33:4096-105. [PMID: 16049021 PMCID: PMC1180744 DOI: 10.1093/nar/gki715] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Small non-coding bacterial RNAs (sRNAs) play important regulatory roles in a variety of cellular processes. Nearly all known sRNAs have been identified in Escherichia coli and most of these are not conserved in the majority of other bacterial species. Many of the E.coli sRNAs were initially predicted through bioinformatic approaches based on their common features, namely that they are encoded between annotated open reading frames and are flanked by predictable transcription signals. Because promoter consensus sequences are undetermined for most species, the successful use of bioinformatics to identify sRNAs in bacteria other than E.coli has been limited. We have created a program, sRNAPredict, which uses coordinate-based algorithms to integrate the respective positions of individual predictive features of sRNAs and rapidly identify putative intergenic sRNAs. Relying only on sequence conservation and predicted Rho-independent terminators, sRNAPredict was used to search for sRNAs in Vibrio cholerae. This search identified 9 of the 10 known or putative V.cholerae sRNAs and 32 candidates for novel sRNAs. Small transcripts for 6 out of 9 candidate sRNAs were observed by Northern analysis. Our findings suggest that sRNAPredict can be used to efficiently identify novel sRNAs even in bacteria for which promoter consensus sequences are not available.
Collapse
Affiliation(s)
- Jonathan Livny
- To whom correspondence should be addressed. Tel: +1 617 636 2778; Fax: +1 617 636 2723;
| | | | | | | |
Collapse
|
105
|
Nittler MP, Hocking-Murray D, Foo CK, Sil A. Identification of Histoplasma capsulatum transcripts induced in response to reactive nitrogen species. Mol Biol Cell 2005; 16:4792-813. [PMID: 16030248 PMCID: PMC1237084 DOI: 10.1091/mbc.e05-05-0434] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
The pathogenic fungus Histoplasma capsulatum escapes innate immune defenses and colonizes host macrophages during infection. After the onset of adaptive immunity, the production of the antimicrobial effector nitric oxide (*NO) restricts H. capsulatum replication. However, H. capsulatum can establish persistent infections, indicating that it survives in the host despite exposure to reactive nitrogen species (RNS). To understand how H. capsulatum responds to RNS, we determined the transcriptional profile of H. capsulatum to *NO-generating compounds using a shotgun genomic microarray. We identified 695 microarray clones that were induced > or = 4-fold upon nitrosative stress. Because our microarray clones were generated from random fragments of genomic DNA, they did not necessarily correspond to H. capsulatum open reading frames. To identify induced genes, we used high-density oligonucleotide tiling arrays to determine the genomic boundaries and coding strand of 153 RNS-induced transcripts. Homologues of these genes in other organisms are involved in iron acquisition, energy production, stress response, protein folding/degradation, DNA repair, and *NO detoxification. Ectopic expression of one of these genes, a P450 nitric oxide reductase homologue, was sufficient to increase resistance of H. capsulatum to RNS in culture. We propose that H. capsulatum uses the pathways identified here to cope with RNS-induced damage during pathogenesis.
Collapse
Affiliation(s)
- M Paige Nittler
- Department of Microbiology and Immunology, University of California-San Francisco, San Francisco, CA 94143-0414, USA
| | | | | | | |
Collapse
|
106
|
Ko JH, Lee Y. RNA-conjugated template-switching RT-PCR method for generating an Escherichia coli cDNA library for small RNAs. J Microbiol Methods 2005; 64:297-304. [PMID: 15987660 DOI: 10.1016/j.mimet.2005.05.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2005] [Revised: 05/02/2005] [Accepted: 05/20/2005] [Indexed: 11/16/2022]
Abstract
The cDNA conversion of RNA molecules is a prerequisite for their analysis. In the case of prokaryotic RNAs, cDNA conversion is difficult due to a lack of the long poly(A) tails that are found in eukaryotic mRNAs. The full cDNAs for eukaryotic mRNAs can be amplified by the reverse transcription polymerase chain reaction (RT-PCR) using the template-switching method together with an oligo(dT) primer. To amplify the full cDNAs for prokaryotic RNAs, we modified the template-switching RT-PCR method by adopting an RNA linker at the 3' end of the target RNAs. Using this method, which we named as RNA-conjugated template-switching RT-PCR (RC/TS RT-PCR), we constructed a cDNA library for small RNAs from cold-shock-treated Escherichia coli cells. To confirm that the cDNAs were amplified by RC/TS RT-PCR without a loss of sequence information, clones carrying the 6S RNA sequence were analyzed from the cDNA library for small RNAs ranging from 130 to 350 nt. We found that the 6S RNA sequences were fully converted into the corresponding cDNAs, confirming that RC/TS RT-PCR is a useful method for constructing a cDNA library for small RNAs in E. coli. This method can be also used to construct a cDNA library for non-poly(A)-containing RNAs from eukaryotic cells.
Collapse
Affiliation(s)
- Jae-Hyeong Ko
- Department of Chemistry, and Center for Molecular Design and Synthesis, Korea Advanced Institute of Science and Technology, Daejeon 305-701, Republic of Korea
| | | |
Collapse
|
107
|
Pieterse B, Jellema RH, van der Werf MJ. Quenching of microbial samples for increased reliability of microarray data. J Microbiol Methods 2005; 64:207-16. [PMID: 15982764 DOI: 10.1016/j.mimet.2005.04.035] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2005] [Revised: 04/25/2005] [Accepted: 04/26/2005] [Indexed: 10/25/2022]
Abstract
Messenger RNA levels change on a minutes scale due to both degradation and de novo transcription. Consequently, alterations in the transcript profiles that are not representative for the condition of interest are easily introduced during sample harvesting and work-up. In order to avoid these unwanted changes we have validated a -45 degrees C methanol-based quenching method for obtaining reliable and reproducible 'snapshot' samples of Lactobacillus plantarum cells for transcriptome analyses. Transcript profiles of cells harvested with the quenching method were compared with transcript profiles of cells that were harvested according to two different commonly applied protocols. Significant differences between the transcript profiles of cells harvested by the different methods from the same steady-state culture were observed. In total, 42 genes or operons were identified from which the transcript levels were altered when the cells were not immediately quenched upon harvesting. Among these, several have previously been associated with cold-shock response. Furthermore, the reproducibility of transcript profiles improved, as indicated by the fact that the variation in the data sets obtained from the quenched cells was smaller than in the data sets obtained from the cells that were harvested under non-quenched conditions.
Collapse
Affiliation(s)
- Bart Pieterse
- Wageningen Centre for Food Sciences, Diedenweg 20, 6700 AN Wageningen, The Netherlands
| | | | | |
Collapse
|
108
|
Abstract
Genome-wide analysis of temporal gene expression profiles in Escherichia coli following exposure to cadmium revealed a shift to anaerobic metabolism and induction of several stress response systems. Disruption in the transcription of genes encoding ribosomal proteins and zinc-binding proteins may partially explain the molecular mechanisms of cadmium toxicity.
Collapse
Affiliation(s)
- Anyou Wang
- Department of Environmental Sciences, University of Caliifornia, Riverside, CA 92521, USA
| | | |
Collapse
|
109
|
Havilio M, Levanon EY, Lerman G, Kupiec M, Eisenberg E. Evidence for abundant transcription of non-coding regions in the Saccharomyces cerevisiae genome. BMC Genomics 2005; 6:93. [PMID: 15960846 PMCID: PMC1181813 DOI: 10.1186/1471-2164-6-93] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2004] [Accepted: 06/16/2005] [Indexed: 11/23/2022] Open
Abstract
Background Recent studies in a growing number of organisms have yielded accumulating evidence that a significant portion of the non-coding region in the genome is transcribed. We address this issue in the yeast Saccharomyces cerevisiae. Results Taking into account the absence of a significantly large yeast EST database, we use microarray expression data collected for genomic regions erroneously believed to be coding to study the expression pattern of non-coding regions in the Saccharomyces cerevisiae genome. We find that at least 164 out of 589 (28%) such regions are expressed under specific biological conditions. In particular, looking at the probes that are located opposing other known genes at the same genomic locus, we find that 88 out of 341 (26%) of these genes support antisense transcription. The expression patterns of these antisense genes are positively correlated. We validate these results using RT-PCR on a sample of 6 non-coding transcripts. Conclusion 1. The yeast genome is transcribed on a scale larger than previously assumed. 2. Correlated transcription of antisense genes is abundant in the yeast genome. 3. Antisense genes in yeast are non-coding.
Collapse
Affiliation(s)
- Moshe Havilio
- Compugen Ltd., 72 Pinchas Rosen St., Tel-Aviv 69512, Israel
| | - Erez Y Levanon
- Compugen Ltd., 72 Pinchas Rosen St., Tel-Aviv 69512, Israel
- Department of Pediatric Hematology-Oncology, Chaim Sheba Medical Center and Sackler School of Medicine, Tel Aviv University, Tel Aviv 52621, Israel
| | - Galia Lerman
- Department of Molecular Microbiology and Biotechnology, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Martin Kupiec
- Department of Molecular Microbiology and Biotechnology, Tel Aviv University, Ramat Aviv 69978, Israel
| | - Eli Eisenberg
- Compugen Ltd., 72 Pinchas Rosen St., Tel-Aviv 69512, Israel
- School of Physics and Astronomy, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
110
|
Saetrom P, Sneve R, Kristiansen KI, Snøve O, Grünfeld T, Rognes T, Seeberg E. Predicting non-coding RNA genes in Escherichia coli with boosted genetic programming. Nucleic Acids Res 2005; 33:3263-70. [PMID: 15942029 PMCID: PMC1143698 DOI: 10.1093/nar/gki644] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Several methods exist for predicting non-coding RNA (ncRNA) genes in Escherichia coli (E.coli). In addition to about sixty known ncRNA genes excluding tRNAs and rRNAs, various methods have predicted more than thousand ncRNA genes, but only 95 of these candidates were confirmed by more than one study. Here, we introduce a new method that uses automatic discovery of sequence patterns to predict ncRNA genes. The method predicts 135 novel candidates. In addition, the method predicts 152 genes that overlap with predictions in the literature. We test sixteen predictions experimentally, and show that twelve of these are actual ncRNA transcripts. Six of the twelve verified candidates were novel predictions. The relatively high confirmation rate indicates that many of the untested novel predictions are also ncRNAs, and we therefore speculate that E.coli contains more ncRNA genes than previously estimated.
Collapse
Affiliation(s)
- Pål Saetrom
- Interagon AS, Medisinsk teknisk senter NO-7489 Trondheim, Norway.
| | | | | | | | | | | | | |
Collapse
|
111
|
Abstract
Over the past two decades a variety of mechanisms regulating cellular differentiation have been uncovered. These include signaling by morphogens or membrane-associated ligands and asymmetric segregation of cytoplasmic components. Most of these processes are driven by protein coding genes. Here I describe another possible cellular differentiation mechanism that involves asymmetric segregation of microRNAs, a group of recently discovered non-protein coding genes that have been shown to be involved in differentiation.
Collapse
Affiliation(s)
- Isaac Bentwich
- Rosetta Genomics, 10 Plaut Street, Rehovot 76706, Israel.
| |
Collapse
|
112
|
Vogel J, Argaman L, Wagner EGH, Altuvia S. The small RNA IstR inhibits synthesis of an SOS-induced toxic peptide. Curr Biol 2005; 14:2271-6. [PMID: 15620655 DOI: 10.1016/j.cub.2004.12.003] [Citation(s) in RCA: 209] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2004] [Revised: 10/20/2004] [Accepted: 10/22/2004] [Indexed: 11/23/2022]
Abstract
More than 60 small RNAs (sRNA) have been identified in E. coli. The functions of the majority of these sRNAs are still unclear. For the few sRNAs characterized, expression and functional studies indicate that they act under stress conditions. Here, we describe a novel E. coli chromosome locus that is part of the SOS response to DNA damage. This locus encodes two sRNAs, IstR-1 and IstR-2, and a toxic peptide, TisB, encoded by tisAB mRNA. Transcription of tisAB and istR-2 is SOS regulated, whereas IstR-1 is present throughout growth. IstR-1 inhibits toxicity by base-pairing to a short region in the tisAB mRNA. This antisense interaction entails RNase III-dependent cleavage, thereby inactivating the mRNA for translation. In the absence of the SOS response, IstR-1 is present in high excess over its target. However, SOS induction leads to depletion of the IstR-1 pool, concomitant with accumulation of tisAB mRNA. Under such conditions, TisB exerts its toxic effect, slowing down growth. We propose that the inhibitory sRNA prevents inadvertent TisB synthesis during normal growth and, possibly, also limits SOS-induced toxicity. Our study adds the SOS regulon to the growing list of global regulatory circuits controlled by sRNA genes.
Collapse
Affiliation(s)
- Jörg Vogel
- Department of Molecular Genetics and Biotechnology, The Hebrew University Hadassah Medical School, Jerusalem 91120, Israel.
| | | | | | | |
Collapse
|
113
|
Kawano M, Reynolds AA, Miranda-Rios J, Storz G. Detection of 5'- and 3'-UTR-derived small RNAs and cis-encoded antisense RNAs in Escherichia coli. Nucleic Acids Res 2005; 33:1040-50. [PMID: 15718303 PMCID: PMC549416 DOI: 10.1093/nar/gki256] [Citation(s) in RCA: 197] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Evidence is accumulating that small, noncoding RNAs are important regulatory molecules. Computational and experimental searches have led to the identification of approximately 60 small RNA genes in Escherichia coli. However, most of these studies focused on the intergenic regions and assumed that small RNAs were >50 nt. Thus, the previous screens missed small RNAs encoded on the antisense strand of protein-coding genes and small RNAs of <50 nt. To identify additional small RNAs, we carried out a cloning-based screen focused on RNAs of 30-65 nt. In this screen, we identified RNA species corresponding to fragments of rRNAs, tRNAs and known small RNAs. Several of the small RNAs also corresponded to 5'- and 3'-untranslated regions (UTRs) and internal fragments of mRNAs. Four of the 3'-UTR-derived RNAs were highly abundant and two showed expression patterns that differed from the corresponding mRNAs, suggesting independent functions for the 3'-UTR-derived small RNAs. We also detected three previously unidentified RNAs encoded in intergenic regions and RNAs from the long direct repeat and hok/sok elements. In addition, we identified a few small RNAs that are expressed opposite protein-coding genes and could base pair with 5' or 3' ends of the mRNAs with perfect complementarity.
Collapse
MESH Headings
- 3' Untranslated Regions/chemistry
- 5' Untranslated Regions/chemistry
- Cloning, Molecular
- DNA, Intergenic
- Escherichia coli/genetics
- RNA, Antisense/analysis
- RNA, Antisense/genetics
- RNA, Bacterial/analysis
- RNA, Bacterial/genetics
- RNA, Messenger/chemistry
- RNA, Ribosomal/chemistry
- RNA, Transfer/chemistry
- RNA, Untranslated/analysis
- RNA, Untranslated/genetics
- Repetitive Sequences, Nucleic Acid
Collapse
Affiliation(s)
- Mitsuoki Kawano
- Cell Biology and Metabolism Branch, National Institute of Child Health and Human DevelopmentBuilding 18T, Room 101, Bethesda, MD 20892-5430, USA
| | - April A. Reynolds
- Cell Biology and Metabolism Branch, National Institute of Child Health and Human DevelopmentBuilding 18T, Room 101, Bethesda, MD 20892-5430, USA
| | - Juan Miranda-Rios
- Cell Biology and Metabolism Branch, National Institute of Child Health and Human DevelopmentBuilding 18T, Room 101, Bethesda, MD 20892-5430, USA
- Molecular Microbiology Department, Biotechnology Institute, National University of MexicoAv. Universidad #2001, Col. Chamilpa, CP. 62210, Cuernavaca, Morelos, Mexico
| | - Gisela Storz
- Cell Biology and Metabolism Branch, National Institute of Child Health and Human DevelopmentBuilding 18T, Room 101, Bethesda, MD 20892-5430, USA
- To whom correspondence should be addressed. Tel: +1 301 402 0968; Fax: +1 301 402 0078;
| |
Collapse
|
114
|
Abstract
Small noncoding RNAs have been found in all organisms, primarily as regulators of translation and message stability. The most exhaustive searches have taken place in E. coli, resulting in identification of more than 50 small RNAs, or 1%-2% of the number of protein-coding genes. One large class of these small RNAs uses the RNA chaperone Hfq; members of this class act by pairing to target messenger RNAs. Among the members of this class are DsrA and RprA, which positively regulate rpoS translation, OxyS, which negatively regulates rpoS translation and fhlA translation, RyhB, which reapportions iron use in the cell by downregulating translation of many genes that encode Fe-containing proteins, and Spot 42, which changes the polarity of translation in the gal operon. The promoters of these small RNAs are tightly regulated, frequently as part of well-understood regulons. Lessons learned from the study of small RNAs in E. coli can be applied to finding these important regulators in other organisms.
Collapse
Affiliation(s)
- Susan Gottesman
- Laboratory of Molecular Biology, National Cancer Institute, Bethesda, Maryland 20892, USA.
| |
Collapse
|
115
|
Bertone P, Stolc V, Royce TE, Rozowsky JS, Urban AE, Zhu X, Rinn JL, Tongprasit W, Samanta M, Weissman S, Gerstein M, Snyder M. Global identification of human transcribed sequences with genome tiling arrays. Science 2004; 306:2242-6. [PMID: 15539566 DOI: 10.1126/science.1103388] [Citation(s) in RCA: 772] [Impact Index Per Article: 38.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Elucidating the transcribed regions of the genome constitutes a fundamental aspect of human biology, yet this remains an outstanding problem. To comprehensively identify coding sequences, we constructed a series of high-density oligonucleotide tiling arrays representing sense and antisense strands of the entire nonrepetitive sequence of the human genome. Transcribed sequences were located across the genome via hybridization to complementary DNA samples, reverse-transcribed from polyadenylated RNA obtained from human liver tissue. In addition to identifying many known and predicted genes, we found 10,595 transcribed sequences not detected by other methods. A large fraction of these are located in intergenic regions distal from previously annotated genes and exhibit significant homology to other mammalian proteins.
Collapse
Affiliation(s)
- Paul Bertone
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT 06520-8103, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
116
|
Affiliation(s)
- Shoshy Altuvia
- Department of Molecular Genetics and Biotechnology, The Hebrew University-Hadassah Medical School, Jerusalem 91120, Israel.
| |
Collapse
|
117
|
Holzman T, Kolker E. Statistical analysis of global gene expression data: some practical considerations. Curr Opin Biotechnol 2004; 15:52-7. [PMID: 15102467 DOI: 10.1016/j.copbio.2003.12.004] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Applying appropriate error models and conservative estimates to microarray data helps to reduce the number of false predictions and allows one to focus on biologically relevant observations. Several key conclusions have been drawn from the statistical analysis of global gene expression data: it is worth keeping core information for each experiment, including raw and processed data; biological and technical replicates are needed; careful experimental design makes the analysis simpler and more powerful; the choice of the similarity measure is nontrivial and depends on the goal of an experiment; array information must be complemented with other data; and gene expression studies are 'hypothesis generators'.
Collapse
Affiliation(s)
- Ted Holzman
- BIATECH, 19310 North Creek Parkway, Suite 115, Bothell, WA 98011, USA
| | | |
Collapse
|
118
|
Schadt EE, Edwards SW, GuhaThakurta D, Holder D, Ying L, Svetnik V, Leonardson A, Hart KW, Russell A, Li G, Cavet G, Castle J, McDonagh P, Kan Z, Chen R, Kasarskis A, Margarint M, Caceres RM, Johnson JM, Armour CD, Garrett-Engele PW, Tsinoremas NF, Shoemaker DD. A comprehensive transcript index of the human genome generated using microarrays and computational approaches. Genome Biol 2004; 5:R73. [PMID: 15461792 PMCID: PMC545593 DOI: 10.1186/gb-2004-5-10-r73] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2004] [Revised: 07/07/2004] [Accepted: 08/16/2004] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. RESULTS The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. CONCLUSIONS These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized.
Collapse
Affiliation(s)
- Eric E Schadt
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Stephen W Edwards
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | | | - Dan Holder
- Merck Research Laboratories, W42-213 Sumneytown Pike, POB 4, Westpoint, PA 19846, USA
| | - Lisa Ying
- Merck Research Laboratories, W42-213 Sumneytown Pike, POB 4, Westpoint, PA 19846, USA
| | - Vladimir Svetnik
- Merck Research Laboratories, W42-213 Sumneytown Pike, POB 4, Westpoint, PA 19846, USA
| | - Amy Leonardson
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Kyle W Hart
- Rally Scientific, 41 Fayette Street, Suite 1, Watertown, MA 02472, USA
| | - Archie Russell
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Guoya Li
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Guy Cavet
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - John Castle
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Paul McDonagh
- Amgen Inc, 1201 Amgen Court W, Seattle, WA 98119, USA
| | - Zhengyan Kan
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Ronghua Chen
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Andrew Kasarskis
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Mihai Margarint
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Ramon M Caceres
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | - Jason M Johnson
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| | | | | | | | - Daniel D Shoemaker
- Rosetta Inpharmatics LLC, 12040 115th Avenue NE, Kirkland, WA 98034, USA
| |
Collapse
|
119
|
Callen BP, Shearwin KE, Egan JB. Transcriptional interference between convergent promoters caused by elongation over the promoter. Mol Cell 2004; 14:647-56. [PMID: 15175159 DOI: 10.1016/j.molcel.2004.05.010] [Citation(s) in RCA: 126] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2003] [Revised: 04/05/2004] [Accepted: 04/12/2004] [Indexed: 01/21/2023]
Abstract
Transcriptional interference with convergent transcription from face-to-face promoters is a potentially important form of gene regulation in all organisms. Using LacZ reporter studies, the mechanism of interference was determined for a pair of face-to-face prokaryotic promoters in which a strong promoter interferes 5.6-fold with a weak promoter, 62 bp away. The promoters were variously rearranged to test different models of interference. Terminating transcription from the strong promoter before it reached the weak promoter dramatically reduced interference, indicating a requirement for the passage of the converging RNAP over the weak promoter. Based on in vitro experiments showing a slow rate of escape for open complexes at the weak promoter and their sensitivity to head-on collisions with elongating RNAP, a "sitting duck" model of interference is proposed and supported with in vivo permanganate footprinting. The model is further supported by the analysis of a second set of prokaryotic face-to-face promoters.
Collapse
MESH Headings
- Gene Expression Regulation, Bacterial/genetics
- Genes, Regulator/genetics
- Models, Biological
- Prokaryotic Cells/metabolism
- Promoter Regions, Genetic/genetics
- Promoter Regions, Genetic/physiology
- RNA Interference/physiology
- RNA, Antisense/genetics
- RNA, Antisense/metabolism
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Transcription, Genetic/genetics
- Transcription, Genetic/physiology
- Transcriptional Elongation Factors/genetics
Collapse
Affiliation(s)
- Benjamin P Callen
- School of Molecular and Biomedical Science (Biochemistry), University of Adelaide, South Australia 5005, Australia
| | | | | |
Collapse
|
120
|
Kolker E, Makarova KS, Shabalina S, Picone AF, Purvine S, Holzman T, Cherny T, Armbruster D, Munson RS, Kolesov G, Frishman D, Galperin MY. Identification and functional analysis of 'hypothetical' genes expressed in Haemophilus influenzae. Nucleic Acids Res 2004; 32:2353-61. [PMID: 15121896 PMCID: PMC419445 DOI: 10.1093/nar/gkh555] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The progress in genome sequencing has led to a rapid accumulation in GenBank submissions of uncharacterized 'hypothetical' genes. These genes, which have not been experimentally characterized and whose functions cannot be deduced from simple sequence comparisons alone, now comprise a significant fraction of the public databases. Expression analyses of Haemophilus influenzae cells using a combination of transcriptomic and proteomic approaches resulted in confident identification of 54 'hypothetical' genes that were expressed in cells under normal growth conditions. In an attempt to understand the functions of these proteins, we used a variety of publicly available analysis tools. Close homologs in other species were detected for each of the 54 'hypothetical' genes. For 16 of them, exact functional assignments could be found in one or more public databases. Additionally, we were able to suggest general functional characterization for 27 more genes (comprising approximately 80% total). Findings from this analysis include the identification of a pyruvate-formate lyase-like operon, likely to be expressed not only in H.influenzae but also in several other bacteria. Further, we also observed three genes that are likely to participate in the transport and/or metabolism of sialic acid, an important component of the H.influenzae lipo-oligosaccharide. Accurate functional annotation of uncharacterized genes calls for an integrative approach, combining expression studies with extensive computational analysis and curation, followed by eventual experimental verification of the computational predictions.
Collapse
Affiliation(s)
- Eugene Kolker
- BIATECH, 19310 North Creek Parkway, Suite 115, Bothell, WA 98011, USA.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
121
|
Zhang A, Wassarman KM, Rosenow C, Tjaden BC, Storz G, Gottesman S. Global analysis of small RNA and mRNA targets of Hfq. Mol Microbiol 2004; 50:1111-24. [PMID: 14622403 DOI: 10.1046/j.1365-2958.2003.03734.x] [Citation(s) in RCA: 418] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Hfq, a bacterial member of the Sm family of RNA-binding proteins, is required for the action of many small regulatory RNAs that act by basepairing with target mRNAs. Hfq binds this family of small RNAs efficiently. We have used co-immunoprecipitation with Hfq and direct detection of the bound RNAs on genomic microarrays to identify members of this small RNA family. This approach was extremely sensitive; even Hfq-binding small RNAs expressed at low levels were readily detected. At least 15 of 46 known small RNAs in E. coli interact with Hfq. In addition, high signals in other intergenic regions suggested up to 20 previously unidentified small RNAs bind Hfq; five were confirmed by Northern analysis. Strong signals within genes and operons also were detected, some of which correspond to known Hfq targets. Within the argX-hisR-leuT-proM operon, Hfq appears to compete with RNase E and modulate RNA processing and degradation. Thus Hfq immunoprecipitation followed by microarray analysis is a highly effective method for detecting a major class of small RNAs as well as identifying new Hfq functions.
Collapse
Affiliation(s)
- Aixia Zhang
- Cell Biology and Metabolism Branch, National Institute of Child Health and Development, Bethesda MD 20892, USA
| | | | | | | | | | | |
Collapse
|
122
|
Campanacci V, Dubois DY, Becker HD, Kern D, Spinelli S, Valencia C, Pagot F, Salomoni A, Grisel S, Vincentelli R, Bignon C, Lapointe J, Giegé R, Cambillau C. The Escherichia coli YadB gene product reveals a novel aminoacyl-tRNA synthetase like activity. J Mol Biol 2004; 337:273-83. [PMID: 15003446 DOI: 10.1016/j.jmb.2004.01.027] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2003] [Revised: 01/08/2004] [Accepted: 01/08/2004] [Indexed: 11/23/2022]
Abstract
In the course of a structural genomics program aiming at solving the structures of Escherichia coli open reading frame products of unknown function, we have determined the structure of YadB at 1.5A using molecular replacement. The YadB protein is 298 amino acid residues long and displays 34% sequence identity with E.coli glutamyl-tRNA synthetase (GluRS). It is much shorter than GluRS, which contains 468 residues, and lacks the complete domain interacting with the tRNA anticodon loop. As E.coli GluRS, YadB possesses a Zn2+ located in the putative tRNA acceptor stem-binding domain. The YadB cluster uses cysteine residues as the first three zinc ligands, but has a weaker tyrosine ligand at the fourth position. It shares with canonical amino acid RNA synthetases a major functional feature, namely activation of the amino acid (here glutamate). It differs, however, from GluRSs by the fact that the activation step is tRNA-independent and that it does not catalyze attachment of the activated glutamate to E.coli tRNAGlu, but to another, as yet unknown tRNA. These results suggest thus a novel function, distinct from that of GluRSs, for the yadB gene family.
Collapse
Affiliation(s)
- Valérie Campanacci
- Architecture et Fonction des Macromolécules Biologiques, UMR 6098, CNRS and Universités d'Aix-Marseille I and II, 31 chemin J. Aiguier, F-13402 Marseille Cedex 20, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
123
|
Vogel J, Bartels V, Tang TH, Churakov G, Slagter-Jäger JG, Hüttenhofer A, Wagner EGH. RNomics in Escherichia coli detects new sRNA species and indicates parallel transcriptional output in bacteria. Nucleic Acids Res 2004; 31:6435-43. [PMID: 14602901 PMCID: PMC275561 DOI: 10.1093/nar/gkg867] [Citation(s) in RCA: 338] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Recent bioinformatics-aided searches have identified many new small RNAs (sRNAs) in the intergenic regions of the bacterium Escherichia coli. Here, a shot-gun cloning approach (RNomics) was used to generate cDNA libraries of small sized RNAs. Besides many of the known sRNAs, we found new species that were not predicted previously. The present work brings the number of sRNAs in E.coli to 62. Experimental transcription start site mapping showed that some sRNAs were encoded from independent genes, while others were processed from mRNA leaders or trailers, indicative of a parallel transcriptional output generating sRNAs co-expressed with mRNAs. Two of these RNAs (SroA and SroG) consist of known (THI and RFN) riboswitch elements. We also show that two recently identified sRNAs (RyeB and SraC/RyeA) interact, resulting in RNase III-dependent cleavage. To the best of our knowledge, this represents the first case of two non-coding RNAs interacting by a putative antisense mechanism. In addition, intracellular metabolic stabilities of sRNAs were determined, including ones from previous screens. The wide range of half-lives (<2 to >32 min) indicates that sRNAs cannot generally be assumed to be metabolically stable. The experimental characterization of sRNAs analyzed here suggests that the definition of an sRNA is more complex than previously assumed.
Collapse
Affiliation(s)
- Jörg Vogel
- Institute of Cell and Molecular Biology, Biomedical Center, Uppsala University, Box 596, 75124 Uppsala, Sweden.
| | | | | | | | | | | | | |
Collapse
|
124
|
Abstract
Many non-coding sequences transcribed from the mammalian genome are proving to have important regulatory roles, but the functions of the majority remain mysterious. For decades, researchers have focused most of their attention on protein-coding genes and proteins. With the completion of the human and mouse genomes and the accumulation of data on the mammalian transcriptome, the focus now shifts to non-coding DNA sequences, RNA-coding genes and their transcripts. Many non-coding transcribed sequences are proving to have important regulatory roles, but the functions of the majority remain mysterious.
Collapse
Affiliation(s)
- Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|
125
|
Nagpal S, Karaman MW, Timmerman MM, Ho VV, Pike BL, Hacia JG. Improving the sensitivity and specificity of gene expression analysis in highly related organisms through the use of electronic masks. Nucleic Acids Res 2004; 32:e51. [PMID: 15031318 PMCID: PMC390348 DOI: 10.1093/nar/gnh048] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
DNA microarrays are powerful tools for comparing gene expression profiles from closely related organisms. However, a single microarray design is frequently used in these studies. Therefore, the levels of certain transcripts can be grossly underestimated due to sequence differences between the transcripts and the arrayed DNA probes. Here, we seek to improve the sensitivity and specificity of oligonucleotide microarray-based gene expression analysis by using genomic sequence information to predict the hybridization efficiency of orthologous transcripts to a given microarray. To test our approach, we examine hybridization patterns from three Escherichia coli strains on E.coli K-12 MG1655 gene expression microarrays. We create electronic mask files to discard data from probes predicted to have poor hybridization sensitivity and specificity to cDNA targets from each strain. We increased the accuracy of gene expression analysis and identified genes that cannot be accurately interrogated in each strain using these microarrays. Overall, these studies provide guidelines for designing effective electronic masks for gene expression analysis in organisms where substantial genome sequence information is available.
Collapse
Affiliation(s)
- Shailender Nagpal
- The Institute for Genetic Medicine, University of Southern California, 2250 Alcazar Street, IGM 240, Los Angeles, CA 90089, USA
| | | | | | | | | | | |
Collapse
|
126
|
Lithwick G, Margalit H. Hierarchy of sequence-dependent features associated with prokaryotic translation. Genome Res 2004; 13:2665-73. [PMID: 14656971 PMCID: PMC403808 DOI: 10.1101/gr.1485203] [Citation(s) in RCA: 107] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Protein expression in the cell is affected by various sequence-dependent features. Several such sequence-dependent features have been individually studied,yet they have not been compared quantitatively in terms of their relative influence on protein expression,and a hierarchy of these elements has not been determined. Here we present a quantitative analysis examining sequence-dependent features involved in prokaryotic translation,namely,the base-pairing potential between the mRNA Shine-Dalgarno sequence and the ribosomal RNA,codon bias,and the identity of the stop codon. We analyzed these features both at intra- and intergenomic levels using the Escherichia coli and Haemophilus influenzae genomes. Within each genome,we examined the relationship between each feature and protein expression levels determined by 2D-gel analyses. At the intergenomic level,comparative genomic principles were applied to study the relative preservation of the different sequence-dependent properties between orthologs. From these analyses,we determined that biased codon usage is the property that is most highly associated with protein expression and that is most conserved. The identity of the stop codon and the base-pairing potential of the mRNA Shine-Dalgarno sequence and the rRNA seem to have less of an effect on protein expression.
Collapse
Affiliation(s)
- Gila Lithwick
- Department of Molecular Genetics and Biotechnology, Faculty of Medicine, The Hebrew University, Jerusalem 91120, Israel
| | | |
Collapse
|
127
|
Yamada K, Lim J, Dale JM, Chen H, Shinn P, Palm CJ, Southwick AM, Wu HC, Kim C, Nguyen M, Pham P, Cheuk R, Karlin-Newmann G, Liu SX, Lam B, Sakano H, Wu T, Yu G, Miranda M, Quach HL, Tripp M, Chang CH, Lee JM, Toriumi M, Chan MMH, Tang CC, Onodera CS, Deng JM, Akiyama K, Ansari Y, Arakawa T, Banh J, Banno F, Bowser L, Brooks S, Carninci P, Chao Q, Choy N, Enju A, Goldsmith AD, Gurjal M, Hansen NF, Hayashizaki Y, Johnson-Hopson C, Hsuan VW, Iida K, Karnes M, Khan S, Koesema E, Ishida J, Jiang PX, Jones T, Kawai J, Kamiya A, Meyers C, Nakajima M, Narusaka M, Seki M, Sakurai T, Satou M, Tamse R, Vaysberg M, Wallender EK, Wong C, Yamamura Y, Yuan S, Shinozaki K, Davis RW, Theologis A, Ecker JR. Empirical analysis of transcriptional activity in the Arabidopsis genome. Science 2003; 302:842-6. [PMID: 14593172 DOI: 10.1126/science.1088305] [Citation(s) in RCA: 659] [Impact Index Per Article: 31.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Functional analysis of a genome requires accurate gene structure information and a complete gene inventory. A dual experimental strategy was used to verify and correct the initial genome sequence annotation of the reference plant Arabidopsis. Sequencing full-length cDNAs and hybridizations using RNA populations from various tissues to a set of high-density oligonucleotide arrays spanning the entire genome allowed the accurate annotation of thousands of gene structures. We identified 5817 novel transcription units, including a substantial amount of antisense gene transcription, and 40 genes within the genetically defined centromeres. This approach resulted in completion of approximately 30% of the Arabidopsis ORFeome as a resource for global functional experimentation of the plant proteome.
Collapse
|
128
|
Allen TE, Herrgård MJ, Liu M, Qiu Y, Glasner JD, Blattner FR, Palsson BØ. Genome-scale analysis of the uses of the Escherichia coli genome: model-driven analysis of heterogeneous data sets. J Bacteriol 2003; 185:6392-9. [PMID: 14563874 PMCID: PMC219383 DOI: 10.1128/jb.185.21.6392-6399.2003] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The recent availability of heterogeneous high-throughput data types has increased the need for scalable in silico methods with which to integrate data related to the processes of regulation, protein synthesis, and metabolism. A sequence-based framework for modeling transcription and translation in prokaryotes has been established and has been extended to study the expression state of the entire Escherichia coli genome. The resulting in silico analysis of the expression state highlighted three facets of gene expression in E. coli: (i) the metabolic resources required for genome expression and protein synthesis were found to be relatively invariant under the conditions tested; (ii) effective promoter strengths were estimated at the genome scale by using global mRNA abundance and half-life data, revealing genes subject to regulation under the experimental conditions tested; and (iii) large-scale genome location-dependent expression patterns with approximately 600-kb periodicity were detected in the E. coli genome based on the 49 expression data sets analyzed. These results support the notion that a structured model-driven analysis of expression data yields additional information that can be subjected to commonly used statistical analyses. The integration of heterogeneous genome-scale data (i.e., sequence, expression data, and mRNA half-life data) is readily achieved in the context of an in silico model.
Collapse
Affiliation(s)
- Timothy E Allen
- Department of Bioengineering, University of California-San Diego, La Jolla, California 92093-0412, USA
| | | | | | | | | | | | | |
Collapse
|
129
|
Kolker E, Purvine S, Picone A, Cherny T, Akerley BJ, Munson RS, Palsson BO, Daines DA, Smith AL. H. influenzae Consortium: integrative study of H. influenzae-human interactions. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2003; 6:341-8. [PMID: 12626093 DOI: 10.1089/153623102321112764] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Developments in high-throughput analysis tools coupled with integrative computational techniques have enabled biological studies to reach new levels. The ability to correlate large volumes of diverse data types into cohesive models of organism function has spawned a new systematic approach to biological investigation. The creation of a new consortium has been proposed to investigate a single organism utilizing these comprehensive approaches. The Haemophilus influenzae Consortium (HIC) would be comprised of five laboratories, each providing separate and complementary areas of expertise in the study of Haemophilus influenzae (HI). The 5-year study proposes to develop coherent models of HI, both as a stand-alone organism, and more importantly, as a human pathogen. Studies in growth condition specificity followed by genomic, metabolic, and proteomic experimentation will be combined and integrated through computational and experimental analyses to form dynamic and predictive models of HI and its responses. Data from the HIC will allow greater understanding of cellular behavior, pathogen-host interactions, bacterial infection, and provide future scientific endeavors with a template for studies of other pathogens.
Collapse
Affiliation(s)
- Eugene Kolker
- BIATECH, 19310 North Creek Parkway, Suite 115, Bothell, WA 98011, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
130
|
Matte A, Sivaraman J, Ekiel I, Gehring K, Jia Z, Cygler M. Contribution of structural genomics to understanding the biology of Escherichia coli. J Bacteriol 2003; 185:3994-4002. [PMID: 12837772 PMCID: PMC164895 DOI: 10.1128/jb.185.14.3994-4002.2003] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- Allan Matte
- Biotechnology Research Institute, Concordia University Montréal, Québec, Canada.
| | | | | | | | | | | |
Collapse
|
131
|
Li Y, Cole K, Altman S. The effect of a single, temperature-sensitive mutation on global gene expression in Escherichia coli. RNA (NEW YORK, N.Y.) 2003; 9:518-32. [PMID: 12702811 PMCID: PMC1370418 DOI: 10.1261/rna.2198203] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
High-density DNA microarrays have been used to explore the genomic profiling of gene expression of a defective Escherichia coli strain with a temperature-sensitive mutation in the protein component of RNase P. A novel gene cluster was discovered in which two of the genes are known substrates of RNase P. The expression pattern of essential genes and gene discovery from intergenic regions, for which other new transcripts are found, are also discussed.
Collapse
Affiliation(s)
- Yong Li
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
| | | | | |
Collapse
|
132
|
Abstract
Microarrays provide a powerful new tool for understanding the regulation of gene expression in bacteria. Many recent publications have used microarrays for identifying regulon members and stimulons that describe the complex organismal responses to environmental perturbations. The use of bioinformatics to identify DNA binding sites of transcription factors greatly facilitates the interpretation of these experiments. Understanding the transcriptome of an organism includes identifying all transcripts and mapping their 5' and 3' ends. High-density oligonucleotide arrays have enabled the identification of many new transcripts, including small RNAs and antisense RNAs.
Collapse
Affiliation(s)
- Virgil A Rhodius
- Department of Microbiology and Immunology, University of California, Genentech Hall, 600 16th Street, San Francisco, CA 94143-2200, USA.
| | | |
Collapse
|
133
|
Hershberg R, Altuvia S, Margalit H. A survey of small RNA-encoding genes in Escherichia coli. Nucleic Acids Res 2003; 31:1813-20. [PMID: 12654996 PMCID: PMC152812 DOI: 10.1093/nar/gkg297] [Citation(s) in RCA: 214] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Small RNA (sRNA) molecules have gained much interest lately, as recent genome-wide studies have shown that they are widespread in a variety of organisms. The relatively small family of 10 known sRNA-encoding genes in Escherichia coli has been significantly expanded during the past two years with the discovery of 45 novel genes. Most of these genes are still uncharacterized and their cellular roles are unknown. In this survey we examined the sequence and genomic features of the 55 currently known sRNA-encoding genes in E.coli, attempting to identify their common characteristics. Such characterization is important for both expanding our understanding of this unique gene family and for improving the methods to predict and identify sRNA-encoding genes based on genomic information.
Collapse
Affiliation(s)
- Ruth Hershberg
- Department of Molecular Genetics and Biotechnology, Faculty of Medicine, The Hebrew University, POB 12272, Jerusalem 91120, Israel
| | | | | |
Collapse
|
134
|
Current Awareness on Comparative and Functional Genomics. Comp Funct Genomics 2003. [PMCID: PMC2447381 DOI: 10.1002/cfg.226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
135
|
Affiliation(s)
- Susan Gottesman
- Laboratory of Molecular Biology, National Cancer Institute, Bethesda, Maryland 20892, USA.
| |
Collapse
|