301
|
Abstract
MYC homeostasis is critical for major cellular and organismal processes. The physiological and pathologic patterns of c-myc transcription are programmed by a large number of cis-elements and transfactors (RNAs and proteins). These elements and factors receive inputs from a multitude of intracellular and extracellular pathways. Because c-myc regulation has customarily been dissected element by element and factor by factor, it has been difficult to appreciate how the c-myc promoter and regulatory sequences operate as a system. A full accounting of the regulation of c-myc transcription will require an understanding of the dynamic interplay of these factors and elements with one another, with chromatin, and with the changes in DNA structure and topology that are inevitably coupled with gene activity.
Collapse
Affiliation(s)
- David Levens
- Laboratory of Pathology, National Cancer Institute, Center for Cancer Research, Bethesda, MD, USA
| |
Collapse
|
302
|
Karginov FV, Cheloufi S, Chong MMW, Stark A, Smith AD, Hannon GJ. Diverse endonucleolytic cleavage sites in the mammalian transcriptome depend upon microRNAs, Drosha, and additional nucleases. Mol Cell 2010; 38:781-8. [PMID: 20620951 DOI: 10.1016/j.molcel.2010.06.001] [Citation(s) in RCA: 150] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2010] [Revised: 05/26/2010] [Accepted: 05/27/2010] [Indexed: 01/22/2023]
Abstract
The life span of a mammalian mRNA is determined, in part, by the binding of regulatory proteins and small RNA-guided complexes. The conserved endonuclease activity of Argonaute2 requires extensive complementarity between a small RNA and its target and is not used by animal microRNAs, which pair with their targets imperfectly. Here we investigate the endonucleolytic function of Ago2 and other nucleases by transcriptome-wide profiling of mRNA cleavage products retaining 5' phosphate groups in mouse embryonic stem cells (mESCs). We detect a prominent signature of Ago2-dependent cleavage events and validate several such targets. Unexpectedly, a broader class of Ago2-independent cleavage sites is also observed, indicating participation of additional nucleases in site-specific mRNA cleavage. Within this class, we identify a cohort of Drosha-dependent mRNA cleavage events that functionally regulate mRNA levels in mESCs, including one in the Dgcr8 mRNA. Together, these results highlight the underappreciated role of endonucleolytic cleavage in controlling mRNA fates in mammals.
Collapse
Affiliation(s)
- Fedor V Karginov
- Watson School of Biological Sciences, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | | | | | | | | | | |
Collapse
|
303
|
Gowda M, Nunes CC, Sailsbery J, Xue M, Chen F, Nelson CA, Brown DE, Oh Y, Meng S, Mitchell T, Hagedorn CH, Dean RA. Genome-wide characterization of methylguanosine-capped and polyadenylated small RNAs in the rice blast fungus Magnaporthe oryzae. Nucleic Acids Res 2010; 38:7558-69. [PMID: 20660015 PMCID: PMC2995040 DOI: 10.1093/nar/gkq583] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Small RNAs are well described in higher eukaryotes such as mammals and plants; however, knowledge in simple eukaryotes such as filamentous fungi is limited. In this study, we discovered and characterized methylguanosine-capped and polyadenylated small RNAs (CPA-sRNAs) by using differential RNA selection, full-length cDNA cloning and 454 transcriptome sequencing of the rice blast fungus Magnaporthe oryzae. This fungus causes blast, a devastating disease on rice, the principle food staple for over half the world’s population. CPA-sRNAs mapped primarily to the transcription initiation and termination sites of protein-coding genes and were positively correlated with gene expression, particularly for highly expressed genes including those encoding ribosomal proteins. Numerous CPA-sRNAs also mapped to rRNAs, tRNAs, snRNAs, transposable elements and intergenic regions. Many other 454 sequence reads could not be mapped to the genome; however, inspection revealed evidence for non-template additions and chimeric sequences. CPA-sRNAs were independently confirmed using a high affinity variant of eIF-4E to capture 5′-methylguanosine-capped RNA followed by 3′-RACE sequencing. These results expand the repertoire of small RNAs in filamentous fungi.
Collapse
Affiliation(s)
- Malali Gowda
- Fungal Genomics Laboratory, Center for Integrated Fungal Research, North Carolina State University, Raleigh, NC 27606, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
304
|
|
305
|
Taft RJ, Simons C, Nahkuri S, Oey H, Korbie DJ, Mercer TR, Holst J, Ritchie W, Wong JJL, Rasko JEJ, Rokhsar DS, Degnan BM, Mattick JS. Nuclear-localized tiny RNAs are associated with transcription initiation and splice sites in metazoans. Nat Struct Mol Biol 2010; 17:1030-4. [PMID: 20622877 DOI: 10.1038/nsmb.1841] [Citation(s) in RCA: 134] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2010] [Accepted: 04/27/2010] [Indexed: 12/21/2022]
Abstract
We have recently shown that transcription initiation RNAs (tiRNAs) are derived from sequences immediately downstream of transcription start sites. Here, using cytoplasmic and nuclear small RNA high-throughput sequencing datasets, we report the identification of a second class of nuclear-specific approximately 17- to 18-nucleotide small RNAs whose 3' ends map precisely to the splice donor site of internal exons in animals. These splice-site RNAs (spliRNAs) are associated with highly expressed genes and show evidence of developmental stage- and region-specific expression. We also show that tiRNAs are localized to the nucleus, are enriched at chromatin marks associated with transcription initiation and possess a 3'-nucleotide bias. Additionally, we find that microRNA-offset RNAs (moRNAs), the miR-15/16 cluster previously linked to oncosuppression and most small nucleolar RNA (snoRNA)-derived small RNAs (sdRNAs) are enriched in the nucleus, whereas most miRNAs and two H/ACA sdRNAs are cytoplasmically enriched. We propose that nuclear-localized tiny RNAs are involved in the epigenetic regulation of gene expression.
Collapse
Affiliation(s)
- Ryan J Taft
- Institute for Molecular Bioscience, School of Integrative Biology, University of Queensland, St. Lucia, Australia
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
306
|
Hestand MS, Klingenhoff A, Scherf M, Ariyurek Y, Ramos Y, van Workum W, Suzuki M, Werner T, van Ommen GJB, den Dunnen JT, Harbers M, 't Hoen PAC. Tissue-specific transcript annotation and expression profiling with complementary next-generation sequencing technologies. Nucleic Acids Res 2010; 38:e165. [PMID: 20615900 PMCID: PMC2938216 DOI: 10.1093/nar/gkq602] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Next-generation sequencing is excellently suited to evaluate the abundance of mRNAs to study gene expression. Here we compare two alternative technologies, cap analysis of gene expression (CAGE) and serial analysis of gene expression (SAGE), for the same RNA samples. Along with quantifying gene expression levels, CAGE can be used to identify tissue-specific transcription start sites, while SAGE monitors 3′-end usage. We used both methods to get more insight into the transcriptional control of myogenesis, studying differential gene expression in differentiated and proliferating C2C12 myoblast cells with statistical evaluation of reproducibility and differential gene expression. Both CAGE and SAGE provided highly reproducible data (Pearson's correlations >0.92 among biological triplicates). With both methods we found around 10 000 genes expressed at levels 2 transcripts per million (0.3 copies per cell), with an overlap of 86%. We identified 4304 and 3846 genes differentially expressed between proliferating and differentiated C2C12 cells by CAGE and SAGE, respectively, with an overlap of 2144. We identified 196 novel regulatory regions with preferential use in proliferating or differentiated cells. Next-generation sequencing of CAGE and SAGE libraries provides consistent expression levels and can enrich current genome annotations with tissue-specific promoters and alternative 3′-UTR usage.
Collapse
Affiliation(s)
- Matthew S Hestand
- The Center for Human and Clinical Genetics, Leiden University Medical Center, 2300 RC Leiden, The Netherlands
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
307
|
Chen LL, Carmichael GG. Long noncoding RNAs in mammalian cells: what, where, and why? WILEY INTERDISCIPLINARY REVIEWS. RNA 2010; 1:2-21. [PMID: 21956903 DOI: 10.1002/wrna.5] [Citation(s) in RCA: 87] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Not all long, polyadenylated cellular RNAs encode polypeptides. In recent years, it has become apparent that a number of organisms express abundant amounts of transcripts that lack open reading frames or that are retained in the nucleus. Rather than accumulating silently in the cell, we now know that many of these long noncoding RNAs (lncRNAs) play important roles in nuclear architecture or in the regulation of gene expression. Here, we discuss some recent progress in our understanding of the functions of a number of important lncRNAs in mammalian cells.
Collapse
Affiliation(s)
- Ling-Ling Chen
- Genetics and Developmental Biology, University of Connecticut Health Center, Farmington, CT 06030, USA.
| | | |
Collapse
|
308
|
Abstract
Integrating results from diverse experiments is an essential process in our effort to understand the logic of complex systems, such as development, homeostasis and responses to the environment. With the advent of high-throughput methods--including genome-wide association (GWA) studies, chromatin immunoprecipitation followed by sequencing (ChIP-seq) and RNA sequencing (RNA-seq)--acquisition of genome-scale data has never been easier. Epigenomics, transcriptomics, proteomics and genomics each provide an insightful, and yet one-dimensional, view of genome function; integrative analysis promises a unified, global view. However, the large amount of information and diverse technology platforms pose multiple challenges for data access and processing. This Review discusses emerging issues and strategies related to data integration in the era of next-generation genomics.
Collapse
Affiliation(s)
- R. David Hawkins
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093-0653
| | - Gary C. Hon
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093-0653
| | - Bing Ren
- Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, 9500 Gilman Drive, La Jolla, CA 92093-0653
| |
Collapse
|
309
|
Kanhere A, Viiri K, Araújo CC, Rasaiyaah J, Bouwman RD, Whyte WA, Pereira CF, Brookes E, Walker K, Bell GW, Pombo A, Fisher AG, Young RA, Jenner RG. Short RNAs are transcribed from repressed polycomb target genes and interact with polycomb repressive complex-2. Mol Cell 2010; 38:675-88. [PMID: 20542000 DOI: 10.1016/j.molcel.2010.03.019] [Citation(s) in RCA: 297] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2009] [Revised: 01/12/2010] [Accepted: 03/26/2010] [Indexed: 01/13/2023]
Abstract
Polycomb proteins maintain cell identity by repressing the expression of developmental regulators specific for other cell types. Polycomb repressive complex-2 (PRC2) catalyzes trimethylation of histone H3 lysine-27 (H3K27me3). Although repressed, PRC2 targets are generally associated with the transcriptional initiation marker H3K4me3, but the significance of this remains unclear. Here, we identify a class of short RNAs, approximately 50-200 nucleotides in length, transcribed from the 5' end of polycomb target genes in primary T cells and embryonic stem cells. Short RNA transcription is associated with RNA polymerase II and H3K4me3, occurs in the absence of mRNA transcription, and is independent of polycomb activity. Short RNAs form stem-loop structures resembling PRC2 binding sites in Xist, interact with PRC2 through SUZ12, cause gene repression in cis, and are depleted from polycomb target genes activated during cell differentiation. We propose that short RNAs play a role in the association of PRC2 with its target genes.
Collapse
Affiliation(s)
- Aditi Kanhere
- Division of Infection and Immunity, University College London, London W1T 4JF, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
310
|
Plessy C, Bertin N, Takahashi H, Simone R, Salimullah M, Lassmann T, Vitezic M, Severin J, Olivarius S, Lazarevic D, Hornig N, Orlando V, Bell I, Gao H, Dumais J, Kapranov P, Wang H, Davis CA, Gingeras TR, Kawai J, Daub CO, Hayashizaki Y, Gustincich S, Carninci P. Linking promoters to functional transcripts in small samples with nanoCAGE and CAGEscan. Nat Methods 2010; 7:528-34. [PMID: 20543846 PMCID: PMC2906222 DOI: 10.1038/nmeth.1470] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Accepted: 05/05/2010] [Indexed: 01/18/2023]
Abstract
Large-scale sequencing projects have revealed an unexpected complexity in the origins, structures and functions of mammalian transcripts. Many loci are known to produce overlapping coding and noncoding RNAs with capped 5' ends that vary in size. Methods to identify the 5' ends of transcripts will facilitate the discovery of new promoters and 5' ends derived from secondary capping events. Such methods often require high input amounts of RNA not obtainable from highly refined samples such as tissue microdissections and subcellular fractions. Therefore, we developed nano-cap analysis of gene expression (nanoCAGE), a method that captures the 5' ends of transcripts from as little as 10 ng of total RNA, and CAGEscan, a mate-pair adaptation of nanoCAGE that captures the transcript 5' ends linked to a downstream region. Both of these methods allow further annotation-agnostic studies of the complex human transcriptome.
Collapse
Affiliation(s)
- Charles Plessy
- RIKEN Yokohama Institute, Omics Science Center, Yokohama, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
311
|
Re-inspection of small RNA sequence datasets reveals several novel human miRNA genes. PLoS One 2010; 5:e10961. [PMID: 20532037 PMCID: PMC2881037 DOI: 10.1371/journal.pone.0010961] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Accepted: 05/10/2010] [Indexed: 12/29/2022] Open
Abstract
Background miRNAs are key players in gene expression regulation. To fully understand the complex nature of cellular differentiation or initiation and progression of disease, it is important to assess the expression patterns of as many miRNAs as possible. Thereby, identifying novel miRNAs is an essential prerequisite to make possible a comprehensive and coherent understanding of cellular biology. Methodology/Principal Findings Based on two extensive, but previously published, small RNA sequence datasets from human embryonic stem cells and human embroid bodies, respectively [1], we identified 112 novel miRNA-like structures and were able to validate miRNA processing in 12 out of 17 investigated cases. Several miRNA candidates were furthermore substantiated by including additional available small RNA datasets, thereby demonstrating the power of combining datasets to identify miRNAs that otherwise may be assigned as experimental noise. Conclusions/Significance Our analysis highlights that existing datasets are not yet exhaustedly studied and continuous re-analysis of the available data is important to uncover all features of small RNA sequencing.
Collapse
|
312
|
Ni T, Corcoran DL, Rach EA, Song S, Spana EP, Gao Y, Ohler U, Zhu J. A paired-end sequencing strategy to map the complex landscape of transcription initiation. Nat Methods 2010; 7:521-7. [PMID: 20495556 PMCID: PMC3197272 DOI: 10.1038/nmeth.1464] [Citation(s) in RCA: 119] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2009] [Accepted: 04/21/2010] [Indexed: 01/06/2023]
Abstract
Recent high-throughput sequencing protocols have uncovered the complexity of mammalian transcription by RNA polymerase II, helping to define several initiation patterns in which transcription start sites (TSSs) cluster within both narrow and broad genomic windows. Here, we describe a paired-end sequencing strategy, which enables more robust mapping and characterization of capped transcripts. This strategy was applied to explore the transcription initiation landscape in the Drosophila melanogaster embryo. Extending the previous findings in mammals, we found that fly promoters exhibit distinct initiation patterns, which are linked to specific promoter sequence motifs. Furthermore, we identified a large number of 5′ capped transcripts originating from coding exons; analyses support that they are unlikely the result of alternative TSSs, but rather the product of post-transcriptional modifications. Taken together, paired-end TSS analysis is demonstrated to be a powerful method to uncover the transcriptional complexity of eukaryotic genomes.
Collapse
Affiliation(s)
- Ting Ni
- Institute for Genome Sciences and Policy, Duke University Medical Center, Durham, North Carolina, USA
| | | | | | | | | | | | | | | |
Collapse
|
313
|
van Bakel H, Nislow C, Blencowe BJ, Hughes TR. Most "dark matter" transcripts are associated with known genes. PLoS Biol 2010; 8:e1000371. [PMID: 20502517 PMCID: PMC2872640 DOI: 10.1371/journal.pbio.1000371] [Citation(s) in RCA: 330] [Impact Index Per Article: 23.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2009] [Accepted: 04/09/2010] [Indexed: 12/18/2022] Open
Abstract
Short-read RNA sequencing in mouse and human tissues shows that most transcripts are encoded within or nearby known genes and that most of the genome is not transcribed. A series of reports over the last few years have indicated that a much larger portion of the mammalian genome is transcribed than can be accounted for by currently annotated genes, but the quantity and nature of these additional transcripts remains unclear. Here, we have used data from single- and paired-end RNA-Seq and tiling arrays to assess the quantity and composition of transcripts in PolyA+ RNA from human and mouse tissues. Relative to tiling arrays, RNA-Seq identifies many fewer transcribed regions (“seqfrags”) outside known exons and ncRNAs. Most nonexonic seqfrags are in introns, raising the possibility that they are fragments of pre-mRNAs. The chromosomal locations of the majority of intergenic seqfrags in RNA-Seq data are near known genes, consistent with alternative cleavage and polyadenylation site usage, promoter- and terminator-associated transcripts, or new alternative exons; indeed, reads that bridge splice sites identified 4,544 new exons, affecting 3,554 genes. Most of the remaining seqfrags correspond to either single reads that display characteristics of random sampling from a low-level background or several thousand small transcripts (median length = 111 bp) present at higher levels, which also tend to display sequence conservation and originate from regions with open chromatin. We conclude that, while there are bona fide new intergenic transcripts, their number and abundance is generally low in comparison to known exons, and the genome is not as pervasively transcribed as previously reported. The human genome was sequenced a decade ago, but its exact gene composition remains a subject of debate. The number of protein-coding genes is much lower than initially expected, and the number of distinct transcripts is much larger than the number of protein-coding genes. Moreover, the proportion of the genome that is transcribed in any given cell type remains an open question: results from “tiling” microarray analyses suggest that transcription is pervasive and that most of the genome is transcribed, whereas new deep sequencing-based methods suggest that most transcripts originate from known genes. We have addressed this discrepancy by comparing samples from the same tissues using both technologies. Our analyses indicate that RNA sequencing appears more reliable for transcripts with low expression levels, that most transcripts correspond to known genes or are near known genes, and that many transcripts may represent new exons or aberrant products of the transcription process. We also identify several thousand small transcripts that map outside known genes; their sequences are often conserved and are often encoded in regions of open chromatin. We propose that most of these transcripts may be by-products of the activity of enhancers, which associate with promoters as part of their role as long-range gene regulatory sites. Overall, however, we find that most of the genome is not appreciably transcribed.
Collapse
Affiliation(s)
- Harm van Bakel
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
| | - Corey Nislow
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Benjamin J. Blencowe
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Timothy R. Hughes
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- * E-mail:
| |
Collapse
|
314
|
Mining mammalian transcript data for functional long non-coding RNAs. PLoS One 2010; 5:e10316. [PMID: 20428234 PMCID: PMC2859052 DOI: 10.1371/journal.pone.0010316] [Citation(s) in RCA: 112] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2009] [Accepted: 03/30/2010] [Indexed: 12/11/2022] Open
Abstract
Background The role of long non-coding RNAs (lncRNAs) in controlling gene expression has garnered increased interest in recent years. Sequencing projects, such as Fantom3 for mouse and H-InvDB for human, have generated abundant data on transcribed components of mammalian cells, the majority of which appear not to be protein-coding. However, much of the non-protein-coding transcriptome could merely be a consequence of ‘transcription noise’. It is therefore essential to use bioinformatic approaches to identify the likely functional candidates in a high throughput manner. Principal Findings We derived a scheme for classifying and annotating likely functional lncRNAs in mammals. Using the available experimental full-length cDNA data sets for human and mouse, we identified 78 lncRNAs that are either syntenically conserved between human and mouse, or that originate from the same protein-coding genes. Of these, 11 have significant sequence homology. We found that these lncRNAs exhibit: (i) patterns of codon substitution typical of non-coding transcripts; (ii) preservation of sequences in distant mammals such as dog and cow, (iii) significant sequence conservation relative to their corresponding flanking regions (in 50% cases, flanking regions do not have homology at all; and in the remaining, the degree of conservation is significantly less); (iv) existence mostly as single-exon forms (8/11); and, (v) presence of conserved and stable secondary structure motifs within them. We further identified orthologous protein-coding genes that are contributing to the pool of lncRNAs; of which, genes implicated in carcinogenesis are significantly over-represented. Conclusion Our comparative mammalian genomics approach coupled with evolutionary analysis identified a small population of conserved long non-protein-coding RNAs (lncRNAs) that are potentially functional across Mammalia. Additionally, our analysis indicates that amongst the orthologous protein-coding genes that produce lncRNAs, those implicated in cancer pathogenesis are significantly over-represented, suggesting that these lncRNAs could play an important role in cancer pathomechanisms.
Collapse
|
315
|
Purvis TL, Hearn T, Spalluto C, Knorz VJ, Hanley KP, Sanchez-Elsner T, Hanley NA, Wilson DI. Transcriptional regulation of the Alström syndrome gene ALMS1 by members of the RFX family and Sp1. Gene 2010; 460:20-9. [PMID: 20381594 PMCID: PMC2913254 DOI: 10.1016/j.gene.2010.03.015] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2009] [Revised: 02/17/2010] [Accepted: 03/31/2010] [Indexed: 01/26/2023]
Abstract
Mutations in the human gene ALMS1 cause Alström syndrome, a disorder characterised by neurosensory degeneration, metabolic defects and cardiomyopathy. ALMS1 encodes a centrosomal protein implicated in the assembly and maintenance of primary cilia. Expression of ALMS1 varies between tissues and recent data suggest that its transcription is modulated during adipogenesis and growth arrest. However the ALMS1 promoter has not been defined. This study focused on identifying and characterising the ALMS1 proximal promoter, initially by using 5' RACE to map transcription start sites. Luciferase reporter assay and EMSA data strongly suggest that ALMS1 transcription is regulated by the ubiquitous factor Sp1. In addition, reporter assay, EMSA, chromatin immunoprecipitation and RNA interference data indicate that ALMS1 transcription is regulated by regulatory factor X (RFX) proteins. These transcription factors are cell-type restricted in their expression profile and known to regulate genes of the ciliogenic pathway. We show binding of RFX proteins to an evolutionarily conserved X-box in the ALMS1 proximal promoter and present evidence that these proteins are responsible for ALMS1 transcription during growth arrest induced by low serum conditions. In summary, this work provides the first data on transcription factors regulating general and context-specific transcription of the disease-associated gene ALMS1.
Collapse
Affiliation(s)
- Tracey L Purvis
- Centre for Human Development, Stem Cells and Regeneration, Human Genetics Division, University of Southampton, UK
| | | | | | | | | | | | | | | |
Collapse
|
316
|
Lau NC. Small RNAs in the animal gonad: guarding genomes and guiding development. Int J Biochem Cell Biol 2010; 42:1334-47. [PMID: 20227517 DOI: 10.1016/j.biocel.2010.03.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2009] [Accepted: 03/08/2010] [Indexed: 10/19/2022]
Abstract
Germ cells must safeguard, apportion, package, and deliver their genomes with exquisite precision to ensure proper reproduction and embryonic development. Classical genetic approaches have identified many genes controlling animal germ cell development, but only recently have some of these genes been linked to the RNA interference (RNAi) pathway, a gene silencing mechanism centered on small regulatory RNAs. Germ cells contain microRNAs (miRNAs), endogenous siRNAs (endo-siRNAs), and Piwi-interacting RNAs (piRNAs); these are bound by members of the Piwi/Argonaute protein family. piwi genes were known to specify germ cell development, but we now understand that mutations disrupting germline development can also affect small RNA accumulation. Small RNA studies in germ cells have revealed a surprising diversity of regulatory mechanisms and a unifying function for germline genes in controlling the spread of transposable elements. Future challenges will be to understand the production of germline small RNAs and to identify the full breadth of gene regulation by these RNAs. Progress in this area will likely impact biomedical goals of manipulating stem cells and preventing diseases caused by the transposition of mobile DNA elements.
Collapse
Affiliation(s)
- Nelson C Lau
- Department of Biology, Brandeis University, 415 South Street, Waltham, MA 02454, USA.
| |
Collapse
|
317
|
Abstract
Initial gene discovery efforts through analysis of genome sequences and identification and characterization of expressed RNAs have revealed that only a relatively small portion of the genome is transcribed into protein coding mRNAs in vertebrates. However, in contrast with this paucity of protein coding ‘genes’, there is an enormous complexity in transcription and the protein coding mRNAs contribute to a very small fraction of transcripts in comparison with the different varieties of non-coding RNAs (ncRNAs). This transcriptome complexity may be hypothesized to have a regulatory role that is required for the development and function of organisms as complex as vertebrates. At the same time, it raises the fundamental question of the unequivocal definition of a gene. It is intriguing to postulate that many ncRNAs might finely modulate gene activity by acting as regulatory elements. The emerging hypotheses suggest that the gene regulatory machinery may be deeply interconnected with the world of short RNAs. These RNAs may generally act for fine-tuning of the protein-coding transcriptome.
Collapse
Affiliation(s)
- Piero Carninci
- Omics Science Center, RIKEN Yokohama Institute, Kanagawa, Japan.
| |
Collapse
|
318
|
Taft RJ, Pang KC, Mercer TR, Dinger M, Mattick JS. Non-coding RNAs: regulators of disease. J Pathol 2010; 220:126-39. [PMID: 19882673 DOI: 10.1002/path.2638] [Citation(s) in RCA: 755] [Impact Index Per Article: 53.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
For 50 years the term 'gene' has been synonymous with regions of the genome encoding mRNAs that are translated into protein. However, recent genome-wide studies have shown that the human genome is pervasively transcribed and produces many thousands of regulatory non-protein-coding RNAs (ncRNAs), including microRNAs, small interfering RNAs, PIWI-interacting RNAs and various classes of long ncRNAs. It is now clear that these RNAs fulfil critical roles as transcriptional and post-transcriptional regulators and as guides of chromatin-modifying complexes. Here we review the biology of ncRNAs, focusing on the fundamental mechanisms by which ncRNAs facilitate normal development and physiology and, when dysfunctional, underpin disease. We also discuss evidence that intergenic regions associated with complex diseases express ncRNAs, as well as the potential use of ncRNAs as diagnostic markers and therapeutic targets. Taken together, these observations emphasize the need to move beyond the confines of protein-coding genes and highlight the fact that continued investigation of ncRNA biogenesis and function will be necessary for a comprehensive understanding of human disease.
Collapse
Affiliation(s)
- Ryan J Taft
- Institute for Molecular Bioscience, University of Queensland, Brisbane QLD 4072, Australia
| | | | | | | | | |
Collapse
|
319
|
Dineen DG, Wilm A, Cunningham P, Higgins DG. High DNA melting temperature predicts transcription start site location in human and mouse. Nucleic Acids Res 2010; 37:7360-7. [PMID: 19820114 PMCID: PMC2794178 DOI: 10.1093/nar/gkp821] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The accurate computational prediction of transcription start sites (TSS) in vertebrate genomes is a difficult problem. The physicochemical properties of DNA can be computed in various ways and a many combinations of DNA features have been tested in the past for use as predictors of transcription. We looked in detail at melting temperature, which measures the temperature, at which two strands of DNA separate, considering the cooperative nature of this process. We find that peaks in melting temperature correspond closely to experimentally determined transcription start sites in human and mouse chromosomes. Using melting temperature alone, and with simple thresholding, we can predict TSS with accuracy that is competitive with the most accurate state-of-the-art TSS prediction methods. Accuracy is measured using both experimentally and manually determined TSS. The method works especially well with CpG island containing promoters, but also works when CpG islands are absent. This result is clear evidence of the important role of the physical properties of DNA in the process of transcription. It also points to the importance for TSS prediction methods to include melting temperature as prior information.
Collapse
Affiliation(s)
- David G Dineen
- Complex and Adaptive Systems Laboratory (CASL), University College Dublin, Belfield, Dublin 4, Ireland.
| | | | | | | |
Collapse
|
320
|
Antisense transcription in gammaretroviruses as a mechanism of insertional activation of host genes. J Virol 2010; 84:3780-8. [PMID: 20130045 DOI: 10.1128/jvi.02088-09] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Transcription of retroviruses is initiated at the U3-R region boundary in the integrated provirus and continues unidirectionally to produce genomic and mRNA products of positive polarity. Several studies have recently demonstrated the existence of naturally occurring protein-encoding transcripts of negative polarity in complex retroviruses. We report here on the identification of transcripts of negative polarity in simple murine leukemia virus (MLV). In T-cell and B-cell lymphomas induced by SL3-3 and Akv MLV, antisense transcripts initiated in the U3 region of the proviral 5' long terminal repeat (LTR) and continued into the cellular proto-oncogenes Jdp2 and Bach2 to create chimeric transcripts consisting of viral and host sequence. The phenomenon was validated in vivo using a knock-in mouse model homozygous for a single LTR at a position known to activate Nras in B-cell lymphomas. A 5' rapid amplification of cDNA ends (RACE) analysis indicated a broad spectrum of initiation sites within the U3 region of the 5' LTR. Our data show for the first time transcriptional activity of negative polarity initiating in the U3 region of simple retroviruses and suggest a novel mechanism of insertional activation of host genes. Elucidation of the nature and potential regulatory role of 5' LTR antisense transcription will be relevant to the design of therapeutic vectors and may contribute to the increasing recognition of pervasive eukaryotic transcription.
Collapse
|
321
|
Hao L, Cai P, Jiang N, Wang H, Chen Q. Identification and characterization of microRNAs and endogenous siRNAs in Schistosoma japonicum. BMC Genomics 2010; 11:55. [PMID: 20092619 PMCID: PMC2820009 DOI: 10.1186/1471-2164-11-55] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2009] [Accepted: 01/21/2010] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Small endogenous non-coding RNAs (sncRNAs) such as small interfering RNA (siRNA), microRNA and other small RNA transcripts are derived from distinct loci in the genome and play critical roles in RNA-mediated gene silencing mechanisms in plants and metazoa. They are approximately 22 nucleotides long; regulate mRNA stability through perfect or imperfect match to the targets. The biological activities of sncRNAs have been related to many biological events, from resistance to microbe infections to cellular differentiation. The development of the zoonotic parasite Schistosoma japonicum parasite includes multiple steps of morphological alterations and biological differentiations, which provide a unique model for studies on the functions of small RNAs. Characterization of the genome-wide transcription of the sncRNAs will be a major step in understanding of the parasite biology. The objective of this study is to investigate the transcriptional profile and potential function of the small non-coding RNAs in the development of S. japanicum. RESULTS The endogenous siRNAs were found mainly derived from transposable elements (TE) or transposons and the natural antisense transcripts (NAT). In contrast to other organisms, the TE-derived siRNAs in S. japonicum were more predominant than other sncRNAs including microRNAs (miRNAs). Further, there were distinct length and 3'end variations in the sncRNAs, which were associated with the developmental differentiation of the parasite. Among the identified miRNA transcripts, there were 38 unique to S. japonicum and 16 that belonged to 13 miRNA families are common to other metazoan lineages. These miRNAs were either ubiquitously expressed, or they exhibited specific expression patterns related to the developmental stages or sex. Genes that encoded miRNAs are mainly located in clusters within the genome of S. japonicum. However, genes within one cluster could be differentially transcribed, which suggested that individual genes might be regulated by distinct mechanisms during parasite development. CONCLUSIONS Many miRNA and endogenous siRNA transcripts were identified in S. japonicum and the amount of siRNA was at least 4.4 and 1.6 times more than that of miRNA in both schistosomulum and adult worm stages respectively. SiRNAs are mainly derived from transposable elements (or transposons); while natural antisense transcripts (NAT)-derived siRNAs were much less. A majority of miRNA transcripts identified in the parasite were species-specific and the expression of certain miRNAs was found developmentally regulated. Both miRNA and siRNAs are potentially important regulators in the development of schistosomal parasites.
Collapse
Affiliation(s)
- Lili Hao
- Laboratory of Parasitology, Institute of Pathogen Biology/Institute of Basic Medicine, Chinese Academy of Medical Sciences, Beijing 100730, PR China
| | | | | | | | | |
Collapse
|
322
|
Sibley CR, Seow Y, Wood MJA. Novel RNA-based strategies for therapeutic gene silencing. Mol Ther 2010; 18:466-76. [PMID: 20087319 DOI: 10.1038/mt.2009.306] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The past decade has seen intense scientific interest in non-coding RNAs. In particular, the discovery and subsequent exploitation of gene silencing via RNA interference (RNAi) has revolutionized the way in which gene expression is now studied and understood. It is now well established that post-transcriptional gene silencing (PTGS) by the microRNA (miRNA) and other RNAi-associated pathways represents an essential layer of complexity to gene regulation. Gene silencing using RNAi additionally demonstrates huge potential as a therapeutic strategy for eliminating pathogenic gene expression. Yet despite the early promise and excitement of gene-specific silencing, several critical hurdles remain to be overcome before widespread clinical adoption. These include off-target effects, toxicity due to saturation of the endogenous RNAi functions, limited duration of silencing, and effective targeted delivery. In recent years, a range of novel strategies for producing RNA-mediated silencing have been developed that can circumvent many of these hurdles, including small internally segmented interfering RNAs, tandem hairpin RNAs, and pri-miRNA cluster mimics. This review discusses RNA-mediated silencing in light of this recent research, and highlights the benefits and limitations conferred by these novel gene-silencing strategies.
Collapse
Affiliation(s)
- Christopher R Sibley
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, UK
| | | | | |
Collapse
|
323
|
Nechaev S, Fargo DC, dos Santos G, Liu L, Gao Y, Adelman K. Global analysis of short RNAs reveals widespread promoter-proximal stalling and arrest of Pol II in Drosophila. Science 2009; 327:335-8. [PMID: 20007866 DOI: 10.1126/science.1181421] [Citation(s) in RCA: 327] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Emerging evidence indicates that gene expression in higher organisms is regulated by RNA polymerase II stalling during early transcription elongation. To probe the mechanisms responsible for this regulation, we developed methods to isolate and characterize short RNAs derived from stalled RNA polymerase II in Drosophila cells. Significant levels of these short RNAs were generated from more than one-third of all genes, indicating that promoter-proximal stalling is a general feature of early polymerase elongation. Nucleotide composition of the initially transcribed sequence played an important role in promoting transcriptional stalling by rendering polymerase elongation complexes highly susceptible to backtracking and arrest. These results indicate that the intrinsic efficiency of early elongation can greatly affect gene expression.
Collapse
Affiliation(s)
- Sergei Nechaev
- Laboratory of Molecular Carcinogenesis, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC 27709, USA
| | | | | | | | | | | |
Collapse
|
324
|
Yang JH, Shao P, Zhou H, Chen YQ, Qu LH. deepBase: a database for deeply annotating and mining deep sequencing data. Nucleic Acids Res 2009; 38:D123-30. [PMID: 19966272 PMCID: PMC2808990 DOI: 10.1093/nar/gkp943] [Citation(s) in RCA: 126] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Advances in high-throughput next-generation sequencing technology have reshaped the transcriptomic research landscape. However, exploration of these massive data remains a daunting challenge. In this study, we describe a novel database, deepBase, which we have developed to facilitate the comprehensive annotation and discovery of small RNAs from transcriptomic data. The current release of deepBase contains deep sequencing data from 185 small RNA libraries from diverse tissues and cell lines of seven organisms: human, mouse, chicken, Ciona intestinalis, Drosophila melanogaster, Caenhorhabditis elegans and Arabidopsis thaliana. By analyzing ∼14.6 million unique reads that perfectly mapped to more than 284 million genomic loci, we annotated and identified ∼380 000 unique ncRNA-associated small RNAs (nasRNAs), ∼1.5 million unique promoter-associated small RNAs (pasRNAs), ∼4.0 million unique exon-associated small RNAs (easRNAs) and ∼6 million unique repeat-associated small RNAs (rasRNAs). Furthermore, 2038 miRNA and 1889 snoRNA candidates were predicted by miRDeep and snoSeeker. All of the mapped reads can be grouped into about 1.2 million RNA clusters. For the purpose of comparative analysis, deepBase provides an integrative, interactive and versatile display. A convenient search option, related publications and other useful information are also provided for further investigation. deepBase is available at: http://deepbase.sysu.edu.cn/.
Collapse
Affiliation(s)
- Jian-Hua Yang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Sun Yat-sen University, Guangzhou 510275, PR China
| | | | | | | | | |
Collapse
|
325
|
|
326
|
Jacquier A. The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs. Nat Rev Genet 2009; 10:833-44. [PMID: 19920851 DOI: 10.1038/nrg2683] [Citation(s) in RCA: 318] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Over the past few years, techniques have been developed that have allowed the study of transcriptomes without bias from previous genome annotations, which has led to the discovery of a plethora of unexpected RNAs that have no obvious coding capacities. There are many different kinds of products that are generated by this pervasive transcription; this Review focuses on small non-coding RNAs (ncRNAs) that have been found to be associated with promoters in eukaryotes from animals to yeast. After comparing the different classes of such ncRNAs described in various studies, the Review discusses how the models proposed for their origins and their possible functions challenge previous views of the basic transcription process and its regulation.
Collapse
Affiliation(s)
- Alain Jacquier
- Unité de Génétique des Interactions Macromoléculaires, Institut Pasteur, Centre National de la Recherche Scientifique URA2171, 25 Rue du Dr Roux, F-75015, Paris, France.
| |
Collapse
|
327
|
Forrest ARR, Abdelhamid RF, Carninci P. Annotating non-coding transcription using functional genomics strategies. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:437-43. [PMID: 19833699 DOI: 10.1093/bfgp/elp041] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Non-coding RNA (ncRNA) transcripts are RNA molecules that do not code for proteins, but elicit function by other mechanisms. The vast majority of RNA produced in a cell is non-coding ribosomal RNA, produced from relatively few loci, however more recently complementary DNA (cDNA) cloning, tag sequencing, and genome tiling array studies suggest that ncRNAs also account for the majority of RNA species produced by a cell. ncRNA based regulation has been referred to as a 'hidden layer' of signals or 'dark matter' that control gene expression in cellular processes by poorly described mechanisms. These terms have appeared as ncRNAs until recently have been ignored by expression profiling and cDNA annotation projects and their mode of action is diverse (e.g. influencing chromatin structure and epigenetics, translational silencing, transcriptional silencing). Here, we highlight recent functional genomics strategies toward identifying and assigning function to ncRNA transcription.
Collapse
Affiliation(s)
- Alistair R R Forrest
- Omics Science Center, RIKEN Yokohama Institute, Yokohama, Kanagawa 230-0045 Japan
| | | | | |
Collapse
|
328
|
van Bakel H, Hughes TR. Establishing legitimacy and function in the new transcriptome. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:424-36. [PMID: 19833698 DOI: 10.1093/bfgp/elp037] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The last decade has seen an explosion of interest in new classes of non-coding RNA. While some are now firmly established as new categories of legitimate functional RNAs, the purpose and even existence of others remain to be solidified. Here, we discuss the challenges associated with discovery and characterization of non-traditional categories of non-coding RNA.
Collapse
Affiliation(s)
- Harm van Bakel
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M5S 3E1, Canada
| | | |
Collapse
|
329
|
Affiliation(s)
- Anne-Claude Gingras
- Samuel Lunenfeld Research Institute at Mount Sinai Hospital, Toronto, Ontario, Canada.
| |
Collapse
|
330
|
Hoeppner MP, White S, Jeffares DC, Poole AM. Evolutionarily stable association of intronic snoRNAs and microRNAs with their host genes. Genome Biol Evol 2009; 1:420-8. [PMID: 20333211 PMCID: PMC2817437 DOI: 10.1093/gbe/evp045] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/31/2009] [Indexed: 12/20/2022] Open
Abstract
Small nucleolar RNAs (snoRNAs) and microRNAs (miRNAs) are integral to a range of processes, including ribosome biogenesis and gene regulation. Some are intron encoded, and this organization may facilitate coordinated coexpression of host gene and RNA. However, snoRNAs and miRNAs are known to be mobile, so intron-RNA associations may not be evolutionarily stable. We have used genome alignments across 11 mammals plus chicken to examine positional orthology of snoRNAs and miRNAs and report that 21% of annotated snoRNAs and 11% of miRNAs are positionally conserved across mammals. Among RNAs traceable to the bird–mammal common ancestor, 98% of snoRNAs and 76% of miRNAs are intronic. Comparison of the most evolutionarily stable mammalian intronic snoRNAs with those positionally conserved among primates reveals that the former are more overrepresented among host genes involved in translation or ribosome biogenesis and are more broadly and highly expressed. This stability is likely attributable to a requirement for overlap between host gene and intronic snoRNA expression profiles, consistent with an ancestral role in ribosome biogenesis. In contrast, whereas miRNA positional conservation is comparable to that observed for snoRNAs, intronic miRNAs show no obvious association with host genes of a particular functional category, and no statistically significant differences in host gene expression are found between those traceable to mammalian or primate ancestors. Our results indicate evolutionarily stable associations of numerous intronic snoRNAs and miRNAs and their host genes, with probable continued diversification of snoRNA function from an ancestral role in ribosome biogenesis.
Collapse
Affiliation(s)
- Marc P Hoeppner
- Department of Molecular Biology and Functional Genomics, Stockholm University, SE-106 91 Stockholm, Sweden
| | | | | | | |
Collapse
|
331
|
Dinger ME, Amaral PP, Mercer TR, Mattick JS. Pervasive transcription of the eukaryotic genome: functional indices and conceptual implications. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:407-23. [PMID: 19770204 DOI: 10.1093/bfgp/elp038] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Genome-wide analyses of the eukaryotic transcriptome have revealed that the majority of the genome is transcribed, producing large numbers of non-protein-coding RNAs (ncRNAs). This surprising observation challenges many assumptions about the genetic programming of higher organisms and how information is stored and organized within the genome. Moreover, the rapid advances in genomics have given little opportunity for biologists to integrate these emerging findings into their intellectual and experimental frameworks. This problem has been compounded by the perception that genome-wide studies often generate more questions than answers, which in turn has led to confusion and controversy. In this article, we address common questions associated with the phenomenon of pervasive transcription and consider the indices that can be used to evaluate the function (or lack thereof) of the resulting ncRNAs. We suggest that many lines of evidence, including expression profiles, conservation signatures, chromatin modification patterns and examination of increasing numbers of individual cases, argue in favour of the widespread functionality of non-coding transcription. We also discuss how informatic and experimental approaches used to analyse protein-coding genes may not be applicable to ncRNAs and how the general perception that protein-coding genes form the main informational output of the genome has resulted in much of the misunderstanding surrounding pervasive transcription and its potential significance. Finally, we present the conceptual implications of the majority of the eukaryotic genome being functional and describe how appreciating this perspective will provide considerable opportunity to further understand the molecular basis of development and complex diseases.
Collapse
Affiliation(s)
- Marcel E Dinger
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia
| | | | | | | |
Collapse
|
332
|
St Laurent G, Savva YA, Reenan R. Enhancing non-coding RNA information content with ADAR editing. Neurosci Lett 2009; 466:89-98. [PMID: 19751800 DOI: 10.1016/j.neulet.2009.09.009] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Revised: 09/02/2009] [Accepted: 09/05/2009] [Indexed: 10/20/2022]
Abstract
The depth and complexity of the non-coding transcriptome in nervous system tissues provides a rich substrate for adenosine de-amination acting on RNA (ADAR). Non-coding RNAs (ncRNAs) serve diverse regulatory and computational functions, coupling signal flow from the environment to evolutionarily coded analog and digital information elements within the transcriptome. We present a perspective of ADARs interaction with the non-coding transcriptome as a computational matrix, enhancing the information processing power of the cell, adding flexibility, rapid response, and fine tuning to critical pathways. Dramatic increases in ADAR activity during stress response and inflammation result in powerful information processing events that change the functional state of the cell. This review examines the pathways and mechanisms of ADAR interaction with the non-coding transcriptome, and their functional consequences for information processing in nervous system tissues.
Collapse
Affiliation(s)
- Georges St Laurent
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI 02912, USA
| | | | | |
Collapse
|
333
|
Genomic analysis suggests that mRNA destabilization by the microprocessor is specialized for the auto-regulation of Dgcr8. PLoS One 2009; 4:e6971. [PMID: 19759829 PMCID: PMC2736397 DOI: 10.1371/journal.pone.0006971] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Accepted: 08/11/2009] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND The Microprocessor, containing the RNA binding protein Dgcr8 and RNase III enzyme Drosha, is responsible for processing primary microRNAs to precursor microRNAs. The Microprocessor regulates its own levels by cleaving hairpins in the 5'UTR and coding region of the Dgcr8 mRNA, thereby destabilizing the mature transcript. METHODOLOGY/PRINCIPAL FINDINGS To determine whether the Microprocessor has a broader role in directly regulating other coding mRNA levels, we integrated results from expression profiling and ultra high-throughput deep sequencing of small RNAs. Expression analysis of mRNAs in wild-type, Dgcr8 knockout, and Dicer knockout mouse embryonic stem (ES) cells uncovered mRNAs that were specifically upregulated in the Dgcr8 null background. A number of these transcripts had evolutionarily conserved predicted hairpin targets for the Microprocessor. However, analysis of deep sequencing data of 18 to 200nt small RNAs in mouse ES, HeLa, and HepG2 indicates that exonic sequence reads that map in a pattern consistent with Microprocessor activity are unique to Dgcr8. CONCLUSION/SIGNIFICANCE We conclude that the Microprocessor's role in directly destabilizing coding mRNAs is likely specifically targeted to Dgcr8 itself, suggesting a specialized cellular mechanism for gene auto-regulation.
Collapse
|
334
|
Schoenberg DR, Maquat LE. Re-capping the message. Trends Biochem Sci 2009; 34:435-42. [PMID: 19729311 DOI: 10.1016/j.tibs.2009.05.003] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2009] [Revised: 05/13/2009] [Accepted: 05/13/2009] [Indexed: 10/20/2022]
Abstract
The 5'-cap structure that typifies all polymerase II-transcribed RNAs plays important roles in pre-mRNA processing and mRNA export, translation and quality control. Removal of the cap is a regulated process that is considered to be the first irreversible step in mRNA decay. An emerging view challenges this idea: mRNAs have been identified in mammalian cells that lack sequences from their 5' ends but nevertheless appear to be modified with a cap or cap-like structure. Furthermore, a cytoplasmic form of capping enzyme was recently identified that, together with a novel kinase, generates capped ends from cleaved RNAs. These and other findings provide evidence for re-capping and its possible functions.
Collapse
Affiliation(s)
- Daniel R Schoenberg
- Department of Molecular and Cellular Biochemistry and Center for RNA Biology, The Ohio State University, Columbus, OH 43210, USA.
| | | |
Collapse
|
335
|
The plasticity of the mammalian transcriptome. Genomics 2009; 95:1-6. [PMID: 19716875 DOI: 10.1016/j.ygeno.2009.08.010] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2009] [Revised: 08/05/2009] [Accepted: 08/22/2009] [Indexed: 11/28/2022]
Abstract
The dogmatic view of RNA as a mere necessity in the transfer of information between DNA and proteins has during recent years come into question. Novel approaches and new technology has revealed an unprecedented level of inherent complexity in the mammalian transcriptome. Here, the majority of nucleotides are expressed, in sharp contrast to the approximately 1.2% of the human genome harboring protein coding information. Also, >50% of genomic loci contain antisense and interleaved transcription, a conservative estimate since non-coding RNA is highly regulated between tissues and developmental stages, which has only been investigated to a limited extent. Subsequent focus on RNA with no coding potential has revealed numerous species with novel functions, and deep sequencing studies imply that many remain to be discovered. This review gives an overview of the plasticity and dynamics of the mammalian transcriptome and the prevailing interpretation of its effect on the complexity of species.
Collapse
|
336
|
Abstract
Disruptions in local chromatin structure often indicate features of biological interest such as regulatory regions. We find that sonication of cross-linked chromatin, when combined with a size-selection step and massively parallel short-read sequencing, can be used as a method (Sono-Seq) to map locations of high chromatin accessibility in promoter regions. Sono-Seq sites frequently correspond to actively transcribed promoter regions, as evidenced by their co-association with RNA Polymerase II ChIP regions, transcription start sites, histone H3 lysine 4 trimethylation (H3K4me3) marks, and CpG islands; signals over other sites, such as those bound by the CTCF insulator, are also observed. The pattern of breakage by Sono-Seq overlaps with, but is distinct from, that observed for FAIRE and DNase I hypersensitive sites. Our results demonstrate that Sono-Seq can be a useful and simple method by which to map many local alterations in chromatin structure. Furthermore, our results provide insights into the mapping of binding sites by using ChIP-Seq experiments and the value of reference samples that should be used in such experiments.
Collapse
|
337
|
Hartig JV, Esslinger S, Böttcher R, Saito K, Förstemann K. Endo-siRNAs depend on a new isoform of loquacious and target artificially introduced, high-copy sequences. EMBO J 2009; 28:2932-44. [PMID: 19644447 DOI: 10.1038/emboj.2009.220] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2009] [Accepted: 07/09/2009] [Indexed: 12/20/2022] Open
Abstract
Colonization of genomes by a new selfish genetic element is detrimental to the host species and must lead to an efficient, repressive response. In vertebrates as well as in Drosophila, piRNAs repress transposons in the germ line, whereas endogenous siRNAs take on this role in somatic cells. We show that their biogenesis depends on a new isoform of the Drosophila TRBP homologue loquacious, which arises by alternative polyadenylation and is distinct from the one that functions during the biogenesis of miRNAs. For endo-siRNAs and piRNAs, it is unclear how an efficient response can be initiated de novo. Our experiments establish that the endo-siRNA pathway will target artificially introduced sequences without the need for a pre-existing template in the genome. This response is also triggered in transiently transfected cells, thus genomic integration is not essential. Deep sequencing showed that corresponding endo-siRNAs are generated throughout the sequence, but preferentially from transcribed regions. One strand of the dsRNA precursor can come from spliced mRNA, whereas the opposite strand derives from independent transcripts in antisense orientation.
Collapse
Affiliation(s)
- Julia Verena Hartig
- Department of Chemistry and Biochemistry, Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | | | | | | | | |
Collapse
|
338
|
Xu M, Medvedev S, Yang J, Hecht NB. MIWI-independent small RNAs (MSY-RNAs) bind to the RNA-binding protein, MSY2, in male germ cells. Proc Natl Acad Sci U S A 2009; 106:12371-6. [PMID: 19597149 PMCID: PMC2718373 DOI: 10.1073/pnas.0903944106] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2009] [Indexed: 11/18/2022] Open
Abstract
The germ cell-specific DNA/RNA-binding protein MSY2 binds small RNAs (MSY-RNAs) that are approximately 25-31 nt in length, often initiate with a 5' adenine, and are expressed in both germ cells and somatic cells. MSY-RNA levels do not decrease in Miwi or Msy2 null mice. Most MSY-RNAs map within annotated genes, but some are PIWI-interacting RNA (piRNA)-like and map to piRNA clusters. MSY-RNAs are in both nuclei and cytoplasm. In nuclei, MSY-RNAs are enriched in chromatin, and in the cytoplasm they are detected in both ribonucleoproteins and polysomes.
Collapse
Affiliation(s)
- Mingang Xu
- Center for Research on Reproduction and Women's Health, Department of Obstetrics and Gynecology, University of Pennsylvania School of Medicine, Philadelphia, PA 19104; and
| | - Sergey Medvedev
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104
| | - Juxiang Yang
- Center for Research on Reproduction and Women's Health, Department of Obstetrics and Gynecology, University of Pennsylvania School of Medicine, Philadelphia, PA 19104; and
| | - Norman B. Hecht
- Center for Research on Reproduction and Women's Health, Department of Obstetrics and Gynecology, University of Pennsylvania School of Medicine, Philadelphia, PA 19104; and
| |
Collapse
|
339
|
Glazov EA, Kongsuwan K, Assavalapsakul W, Horwood PF, Mitter N, Mahony TJ. Repertoire of bovine miRNA and miRNA-like small regulatory RNAs expressed upon viral infection. PLoS One 2009; 4:e6349. [PMID: 19633723 PMCID: PMC2713767 DOI: 10.1371/journal.pone.0006349] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2009] [Accepted: 06/17/2009] [Indexed: 12/21/2022] Open
Abstract
MicroRNA (miRNA) and other types of small regulatory RNAs play a crucial role in the regulation of gene expression in eukaryotes. Several distinct classes of small regulatory RNAs have been discovered in recent years. To extend the repertoire of small RNAs characterized in mammals and to examine relationship between host miRNA expression and viral infection we used Illumina's ultrahigh throughput sequencing approach. We sequenced three small RNA libraries prepared from cell line derived from the adult bovine kidney under normal conditions and upon infection of the cell line with Bovine herpesvirus 1. We used a bioinformatics approach to distinguish authentic mature miRNA sequences from other classes of small RNAs and short RNA fragments represented in the sequencing data. Using this approach we detected 219 out of 356 known bovine miRNAs and 115 respective miRNA* sequences. In addition we identified five new bovine orthologs of known mammalian miRNAs and discovered 268 new cow miRNAs many of which are not identifiable in other mammalian genomes and thus might be specific to the ruminant lineage. In addition we found seven new bovine mirtron candidates. We also discovered 10 small nucleolar RNA (snoRNA) loci that give rise to small RNA with possible miRNA-like function. Results presented in this study extend our knowledge of the biology and evolution of small regulatory RNAs in mammals and illuminate mechanisms of small RNA biogenesis and function. New miRNA sequences and the original sequencing data have been submitted to miRNA repository (miRBase) and NCBI GEO archive respectively. We envisage that these resources will facilitate functional annotation of the bovine genome and promote further functional and comparative genomics studies of small regulatory RNA in mammals.
Collapse
Affiliation(s)
- Evgeny A. Glazov
- Diamantina Institute for Cancer, Immunology and Metabolic Medicine, The University of Queensland, Princess Alexandra Hospital, Woolloongabba, Queensland, Australia
- * E-mail: (EAG); (TJM)
| | - Kritaya Kongsuwan
- CSIRO Livestock Industries, Queensland Bioscience Precinct, St Lucia, Queensland, Australia
| | - Wanchai Assavalapsakul
- Department of Microbiology, Faculty of Science, Chulalongkorn University, Phayathai, Bangkok, Thailand
| | - Paul F. Horwood
- Department of Primary Industries and Fisheries, Ritchie Building, Brisbane, Queensland, Australia
| | - Neena Mitter
- Department of Primary Industries and Fisheries, Ritchie Building, Brisbane, Queensland, Australia
| | - Timothy J. Mahony
- Department of Primary Industries and Fisheries, Ritchie Building, Brisbane, Queensland, Australia
- School of Veterinary Sciences, University of Queensland, St Lucia, Queensland, Australia
- * E-mail: (EAG); (TJM)
| |
Collapse
|
340
|
Camblong J, Beyrouthy N, Guffanti E, Schlaepfer G, Steinmetz LM, Stutz F. Trans-acting antisense RNAs mediate transcriptional gene cosuppression in S. cerevisiae. Genes Dev 2009; 23:1534-45. [PMID: 19571181 DOI: 10.1101/gad.522509] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Homology-dependent gene silencing, a phenomenon described as cosuppression in plants, depends on siRNAs. We provide evidence that in Saccharomyces cerevisiae, which is missing the RNAi machinery, protein coding gene cosuppression exists. Indeed, introduction of an additional copy of PHO84 on a plasmid or within the genome results in the cosilencing of both the transgene and the endogenous gene. This repression is transcriptional and position-independent and requires trans-acting antisense RNAs. Antisense RNAs induce transcriptional gene silencing both in cis and in trans, and the two pathways differ by the implication of the Hda1/2/3 complex. We also show that trans-silencing is influenced by the Set1 histone methyltransferase, which promotes antisense RNA production. Finally we show that although antisense-mediated cis-silencing occurs in other genes, trans-silencing so far depends on features specific to PHO84. All together our data highlight the importance of noncoding RNAs in mediating RNAi-independent transcriptional gene silencing.
Collapse
Affiliation(s)
- Jurgi Camblong
- Department of Cell Biology, University of Geneva, Geneva, Switzerland
| | | | | | | | | | | |
Collapse
|
341
|
Wilusz JE, Sunwoo H, Spector DL. Long noncoding RNAs: functional surprises from the RNA world. Genes Dev 2009; 23:1494-504. [PMID: 19571179 DOI: 10.1101/gad.1800909] [Citation(s) in RCA: 1811] [Impact Index Per Article: 120.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Most of the eukaryotic genome is transcribed, yielding a complex network of transcripts that includes tens of thousands of long noncoding RNAs with little or no protein-coding capacity. Although the vast majority of long noncoding RNAs have yet to be characterized thoroughly, many of these transcripts are unlikely to represent transcriptional "noise" as a significant number have been shown to exhibit cell type-specific expression, localization to subcellular compartments, and association with human diseases. Here, we highlight recent efforts that have identified a myriad of molecular functions for long noncoding RNAs. In some cases, it appears that simply the act of noncoding RNA transcription is sufficient to positively or negatively affect the expression of nearby genes. However, in many cases, the long noncoding RNAs themselves serve key regulatory roles that were assumed previously to be reserved for proteins, such as regulating the activity or localization of proteins and serving as organizational frameworks of subcellular structures. In addition, many long noncoding RNAs are processed to yield small RNAs or, conversely, modulate how other RNAs are processed. It is thus becoming increasingly clear that long noncoding RNAs can function via numerous paradigms and are key regulatory molecules in the cell.
Collapse
Affiliation(s)
- Jeremy E Wilusz
- Watson School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | | |
Collapse
|
342
|
Smith AE, Chronis C, Christodoulakis M, Orr SJ, Lea NC, Twine NA, Bhinge A, Mufti GJ, Thomas NSB. Epigenetics of human T cells during the G0-->G1 transition. Genome Res 2009; 19:1325-37. [PMID: 19546172 DOI: 10.1101/gr.085530.108] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
We investigated functional epigenetic changes that occur in primary human T lymphocytes during entry into the cell cycle and mapped these at the single-nucleosome level by ChIP-chip on tiling arrays for chromosomes 1 and 6. We show that nucleosome loss and flanking active histone marks define active transcriptional start sites (TSSs). Moreover, these signatures are already set at many inducible genes in quiescent cells prior to cell stimulation. In contrast, there is a dearth of the inactive histone mark H3K9me3 at the TSS, and under-representation of H3K9me2 and H3K9me3 defines the body of active genes. At the DNA level, cytosine methylation (meC) is enriched for nucleosomes that remain at the TSS, whereas in general there is a dearth of meC at TSSs. Furthermore, a drop in meC also marks 3' transcription termination, and a peak of meC occurs at stop codons. This mimics the 3' nucleosomal distribution in yeast, which we show does not occur in human T cells.
Collapse
Affiliation(s)
- Alexander E Smith
- King's College London, Department of Haematological Medicine, Leukaemia Sciences Laboratories, Rayne Institute, London, United Kingdom
| | | | | | | | | | | | | | | | | |
Collapse
|
343
|
Belostotsky D. Exosome complex and pervasive transcription in eukaryotic genomes. Curr Opin Cell Biol 2009; 21:352-8. [DOI: 10.1016/j.ceb.2009.04.011] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2009] [Revised: 04/19/2009] [Accepted: 04/20/2009] [Indexed: 12/27/2022]
|
344
|
Forte GI, Scola L, Bellavia D, Vaccarino L, Sanacore M, Sisino G, Scazzone C, Caruso C, Barbieri R, Lio D. Characterization of two alternative Interleukin(IL)-10 5'UTR mRNA sequences, induced by lipopolysaccharide (LPS) stimulation of peripheral blood mononuclear cells. Mol Immunol 2009; 46:2161-6. [PMID: 19477525 DOI: 10.1016/j.molimm.2009.04.034] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2009] [Revised: 04/20/2009] [Accepted: 04/29/2009] [Indexed: 11/28/2022]
Abstract
IL-10 production shows a broad-spectrum of individual response, suggesting a genetic component of approximately 75%. Different polymorphisms located close to, or within the IL-10 gene has been demonstrated to influence its transcription rate whereas the post-transcriptional regulation of IL-10 production has not well elucidated. The main responsible elements at this control level are both the 5'- and 3'-untranslated regions (UTR's) of mRNAs, and as the 3'-UTR regions are mainly involved in the stability and decay rate of mRNAs, the 5'-UTR regions mediate the binding rate of the molecule with ribosomal 40S subunit as a cis-acting element. Herein are report data on the identification of two IL10 mRNA that differ by the length of respective 5'UTR regions (160 and 288 nucleotides, respectively; EMBL accession nrs: EU751618 and EU751619) produced after stimulation of human blood samples with bacterial lipopolysaccharide (LPS). The longer 5'UTR is constitutively expressed in unstimulated PBMC cells cultured at 37 degrees C for 24h, while in LPS stimulated cells an additional IL-10 mRNA molecule, containing a shorter 5'UTR, is synthesized. RNADRAW software (http://www.rnadraw.com/) analysis have indicated that the secondary structures of the shorter 5'UTR IL-10 mRNA region is more available for the binding to the 40S ribosomal subunit. In conclusion, our data seem to suggest that LPS could influence the post-transcriptional control of IL-10 production inducing an alternative mRNA immediately available in response to the inflammatory stimulation.
Collapse
Affiliation(s)
- Giusi Irma Forte
- Clinical Pathology, Section of General Pathology, Department of Pathobiology and Biomedical Methodology, University of Palermo, Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
345
|
Abstract
A large amount of experimental data collected over the last decade has shown that genomic organization is very complex and has highlighted the fact that the current set of gene annotations does not fully capture this complexity. Much of the RNA detected in a cell is found to originate from outside the exons of annotated genes. Exons of annotated and unannotated transcripts separated by large genomic distances can be joined together in chimeric transcripts. Any given base-pair in a genome could be traversed by many protein-coding and non-coding RNAs. We discuss the implications of these effects for our understanding of disease.
Collapse
Affiliation(s)
- Philipp Kapranov
- Helicos BioSciences Corporation, One Kendall Square Bldg 700, Cambridge, MA 02139, USA
| |
Collapse
|
346
|
The transcriptome of human CD34+ hematopoietic stem-progenitor cells. Proc Natl Acad Sci U S A 2009; 106:8278-83. [PMID: 19416867 DOI: 10.1073/pnas.0903390106] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Studying gene expression at different hematopoietic stages provides insights for understanding the genetic basis of hematopoiesis. We analyzed gene expression in human CD34(+) hematopoietic cells that represent the stem-progenitor population (CD34(+) cells). We collected >459,000 transcript signatures from CD34(+) cells, including the de novo-generated 3' ESTs and the existing sequences of full-length cDNAs, ESTs, and serial analysis of gene expression (SAGE) tags, and performed an extensive annotation on this large set of CD34(+) transcript sequences. We determined the genes expressed in CD34(+) cells, verified the known genes and identified the new genes of different functional categories involved in hematopoiesis, dissected the alternative gene expression including alternative transcription initiation, splicing, and adenylation, identified the antisense and noncoding transcripts, determined the CD34(+) cell-specific gene expression signature, and developed the CD34(+) cell-transcription map in the human genome. Our study provides a current view on gene expression in human CD34(+) cells and reveals that early hematopoiesis is an orchestrated process with the involvement of over half of the human genes distributed in various functions. The data generated from our study provide a comprehensive and uniform resource for studying hematopoiesis and stem cell biology.
Collapse
|
347
|
Layer JH, Weil PA. Ubiquitous antisense transcription in eukaryotes: novel regulatory mechanism or byproduct of opportunistic RNA polymerase? F1000 BIOLOGY REPORTS 2009; 1:33. [PMID: 20948652 PMCID: PMC2924692 DOI: 10.3410/b1-33] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
One of the more notable observations made in the last few years in gene regulation is that eukaryotic genomes appear to be pervasively transcribed. Recent transcriptome mapping studies have shown that much of the genome is transcribed, and in some instances transcripts from both strands of specific genomic loci are detectable. While some of these transcripts map to known RNA polymerase II transcription units [that is, protein encoding open reading frames (ORFs)], many are derived from regions of DNA thought to be non-genic. Parallel chromatin immunoprecipitation studies of template-bound RNA polymerase II have shown that it is indeed resident on those regions found to be transcribed, both ORF and non-ORF. However, the strandedness of these pervasive transcripts has never been measured on a genome-wide basis. Four recent reports have addressed this question and, in the process, have made the startling discovery that many loci of mRNA sense gene transcription are associated with very active antisense or divergent transcription that begins at mapped transcription start sites and proceeds in an upstream direction.
Collapse
Affiliation(s)
- Justin H Layer
- Department of Molecular Physiology and Biophysics, Vanderbilt University, School of Medicine Nashville, TN 37232-0615 USA
| | | |
Collapse
|
348
|
Abstract
The regulation of transcription is a complex process. Recent novel insights concerning the in vivo regulation and expression of protein-coding and non-coding RNAs have added previously unimagined levels of complexity to these processes.
Collapse
Affiliation(s)
- Philipp Kapranov
- Helicos BioSciences Corporation, One Kendall Square Building 700, Cambridge, MA 02139, USA.
| |
Collapse
|
349
|
The regulated retrotransposon transcriptome of mammalian cells. Nat Genet 2009; 41:563-71. [PMID: 19377475 DOI: 10.1038/ng.368] [Citation(s) in RCA: 592] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2008] [Accepted: 03/05/2009] [Indexed: 12/14/2022]
Abstract
Although repetitive elements pervade mammalian genomes, their overall contribution to transcriptional activity is poorly defined. Here, as part of the FANTOM4 project, we report that 6-30% of cap-selected mouse and human RNA transcripts initiate within repetitive elements. Analysis of approximately 250,000 retrotransposon-derived transcription start sites shows that the associated transcripts are generally tissue specific, coincide with gene-dense regions and form pronounced clusters when aligned to full-length retrotransposon sequences. Retrotransposons located immediately 5' of protein-coding loci frequently function as alternative promoters and/or express noncoding RNAs. More than a quarter of RefSeqs possess a retrotransposon in their 3' UTR, with strong evidence for the reduced expression of these transcripts relative to retrotransposon-free transcripts. Finally, a genome-wide screen identifies 23,000 candidate regulatory regions derived from retrotransposons, in addition to more than 2,000 examples of bidirectional transcription. We conclude that retrotransposon transcription has a key influence upon the transcriptional output of the mammalian genome.
Collapse
|
350
|
|