4301
|
Raasch P, Schmitz U, Patenge N, Vera J, Kreikemeyer B, Wolkenhauer O. Non-coding RNA detection methods combined to improve usability, reproducibility and precision. BMC Bioinformatics 2010; 11:491. [PMID: 20920260 PMCID: PMC2955705 DOI: 10.1186/1471-2105-11-491] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2010] [Accepted: 09/29/2010] [Indexed: 11/10/2022] Open
Abstract
Background Non-coding RNAs gain more attention as their diverse roles in many cellular processes are discovered. At the same time, the need for efficient computational prediction of ncRNAs increases with the pace of sequencing technology. Existing tools are based on various approaches and techniques, but none of them provides a reliable ncRNA detector yet. Consequently, a natural approach is to combine existing tools. Due to a lack of standard input and output formats combination and comparison of existing tools is difficult. Also, for genomic scans they often need to be incorporated in detection workflows using custom scripts, which decreases transparency and reproducibility. Results We developed a Java-based framework to integrate existing tools and methods for ncRNA detection. This framework enables users to construct transparent detection workflows and to combine and compare different methods efficiently. We demonstrate the effectiveness of combining detection methods in case studies with the small genomes of Escherichia coli, Listeria monocytogenes and Streptococcus pyogenes. With the combined method, we gained 10% to 20% precision for sensitivities from 30% to 80%. Further, we investigated Streptococcus pyogenes for novel ncRNAs. Using multiple methods--integrated by our framework--we determined four highly probable candidates. We verified all four candidates experimentally using RT-PCR. Conclusions We have created an extensible framework for practical, transparent and reproducible combination and comparison of ncRNA detection methods. We have proven the effectiveness of this approach in tests and by guiding experiments to find new ncRNAs. The software is freely available under the GNU General Public License (GPL), version 3 at http://www.sbi.uni-rostock.de/moses along with source code, screen shots, examples and tutorial material.
Collapse
Affiliation(s)
- Peter Raasch
- Systems Biology and Bioinformatics Group, University of Rostock, Rostock, Germany
| | | | | | | | | | | |
Collapse
|
4302
|
Prasanth KV, Camiolo M, Chan G, Tripathi V, Denis L, Nakamura T, Hübner MR, Spector DL. Nuclear organization and dynamics of 7SK RNA in regulating gene expression. Mol Biol Cell 2010; 21:4184-96. [PMID: 20881057 PMCID: PMC2993747 DOI: 10.1091/mbc.e10-02-0105] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
We have identified 7SK RNA to be enriched in nuclear speckles. Knock-down of 7SK results in the mislocalization of nuclear speckle constituents, and the transcriptional up-regulation of a reporter gene locus. 7SK RNA transiently associates with the locus upon transcriptional down-regulation correlating with the displacement of pTEF-b. Noncoding RNAs play important roles in various aspects of gene regulation. We have identified 7SK RNA to be enriched in nuclear speckles or interchromatin granule clusters (IGCs), a subnuclear domain enriched in pre-mRNA processing factors. 7SK RNA, in association with HEXIM 1 and 2, is involved in the inhibition of transcriptional elongation by RNA polymerase II. Inhibition occurs via sequestration of the active P-TEFb kinase complex (CDK 9 and Cyclin T1/T2a/b or K) that is involved in phosphorylating the C-terminal domain of RNA polymerase II. Our results demonstrate that knock-down of 7SK RNA, by specific antisense oligonucleotides, results in the mislocalization of nuclear speckle constituents in a transcription-dependent manner, and the transcriptional up-regulation of a RNA polymerase II transcribed reporter gene locus. Furthermore, 7SK RNA transiently associates with a stably integrated reporter gene locus upon transcriptional down-regulation and its presence correlates with the efficient displacement of P-TEFb constituents from the locus. Our results suggest that 7SK RNA plays a role in modulating the available level of P-TEFb upon transcriptional down-regulation by sequestering its constituents in nuclear speckles.
Collapse
|
4303
|
Chung S, Nakagawa H, Uemura M, Piao L, Ashikawa K, Hosono N, Takata R, Akamatsu S, Kawaguchi T, Morizono T, Tsunoda T, Daigo Y, Matsuda K, Kamatani N, Nakamura Y, Kubo M. Association of a novel long non-coding RNA in 8q24 with prostate cancer susceptibility. Cancer Sci 2010; 102:245-52. [PMID: 20874843 DOI: 10.1111/j.1349-7006.2010.01737.x] [Citation(s) in RCA: 228] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Recent genome-wide association studies reported strong and reproducible associations of multiple genetic variants in a large "gene-desert" region of chromosome 8q24 with susceptibility to prostate cancer (PC). However, the causative or functional variants of these 8q24 loci and their biological mechanisms associated with PC susceptibility remain unclear and should be investigated. Here, focusing on its most centromeric region (so-called Region 2: Chr8: 128.14-128.28 Mb) among the multiple PC loci on 8q24, we performed fine mapping and re-sequencing of this critical region and identified SNPs (single nucleotide polymorphisms) between rs1456315 and rs7463708 (chr8: 128,173,119-128,173,237 bp) to be most significantly associated with PC susceptibility (P = 2.00 × 10(-24) , OR = 1.74, 95% confidence interval = 1.56-1.93). Importantly, we show that this region was transcribed as a ∼13 kb intron-less long non-coding RNA (ncRNA), termed PRNCR1 (prostate cancer non-coding RNA 1), and PRNCR1 expression was upregulated in some of the PC cells as well as precursor lesion prostatic intraepithelial neoplasia. Knockdown of PRNCR1 by siRNA attenuated the viability of PC cells and the transactivation activity of androgen receptor, which indicates that PRNCR1 could be involved in prostate carcinogenesis possibly through androgen receptor activity. These findings could provide a new insight in understanding the pathogenesis of genetic factors for PC susceptibility and prostate carcinogenesis.
Collapse
Affiliation(s)
- Suyoun Chung
- Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4304
|
Cooper DN, Chen JM, Ball EV, Howells K, Mort M, Phillips AD, Chuzhanova N, Krawczak M, Kehrer-Sawatzki H, Stenson PD. Genes, mutations, and human inherited disease at the dawn of the age of personalized genomics. Hum Mutat 2010; 31:631-55. [PMID: 20506564 DOI: 10.1002/humu.21260] [Citation(s) in RCA: 117] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The number of reported germline mutations in human nuclear genes, either underlying or associated with inherited disease, has now exceeded 100,000 in more than 3,700 different genes. The availability of these data has both revolutionized the study of the morbid anatomy of the human genome and facilitated "personalized genomics." With approximately 300 new "inherited disease genes" (and approximately 10,000 new mutations) being identified annually, it is pertinent to ask how many "inherited disease genes" there are in the human genome, how many mutations reside within them, and where such lesions are likely to be located? To address these questions, it is necessary not only to reconsider how we define human genes but also to explore notions of gene "essentiality" and "dispensability."Answers to these questions are now emerging from recent novel insights into genome structure and function and through complete genome sequence information derived from multiple individual human genomes. However, a change in focus toward screening functional genomic elements as opposed to genes sensu stricto will be required if we are to capitalize fully on recent technical and conceptual advances and identify new types of disease-associated mutation within noncoding regions remote from the genes whose function they disrupt.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
4305
|
Pereira MJB, Behera V, Walter NG. Nondenaturing purification of co-transcriptionally folded RNA avoids common folding heterogeneity. PLoS One 2010; 5:e12953. [PMID: 20886091 PMCID: PMC2944885 DOI: 10.1371/journal.pone.0012953] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2010] [Accepted: 08/31/2010] [Indexed: 11/18/2022] Open
Abstract
Due to the energetic frustration of RNA folding, tertiary structured RNA is typically characterized by a rugged folding free energy landscape where deep kinetic barriers separate numerous misfolded states from one or more native states. While most in vitro studies of RNA rely on (re)folding chemically and/or enzymatically synthesized RNA in its entirety, which frequently leads into kinetic traps, nature reduces the complexity of the RNA folding problem by segmental, co-transcriptional folding starting from the 5' end. We here have developed a simplified, general, nondenaturing purification protocol for RNA to ask whether avoiding denaturation of a co-transcriptionally folded RNA can reduce commonly observed in vitro folding heterogeneity. Our protocol bypasses the need for large-scale auxiliary protein purification and expensive chromatographic equipment and involves rapid affinity capture with magnetic beads and removal of chemical heterogeneity by cleavage of the target RNA from the beads using the ligand-induced glmS ribozyme. For two disparate model systems, the Varkud satellite (VS) and hepatitis delta virus (HDV) ribozymes, we achieve >95% conformational purity within one hour of enzymatic transcription, without the need for any folding chaperones. We further demonstrate that in vitro refolding introduces severe conformational heterogeneity into the natively-purified VS ribozyme but not into the compact, double-nested pseudoknot fold of the HDV ribozyme. We conclude that conformational heterogeneity in complex RNAs can be avoided by co-transcriptional folding followed by nondenaturing purification, providing rapid access to chemically and conformationally pure RNA for biologically relevant biochemical and biophysical studies.
Collapse
Affiliation(s)
- Miguel J. B. Pereira
- Single Molecule Analysis Group, Department of Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Vivek Behera
- Single Molecule Analysis Group, Department of Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Nils G. Walter
- Single Molecule Analysis Group, Department of Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
4306
|
Corcoran AE. The epigenetic role of non-coding RNA transcription and nuclear organization in immunoglobulin repertoire generation. Semin Immunol 2010; 22:353-61. [PMID: 20863715 DOI: 10.1016/j.smim.2010.08.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/28/2010] [Accepted: 08/12/2010] [Indexed: 01/04/2023]
Abstract
Within the lymphocyte lineages, restriction of immunoglobulin V(D)J recombination to B cells and T cell receptor (TCR) recombination to T cells is governed by a myriad of epigenetic mechanisms that control the chromatin accessibility of these loci to the Rag recombinase machinery in a lineage and developmental stage-specific manner. These mechanisms operate both locally at individual gene segments, and globally over large chromatin domains in these enormous multigene loci. In this review we will explore the established and emerging roles of three aspects of epigenetic regulation that contribute to large-scale control of the immunoglobulin heavy chain locus in B cells: non-coding RNA transcription, regulatory elements, and nuclear organization. Recent conceptual and technological advances have produced a paradigm shift in our thinking about how these components regulate gene expression in general and V(D)J recombination in particular.
Collapse
Affiliation(s)
- Anne E Corcoran
- Laboratory of Chromatin and Gene Expression, The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK.
| |
Collapse
|
4307
|
Hubé F, Velasco G, Rollin J, Furling D, Francastel C. Steroid receptor RNA activator protein binds to and counteracts SRA RNA-mediated activation of MyoD and muscle differentiation. Nucleic Acids Res 2010; 39:513-25. [PMID: 20855289 PMCID: PMC3025577 DOI: 10.1093/nar/gkq833] [Citation(s) in RCA: 112] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The steroid receptor RNA activator (SRA) has the unusual property to function as both a non-coding RNA (ncRNA) and a protein SRAP. SRA ncRNA is known to increase the activity of a range of nuclear receptors as well as the master regulator of muscle differentiation MyoD. The contribution of SRA to either a ncRNA or a protein is influenced by alternative splicing of the first intron, the retention of which disrupts the SRAP open reading frame. We reported here that the ratio between non-coding and coding SRA isoforms increased during myogenic differentiation of human satellite cells but not myotonic dystrophy patient satellite cells, in which differentiation capacity is affected. Using constructs that exclusively produce SRA ncRNA or SRAP, we demonstrated that whereas SRA ncRNA was indeed an enhancer of myogenic differentiation and myogenic conversion of non-muscle cells through the co-activation of MyoD activity, SRAP prevented this SRA RNA-dependant co-activation. Interestingly, the SRAP inhibitory effect is mediated through the interaction of SRAP with its RNA counterpart via its RRM-like domain interacting with the functional sub-structure of SRA RNA, STR7. This study thus provides a new model for SRA-mediated regulation of MyoD transcriptional activity in the promotion of normal muscle differentiation, which takes into account the nature of SRA molecules present.
Collapse
Affiliation(s)
- Florent Hubé
- Université Paris Diderot-Paris7, CNRS UMR7216, Epigenetics and Cell Fate, Paris, France.
| | | | | | | | | |
Collapse
|
4308
|
|
4309
|
|
4310
|
Qureshi IA, Mehler MF. Impact of nuclear organization and dynamics on epigenetic regulation in the central nervous system: implications for neurological disease states. Ann N Y Acad Sci 2010; 1204 Suppl:E20-37. [PMID: 20840166 PMCID: PMC2946117 DOI: 10.1111/j.1749-6632.2010.05718.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Epigenetic mechanisms that are highly responsive to interoceptive and environmental stimuli mediate the proper execution of complex genomic programs, such as cell type-specific gene transcription and posttranscriptional RNA processing, and are increasingly thought to be important for modulating the development, homeostasis, and plasticity of the central nervous system (CNS). These epigenetic processes include DNA methylation, histone modifications, and chromatin remodeling, all of which play roles in neural cellular diversity, connectivity, and plasticity. Further, large-scale transcriptomic analyses have revealed that the eukaryotic genome is pervasively transcribed, forming interleaved protein-coding RNAs and regulatory nonprotein-coding RNAs (ncRNAs), which act through a broad array of molecular mechanisms. Most of these ncRNAs are transcribed in a cell type- and developmental stage-specific manner in the CNS. A broad array of posttranscriptional processes, such as RNA editing and transport, can modulate the functions of both protein-coding RNAs and ncRNAs. Additional studies implicate nuclear organization and dynamics in mediating epigenetic regulation. The compartmentalization of DNA sequences and other molecular machinery into functional nuclear domains, such as transcription factories, Cajal bodies, promyelocytic leukemia nuclear bodies, nuclear speckles, and paraspeckles, some of which are found prominently in neural cells, is associated with regulation of transcriptional activity and posttranscriptional RNA processing. These observations suggest that genomic architecture and RNA biology in the CNS are much more complex and nuanced than previously appreciated. Increasing evidence now suggests that most, if not all, human CNS diseases are associated with either primary or secondary perturbations in one or more aspects of the epigenome. In this review, we provide an update of our emerging understanding of genomic architecture, RNA biology, and nuclear organization and highlight the interconnected roles that deregulation of these factors may play in diverse CNS disorders.
Collapse
Affiliation(s)
- Irfan A. Qureshi
- Rosyln and Leslie Goldstein Laboratory for Stem Cell Biology and Regenerative Medicine, Albert Einstein College of Medicine, Bronx, New York, NY
- Institute for Brain Disorders and Neural Regeneration, Albert Einstein College of Medicine, Bronx, New York, NY
- Department of Neurology, Albert Einstein College of Medicine, Bronx, New York, NY
- Rose F. Kennedy Center for Research on Intellectual and Developmental Disabilities, Albert Einstein College of Medicine, Bronx, New York, NY
| | - Mark F. Mehler
- Rosyln and Leslie Goldstein Laboratory for Stem Cell Biology and Regenerative Medicine, Albert Einstein College of Medicine, Bronx, New York, NY
- Institute for Brain Disorders and Neural Regeneration, Albert Einstein College of Medicine, Bronx, New York, NY
- Department of Neurology, Albert Einstein College of Medicine, Bronx, New York, NY
- Department of Neuroscience, Albert Einstein College of Medicine, Bronx, New York, NY
- Department of Psychiatry and Behavioral Sciences, Albert Einstein College of Medicine, Bronx, New York, NY
- Rose F. Kennedy Center for Research on Intellectual and Developmental Disabilities, Albert Einstein College of Medicine, Bronx, New York, NY
| |
Collapse
|
4311
|
Hung T, Chang HY. Long noncoding RNA in genome regulation: prospects and mechanisms. RNA Biol 2010; 7:582-5. [PMID: 20930520 DOI: 10.4161/rna.7.5.13216] [Citation(s) in RCA: 450] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Long noncoding RNAs (lncRNAs) are pervasively transcribed and critical regulators of the epigenome[1, 2]. These long, polyadenylated RNAs do not code for proteins, but function directly as RNAs, recruiting chromatin modifiers to mediate transcriptional changes in processes ranging from X-inactivation (XIST) to imprinting (H19)[3]. The recent discovery that lncRNA HOTAIR can link chromatin changes to cancer metastasis[4] furthers the relevance of lncRNAs to human disease. Here, we discuss lncRNAs as regulatory modules and explore the implications for disease pathogenesis. Although large-scale analyses of mammalian transcriptomes have revealed that more than 50% of transcripts have no protein coding potential[2, 5, 6], the functions of these putative transcripts are largely unknown. A subset of these noncoding transcripts are termed long noncoding RNAs (lncRNAs), based on an arbitrary minimum length of 200 nucleotides. LncRNAs are roughly classified based on their position relative to protein-coding genes: intergenic (between genes), intragenic/intronic (within genes), and antisense[2]. Initial efforts to characterize these molecules demonstrated that they function in cis, regulating their immediate genomic neighbors. Examples include AIR, XIST, and Kcnq1ot (reviewed in [1, 7, 8]), which recruit chromatin modifying complexes to silence adjacent sites. The scope of lncRNAs in gene regulation was advanced with the finding that lncRNA HOTAIR exhibited trans regulatory capacities. HOTAIR is transcribed at the intersection of opposing chromatin domains in the HOXC locus, but targets Polycomb Repressive Complex 2 (PRC2) to silence 40 kilobases of HOXD[9], a locus involved in developmental patterning. A subsequent study revealed that HOTAIR is overexpressed in approximately one quarter of human breast cancers, directing PRC2 to approximately 800 ectopic sites in the genome, which leads to histone H3 lysine 27 trimethylation and changes in gene expression[4]. The impacts of lncRNA-mediated chromatin changes are noteworthy: not only did HOTAIR drive metastasis in a mouse model, but HOTAIR expression in human breast cancer was found to be an independent prognostic marker for death and metastasis[4]. The fact that HOTAIR drives chromatin reprogramming genome-wide suggests that long-range regulation by lncRNAs may be a widespread mechanism. This is supported by a study showing that > 20% of tested lncRNAs are bound by PRC2 and other chromatin modifiers[10]. Furthermore, this is an underestimate of the total RNAs involved in chromatin modification, as PRC2 target genes also transcribe smaller 50-200 nt RNAs that interact with SUZ12 to mediate gene repression[11]. These findings provoke questions regarding the initial triggers for HOTAIR overexpression and whether understanding of lncRNA mechanics may have clinical relevance.
Collapse
Affiliation(s)
- Tiffany Hung
- Howard Hughes Medical Institute and Program in Epithelial Biology, Stanford University School of Medicine, Stanford, CA, USA
| | | |
Collapse
|
4312
|
A large intergenic noncoding RNA induced by p53 mediates global gene repression in the p53 response. Cell 2010; 142:409-19. [PMID: 20673990 DOI: 10.1016/j.cell.2010.06.040] [Citation(s) in RCA: 1690] [Impact Index Per Article: 112.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Revised: 04/06/2010] [Accepted: 06/03/2010] [Indexed: 12/15/2022]
Abstract
Recently, more than 1000 large intergenic noncoding RNAs (lincRNAs) have been reported. These RNAs are evolutionarily conserved in mammalian genomes and thus presumably function in diverse biological processes. Here, we report the identification of lincRNAs that are regulated by p53. One of these lincRNAs (lincRNA-p21) serves as a repressor in p53-dependent transcriptional responses. Inhibition of lincRNA-p21 affects the expression of hundreds of gene targets enriched for genes normally repressed by p53. The observed transcriptional repression by lincRNA-p21 is mediated through the physical association with hnRNP-K. This interaction is required for proper genomic localization of hnRNP-K at repressed genes and regulation of p53 mediates apoptosis. We propose a model whereby transcription factors activate lincRNAs that serve as key repressors by physically associating with repressive complexes and modulate their localization to sets of previously active genes.
Collapse
|
4313
|
Ponting CP, Belgard TG. Transcribed dark matter: meaning or myth? Hum Mol Genet 2010; 19:R162-8. [PMID: 20798109 DOI: 10.1093/hmg/ddq362] [Citation(s) in RCA: 220] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Genomic tiling arrays, cDNA sequencing and, more recently, RNA-Seq have provided initial insights into the extent and depth of transcribed sequence across human and other genomes. These methods have led to greatly improved annotations of protein-coding genes, but have also identified transcription outside of annotated exons. One resultant issue that has aroused dispute is the balance of transcription of known exons against transcription outside of known exons. While non-genic 'dark matter' transcription was found by tiling arrays to be pervasive, it was seen to contribute only a small percentage of the polyadenylated transcriptome in some RNA-Seq experiments. This apparent contradiction has been compounded by a lack of clarity about what exactly constitutes a protein-coding gene. It remains unclear, for example, whether or not all transcripts that overlap on either strand within a genomic locus should be assigned to a single gene locus, including those that fail to share promoters, exons and splice junctions. The inability of tiling arrays and RNA-Seq to count transcripts, rather than exons or exon pairs, adds to these difficulties. While there is agreement that thousands of apparently non-coding loci are present outside of protein-coding genes in the human genome, there is vigorous debate of what constitutes evidence for their functionality. These issues will only be resolved upon the demonstration, or otherwise, that organismal or cellular phenotypes frequently result when non-coding RNA loci are disrupted.
Collapse
Affiliation(s)
- Chris P Ponting
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK.
| | | |
Collapse
|
4314
|
Prasov L, Brown NL, Glaser T. A critical analysis of Atoh7 (Math5) mRNA splicing in the developing mouse retina. PLoS One 2010; 5:e12315. [PMID: 20808762 PMCID: PMC2927423 DOI: 10.1371/journal.pone.0012315] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2010] [Accepted: 06/25/2010] [Indexed: 01/22/2023] Open
Abstract
The Math5 (Atoh7) gene is transiently expressed during retinogenesis by progenitors exiting mitosis, and is essential for ganglion cell (RGC) development. Math5 contains a single exon, and its 1.7 kb mRNA encodes a 149-aa polypeptide. Mouse Math5 mutants have essentially no RGCs or optic nerves. Given the importance of this gene in retinal development, we thoroughly investigated the possibility of Math5 mRNA splicing by Northern blot, 3'RACE, RNase protection assays, and RT-PCR, using RNAs extracted from embryonic eyes and adult cerebellum, or transcribed in vitro from cDNA clones. Because Math5 mRNA contains an elevated G+C content, we used graded concentrations of betaine, an isostabilizing agent that disrupts secondary structure. Although approximately 10% of cerebellar Math5 RNAs are spliced, truncating the polypeptide, our results show few, if any, spliced Math5 transcripts exist in the developing retina (<1%). Rare deleted cDNAs do arise via RT-mediated RNA template switching in vitro, and are selectively amplified during PCR. These data differ starkly from a recent study (Kanadia and Cepko 2010), which concluded that the vast majority of Math5 and other bHLH transcripts are spliced to generate noncoding RNAs. Our findings clarify the architecture of the Math5 gene and its mechanism of action. These results have implications for all members of the bHLH gene family, for any gene that is alternatively spliced, and for the interpretation of all RT-PCR experiments.
Collapse
Affiliation(s)
- Lev Prasov
- Departments of Human Genetics and Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Nadean L. Brown
- Division of Developmental Biology, Department of Pediatrics and Ophthalmology, Cincinnati Children's Research Foundation, University of Cincinnati School of Medicine, Cincinnati, Ohio, United States of America
| | - Tom Glaser
- Departments of Human Genetics and Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
4315
|
Bernard D, Prasanth KV, Tripathi V, Colasse S, Nakamura T, Xuan Z, Zhang MQ, Sedel F, Jourdren L, Coulpier F, Triller A, Spector DL, Bessis A. A long nuclear-retained non-coding RNA regulates synaptogenesis by modulating gene expression. EMBO J 2010; 29:3082-93. [PMID: 20729808 DOI: 10.1038/emboj.2010.199] [Citation(s) in RCA: 587] [Impact Index Per Article: 39.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2010] [Accepted: 07/16/2010] [Indexed: 12/19/2022] Open
Abstract
A growing number of long nuclear-retained non-coding RNAs (ncRNAs) have recently been described. However, few functions have been elucidated for these ncRNAs. Here, we have characterized the function of one such ncRNA, identified as metastasis-associated lung adenocarcinoma transcript 1 (Malat1). Malat1 RNA is expressed in numerous tissues and is highly abundant in neurons. It is enriched in nuclear speckles only when RNA polymerase II-dependent transcription is active. Knock-down studies revealed that Malat1 modulates the recruitment of SR family pre-mRNA-splicing factors to the transcription site of a transgene array. DNA microarray analysis in Malat1-depleted neuroblastoma cells indicates that Malat1 controls the expression of genes involved not only in nuclear processes, but also in synapse function. In cultured hippocampal neurons, knock-down of Malat1 decreases synaptic density, whereas its over-expression results in a cell-autonomous increase in synaptic density. Our results suggest that Malat1 regulates synapse formation by modulating the expression of genes involved in synapse formation and/or maintenance.
Collapse
Affiliation(s)
- Delphine Bernard
- Laboratoire de Biologie Cellulaire de la Synapse, Inserm 1024/CNRS 8197, Institut de Biologie de l'Ecole Normale Supérieure, Paris, France
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4316
|
Abstract
Cellular homeostasis is achieved by the proper balance of regulatory networks that if disrupted can lead to cellular transformation. These cell circuits are fine-tuned and maintained by the coordinated function of proteins and non-coding RNAs (ncRNAs). In addition to the well-characterized protein coding and microRNAs constituents, large ncRNAs are also emerging as important regulatory molecules in tumor-suppressor and oncogenic pathways. Recent studies have revealed mechanistic insight of large ncRNAs regulating key cancer pathways at a transcriptional, post-transcriptional and epigenetic level. Here we synthesize these latest advances within the context of their mechanistic roles in regulating and maintaining cellular equilibrium. We posit that similar to protein-coding genes, large ncRNAs are a newly emerging class of oncogenic and tumor-suppressor genes. Our growing knowledge of the role of large ncRNAs in cellular transformation is pointing towards their potential use as biomarkers and targets for novel therapeutic approaches in the future.
Collapse
Affiliation(s)
- Maite Huarte
- The Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | |
Collapse
|
4317
|
Berretta R, Moscato P. Cancer biomarker discovery: the entropic hallmark. PLoS One 2010; 5:e12262. [PMID: 20805891 PMCID: PMC2923618 DOI: 10.1371/journal.pone.0012262] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2009] [Accepted: 06/26/2010] [Indexed: 12/29/2022] Open
Abstract
Background It is a commonly accepted belief that cancer cells modify their transcriptional state during the progression of the disease. We propose that the progression of cancer cells towards malignant phenotypes can be efficiently tracked using high-throughput technologies that follow the gradual changes observed in the gene expression profiles by employing Shannon's mathematical theory of communication. Methods based on Information Theory can then quantify the divergence of cancer cells' transcriptional profiles from those of normally appearing cells of the originating tissues. The relevance of the proposed methods can be evaluated using microarray datasets available in the public domain but the method is in principle applicable to other high-throughput methods. Methodology/Principal Findings Using melanoma and prostate cancer datasets we illustrate how it is possible to employ Shannon Entropy and the Jensen-Shannon divergence to trace the transcriptional changes progression of the disease. We establish how the variations of these two measures correlate with established biomarkers of cancer progression. The Information Theory measures allow us to identify novel biomarkers for both progressive and relatively more sudden transcriptional changes leading to malignant phenotypes. At the same time, the methodology was able to validate a large number of genes and processes that seem to be implicated in the progression of melanoma and prostate cancer. Conclusions/Significance We thus present a quantitative guiding rule, a new unifying hallmark of cancer: the cancer cell's transcriptome changes lead to measurable observed transitions of Normalized Shannon Entropy values (as measured by high-througput technologies). At the same time, tumor cells increment their divergence from the normal tissue profile increasing their disorder via creation of states that we might not directly measure. This unifying hallmark allows, via the the Jensen-Shannon divergence, to identify the arrow of time of the processes from the gene expression profiles, and helps to map the phenotypical and molecular hallmarks of specific cancer subtypes. The deep mathematical basis of the approach allows us to suggest that this principle is, hopefully, of general applicability for other diseases.
Collapse
Affiliation(s)
- Regina Berretta
- Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine, The University of Newcastle, Callaghan, New South Wales, Australia
- Information Based Medicine Program, Hunter Medical Research Institute, John Hunter Hospital, New Lambton Heights, New South Wales, Australia
| | - Pablo Moscato
- Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine, The University of Newcastle, Callaghan, New South Wales, Australia
- Information Based Medicine Program, Hunter Medical Research Institute, John Hunter Hospital, New Lambton Heights, New South Wales, Australia
- Australian Research Council Centre of Excellence in Bioinformatics, Callaghan, New South Wales, Australia
- * E-mail:
| |
Collapse
|
4318
|
|
4319
|
Buschdorf JP, Lee PKM, Lim VKJ, Kappeler L, Sng JCG. Conference Scene: Epigenetics goes translational. Epigenomics 2010; 2:509-12. [DOI: 10.2217/epi.10.34] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Keystone Symposia held one of its five global conferences outside of North America in Singapore on 26–30 April 2010, entitled ‘Developmental Origins and Epigenesis in Human Health and Disease’. This was organized by Judith Swain, Peter Gluckman, Michael Meaney and Anne Ferguson-Smith from the Singapore Institute for Clinical Sciences (SICS, Singapore), with 23 renowned international speakers and more than 100 delegates attending this symposium.
Collapse
Affiliation(s)
- Jan Paul Buschdorf
- Growth, Development & Metabolism Programme, Singapore Institute for Clinical Sciences, A*Star, 30 Medical Drive, 117609, Singapore
| | - Patrick KM Lee
- Growth, Development & Metabolism Programme, Singapore Institute for Clinical Sciences, A*Star, 30 Medical Drive, 117609, Singapore
| | - Vania KJ Lim
- Growth, Development & Metabolism Programme, Singapore Institute for Clinical Sciences, A*Star, 30 Medical Drive, 117609, Singapore
| | - Laurent Kappeler
- Sackler Program for Epigenetics & Psychobiology at McGill University, Douglas Mental Health Institute, Montreal, QC, Canada
- Centre de Recherche St-Antoine, UPMC-Inserm UMRS 938, Paris, France
| | | |
Collapse
|
4320
|
Abstract
Ever since the pre-molecular era, the birth of new genes with novel functions has been considered to be a major contributor to adaptive evolutionary innovation. Here, I review the origin and evolution of new genes and their functions in eukaryotes, an area of research that has made rapid progress in the past decade thanks to the genomics revolution. Indeed, recent work has provided initial whole-genome views of the different types of new genes for a large number of different organisms. The array of mechanisms underlying the origin of new genes is compelling, extending way beyond the traditionally well-studied source of gene duplication. Thus, it was shown that novel genes also regularly arose from messenger RNAs of ancestral genes, protein-coding genes metamorphosed into new RNA genes, genomic parasites were co-opted as new genes, and that both protein and RNA genes were composed from scratch (i.e., from previously nonfunctional sequences). These mechanisms then also contributed to the formation of numerous novel chimeric gene structures. Detailed functional investigations uncovered different evolutionary pathways that led to the emergence of novel functions from these newly minted sequences and, with respect to animals, attributed a potentially important role to one specific tissue--the testis--in the process of gene birth. Remarkably, these studies also demonstrated that novel genes of the various types significantly impacted the evolution of cellular, physiological, morphological, behavioral, and reproductive phenotypic traits. Consequently, it is now firmly established that new genes have indeed been major contributors to the origin of adaptive evolutionary novelties.
Collapse
Affiliation(s)
- Henrik Kaessmann
- Center for Integrative Genomics, University of Lausanne, CH-1015 Lausanne, Switzerland.
| |
Collapse
|
4321
|
Gowda M, Nunes CC, Sailsbery J, Xue M, Chen F, Nelson CA, Brown DE, Oh Y, Meng S, Mitchell T, Hagedorn CH, Dean RA. Genome-wide characterization of methylguanosine-capped and polyadenylated small RNAs in the rice blast fungus Magnaporthe oryzae. Nucleic Acids Res 2010; 38:7558-69. [PMID: 20660015 PMCID: PMC2995040 DOI: 10.1093/nar/gkq583] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Small RNAs are well described in higher eukaryotes such as mammals and plants; however, knowledge in simple eukaryotes such as filamentous fungi is limited. In this study, we discovered and characterized methylguanosine-capped and polyadenylated small RNAs (CPA-sRNAs) by using differential RNA selection, full-length cDNA cloning and 454 transcriptome sequencing of the rice blast fungus Magnaporthe oryzae. This fungus causes blast, a devastating disease on rice, the principle food staple for over half the world’s population. CPA-sRNAs mapped primarily to the transcription initiation and termination sites of protein-coding genes and were positively correlated with gene expression, particularly for highly expressed genes including those encoding ribosomal proteins. Numerous CPA-sRNAs also mapped to rRNAs, tRNAs, snRNAs, transposable elements and intergenic regions. Many other 454 sequence reads could not be mapped to the genome; however, inspection revealed evidence for non-template additions and chimeric sequences. CPA-sRNAs were independently confirmed using a high affinity variant of eIF-4E to capture 5′-methylguanosine-capped RNA followed by 3′-RACE sequencing. These results expand the repertoire of small RNAs in filamentous fungi.
Collapse
Affiliation(s)
- Malali Gowda
- Fungal Genomics Laboratory, Center for Integrated Fungal Research, North Carolina State University, Raleigh, NC 27606, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4322
|
Guohua Xu A, He L, Li Z, Xu Y, Li M, Fu X, Yan Z, Yuan Y, Menzel C, Li N, Somel M, Hu H, Chen W, Pääbo S, Khaitovich P. Intergenic and repeat transcription in human, chimpanzee and macaque brains measured by RNA-Seq. PLoS Comput Biol 2010; 6:e1000843. [PMID: 20617162 PMCID: PMC2895644 DOI: 10.1371/journal.pcbi.1000843] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2009] [Accepted: 05/28/2010] [Indexed: 01/22/2023] Open
Abstract
Transcription is the first step connecting genetic information with an organism's phenotype. While expression of annotated genes in the human brain has been characterized extensively, our knowledge about the scope and the conservation of transcripts located outside of the known genes' boundaries is limited. Here, we use high-throughput transcriptome sequencing (RNA-Seq) to characterize the total non-ribosomal transcriptome of human, chimpanzee, and rhesus macaque brain. In all species, only 20-28% of non-ribosomal transcripts correspond to annotated exons and 20-23% to introns. By contrast, transcripts originating within intronic and intergenic repetitive sequences constitute 40-48% of the total brain transcriptome. Notably, some repeat families show elevated transcription. In non-repetitive intergenic regions, we identify and characterize 1,093 distinct regions highly expressed in the human brain. These regions are conserved at the RNA expression level across primates studied and at the DNA sequence level across mammals. A large proportion of these transcripts (20%) represents 3'UTR extensions of known genes and may play roles in alternative microRNA-directed regulation. Finally, we show that while transcriptome divergence between species increases with evolutionary time, intergenic transcripts show more expression differences among species and exons show less. Our results show that many yet uncharacterized evolutionary conserved transcripts exist in the human brain. Some of these transcripts may play roles in transcriptional regulation and contribute to evolution of human-specific phenotypic traits.
Collapse
Affiliation(s)
- Augix Guohua Xu
- Partner Institute for Computational Biology, Shanghai, China
- Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Liu He
- Partner Institute for Computational Biology, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Shanghai, China
| | - Zhongshan Li
- Partner Institute for Computational Biology, Shanghai, China
- Graduate School of the Chinese Academy of Sciences, Shanghai, China
| | - Ying Xu
- Partner Institute for Computational Biology, Shanghai, China
| | - Mingfeng Li
- Partner Institute for Computational Biology, Shanghai, China
| | - Xing Fu
- Partner Institute for Computational Biology, Shanghai, China
| | - Zheng Yan
- Partner Institute for Computational Biology, Shanghai, China
| | - Yuan Yuan
- Partner Institute for Computational Biology, Shanghai, China
| | - Corinna Menzel
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Na Li
- Max Delbrück Centrum für Molekulare Medizin, Berlin Institute for Medical Systems Biology, Berlin-Buch, Germany
| | - Mehmet Somel
- Partner Institute for Computational Biology, Shanghai, China
- Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Hao Hu
- Partner Institute for Computational Biology, Shanghai, China
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Wei Chen
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- Max Delbrück Centrum für Molekulare Medizin, Berlin Institute for Medical Systems Biology, Berlin-Buch, Germany
- * E-mail: (WC); (PK)
| | - Svante Pääbo
- Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Philipp Khaitovich
- Partner Institute for Computational Biology, Shanghai, China
- Max-Planck-Institute for Evolutionary Anthropology, Leipzig, Germany
- * E-mail: (WC); (PK)
| |
Collapse
|
4323
|
Charon C, Moreno AB, Bardou F, Crespi M. Non-protein-coding RNAs and their interacting RNA-binding proteins in the plant cell nucleus. MOLECULAR PLANT 2010; 3:729-739. [PMID: 20603381 DOI: 10.1093/mp/ssq037] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
The complex responses of eukaryotic cells to external factors are governed by several transcriptional and post-transcriptional processes. Several of them occur in the nucleus and have been linked to the action of non-protein-coding RNAs (or npcRNAs), both long and small npcRNAs, that recently emerged as major regulators of gene expression. Regulatory npcRNAs acting in the nucleus include silencing-related RNAs, intergenic npcRNAs, natural antisense RNAs, and other aberrant RNAs resulting from the interplay between global transcription and RNA processing activities (such as Dicers and RNA-dependent polymerases). Generally, the resulting npcRNAs exert their regulatory effects through interactions with RNA-binding proteins (or RBPs) within ribonucleoprotein particles (or RNPs). A large group of RBPs are implicated in the silencing machinery through small interfering RNAs (siRNAs) and their localization suggests that several act in the nucleus to trigger epigenetic and chromatin changes at a whole-genome scale. Other nuclear RBPs interact with npcRNAs and change their localization. In the fission yeast, the RNA-binding Mei2p protein, playing pivotal roles in meiosis, interact with a meiotic npcRNA involved in its nuclear re-localization. Related processes have been identified in plants and the ENOD40 npcRNA was shown to re-localize a nuclear-speckle RBP from the nucleus to the cytoplasm in Medicago truncatula. Plant RBPs have been also implicated in RNA-mediated chromatin silencing in the FLC locus through interaction with specific antisense transcripts. In this review, we discuss the interactions between RBPs and npcRNAs in the context of nuclear-related processes and their implication in plant development and stress responses. We propose that these interactions may add a regulatory layer that modulates the interactions between the nuclear genome and the environment and, consequently, control plant developmental plasticity.
Collapse
Affiliation(s)
- Celine Charon
- University Paris-Sud, Institut de Biologie des Plantes, UMR8618, Orsay Cedex, F-91405, France
| | | | | | | |
Collapse
|
4324
|
Mohammad F, Mondal T, Guseva N, Pandey GK, Kanduri C. Kcnq1ot1 noncoding RNA mediates transcriptional gene silencing by interacting with Dnmt1. Development 2010; 137:2493-9. [PMID: 20573698 DOI: 10.1242/dev.048181] [Citation(s) in RCA: 213] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
A long noncoding RNA, Kcnq1ot1, regulates the expression of both ubiquitously and tissue-specific imprinted genes within the Kcnq1 domain. However, the functional sequences of the Kcnq1ot1 RNA that mediate lineage-specific imprinting are unknown. Here, we have generated a knockout mouse with a deletion encompassing an 890-bp silencing domain (Delta890) downstream of the Kcnq1ot1 promoter. Maternal transmission of the Delta890 allele has no effect on imprinting, whereas paternal inheritance of the deletion leads to selective relaxation of the imprinting of ubiquitously imprinted genes to a variable extent in a tissue-specific manner. Interestingly, the deletion affects DNA methylation at somatically acquired differentially methylated regions (DMRs), but does not affect the histone modifications of the ubiquitously imprinted genes. Importantly, we found that Kcnq1ot1 recruits Dnmt1 to somatic DMRs by interacting with Dnmt1, and that this interaction was significantly reduced in the Delta890 mice. Thus, the ubiquitous and placental-specific imprinting of genes within the Kcnq1 domain might be mediated by distinct mechanisms, and Kcnq1ot1 RNA might mediate the silencing of ubiquitously imprinted genes by maintaining allele-specific methylation through its interactions with Dnmt1.
Collapse
Affiliation(s)
- Faizaan Mohammad
- Department of Genetics and Pathology, Dag Hammarskjölds Väg 20, 75185 Rudbeck Laboratory, Uppsala University, Uppsala 75185, Sweden
| | | | | | | | | |
Collapse
|
4325
|
Abstract
Paraspeckles are a relatively new class of subnuclear bodies found in the interchromatin space of mammalian cells. They are RNA-protein structures formed by the interaction between a long nonprotein-coding RNA species, NEAT1/Men epsilon/beta, and members of the DBHS (Drosophila Behavior Human Splicing) family of proteins: P54NRB/NONO, PSPC1, and PSF/SFPQ. Paraspeckles are critical to the control of gene expression through the nuclear retention of RNA containing double-stranded RNA regions that have been subject to adenosine-to-inosine editing. Through this mechanism paraspeckles and their components may ultimately have a role in controlling gene expression during many cellular processes including differentiation, viral infection, and stress responses.
Collapse
Affiliation(s)
- Archa H Fox
- Western Australian Institute for Medical Research and Centre For Medical Research, University of Western Australia, Crawley, 6009 Western Australia, Australia.
| | | |
Collapse
|
4326
|
Babik W, Stuglik M, Qi W, Kuenzli M, Kuduk K, Koteja P, Radwan J. Heart transcriptome of the bank vole (Myodes glareolus): towards understanding the evolutionary variation in metabolic rate. BMC Genomics 2010; 11:390. [PMID: 20565972 PMCID: PMC2996923 DOI: 10.1186/1471-2164-11-390] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2010] [Accepted: 06/21/2010] [Indexed: 03/13/2023] Open
Abstract
BACKGROUND Understanding the genetic basis of adaptive changes has been a major goal of evolutionary biology. In complex organisms without sequenced genomes, de novo transcriptome assembly using a longer read sequencing technology followed by expression profiling using short reads is likely to provide comprehensive identification of adaptive variation at the expression level and sequence polymorphisms in coding regions. We performed sequencing and de novo assembly of the bank vole heart transcriptome in lines selected for high metabolism and unselected controls. RESULTS A single 454 Titanium run produced over million reads, which were assembled into 63,581 contigs. Searches against the SwissProt protein database and the ENSEMBL collection of mouse transcripts detected similarity to 11,181 and 14,051 genes, respectively. As judged by the representation of genes from the heart-related Gene Ontology categories and UniGenes detected in the mouse heart, our detection of the genes expressed in the heart was nearly complete (> 95% and almost 90% respectively). On average, 38.7% of the transcript length was covered by our sequences, with notably higher (45.0%) coverage of coding regions than of untranslated regions (24.5% of 5' and 32.7% of 3'UTRs). Lower sequence conservation between mouse and bank vole in untranslated regions was found to be partially responsible for poorer UTR representation. Our data might suggest a widespread transcription from noncoding genomic regions, a finding not reported in previous studies regarding transcriptomes in non-model organisms. We also identified over 19 thousand putative single nucleotide polymorphisms (SNPs). A much higher fraction of the SNPs than expected by chance exhibited variant frequency differences between selection regimes. CONCLUSION Longer reads and higher sequence yield per run provided by the 454 Titanium technology in comparison to earlier generations of pyrosequencing proved beneficial for the quality of assembly. An almost full representation of genes known to be expressed in the mouse heart was identified. Usage of the extensive genomic resources available for the house mouse, a moderately (20-40 mln years) divergent relative of the voles, enabled a comprehensive assessment of the transcript completeness. Transcript sequences generated in the present study allowed the identification of candidate SNPs associated with divergence of selection lines and constitute a valuable permanent resource forming a foundation for RNAseq experiments aiming at detection of adaptive changes both at the level of gene expression and sequence variants, that would facilitate studies of the genetic basis of evolutionary divergence.
Collapse
Affiliation(s)
- Wiesław Babik
- Institute of Environmental Sciences, Jagiellonian University, 30-387 Krakow, Poland.
| | | | | | | | | | | | | |
Collapse
|
4327
|
Rouquette J, Cremer C, Cremer T, Fakan S. Functional nuclear architecture studied by microscopy: present and future. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2010; 282:1-90. [PMID: 20630466 DOI: 10.1016/s1937-6448(10)82001-5] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
In this review we describe major contributions of light and electron microscopic approaches to the present understanding of functional nuclear architecture. The large gap of knowledge, which must still be bridged from the molecular level to the level of higher order structure, is emphasized by differences of currently discussed models of nuclear architecture. Molecular biological tools represent new means for the multicolor visualization of various nuclear components in living cells. New achievements offer the possibility to surpass the resolution limit of conventional light microscopy down to the nanometer scale and require improved bioinformatics tools able to handle the analysis of large amounts of data. In combination with the much higher resolution of electron microscopic methods, including ultrastructural cytochemistry, correlative microscopy of the same cells in their living and fixed state is the approach of choice to combine the advantages of different techniques. This will make possible future analyses of cell type- and species-specific differences of nuclear architecture in more detail and to put different models to critical tests.
Collapse
Affiliation(s)
- Jacques Rouquette
- Biocenter, Ludwig Maximilians University (LMU), Martinsried, Germany
| | | | | | | |
Collapse
|
4328
|
Pasmant É, Laurendeau I, Sabbagh A, Parfait B, Vidaud M, Vidaud D, Bièche I. ANRILou l’étrange histoire d’un grand ARN non codant. Med Sci (Paris) 2010; 26:564-6. [DOI: 10.1051/medsci/2010266-7564] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
|
4329
|
Ferdin J, Kunej T, Calin GA. Non-coding RNAs: identification of cancer-associated microRNAs by gene profiling. Technol Cancer Res Treat 2010; 9:123-38. [PMID: 20218735 DOI: 10.1177/153303461000900202] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
MicroRNAs (miRNAs) belong to the heterogeneous class of non-coding RNAs (ncRNAs), which are by definition RNA molecules that do not encode for proteins, but have instead important structural, catalytic or regulatory functions. In this review we first provide an overview of the different ncRNA families, focusing in particular on miRNAs and their relevance in tumour development and progression. Second we shortly describe the available ncRNA expression profiling methods, which comprise microarray, bead-based hybridization methods, in situ hybridization, quantitative real-time polymerase chain reaction, cloning and deep sequencing methods. Finally, we used the PubMed database to perform an extensive literature search for miRNA expression profiling research articles in cancer and identified 58 studies that were published between 2004 and 2009; we identified 70 miRNAs that were reported in at least five studies as being either up- or downregulated, depending on the type of cancer, and 192 miRNAs that were reported to be up- or downregulated in at least two reports. MiRNA expression profiling of human tumours has identified signatures associated with diagnosis, staging, progression, prognosis, and response to treatment. Based on the most important findings we discuss the possible use of miRNAs as clinical biomarkers in the management of cancer patients for diagnosis, prognosis, and response to therapy.
Collapse
Affiliation(s)
- Jana Ferdin
- Department of Animal Science Biotechnical Faculty, University of Ljubljana, Domzale, Slovenia.
| | | | | |
Collapse
|
4330
|
Role for the MOV10 RNA helicase in polycomb-mediated repression of the INK4a tumor suppressor. Nat Struct Mol Biol 2010; 17:862-8. [PMID: 20543829 PMCID: PMC2929459 DOI: 10.1038/nsmb.1824] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2009] [Accepted: 04/02/2010] [Indexed: 12/31/2022]
Abstract
Several lines of evidence point to a role for non-coding RNA in transcriptional repression by Polycomb group (PcG) proteins but the precise mechanism remains unclear. Here we show that MOV10, a putative RNA helicase previously implicated in post-transcriptional gene silencing, co-purifies and interacts with components of Polycomb-repressive complex 1 (PRC1) from human cells. Endogenous MOV10 is mostly nuclear and a proportion associates with chromatin in an RNA-dependent manner. ShRNA-mediated knockdown of MOV10 in human fibroblasts leads to up-regulation of the INK4a tumor suppressor, a known target of PcG-mediated repression, accompanied by dissociation of PRC1 proteins from the locus and a reduction in trimethylation of histone H3 on lysine 27 (H3K27me3). As well as prompting reassessment of MOV10's role in other settings, our findings suggest that it is directly involved in transcriptional silencing by PcG complexes.
Collapse
|
4331
|
Matrajt M. Non-coding RNA in apicomplexan parasites. Mol Biochem Parasitol 2010; 174:1-7. [PMID: 20566348 DOI: 10.1016/j.molbiopara.2010.06.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2009] [Revised: 05/29/2010] [Accepted: 06/01/2010] [Indexed: 11/28/2022]
Abstract
In recent years it has became evident that the transcriptome of most species has little protein-coding capacity and that the abundance of non-coding RNA was previously overlooked. Non-coding RNAs were initially thought to be transcriptional noise, however, a growing number of studies is showing that many of these RNAs have important regulatory functions. Here, we review the progress done in apicomplexan parasites in this rapidly growing field.
Collapse
Affiliation(s)
- Mariana Matrajt
- Department of Microbiology and Molecular Genetics, University of Vermont, Stafford Hall, Room 306, 95 Carrigan Drive, Burlington, VT 05405, United States.
| |
Collapse
|
4332
|
Mazar J, Sinha S, Dinger ME, Mattick JS, Perera RJ. Protein-coding and non-coding gene expression analysis in differentiating human keratinocytes using a three-dimensional epidermal equivalent. Mol Genet Genomics 2010; 284:1-9. [PMID: 20499100 DOI: 10.1007/s00438-010-0543-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2010] [Accepted: 05/02/2010] [Indexed: 01/27/2023]
Abstract
The epidermal compartment is complex and organized into several strata composed of keratinocytes (KCs), including basal, spinous, granular, and cornified layers. The continuous process of self-renewal and barrier formation is dependent on a homeostatic balance achieved amongst KCs involving proliferation, differentiation, and cell death. To determine genes responsible for initiating and maintaining a cornified epidermis, organotypic cultures comprised entirely of stratified KCs creating epidermal equivalents (EE) were raised from a submerged state to an air/liquid (A/L) interface. Compared to the array profile of submerged cultures containing KCs predominantly in a proliferative (relatively undifferentiated) state, EEs raised to an A/L interface displayed a remarkably consistent and distinct profile of mRNAs. Cultures lifted to an A/L interface triggered the induction of gene groups that regulate proliferation, differentiation, and cell death. Next, differentially expressed microRNAs (miRNAs) and long non-coding (lncRNA) RNAs were identified in EEs. Several differentially expressed miRNAs were validated by qRT-PCR and Northern blots. miRNAs 203, 205 and Let-7b were up-regulated at early time points (6, 18 and 24 h) but down-regulated by 120 h. To study the lncRNA regulation in EEs, we profiled lncRNA expression by microarray and validated the results by qRT-PCR. Although the differential expression of several lncRNAs is suggestive of a role in epidermal differentiation, their biological functions remain to be elucidated. The current studies lay the foundation for relevant model systems to address such fundamentally important biological aspects of epidermal structure and function in normal and diseased human skin.
Collapse
Affiliation(s)
- Joseph Mazar
- Sanford Burnham Medical Research Institute, Orlando, FL 32827, USA
| | | | | | | | | |
Collapse
|
4333
|
Moroz LL, Kohn AB. Do different neurons age differently? Direct genome-wide analysis of aging in single identified cholinergic neurons. Front Aging Neurosci 2010; 2. [PMID: 20725513 PMCID: PMC2910937 DOI: 10.3389/neuro.24.006.2010] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2009] [Accepted: 01/27/2010] [Indexed: 11/24/2022] Open
Abstract
Aplysia californica is a powerful experimental system to study the entire scope of genomic and epigenomic regulation at the resolution of single functionally characterized neurons and is an emerging model in the neurobiology of aging. First, we have identified and cloned a number of evolutionarily conserved genes that are age-related, including components of apoptosis and chromatin remodeling. Second, we performed gene expression profiling of different identified cholinergic neurons between young and aged animals. Our initial analysis indicates that two cholinergic neurons (R2 and LPl1) revealed highly differential genome-wide changes following aging suggesting that on the molecular scale different neurons indeed age differently. Each of the neurons tested has a unique subset of genes differentially expressed in older animals, and the majority of differently expressed genes (including those related to apoptosis and Alzheimer's disease) are found in aging neurons of one but not another type. The performed analysis allows us to implicate (i) cell specific changes in histones, (ii) DNA methylation and (iii) regional relocation of RNAs as key processes underlying age-related changes in neuronal functions and synaptic plasticity. These mechanisms can fine-tune the dynamics of long-term chromatin remodeling, or control weakening and the loss of synaptic connections in aging. At the same time our genomic tests revealed evolutionarily conserved gene clusters associated with aging (e.g., apoptosis-, telomere- and redox-dependent processes, insulin and estrogen signaling and water channels).
Collapse
Affiliation(s)
- Leonid L Moroz
- Department of Neuroscience, Evelyn F and William L. McKnight Brain Institute, University of Florida Gainesville, FL, USA
| | | |
Collapse
|
4334
|
van Bakel H, Nislow C, Blencowe BJ, Hughes TR. Most "dark matter" transcripts are associated with known genes. PLoS Biol 2010; 8:e1000371. [PMID: 20502517 PMCID: PMC2872640 DOI: 10.1371/journal.pbio.1000371] [Citation(s) in RCA: 340] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2009] [Accepted: 04/09/2010] [Indexed: 12/18/2022] Open
Abstract
Short-read RNA sequencing in mouse and human tissues shows that most transcripts are encoded within or nearby known genes and that most of the genome is not transcribed. A series of reports over the last few years have indicated that a much larger portion of the mammalian genome is transcribed than can be accounted for by currently annotated genes, but the quantity and nature of these additional transcripts remains unclear. Here, we have used data from single- and paired-end RNA-Seq and tiling arrays to assess the quantity and composition of transcripts in PolyA+ RNA from human and mouse tissues. Relative to tiling arrays, RNA-Seq identifies many fewer transcribed regions (“seqfrags”) outside known exons and ncRNAs. Most nonexonic seqfrags are in introns, raising the possibility that they are fragments of pre-mRNAs. The chromosomal locations of the majority of intergenic seqfrags in RNA-Seq data are near known genes, consistent with alternative cleavage and polyadenylation site usage, promoter- and terminator-associated transcripts, or new alternative exons; indeed, reads that bridge splice sites identified 4,544 new exons, affecting 3,554 genes. Most of the remaining seqfrags correspond to either single reads that display characteristics of random sampling from a low-level background or several thousand small transcripts (median length = 111 bp) present at higher levels, which also tend to display sequence conservation and originate from regions with open chromatin. We conclude that, while there are bona fide new intergenic transcripts, their number and abundance is generally low in comparison to known exons, and the genome is not as pervasively transcribed as previously reported. The human genome was sequenced a decade ago, but its exact gene composition remains a subject of debate. The number of protein-coding genes is much lower than initially expected, and the number of distinct transcripts is much larger than the number of protein-coding genes. Moreover, the proportion of the genome that is transcribed in any given cell type remains an open question: results from “tiling” microarray analyses suggest that transcription is pervasive and that most of the genome is transcribed, whereas new deep sequencing-based methods suggest that most transcripts originate from known genes. We have addressed this discrepancy by comparing samples from the same tissues using both technologies. Our analyses indicate that RNA sequencing appears more reliable for transcripts with low expression levels, that most transcripts correspond to known genes or are near known genes, and that many transcripts may represent new exons or aberrant products of the transcription process. We also identify several thousand small transcripts that map outside known genes; their sequences are often conserved and are often encoded in regions of open chromatin. We propose that most of these transcripts may be by-products of the activity of enhancers, which associate with promoters as part of their role as long-range gene regulatory sites. Overall, however, we find that most of the genome is not appreciably transcribed.
Collapse
Affiliation(s)
- Harm van Bakel
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
| | - Corey Nislow
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Benjamin J. Blencowe
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Timothy R. Hughes
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- * E-mail:
| |
Collapse
|
4335
|
De Santa F, Barozzi I, Mietton F, Ghisletti S, Polletti S, Tusi BK, Muller H, Ragoussis J, Wei CL, Natoli G. A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS Biol 2010; 8:e1000384. [PMID: 20485488 PMCID: PMC2867938 DOI: 10.1371/journal.pbio.1000384] [Citation(s) in RCA: 659] [Impact Index Per Article: 43.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2009] [Accepted: 04/21/2010] [Indexed: 01/17/2023] Open
Abstract
A substantial fraction of extragenic Pol II transcription sites coincides with transcriptional enhancers, which may be relevant for functional annotation of mammalian genomes. Mammalian genomes are pervasively transcribed outside mapped protein-coding genes. One class of extragenic transcription products is represented by long non-coding RNAs (lncRNAs), some of which result from Pol_II transcription of bona-fide RNA genes. Whether all lncRNAs described insofar are products of RNA genes, however, is still unclear. Here we have characterized transcription sites located outside protein-coding genes in a highly regulated response, macrophage activation by endotoxin. Using chromatin signatures, we could unambiguously classify extragenic Pol_II binding sites as belonging to either canonical RNA genes or transcribed enhancers. Unexpectedly, 70% of extragenic Pol_II peaks were associated with genomic regions with a canonical chromatin signature of enhancers. Enhancer-associated extragenic transcription was frequently adjacent to inducible inflammatory genes, was regulated in response to endotoxin stimulation, and generated very low abundance transcripts. Moreover, transcribed enhancers were under purifying selection and contained binding sites for inflammatory transcription factors, thus suggesting their functionality. These data demonstrate that a large fraction of extragenic Pol_II transcription sites can be ascribed to cis-regulatory genomic regions. Discrimination between lncRNAs generated by canonical RNA genes and products of transcribed enhancers will provide a framework for experimental approaches to lncRNAs and help complete the annotation of mammalian genomes. Mammalian genomes contain vast intergenic regions that are extensively transcribed and generate various types of short and long non-coding RNAs (ncRNAs). Although in some cases specific functions have been assigned to intergenic transcripts, the functional significance of this transcriptional output remains largely unknown, and the possibility exists that part of this transcription reflects noise generated by random collisions of the transcriptional machinery with the genome to generate meaningless transcription. In this study we used chromatin signatures to characterize extragenic transcription sites targeted by RNA Polymerase II (RNA Pol II) in a highly regulated response—endotoxin activation of macrophages. We found that a significant portion of extragenic transcription sites are associated with the chromatin signature characteristic of enhancers. Consistent with their chromatin signature, we found that these extragenic transcription sites are under purifying selection and contain binding sites for inflammatory transcription factors, as well as for PU.1, a hematopoietic transcription factor that marks enhancers in macrophages. Moreover, much of this extragenic transcription is regulated by stimulation. We also identified hundreds of transcribed regions with a signature of canonical RNA genes. Our data indicate that extragenic transcription sites can be efficiently classified using chromatin signatures, which will be relevant for functional annotation of mammalian genomes.
Collapse
Affiliation(s)
- Francesca De Santa
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
| | - Iros Barozzi
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
| | - Flore Mietton
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
| | - Serena Ghisletti
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
| | - Sara Polletti
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
| | - Betsabeh Khoramian Tusi
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
| | - Heiko Muller
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
| | - Jiannis Ragoussis
- Genomics Laboratory, Wellcome Trust Centre for Human Genetics (WTCHG), University of Oxford, Oxford, United Kingdom
| | - Chia-Lin Wei
- Genome Technology and Biology Group, Genome Institute of Singapore, Singapore
| | - Gioacchino Natoli
- Department of Experimental Oncology, European Institute of Oncology (IEO) Campus IFOM-IEO, Milan, Italy
- * E-mail:
| |
Collapse
|
4336
|
Edwards JK, Pasqualini R, Arap W, Calin GA. MicroRNAs and ultraconserved genes as diagnostic markers and therapeutic targets in cancer and cardiovascular diseases. J Cardiovasc Transl Res 2010; 3:271-9. [PMID: 20560048 DOI: 10.1007/s12265-010-9179-5] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/12/2010] [Accepted: 03/01/2010] [Indexed: 12/19/2022]
Abstract
MicroRNAs (miRNAs), approximately 19-25 nucleotides in length, are posttranscriptional regulators of protein expression that target and inhibit translation of messenger (m) RNAs. Recent research on miRNAs has produced a plethora of new material on the role of miRNAs in disease. Deregulation or ablation of miRNA expression has led to major pathologies including heart disease and cancer. Signatures of differential miRNA expression have been uncovered for nearly every disease. Recent research has focused on exploitation of the selectivity of these signatures as markers of disease and for therapeutic applications. The significance of additional mechanisms of abnormal posttranscriptional regulation, such as ultraconserved genes (UCGs), has recently been recognized. This review focuses on the identification of aberrant posttranscriptional regulators (miRNAs and UCGs) in cancer and cardiovascular disease and addresses the applications of this work towards diagnosis and therapy.
Collapse
Affiliation(s)
- Julianna K Edwards
- David H. Koch Center, University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | | | | | | |
Collapse
|
4337
|
Calderwood SK, Xie Y, Wang X, Khaleque MA, Chou SD, Murshid A, Prince T, Zhang Y. Signal Transduction Pathways Leading to Heat Shock Transcription. ACTA ACUST UNITED AC 2010; 2:13-24. [PMID: 21687820 DOI: 10.4137/sti.s3994] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Heat shock proteins (HSP) are essential for intracellular protein folding during stress and protect cells from denaturation and aggregation cascades that can lead to cell death. HSP genes are regulated at the transcriptional level by heat shock transcription factor 1 (HSF1) that is activated by stress and binds to heat shock elements in HSP genes. The activation of HSF1 during heat shock involves conversion from an inert monomer to a DNA binding trimer through a series of intramolecular folding rearrangements. However, the trigger for HSF1 at the molecular level is unclear and hypotheses for this process include reversal of feedback inhibition of HSF1 by molecular chaperones and heat-induced binding to large non-coding RNAs. Heat shock also causes a profound modulation in cell signaling pathways that lead to protein kinase activation and phosphorylation of HSF1 at a number of regulatory serine residues. HSP genes themselves exist in an accessible chromatin conformation already bound to RNA polymerase II. The RNA polymerase II is paused on HSP promoters after transcribing a short RNA sequence proximal to the promoter. Activation by heat shock involves HSF1 binding to the promoter and release of the paused RNA polymerase II followed by further rounds of transcriptional initiation and elongation. HSF1 is thus involved in both initiation and elongation of HSP RNA transcripts. Recent studies indicate important roles for histone modifications on HSP genes during heat shock. Histone modification occurs rapidly after stress and may be involved in promoting nucleosome remodeling on HSP promoters and in the open reading frames of HSP genes. Understanding these processes may be key to evaluating mechanisms of deregulated HSP expression that plays a key role in neurodegeneration and cancer.
Collapse
Affiliation(s)
- S K Calderwood
- Division of Molecular and Cellular Radiation Oncology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
| | | | | | | | | | | | | | | |
Collapse
|
4338
|
Waldman YY, Tuller T, Shlomi T, Sharan R, Ruppin E. Translation efficiency in humans: tissue specificity, global optimization and differences between developmental stages. Nucleic Acids Res 2010; 38:2964-74. [PMID: 20097653 PMCID: PMC2875035 DOI: 10.1093/nar/gkq009] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2009] [Revised: 01/01/2010] [Accepted: 01/05/2010] [Indexed: 01/22/2023] Open
Abstract
Various studies in unicellular and multicellular organisms have shown that codon bias plays a significant role in translation efficiency (TE) by co-adaptation to the tRNA pool. Yet, in humans and other mammals the role of codon bias is still an open question, with contradictory results from different studies. Here we address this question, performing a large-scale tissue-specific analysis of TE in humans, using the tRNA Adaptation Index (tAI) as a direct measure for TE. We find tAI to significantly correlate with expression levels both in tissue-specific and in global expression measures, testifying to the TE of human tissues. Interestingly, we find significantly higher correlations in adult tissues as opposed to fetal tissues, suggesting that the tRNA pool is more adjusted to the adult period. Optimization based analysis suggests that the tRNA pool-codon bias co-adaptation is globally (and not tissue-specific) driven. Additionally, we find that tAI correlates with several measures related to the protein functionally importance, including gene essentiality. Using inferred tissue-specific tRNA pools lead to similar results and shows that tissue-specific genes are more adapted to their tRNA pool than other genes and that related sets of functional gene groups are translated efficiently in each tissue. Similar results are obtained for other mammals. Taken together, these results demonstrate the role of codon bias in TE in humans, and pave the way for future studies of tissue-specific TE in multicellular organisms.
Collapse
Affiliation(s)
- Yedael Y. Waldman
- Blavatnik School of Computer Science, Department of Molecular Microbiology and Biotechnology and School of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel, Computer Science Department, Technion – Israel Institute of Technology, Haifa 32000, Israel
| | - Tamir Tuller
- Blavatnik School of Computer Science, Department of Molecular Microbiology and Biotechnology and School of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel, Computer Science Department, Technion – Israel Institute of Technology, Haifa 32000, Israel
| | - Tomer Shlomi
- Blavatnik School of Computer Science, Department of Molecular Microbiology and Biotechnology and School of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel, Computer Science Department, Technion – Israel Institute of Technology, Haifa 32000, Israel
| | - Roded Sharan
- Blavatnik School of Computer Science, Department of Molecular Microbiology and Biotechnology and School of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel, Computer Science Department, Technion – Israel Institute of Technology, Haifa 32000, Israel
| | - Eytan Ruppin
- Blavatnik School of Computer Science, Department of Molecular Microbiology and Biotechnology and School of Medicine, Tel Aviv University, Ramat Aviv 69978, Israel, Computer Science Department, Technion – Israel Institute of Technology, Haifa 32000, Israel
| |
Collapse
|
4339
|
Risueño A, Fontanillo C, Dinger ME, De Las Rivas J. GATExplorer: genomic and transcriptomic explorer; mapping expression probes to gene loci, transcripts, exons and ncRNAs. BMC Bioinformatics 2010; 11:221. [PMID: 20429936 PMCID: PMC2875241 DOI: 10.1186/1471-2105-11-221] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2009] [Accepted: 04/29/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genome-wide expression studies have developed exponentially in recent years as a result of extensive use of microarray technology. However, expression signals are typically calculated using the assignment of "probesets" to genes, without addressing the problem of "gene" definition or proper consideration of the location of the measuring probes in the context of the currently known genomes and transcriptomes. Moreover, as our knowledge of metazoan genomes improves, the number of both protein-coding and noncoding genes, as well as their associated isoforms, continues to increase. Consequently, there is a need for new databases that combine genomic and transcriptomic information and provide updated mapping of expression probes to current genomic annotations. RESULTS GATExplorer (Genomic and Transcriptomic Explorer) is a database and web platform that integrates a gene loci browser with nucleotide level mappings of oligo probes from expression microarrays. It allows interactive exploration of gene loci, transcripts and exons of human, mouse and rat genomes, and shows the specific location of all mappable Affymetrix microarray probes and their respective expression levels in a broad set of biological samples. The web site allows visualization of probes in their genomic context together with any associated protein-coding or noncoding transcripts. In the case of all-exon arrays, this provides a means by which the expression of the individual exons within a gene can be compared, thereby facilitating the identification and analysis of alternatively spliced exons. The application integrates data from four major source databases: Ensembl, RNAdb, Affymetrix and GeneAtlas; and it provides the users with a series of files and packages (R CDFs) to analyze particular query expression datasets. The maps cover both the widely used Affymetrix GeneChip microarrays based on 3' expression (e.g. human HG U133 series) and the all-exon expression microarrays (Gene 1.0 and Exon 1.0). CONCLUSIONS GATExplorer is an integrated database that combines genomic/transcriptomic visualization with nucleotide-level probe mapping. By considering expression at the nucleotide level rather than the gene level, it shows that the arrays detect expression signals from entities that most researchers do not contemplate or discriminate. This approach provides the means to undertake a higher resolution analysis of microarray data and potentially extract considerably more detailed and biologically accurate information from existing and future microarray experiments.
Collapse
Affiliation(s)
- Alberto Risueño
- Bioinformatics and Functional Genomics Research Group, Cancer Research Center (CiC-IBMCC, CSIC/USAL), Salamanca, Spain
| | | | | | | |
Collapse
|
4340
|
Mining mammalian transcript data for functional long non-coding RNAs. PLoS One 2010; 5:e10316. [PMID: 20428234 PMCID: PMC2859052 DOI: 10.1371/journal.pone.0010316] [Citation(s) in RCA: 112] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2009] [Accepted: 03/30/2010] [Indexed: 12/11/2022] Open
Abstract
Background The role of long non-coding RNAs (lncRNAs) in controlling gene expression has garnered increased interest in recent years. Sequencing projects, such as Fantom3 for mouse and H-InvDB for human, have generated abundant data on transcribed components of mammalian cells, the majority of which appear not to be protein-coding. However, much of the non-protein-coding transcriptome could merely be a consequence of ‘transcription noise’. It is therefore essential to use bioinformatic approaches to identify the likely functional candidates in a high throughput manner. Principal Findings We derived a scheme for classifying and annotating likely functional lncRNAs in mammals. Using the available experimental full-length cDNA data sets for human and mouse, we identified 78 lncRNAs that are either syntenically conserved between human and mouse, or that originate from the same protein-coding genes. Of these, 11 have significant sequence homology. We found that these lncRNAs exhibit: (i) patterns of codon substitution typical of non-coding transcripts; (ii) preservation of sequences in distant mammals such as dog and cow, (iii) significant sequence conservation relative to their corresponding flanking regions (in 50% cases, flanking regions do not have homology at all; and in the remaining, the degree of conservation is significantly less); (iv) existence mostly as single-exon forms (8/11); and, (v) presence of conserved and stable secondary structure motifs within them. We further identified orthologous protein-coding genes that are contributing to the pool of lncRNAs; of which, genes implicated in carcinogenesis are significantly over-represented. Conclusion Our comparative mammalian genomics approach coupled with evolutionary analysis identified a small population of conserved long non-protein-coding RNAs (lncRNAs) that are potentially functional across Mammalia. Additionally, our analysis indicates that amongst the orthologous protein-coding genes that produce lncRNAs, those implicated in cancer pathogenesis are significantly over-represented, suggesting that these lncRNAs could play an important role in cancer pathomechanisms.
Collapse
|
4341
|
Yan MSC, Matouk CC, Marsden PA. Epigenetics of the vascular endothelium. J Appl Physiol (1985) 2010; 109:916-26. [PMID: 20413423 DOI: 10.1152/japplphysiol.00131.2010] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Classical models of transcription in vascular endothelial cells, specifically the cis/trans paradigm, have limitations. For instance, how does the environment have chronic effects on gene expression in endothelial cells after weeks or years? When an endothelial cell divides, how is this information transmitted to daughter cells? Epigenetics refers to chromatin-based pathways important in the regulation of gene expression and includes three distinct, but highly interrelated, mechanisms: DNA methylation, histone density and posttranslational modifications, and RNA-based mechanisms. Together they offer a newer perspective on transcriptional control paradigms in vascular endothelial cells and provide a molecular basis for understanding how the environment impacts the genome to modify disease susceptibility. This alternative viewpoint for transcriptional regulation allows a reassessment of the cis/trans model and even helps explain some of its limitations. This review provides an introduction to epigenetic concepts for vascular biologists and uses topical examples in cell biology to provide insight into how cell types or even whole organisms, such as monozygotic human twins with the same DNA sequence, can exhibit heterogeneous patterns of gene expression, phenotype, or diseases prevalence. Using endothelial nitric oxide synthase (NOS3) as an example, we examine the growing body of evidence implicating epigenetic pathways in the control of vascular endothelial gene expression in health and disease.
Collapse
Affiliation(s)
- Matthew Shu-Ching Yan
- Department of Medical Biophysics, Rm 7358, Medical Sciences Bldg., University of Toronto, and Department of Medicine, St. Michael's Hospital, 1 King's College Circle, Toronto, Ontario M5S 1A8, Canada
| | | | | |
Collapse
|
4342
|
Mondal T, Rasmussen M, Pandey GK, Isaksson A, Kanduri C. Characterization of the RNA content of chromatin. Genome Res 2010; 20:899-907. [PMID: 20404130 DOI: 10.1101/gr.103473.109] [Citation(s) in RCA: 157] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Noncoding RNA (ncRNA) constitutes a significant portion of the mammalian transcriptome. Emerging evidence suggests that it regulates gene expression in cis or trans by modulating the chromatin structure. To uncover the functional role of ncRNA in chromatin organization, we deep sequenced chromatin-associated RNAs (CARs) from human fibroblast (HF) cells. This resulted in the identification of 141 intronic regions and 74 intergenic regions harboring CARs. The intronic and intergenic CARs show significant conservation across 44 species of placental mammals. Functional characterization of one of the intergenic CARs, Intergenic10, revealed that it regulates gene expression of neighboring genes through modulating the chromatin structure in cis. Our data suggest that ncRNA is an integral component of chromatin and that it may regulate various biological functions through fine-tuning of the chromatin architecture.
Collapse
Affiliation(s)
- Tanmoy Mondal
- Department of Genetics and Pathology, Uppsala University, Uppsala S-75185, Sweden
| | | | | | | | | |
Collapse
|
4343
|
Majer A, Booth SA. Computational methodologies for studying non-coding RNAs relevant to central nervous system function and dysfunction. Brain Res 2010; 1338:131-45. [PMID: 20381467 DOI: 10.1016/j.brainres.2010.03.095] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2009] [Revised: 03/19/2010] [Accepted: 03/26/2010] [Indexed: 12/21/2022]
Abstract
Non-coding RNAs (ncRNAs) are a large and diverse group of transcripts that span the eukaryotic genome, of which less than 2% encodes proteins. Several distinct families of ncRNAs have been described and implicated in many aspects of central nervous system (CNS) function including translation, RNA metabolism, gene regulation, and development. The need to distinguish ncRNAs from sequence data, as well as potentially uncovering novel ncRNA families, has ignited the development of customized computational approaches and bioinformatic resources to handle these tasks. In this review, we provide an overview of the numerous procedures developed to predict ncRNAs based on their primary sequence and predicted secondary structure. These methodologies are broadly grouped into genome scanning algorithms, mixed approaches, and machine learning algorithms. Regulatory ncRNAs, particularly microRNAs (miRNAs), are a major focus of current research efforts and this review will therefore center on the prediction of miRNAs and the putative gene targets they act upon. With the advent of ultra high-throughput sequencing technologies 'deep sequencing' has emerged as the cutting-edge method for ncRNA identification and we will also touch on some computational resources that play a key role in analysis of this type of data.
Collapse
Affiliation(s)
- Anna Majer
- Department of Medical Microbiology and Infectious Diseases, Faculty of Medicine, University of Manitoba, Manitoba, Canada
| | | |
Collapse
|
4344
|
Cunnington MS, Santibanez Koref M, Mayosi BM, Burn J, Keavney B. Chromosome 9p21 SNPs Associated with Multiple Disease Phenotypes Correlate with ANRIL Expression. PLoS Genet 2010; 6:e1000899. [PMID: 20386740 PMCID: PMC2851566 DOI: 10.1371/journal.pgen.1000899] [Citation(s) in RCA: 299] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2009] [Accepted: 03/08/2010] [Indexed: 12/13/2022] Open
Abstract
Single nucleotide polymorphisms (SNPs) on chromosome 9p21 are associated with coronary artery disease, diabetes, and multiple cancers. Risk SNPs are mainly non-coding, suggesting that they influence expression and may act in cis. We examined the association between 56 SNPs in this region and peripheral blood expression of the three nearest genes CDKN2A, CDKN2B, and ANRIL using total and allelic expression in two populations of healthy volunteers: 177 British Caucasians and 310 mixed-ancestry South Africans. Total expression of the three genes was correlated (P<0.05), suggesting that they are co-regulated. SNP associations mapped by allelic and total expression were similar (r = 0.97, P = 4.8×10−99), but the power to detect effects was greater for allelic expression. The proportion of expression variance attributable to cis-acting effects was 8% for CDKN2A, 5% for CDKN2B, and 20% for ANRIL. SNP associations were similar in the two populations (r = 0.94, P = 10−72). Multiple SNPs were independently associated with expression of each gene (P<0.05 after correction for multiple testing), suggesting that several sites may modulate disease susceptibility. Individual SNPs correlated with changes in expression up to 1.4-fold for CDKN2A, 1.3-fold for CDKN2B, and 2-fold for ANRIL. Risk SNPs for coronary disease, stroke, diabetes, melanoma, and glioma were all associated with allelic expression of ANRIL (all P<0.05 after correction for multiple testing), while association with the other two genes was only detectable for some risk SNPs. SNPs had an inverse effect on ANRIL and CDKN2B expression, supporting a role of antisense transcription in CDKN2B regulation. Our study suggests that modulation of ANRIL expression mediates susceptibility to several important human diseases. Genetic variants on chromosome 9p21 have been associated with several important diseases including coronary artery disease, diabetes, and multiple cancers. Most of the risk variants in this region do not alter any protein sequence and are therefore likely to act by influencing the expression of nearby genes. We investigated whether chromosome 9p21 variants are correlated with expression of the three nearest genes (CDKN2A, CDKN2B, and ANRIL) which might mediate the association with disease. Using two different techniques to study effects on expression in blood from two separate populations of healthy volunteers, we show that variants associated with disease are all correlated with ANRIL expression, but associations with the other two genes are weaker and less consistent. Multiple genetic variants are independently associated with expression of all three genes. Although total expression levels of CDKN2A, CDKN2B, and ANRIL are positively correlated, individual genetic variants influence ANRIL and CDKN2B expression in opposite directions, suggesting a possible role of ANRIL in CDKN2B regulation. Our study suggests that modulation of ANRIL expression mediates susceptibility to several important human diseases.
Collapse
Affiliation(s)
- Michael S Cunnington
- Institute of Human Genetics, Newcastle University, Newcastle upon Tyne, United Kingdom
| | | | | | | | | |
Collapse
|
4345
|
Qureshi IA, Mattick JS, Mehler MF. Long non-coding RNAs in nervous system function and disease. Brain Res 2010; 1338:20-35. [PMID: 20380817 DOI: 10.1016/j.brainres.2010.03.110] [Citation(s) in RCA: 357] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2010] [Revised: 03/26/2010] [Accepted: 03/30/2010] [Indexed: 12/14/2022]
Abstract
Central nervous system (CNS) development, homeostasis, stress responses, and plasticity are all mediated by epigenetic mechanisms that modulate gene expression and promote selective deployment of functional gene networks in response to complex profiles of interoceptive and environmental signals. Thus, not surprisingly, disruptions of these epigenetic processes are implicated in the pathogenesis of a spectrum of neurological and psychiatric diseases. Epigenetic mechanisms involve chromatin remodeling by relatively generic complexes that catalyze DNA methylation and various types of histone modifications. There is increasing evidence that these complexes are directed to their sites of action by long non-protein-coding RNAs (lncRNAs), of which there are tens if not hundreds of thousands specified in the genome. LncRNAs are transcribed in complex intergenic, overlapping and antisense patterns relative to adjacent protein-coding genes, suggesting that many lncRNAs regulate the expression of these genes. LncRNAs also participate in a wide array of subcellular processes, including the formation and function of cellular organelles. Most lncRNAs are transcribed in a developmentally regulated and cell type specific manner, particularly in the CNS, wherein over half of all lncRNAs are expressed. While the numerous biological functions of lncRNAs are yet to be characterized fully, a number of recent studies suggest that lnRNAs are important for mediating cell identity. This function seems to be especially important for generating the enormous array of regional neuronal and glial cell subtypes that are present in the CNS. Further studies have also begun to elucidate additional roles played by lncRNAs in CNS processes, including homeostasis, stress responses and plasticity. Herein, we review emerging evidence that highlights the expression and function of lncRNAs in the CNS and suggests that lncRNA deregulation is an important factor in various CNS pathologies including neurodevelopmental, neurodegenerative and neuroimmunological disorders, primary brain tumors, and psychiatric diseases.
Collapse
Affiliation(s)
- Irfan A Qureshi
- Rosyln and Leslie Goldstein Laboratory for Stem Cell Biology and Regenerative Medicine, Albert Einstein College of Medicine, 1410 Pelham Parkway South, Bronx, NY 10461, USA
| | | | | |
Collapse
|
4346
|
Koerner MV, Barlow DP. Genomic imprinting-an epigenetic gene-regulatory model. Curr Opin Genet Dev 2010; 20:164-70. [PMID: 20153958 PMCID: PMC2860637 DOI: 10.1016/j.gde.2010.01.009] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 01/15/2010] [Accepted: 01/24/2010] [Indexed: 11/24/2022]
Abstract
Epigenetic mechanisms (Box 1) are considered to play major gene-regulatory roles in development, differentiation and disease. However, the relative importance of epigenetics in defining the mammalian transcriptome in normal and disease states is unknown. The mammalian genome contains only a few model systems where epigenetic gene regulation has been shown to play a major role in transcriptional control. These model systems are important not only to investigate the biological function of known epigenetic modifications but also to identify new and unexpected epigenetic mechanisms in the mammalian genome. Here we review recent progress in understanding how epigenetic mechanisms control imprinted gene expression.
Collapse
Affiliation(s)
- Martha V Koerner
- Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna Biocenter, Austria
| | | |
Collapse
|
4347
|
Abstract
The X-linked region now known as the "X-inactivation center" (Xic) was once dominated by protein-coding genes but, with the rise of Eutherian mammals some 150-200 million years ago, became infiltrated by genes that produce long noncoding RNA (ncRNA). Some of the noncoding genes have been shown to play crucial roles during X-chromosome inactivation (XCI), including the targeting of chromatin modifiers to the X. The rapid establishment of ncRNA hints at a possible preference for long transcripts in some aspects of epigenetic regulation. This article discusses the role of RNA in XCI and considers the advantages RNA offers in delivering allelic, cis-limited, and locus-specific control. Unlike proteins and small RNAs, long ncRNAs are tethered to the site of transcription and effectively tag the allele of origin. Furthermore, long ncRNAs are drawn from larger sequence space than proteins and can mark a unique region in a complex genome. Thus, like their small RNA cousins, long ncRNAs may emerge as versatile and powerful regulators of the epigenome.
Collapse
|
4348
|
Deep sequencing of coding and non-coding RNA in the CNS. Brain Res 2010; 1338:146-54. [PMID: 20307502 DOI: 10.1016/j.brainres.2010.03.039] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2009] [Revised: 03/10/2010] [Accepted: 03/15/2010] [Indexed: 01/23/2023]
Abstract
Several methods now exist for identifying and quantifying many biological events in parallel and in a relatively unbiased fashion. For gene expression experiments, cloning approaches have been supplemented with microarray platforms over the past few years. The focus of this review is on deep sequencing, a new set of techniques that can be used to both identify RNA species and quantify them in a massively parallel fashion. Deep sequencing has some advantages over other methods, driven largely by the high depth of coverage for any library of nucleic acids. This allows, for example, estimates of alternative splicing and untranslated region utilization. We will discuss how deep sequencing methods are being applied to characterization of gene expression in the brain and how these technologies might develop over the next few years.
Collapse
|
4349
|
Abstract
Initial gene discovery efforts through analysis of genome sequences and identification and characterization of expressed RNAs have revealed that only a relatively small portion of the genome is transcribed into protein coding mRNAs in vertebrates. However, in contrast with this paucity of protein coding ‘genes’, there is an enormous complexity in transcription and the protein coding mRNAs contribute to a very small fraction of transcripts in comparison with the different varieties of non-coding RNAs (ncRNAs). This transcriptome complexity may be hypothesized to have a regulatory role that is required for the development and function of organisms as complex as vertebrates. At the same time, it raises the fundamental question of the unequivocal definition of a gene. It is intriguing to postulate that many ncRNAs might finely modulate gene activity by acting as regulatory elements. The emerging hypotheses suggest that the gene regulatory machinery may be deeply interconnected with the world of short RNAs. These RNAs may generally act for fine-tuning of the protein-coding transcriptome.
Collapse
Affiliation(s)
- Piero Carninci
- Omics Science Center, RIKEN Yokohama Institute, Kanagawa, Japan.
| |
Collapse
|
4350
|
Licastro D, Gennarino VA, Petrera F, Sanges R, Banfi S, Stupka E. Promiscuity of enhancer, coding and non-coding transcription functions in ultraconserved elements. BMC Genomics 2010; 11:151. [PMID: 20202189 PMCID: PMC2847969 DOI: 10.1186/1471-2164-11-151] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2009] [Accepted: 03/04/2010] [Indexed: 11/10/2022] Open
Abstract
Background Ultraconserved elements (UCEs) are highly constrained elements of mammalian genomes, whose functional role has not been completely elucidated yet. Previous studies have shown that some of them act as enhancers in mouse, while some others are expressed in both normal and cancer-derived human tissues. Only one UCE element so far was shown to present these two functions concomitantly, as had been observed in other isolated instances of single, non ultraconserved enhancer elements. Results We used a custom microarray to assess the levels of UCE transcription during mouse development and integrated these data with published microarray and next-generation sequencing datasets as well as with newly produced PCR validation experiments. We show that a large fraction of non-exonic UCEs is transcribed across all developmental stages examined from only one DNA strand. Although the nature of these transcripts remains a mistery, our meta-analysis of RNA-Seq datasets indicates that they are unlikely to be short RNAs and that some of them might encode nuclear transcripts. In the majority of cases this function overlaps with the already established enhancer function of these elements during mouse development. Utilizing several next-generation sequencing datasets, we were further able to show that the level of expression observed in non-exonic UCEs is significantly higher than in random regions of the genome and that this is also seen in other regions which act as enhancers. Conclusion Our data shows that the concurrent presence of enhancer and transcript function in non-exonic UCE elements is more widespread than previously shown. Moreover through our own experiments as well as the use of next-generation sequencing datasets, we were able to show that the RNAs encoded by non-exonic UCEs are likely to be long RNAs transcribed from only one DNA strand.
Collapse
Affiliation(s)
- Danilo Licastro
- Telethon Institute of Genetics and Medicine (TIGEM), via Pietro Castellino 111, 80131 Napoli, Italy
| | | | | | | | | | | |
Collapse
|