Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Numata K, Kanai A, Saito R, Kondo S, Adachi J, Wilming LG, Hume DA, Hayashizaki Y, Tomita M. Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection. Genome Res 2003;13:1301-6. [PMID: 12819127 PMCID: PMC403720 DOI: 10.1101/gr.1011603] [Citation(s) in RCA: 115] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

For:	Numata K, Kanai A, Saito R, Kondo S, Adachi J, Wilming LG, Hume DA, Hayashizaki Y, Tomita M. Identification of putative noncoding RNAs among the RIKEN mouse full-length cDNA collection. Genome Res 2003;13:1301-6. [PMID: 12819127 PMCID: PMC403720 DOI: 10.1101/gr.1011603] [Citation(s) in RCA: 115] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Louro R, El-Jundi T, Nakaya HI, Reis EM, Verjovski-Almeida S. Conserved tissue expression signatures of intronic noncoding RNAs transcribed from human and mouse loci. Genomics 2008;92:18-25. [DOI: 10.1016/j.ygeno.2008.03.013] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2008] [Revised: 03/25/2008] [Accepted: 03/28/2008] [Indexed: 12/15/2022]

Kim M, Patel B, Schroeder KE, Raza A, Dejong J. Organization and transcriptional output of a novel mRNA-like piRNA gene (mpiR) located on mouse chromosome 10. RNA (NEW YORK, N.Y.) 2008;14:1005-1011. [PMID: 18441047 PMCID: PMC2390792 DOI: 10.1261/rna.974608] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2007] [Accepted: 02/21/2008] [Indexed: 05/26/2023]

He S, Su H, Liu C, Skogerbø G, He H, He D, Zhu X, Liu T, Zhao Y, Chen R. MicroRNA-encoding long non-coding RNAs. BMC Genomics 2008;9:236. [PMID: 18492288 PMCID: PMC2410135 DOI: 10.1186/1471-2164-9-236] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2007] [Accepted: 05/21/2008] [Indexed: 01/12/2023] Open

Riboregulators in plant development. Biochem Soc Trans 2008;35:1638-42. [PMID: 18031282 DOI: 10.1042/bst0351638] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Abstract

npcRNA (non-protein-coding RNAs) are an emerging class of regulators, so-called riboregulators, and include a large diversity of small RNAs [miRNAs (microRNAs)/siRNAs (small interfering RNAs)] that are involved in various developmental processes in plants and animals. In addition, several other npcRNAs encompassing various transcript sizes (up to several kilobases) have been identified using different genomic approaches. Much less is known about the mechanism of action of these other classes of riboregulators also present in the cell. The organogenesis of nitrogen-fixing nodules in legume plants is initiated in specific root cortical cells that express the npcRNA MtENOD40 (Medicago truncatula early nodulin 40). We have identified a novel RBP (RNA-binding protein), MtRBP1 (M. truncatula RBP 1), which interacts with the MtENOD40 RNA, and is exported into the cytoplasm during legume nodule development in the region expressing MtENOD40. A direct involvement of the MtENOD40 RNA in the relocalization of this RBP into cytoplasmic granules could be demonstrated, revealing a new RNA function in the cell. To extend these results, we searched for npcRNAs in the model plant Arabidopsis thaliana whose genome is completely known. We have identified 86 novel npcRNAs from which 27 corresponded to antisense RNAs of known coding regions. Using a dedicated 'macroarray' containing these npcRNAs and a collection of RBPs, we characterized their regulation in different tissues and plants subjected to environmental stresses. Most of the npcRNAs showed high variations in gene expression in contrast with the RBP genes. Recent large-scale analysis of the sRNA component of the transcriptome revealed an enormous diversity of siRNAs/miRNAs in the Arabidopsis genome. Bioinformatic analysis revealed that 34 large npcRNAs are precursors of siRNAs/miRNAs. npcRNAs, which are a sensitive component of the transcriptome, may reveal novel riboregulatory mechanisms involved in post-transcriptional control of differentiation or environmental responses.

Collapse

Roshan U, Chikkagoudar S, Livesay DR. Searching for evolutionary distant RNA homologs within genomic sequences using partition function posterior probabilities. BMC Bioinformatics 2008;9:61. [PMID: 18226231 PMCID: PMC2248559 DOI: 10.1186/1471-2105-9-61] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2007] [Accepted: 01/28/2008] [Indexed: 11/11/2022] Open

Abstract

Background

Identification of RNA homologs within genomic stretches is difficult when pairwise sequence identity is low or unalignable flanking residues are present. In both cases structure-sequence or profile/family-sequence alignment programs become difficult to apply because of unreliable RNA structures or family alignments. As such, local sequence-sequence alignment programs are frequently used instead. We have recently demonstrated that maximal expected accuracy alignments using partition function match probabilities (implemented in Probalign) are significantly better than contemporary methods on heterogeneous length protein sequence datasets, thus suggesting an affinity for local alignment.

Results

We create a pairwise RNA-genome alignment benchmark from RFAM families with average pairwise sequence identity up to 60%. Each dataset contains a query RNA aligned to a target RNA (of the same family) embedded in a genomic sequence at least 5K nucleotides long. To simulate common conditions when exact ends of an ncRNA are unknown, each query RNA has 5' and 3' genomic flanks of size 50, 100, and 150 nucleotides. We subsequently compare the error of the Probalign program (adjusted for local alignment) to the commonly used local alignment programs HMMER, SSEARCH, and BLAST, and the popular ClustalW program with zero end-gap penalties. Parameters were optimized for each program on a small subset of the benchmark. Probalign has overall highest accuracies on the full benchmark. It leads by 10% accuracy over SSEARCH (the next best method) on 5 out of 22 families. On datasets restricted to maximum of 30% sequence identity, Probalign's overall median error is 71.2% vs. 83.4% for SSEARCH (P-value < 0.05). Furthermore, on these datasets Probalign leads SSEARCH by at least 10% on five families; SSEARCH leads Probalign by the same margin on two of the fourteen families. We also demonstrate that the Probalign mean posterior probability, compared to the normalized SSEARCH Z-score, is a better discriminator of alignment quality. All datasets and software are available online.

Conclusion

We demonstrate, for the first time, that partition function match probabilities used for expected accuracy alignment, as done in Probalign, provide statistically significant improvement over current approaches for identifying distantly related RNA sequences in larger genomic segments.

Collapse

Carninci P. Constructing the landscape of the mammalian transcriptome. ACTA ACUST UNITED AC 2008;210:1497-506. [PMID: 17449815 DOI: 10.1242/jeb.000406] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Xin Y, Quarta G, Gan HH, Schlick T. Estimating the Fraction of Non-Coding RNAs in Mammalian Transcriptomes. Bioinform Biol Insights 2008;2:75-94. [PMID: 19812767 PMCID: PMC2735967 DOI: 10.4137/bbi.s443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Fast pairwise structural RNA alignments by pruning of the dynamical programming matrix. PLoS Comput Biol 2007;3:1896-908. [PMID: 17937495 PMCID: PMC2014794 DOI: 10.1371/journal.pcbi.0030193] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2007] [Accepted: 08/20/2007] [Indexed: 11/19/2022] Open

Abstract

It has become clear that noncoding RNAs (ncRNA) play important roles in cells, and emerging studies indicate that there might be a large number of unknown ncRNAs in mammalian genomes. There exist computational methods that can be used to search for ncRNAs by comparing sequences from different genomes. One main problem with these methods is their computational complexity, and heuristics are therefore employed. Two heuristics are currently very popular: pre-folding and pre-aligning. However, these heuristics are not ideal, as pre-aligning is dependent on sequence similarity that may not be present and pre-folding ignores the comparative information. Here, pruning of the dynamical programming matrix is presented as an alternative novel heuristic constraint. All subalignments that do not exceed a length-dependent minimum score are discarded as the matrix is filled out, thus giving the advantage of providing the constraints dynamically. This has been included in a new implementation of the FOLDALIGN algorithm for pairwise local or global structural alignment of RNA sequences. It is shown that time and memory requirements are dramatically lowered while overall performance is maintained. Furthermore, a new divide and conquer method is introduced to limit the memory requirement during global alignment and backtrack of local alignment. All branch points in the computed RNA structure are found and used to divide the structure into smaller unbranched segments. Each segment is then realigned and backtracked in a normal fashion. Finally, the FOLDALIGN algorithm has also been updated with a better memory implementation and an improved energy model. With these improvements in the algorithm, the FOLDALIGN software package provides the molecular biologist with an efficient and user-friendly tool for searching for new ncRNAs. The software package is available for download at http://foldalign.ku.dk.

FOLDALIGN is an algorithm for making pairwise structural alignments of RNA sequences. It uses a lightweight energy model and sequence similarity to simultaneously fold and align the sequences. The algorithm can make local and global alignments. The power of structural alignment methods is that they can align sequences where the primary sequences have diverged too much for normal alignment methods to be useful. The structures predicted by structural alignment methods are usually better than the structures predicted by single-sequence folding methods since they can take comparative information into account. The main problem for most structural alignment methods is that they are too computationally expensive. In this paper we introduce the dynamical pruning heuristic that makes the FOLDALIGN method significantly faster without lowering the predictive performance. The memory requirements are also significantly lowered, allowing for the analysis of longer sequences. A user-friendly (still command-line based, though) implementation of the algorithm is available at the Web site: http://foldalign.ku.dk

Collapse

Sakakibara Y, Irie T, Suzuki Y, Yamashita R, Wakaguri H, Kanai A, Chiba J, Takagi T, Mizushima-Sugano J, Hashimoto SI, Nakai K, Sugano S. Intrinsic promoter activities of primary DNA sequences in the human genome. DNA Res 2007;14:71-7. [PMID: 17522093 PMCID: PMC2779894 DOI: 10.1093/dnares/dsm006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Affiliation(s)

Yuta Sakakibara Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan Faculty of Industrial Science and Technology, Tokyo University of Science, 2641 Yamazaki, Noda-shi, Chiba 278-8510, Japan
Takuma Irie Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan
Yutaka Suzuki Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan To whom correspondence should be addressed. Tel/Fax. +81 4-7136-3607. E-mail:
Riu Yamashita Human Genome Center, The Institute of Medical Science, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan
Hiroyuki Wakaguri Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan
Akinori Kanai Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan
Joe Chiba Faculty of Industrial Science and Technology, Tokyo University of Science, 2641 Yamazaki, Noda-shi, Chiba 278-8510, Japan
Toshihisa Takagi Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan
Junko Mizushima-Sugano Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan Laboratory of Viral Infection II, Kitasato Institute for Life Sciences, Kitasato University, 5-9-1 Sirokane Minato-ku, Tokyo 108-8641, Japan
Shin-ichi Hashimoto School of Medicine, the University of Tokyo, 7-3-1 Hongo, Bunkyoku, Tokyo 113-0033, Japan
Kenta Nakai Human Genome Center, The Institute of Medical Science, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan
Sumio Sugano Graduate School of Frontier Sciences, the University of Tokyo, 4-6-1 Shirokanedai, Minatoku, Tokyo 108-8639, Japan

Collapse

Ponjavic J, Ponting CP, Lunter G. Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res 2007;17:556-65. [PMID: 17387145 PMCID: PMC1855172 DOI: 10.1101/gr.6036807] [Citation(s) in RCA: 529] [Impact Index Per Article: 31.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Zhang Z, Pang AWC, Gerstein M. Comparative analysis of genome tiling array data reveals many novel primate-specific functional RNAs in human. BMC Evol Biol 2007;7 Suppl 1:S14. [PMID: 17288572 PMCID: PMC1796608 DOI: 10.1186/1471-2148-7-s1-s14] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Abstract

Background

Widespread transcription activities in the human genome were recently observed in high-resolution tiling array experiments, which revealed many novel transcripts that are outside of the boundaries of known protein or RNA genes. Termed as "TARs" (Transcriptionally Active Regions), these novel transcribed regions represent "dark matter" in the genome, and their origin and functionality need to be explained. Many of these transcripts are thought to code for novel proteins or non-protein-coding RNAs. We have applied an integrated bioinformatics approach to investigate the properties of these TARs, including cross-species conservation, and the ability to form stable secondary structures. The goal of this study is to identify a list of potential candidate sequences that are likely to code for functional non-protein-coding RNAs. We are particularly interested in the discovery of those functional RNA candidates that are primate-specific, i.e. those that do not have homologs in the mouse or dog genomes but in rhesus.

Results

Using sequence conservation and the probability of forming stable secondary structures, we have identified ~300 possible candidates for primate-specific noncoding RNAs. We are currently in the process of sequencing the orthologous regions of these candidate sequences in several other primate species. We will then be able to apply a "phylogenetic shadowing" approach to analyze the functionality of these ncRNA candidates.

Conclusion

The existence of potential primate-specific functional transcripts has demonstrated the limitation of previous genome comparison studies, which put too much emphasis on conservation between human and rodents. It also argues for the necessity of sequencing additional primate species to gain a better and more comprehensive understanding of the human genome.

Collapse

Prasanth KV, Spector DL. Eukaryotic regulatory RNAs: an answer to the 'genome complexity' conundrum. Genes Dev 2007;21:11-42. [PMID: 17210785 DOI: 10.1101/gad.1484207] [Citation(s) in RCA: 301] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Geng X, Lavado A, Lagutin OV, Liu W, Oliver G. Expression of Six3 Opposite Strand (Six3OS) during mouse embryonic development. Gene Expr Patterns 2007;7:252-7. [PMID: 17084678 PMCID: PMC1986792 DOI: 10.1016/j.modgep.2006.09.007] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2006] [Revised: 09/18/2006] [Accepted: 09/19/2006] [Indexed: 10/24/2022]

Numata K, Okada Y, Saito R, Kiyosawa H, Kanai A, Tomita M. Comparative analysis of cis-encoded antisense RNAs in eukaryotes. Gene 2006;392:134-41. [PMID: 17250976 DOI: 10.1016/j.gene.2006.12.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Revised: 11/17/2006] [Accepted: 12/06/2006] [Indexed: 10/23/2022]

Navarro P, Page DR, Avner P, Rougeulle C. Tsix-mediated epigenetic switch of a CTCF-flanked region of the Xist promoter determines the Xist transcription program. Genes Dev 2006;20:2787-92. [PMID: 17043308 PMCID: PMC1619945 DOI: 10.1101/gad.389006] [Citation(s) in RCA: 107] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Michalak P. RNA world - the dark matter of evolutionary genomics. J Evol Biol 2006;19:1768-74. [PMID: 17040373 DOI: 10.1111/j.1420-9101.2006.01141.x] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Nordström KJV, Mirza MAI, Larsson TP, Gloriam DEI, Fredriksson R, Schiöth HB. Comprehensive comparisons of the current human, mouse, and rat RefSeq, Ensembl, EST, and FANTOM3 datasets: Identification of new human genes with specific tissue expression profile. Biochem Biophys Res Commun 2006;348:1063-74. [PMID: 16904064 DOI: 10.1016/j.bbrc.2006.07.153] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2006] [Accepted: 07/25/2006] [Indexed: 10/24/2022]

Hamada M, Tsuda K, Kudo T, Kin T, Asai K. Mining frequent stem patterns from unaligned RNA sequences. Bioinformatics 2006;22:2480-7. [PMID: 16908501 DOI: 10.1093/bioinformatics/btl431] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Lin R, Maeda S, Liu C, Karin M, Edgington TS. A large noncoding RNA is a marker for murine hepatocellular carcinomas and a spectrum of human carcinomas. Oncogene 2006;26:851-8. [PMID: 16878148 DOI: 10.1038/sj.onc.1209846] [Citation(s) in RCA: 432] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Szymanski M, Barciszewski J. RNA regulation in mammals. Ann N Y Acad Sci 2006;1067:461-8. [PMID: 16804027 DOI: 10.1196/annals.1354.066] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Angeloni D, ter Elst A, Wei MH, van der Veen AY, Braga EA, Klimov EA, Timmer T, Korobeinikova L, Lerman MI, Buys CHCM. Analysis of a new homozygous deletion in the tumor suppressor region at 3p12.3 reveals two novel intronic noncoding RNA genes. Genes Chromosomes Cancer 2006;45:676-91. [PMID: 16607615 DOI: 10.1002/gcc.20332] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Bickel KS, Morris DR. Silencing the transcriptome's dark matter: mechanisms for suppressing translation of intergenic transcripts. Mol Cell 2006;22:309-16. [PMID: 16678103 DOI: 10.1016/j.molcel.2006.04.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Costain WJ, Rasquinha I, Graber T, Luebbert C, Preston E, Slinn J, Xie X, MacManus JP. Cerebral ischemia induces neuronal expression of novel VL30 mouse retrotransposons bound to polyribosomes. Brain Res 2006;1094:24-37. [PMID: 16730676 DOI: 10.1016/j.brainres.2006.03.120] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2005] [Revised: 03/14/2006] [Accepted: 03/23/2006] [Indexed: 01/27/2023]

Furuno M, Pang KC, Ninomiya N, Fukuda S, Frith MC, Bult C, Kai C, Kawai J, Carninci P, Hayashizaki Y, Mattick JS, Suzuki H. Clusters of internally primed transcripts reveal novel long noncoding RNAs. PLoS Genet 2006;2:e37. [PMID: 16683026 PMCID: PMC1449886 DOI: 10.1371/journal.pgen.0020037] [Citation(s) in RCA: 133] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2005] [Accepted: 02/01/2006] [Indexed: 02/07/2023] Open

Abstract

Non-protein-coding RNAs (ncRNAs) are increasingly being recognized as having important regulatory roles. Although much recent attention has focused on tiny 22- to 25-nucleotide microRNAs, several functional ncRNAs are orders of magnitude larger in size. Examples of such macro ncRNAs include Xist and Air, which in mouse are 18 and 108 kilobases (Kb), respectively. We surveyed the 102,801 FANTOM3 mouse cDNA clones and found that Air and Xist were present not as single, full-length transcripts but as a cluster of multiple, shorter cDNAs, which were unspliced, had little coding potential, and were most likely primed from internal adenine-rich regions within longer parental transcripts. We therefore conducted a genome-wide search for regional clusters of such cDNAs to find novel macro ncRNA candidates. Sixty-six regions were identified, each of which mapped outside known protein-coding loci and which had a mean length of 92 Kb. We detected several known long ncRNAs within these regions, supporting the basic rationale of our approach. In silico analysis showed that many regions had evidence of imprinting and/or antisense transcription. These regions were significantly associated with microRNAs and transcripts from the central nervous system. We selected eight novel regions for experimental validation by northern blot and RT-PCR and found that the majority represent previously unrecognized noncoding transcripts that are at least 10 Kb in size and predominantly localized in the nucleus. Taken together, the data not only identify multiple new ncRNAs but also suggest the existence of many more macro ncRNAs like Xist and Air.

The human genome has been sequenced, and, intriguingly, less than 2% specifies the information for the basic protein building blocks of our bodies. So, what does the other 98% do? It now appears that the mammalian genome also specifies the instructions for many previously undiscovered “non protein-coding RNA” (ncRNA) genes. However, what these ncRNAs do is largely unknown. In recent years, strategies have been designed that have successfully identified hundreds of short ncRNAs—termed microRNAs—many of which have since been shown to act as genetic regulators. Also known to be functionally important are a handful of ncRNAs orders of magnitude larger in size than microRNAs. The availability of complete genome and comprehensive transcript sequences allows for the systematic discovery of more large ncRNAs. The authors developed a computational strategy to screen the mouse genome and identify large ncRNAs. They detected existing large ncRNAs, thus validating their approach, but, more importantly, discovered more than 60 other candidates, some of which were subsequently confirmed experimentally. This work opens the door to a virtually unexplored world of large ncRNAs and beckons future experimental work to define the cellular functions of these molecules.

Collapse

Affiliation(s)

Masaaki Furuno Mouse Genome Informatics Consortium, The Jackson Laboratory, Bar Harbor, Maine, United States of America
Ken C Pang Australian Research Council Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia T Cell laboratory, Ludwig Institute for Cancer Research, Austin Health, Heidelberg, Victoria, Australia
Noriko Ninomiya Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
Shiro Fukuda Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
Martin C Frith Australian Research Council Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
Carol Bult Mouse Genome Informatics Consortium, The Jackson Laboratory, Bar Harbor, Maine, United States of America
Chikatoshi Kai Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
Jun Kawai Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan Genome Science Laboratory, Discovery Research Institute, RIKEN Wako Institute, Wako, Japan
Piero Carninci Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan Genome Science Laboratory, Discovery Research Institute, RIKEN Wako Institute, Wako, Japan
Yoshihide Hayashizaki Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan Genome Science Laboratory, Discovery Research Institute, RIKEN Wako Institute, Wako, Japan
John S Mattick Australian Research Council Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
Harukazu Suzuki Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan * To whom correspondence should be addressed. E-mail:

Collapse

Inagaki S, Numata K, Kondo T, Tomita M, Yasuda K, Kanai A, Kageyama Y. Identification and expression analysis of putative mRNA-like non-coding RNA in Drosophila. Genes Cells 2006;10:1163-73. [PMID: 16324153 DOI: 10.1111/j.1365-2443.2005.00910.x] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Mattick JS, Makunin IV. Non-coding RNA. Hum Mol Genet 2006;15 Spec No 1:R17-29. [PMID: 16651366 DOI: 10.1093/hmg/ddl046] [Citation(s) in RCA: 1701] [Impact Index Per Article: 94.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Ginger MR, Shore AN, Contreras A, Rijnkels M, Miller J, Gonzalez-Rimbau MF, Rosen JM. A noncoding RNA is a potential marker of cell fate during mammary gland development. Proc Natl Acad Sci U S A 2006;103:5781-6. [PMID: 16574773 PMCID: PMC1420634 DOI: 10.1073/pnas.0600745103] [Citation(s) in RCA: 146] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2005] [Indexed: 12/26/2022] Open

Hirsch J, Lefort V, Vankersschaver M, Boualem A, Lucas A, Thermes C, d'Aubenton-Carafa Y, Crespi M. Characterization of 43 non-protein-coding mRNA genes in Arabidopsis, including the MIR162a-derived transcripts. PLANT PHYSIOLOGY 2006;140:1192-204. [PMID: 16500993 PMCID: PMC1435803 DOI: 10.1104/pp.105.073817] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]

Eddy SR. Computational analysis of RNAs. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2006;71:117-28. [PMID: 17381287 DOI: 10.1101/sqb.2006.71.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]

Ravasi T, Suzuki H, Pang KC, Katayama S, Furuno M, Okunishi R, Fukuda S, Ru K, Frith MC, Gongora MM, Grimmond SM, Hume DA, Hayashizaki Y, Mattick JS. Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome. Genome Res 2005;16:11-9. [PMID: 16344565 PMCID: PMC1356124 DOI: 10.1101/gr.4200206] [Citation(s) in RCA: 394] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Lipovich L, King MC. Abundant novel transcriptional units and unconventional gene pairs on human chromosome 22. Genome Res 2005;16:45-54. [PMID: 16344557 PMCID: PMC1356128 DOI: 10.1101/gr.3883606] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Arvestad L, Visa N, Lundeberg J, Wieslander L, Savolainen P. Expressed sequence tags from the midgut and an epithelial cell line of Chironomus tentans: annotation, bioinformatic classification of unknown transcripts and analysis of expression levels. INSECT MOLECULAR BIOLOGY 2005;14:689-95. [PMID: 16313569 DOI: 10.1111/j.1365-2583.2005.00600.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Savolainen P, Fitzsimmons C, Arvestad L, Andersson L, Lundeberg J. ESTs from brain and testis of White Leghorn and red junglefowl: annotation, bioinformatic classification of unknown transcripts and analysis of expression levels. Cytogenet Genome Res 2005;111:79-87. [PMID: 16093725 DOI: 10.1159/000085674] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2004] [Accepted: 11/30/2004] [Indexed: 11/19/2022] Open

Abstract

We report the generation, assembly and annotation of expressed sequence tags (ESTs) from four chicken cDNA libraries, constructed from brain and testis tissue dissected from red junglefowl and White Leghorn. 21,285 5'-end ESTs were generated and assembled into 2,813 contigs and 9,737 singletons, giving 12,549 tentative unique transcripts. The transcripts were annotated using BLAST by matching to known chicken genes or to putative homologues in other species using the major gene/protein databases. The results for these similarity searches are available on www.sbc.su.se/~arve/chicken. 4,129 (32.9%) of the transcripts remained without a significant match to gene/protein databases, a proportion of unmatched transcripts similar to earlier non-mammalian EST studies. To estimate how many of these transcripts may represent novel genes, they were studied for the presence of coding sequence. It was shown that most of the unique chicken transcripts do not contain coding parts of genes, but it was estimated that at least 400 of the transcripts contain coding sequence, indicating that 3.2% of avian genes belong to previously unknown gene families. Further BLAST search against dbEST left 1,649 (13.1%) of the transcripts unmatched to any library. The number of completely unmatched transcripts containing coding sequence was estimated at 180, giving a measure of the number of putative novel chicken genes identified in this study. 84.3% of the identified transcripts were found only in testis tissue, which has been poorly studied in earlier chicken EST studies. Large differences in expression levels were found between the brain and testis libraries for a large number of transcripts, and among the 525 most frequently represented transcripts, there were at least 20 transcripts with significant difference in expression levels between red junglefowl and White Leghorn.

Collapse

Pang KC, Frith MC, Mattick JS. Rapid evolution of noncoding RNAs: lack of conservation does not mean lack of function. Trends Genet 2005;22:1-5. [PMID: 16290135 DOI: 10.1016/j.tig.2005.10.003] [Citation(s) in RCA: 481] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2005] [Revised: 09/02/2005] [Accepted: 10/14/2005] [Indexed: 01/05/2023]

Szymanski M, Barciszewska MZ, Erdmann VA, Barciszewski J. A new frontier for molecular medicine: noncoding RNAs. Biochim Biophys Acta Rev Cancer 2005;1756:65-75. [PMID: 16125325 DOI: 10.1016/j.bbcan.2005.07.005] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2005] [Revised: 07/27/2005] [Accepted: 07/28/2005] [Indexed: 02/06/2023]

Kawano M, Storz G, Rao BS, Rosner JL, Martin RG. Detection of low-level promoter activity within open reading frame sequences of Escherichia coli. Nucleic Acids Res 2005;33:6268-76. [PMID: 16260475 PMCID: PMC1275588 DOI: 10.1093/nar/gki928] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Brosius J. Echoes from the past--are we still in an RNP world? Cytogenet Genome Res 2005;110:8-24. [PMID: 16093654 DOI: 10.1159/000084934] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2004] [Accepted: 05/04/2004] [Indexed: 11/19/2022] Open

Laserson U, Gan HH, Schlick T. Predicting candidate genomic sequences that correspond to synthetic functional RNA motifs. Nucleic Acids Res 2005;33:6057-69. [PMID: 16254081 PMCID: PMC1270951 DOI: 10.1093/nar/gki911] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Willingham AT, Orth AP, Batalov S, Peters EC, Wen BG, Aza-Blanc P, Hogenesch JB, Schultz PG. A strategy for probing the function of noncoding RNAs finds a repressor of NFAT. Science 2005;309:1570-3. [PMID: 16141075 DOI: 10.1126/science.1115901] [Citation(s) in RCA: 592] [Impact Index Per Article: 31.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]

Mattick JS. The functional genomics of noncoding RNA. Science 2005;309:1527-8. [PMID: 16141063 DOI: 10.1126/science.1117806] [Citation(s) in RCA: 223] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Costa FF. Non-coding RNAs: New players in eukaryotic biology. Gene 2005;357:83-94. [PMID: 16111837 DOI: 10.1016/j.gene.2005.06.019] [Citation(s) in RCA: 253] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2005] [Revised: 04/28/2005] [Accepted: 06/02/2005] [Indexed: 11/21/2022]

Babak T, Blencowe BJ, Hughes TR. A systematic search for new mammalian noncoding RNAs indicates little conserved intergenic transcription. BMC Genomics 2005;6:104. [PMID: 16083503 PMCID: PMC1199595 DOI: 10.1186/1471-2164-6-104] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2005] [Accepted: 08/05/2005] [Indexed: 11/10/2022] Open

Abstract

Background

Systematic identification and functional characterization of novel types of noncoding (nc)RNA in genomes is more difficult than it is for protein coding mRNAs, since ncRNAs typically do not possess sequence features such as splicing or translation signals, or long open reading frames. Recent "tiling" microarray studies have reported that a surprisingly larger proportion of mammalian genomes is transcribed than was previously anticipated. However, these non-genic transcripts often appear to be low in abundance, and their functional significance is not known.

Results

To systematically search for functional ncRNAs, we designed microarrays to detect 3,478 intergenic and intronic sequences that are conserved between the human, mouse, and rat genomes, and that score highly by other criteria that characterize ncRNAs. We probed these arrays with total RNA isolated from 16 wild-type mouse tissues. Among 55 candidates for highly-expressed novel ncRNAs tested by northern blotting, eight were confirmed as small, highly-and ubiquitously-expressed RNAs in mouse. Of the eight, five were also detected in rat tissues, but none were detected at appreciable levels in human tissues or cultured cells.

Conclusion

Since the sequence and expression of most known coding transcripts and functional ncRNAs is conserved between human and mouse, the lack of northern-detectable expression in human cells and tissues of the novel mouse and rat ncRNAs that we identified suggests that they are not functional or possibly have rodent-specific functions. Our results confirm that relatively little of the intergenic sequence conserved between human, mouse and rat is transcribed at high levels in mammalian tissues, possibly suggesting a limited role for transcribed intergenic and intronic sequences as independent functional elements.

Collapse

Shearstone JR, Wang YE, Clement A, Allaire NE, Yang C, Worley DS, Carulli JP, Perrin S. Application of functional genomic technologies in a mouse model of retinal degeneration. Genomics 2005;85:309-21. [PMID: 15718098 DOI: 10.1016/j.ygeno.2004.11.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2004] [Accepted: 11/01/2004] [Indexed: 02/03/2023]

Hüttenhofer A, Schattner P, Polacek N. Non-coding RNAs: hope or hype? Trends Genet 2005;21:289-97. [PMID: 15851066 DOI: 10.1016/j.tig.2005.03.007] [Citation(s) in RCA: 288] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Brosius J. Waste not, want not – transcript excess in multicellular eukaryotes. Trends Genet 2005;21:287-8. [PMID: 15851065 DOI: 10.1016/j.tig.2005.02.014] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Tupy JL, Bailey AM, Dailey G, Evans-Holm M, Siebel CW, Misra S, Celniker SE, Rubin GM. Identification of putative noncoding polyadenylated transcripts in Drosophila melanogaster. Proc Natl Acad Sci U S A 2005;102:5495-500. [PMID: 15809421 PMCID: PMC555963 DOI: 10.1073/pnas.0501422102] [Citation(s) in RCA: 107] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Hubbard SJ, Grafham DV, Beattie KJ, Overton IM, McLaren SR, Croning MDR, Boardman PE, Bonfield JK, Burnside J, Davies RM, Farrell ER, Francis MD, Griffiths-Jones S, Humphray SJ, Hyland C, Scott CE, Tang H, Taylor RG, Tickle C, Brown WRA, Birney E, Rogers J, Wilson SA. Transcriptome analysis for the chicken based on 19,626 finished cDNA sequences and 485,337 expressed sequence tags. Genome Res 2005;15:174-83. [PMID: 15590942 PMCID: PMC540287 DOI: 10.1101/gr.3011405] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2004] [Accepted: 10/04/2004] [Indexed: 12/22/2022]

Integration with the human genome of peptide sequences obtained by high-throughput mass spectrometry. Genome Biol 2004;6:R9. [PMID: 15642101 PMCID: PMC549070 DOI: 10.1186/gb-2004-6-1-r9] [Citation(s) in RCA: 228] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2004] [Revised: 10/21/2004] [Accepted: 11/17/2004] [Indexed: 11/21/2022] Open

Kasukawa T, Katayama S, Kawaji H, Suzuki H, Hume DA, Hayashizaki Y. Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis. Genomics 2004;84:913-21. [PMID: 15533708 DOI: 10.1016/j.ygeno.2004.08.011] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2004] [Accepted: 08/16/2004] [Indexed: 10/26/2022]

100

Neutral evolution of ‘non-coding’ complementary DNAs. Nature 2004. [DOI: 10.1038/nature03016] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]