1
|
Chen X, Wang T, Guo W, Yan X, Kou H, Yu Y, Liu C, Gao W, Wang W, Wang R. Transcriptome reveals the roles and potential mechanisms of lncRNAs in the regulation of albendazole resistance in Haemonchus contortus. BMC Genomics 2024; 25:188. [PMID: 38368335 PMCID: PMC10873934 DOI: 10.1186/s12864-024-10096-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 02/07/2024] [Indexed: 02/19/2024] Open
Abstract
BACKGROUND Haemonchus contortus (H. contortus) is the most common parasitic nematode in ruminants and is prevalent worldwide. H. contortus resistance to albendazole (ABZ) hinders the efficacy of anthelmintic drugs, but little is known about the molecular mechanisms that regulate this of drug resistance. Recent research has demonstrated that long noncoding RNAs (lncRNAs) can exert significant influence as pivotal regulators of the emergence of drug resistance. RESULTS In this study, transcriptome sequencing was conducted on both albendazole-sensitive (ABZ-sensitive) and albendazole-resistant (ABZ-resistant) H. contortus strains, with three biological replicates for each group. The analysis of lncRNA in the transcriptomic data revealed that there were 276 differentially expressed lncRNA (DElncRNA) between strains with ABZ-sensitive and ABZ-resistant according to the criteria of |log2Foldchange|≥ 1 and FDR < 0.05. Notably, MSTRG.12969.2 and MSTRG.9827.1 exhibited the most significant upregulation and downregulation, respectively, in the resistant strains. The potential roles of the DElncRNAs included catalytic activity, stimulus response, regulation of drug metabolism, and modulation of the immune response. Moreover, we investigated the interactions between DElncRNAs and other RNAs, specifically MSTRG.12741.1, MSTRG.11848.1, MSTRG.5895.1, and MSTRG.14070.1, involved in regulating drug stimulation through cis/trans/antisense/lncRNA‒miRNA-mRNA interaction networks. This regulation leads to a decrease (or increase) in the expression of relevant genes, consequently enhancing the resistance of H. contortus to albendazole. Furthermore, through comprehensive analysis of competitive endogenous RNAs (ceRNAs) involved in drug resistance-related pathways, such as the mTOR signalling pathway and ABC transporter signalling pathway, the relevance of the MSTRG.2499.1-novel-m0062-3p-HCON_00099610 interaction was identified to mainly involve the regulation of catalytic activity, metabolism, ubiquitination and transcriptional regulation of gene promoters. Additionally, quantitative real-time polymerase chain reaction (qRT-PCR) validation indicated that the transcription profiles of six DElncRNAs and six DEmRNAs were consistent with those obtained by RNA-seq. CONCLUSIONS The results of the present study allowed us to better understand the changes in the lncRNA expression profile of ABZ-resistant H. contortus. In total, these results suggest that the lncRNAs MSTRG.963.1, MSTRG.12741.1, MSTRG.11848.1 and MSTRG.2499.1 play important roles in the development of ABZ resistance and can serve as promising biomarkers for further study.
Collapse
Affiliation(s)
- Xindi Chen
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China
| | - Tengyu Wang
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China
| | - Wenrui Guo
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China
| | - Xu Yan
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China
| | - Huilin Kou
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China
| | - Yu Yu
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China
| | - Chunxia Liu
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Life Science, Inner Mongolia Agricultural University, Hohhot, 010018, Inner Mongolia Municipality, China
| | - Wa Gao
- Inner Mongolia Key Laboratory of Tick-Borne Zoonotic Infectious Disease, Department of Medicine, Hetao College, Bayan Nur, 015000, Inner Mongolia Autonomous Region, China
| | - Wenlong Wang
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China.
| | - Rui Wang
- Key Laboratory of Animal Disease Clinical Diagnosis and Treatment Technology, College of Veterinary Medicine, Inner Mongolia Agricultural University, Ordos Street, Hohhot, 010018, Inner Mongolia Municipality, China.
| |
Collapse
|
2
|
Sanbonmatsu K. Towards Molecular Mechanism in Long Non-coding RNAs: Linking Structure and Function. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1363:23-32. [DOI: 10.1007/978-3-030-92034-0_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
3
|
Rostami Azar A, Maroufi A. Identification of Long Non-coding RNA Transcripts in Glycyrrhiza uralensis. IRANIAN JOURNAL OF BIOTECHNOLOGY 2022; 20:e2607. [PMID: 35891954 PMCID: PMC9284242 DOI: 10.30498/ijb.2021.205469.2607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Background: Chinese liquorice (Glycyrrhiza uralensis), an important medicinal plant, contains various valuable secondary metabolites. Secondary metabolites biosynthesis is
very tightly regulated; therefore, elucidation and manipulation of the biosynthetic pathways are of great interest. Recent studies have shown that lncRNAs play important
regulatory roles in many biological processes, thus identification and modification of their expression is essential to metabolic pathways for biosynthesis of secondary metabolites. Objectives: In this study we attempted to identify non-coding RNA transcripts (lncRNAs) that may act as important regulators of diverse biological processes, including stress responses
and developmental programs in Glycyrrhiza uralensis. Materials and Methods: Identification of potential lncRNAs in Chinese liquorice was performed using a bioinformatics pipeline from the available EST dataset of G. uralensis. Results: Bioinformatics analysis revealed that 1365 identical sequences in the range of 200 to 1286 base pair are putative lncRNAs. Only less than one percent of the
predicted lncRNAs display sequence conservation with lncRNAs from other species. Moreover, 13 lncRNAs were detected as the potential precursors of 16 miRNAs.
From this analysis, we also detected possible target genes of 16 known miRNA genes. The majority of the predicted miRNA target genes have important role in response
to plant disease and a couple of them contribute to signalling and metabolic pathways. Conclusion: This study demonstrates the existence of lncRNAs in G. uralensis which has not been found before and provides valuable resources for further understanding and characterizing
of lncRNAs and also a basis for additional investigation to reveal specific roles of lncRNAs in various biological processes and particularly in response to plant diseases.
Collapse
Affiliation(s)
- Arash Rostami Azar
- Department of Plant Production and Genetics, University of Kurdistan, Sanandaj, Iran
| | - Asad Maroufi
- Department of Plant Production and Genetics, University of Kurdistan, Sanandaj, Iran.,Research Center for Medicinal Plant Breeding and Development, University of Kurdistan, Sanandaj, Iran
| |
Collapse
|
4
|
Sanbonmatsu K. Getting to the bottom of lncRNA mechanism: structure-function relationships. Mamm Genome 2021; 33:343-353. [PMID: 34642784 PMCID: PMC8509902 DOI: 10.1007/s00335-021-09924-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 09/28/2021] [Indexed: 12/14/2022]
Abstract
While long non-coding RNAs are known to play key roles in disease and development, relatively few structural studies have been performed for this important class of RNAs. Here, we review functional studies of long non-coding RNAs and expose the need for high-resolution 3-D structural studies, discussing the roles of long non-coding RNAs in the cell and how structure–function relationships might be used to elucidate further understanding. We then describe structural studies of other classes of RNAs using chemical probing, nuclear magnetic resonance, small-angle X-ray scattering, X-ray crystallography, and cryogenic electron microscopy (cryo-EM). Next, we review early structural studies of long non-coding RNAs to date and describe the way forward for the structural biology of long non-coding RNAs in terms of cryo-EM.
Collapse
|
5
|
Sergiev PV, Rubtsova MP. Little but Loud. The Diversity of Functions of Small Proteins and Peptides - Translational Products of Short Reading Frames. BIOCHEMISTRY (MOSCOW) 2021; 86:1139-1150. [PMID: 34565317 DOI: 10.1134/s0006297921090091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Cell functioning is tightly regulated process. For many years, research in the fields of proteomics and functional genomics has been focused on the role of proteins in cell functioning. The advances in science have led to the uncovering that short open reading frames, previously considered non-functional, serve a variety of functions. Short reading frames in polycistronic mRNAs often regulate their stability and translational efficiency of the main reading frame. The improvement of proteomic analysis methods has made it possible to identify the products of translation of short open reading frames in quantities that suggest the existence of functional role of those peptides and short proteins. Studies demonstrating their role unravel a new level of the regulation of cell functioning and its adaptation to changing conditions. This review is devoted to the analysis of functions of recently discovered peptides and short proteins.
Collapse
Affiliation(s)
- Petr V Sergiev
- Faculty of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia. .,Skoltech Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143025, Russia.,Institute of Functional Genomics, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Maria P Rubtsova
- Faculty of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia.
| |
Collapse
|
6
|
Thepsuwan T, Rungrassamee W, Sangket U, Whankaew S, Sathapondecha P. Long non-coding RNA profile in banana shrimp, Fenneropenaeus merguiensis and the potential role of lncPV13 in vitellogenesis. Comp Biochem Physiol A Mol Integr Physiol 2021; 261:111045. [PMID: 34358684 DOI: 10.1016/j.cbpa.2021.111045] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 07/30/2021] [Accepted: 07/30/2021] [Indexed: 01/04/2023]
Abstract
The long non-coding RNAs (lncRNAs) have been known to play important roles in several biological processes as well as in reproduction. This study aimed to identify lncRNA in ovary female banana shrimp, Fenneropenaeus merguiensis, and investigate the potential role of lncPV13 in the vitellogenesis. After the in silico identification of the ovarian transcriptome, a total of 24,733 putative lncRNAs were obtained, and only 147 putative lncRNAs were significantly differentially expressed among the ovarian development stages. To validate the in silico identification of lncRNAs, the 16 lncRNAs with the highest differential expression in the transcriptome analysis were evaluated by RT-qPCR. The 6 lncRNAs showed higher expression levels in the mature stage than in the previtellogenic stage and were found in several tissues such as in eyestalks, brains, thoracic ganglia, gills, and muscle. Furthermore, most candidate lncRNAs were amplifiable in Litopenaeus vannamei's and Penaeus monodon's DNA but not in Macrobrachium rosenbergii's DNA, suggesting some lncRNAs are expressed in a species-specific manner among penaeid shrimp. In this study, the lncPV13 was investigated for its vitellogenin regulating function by RNA interference. The result indicates that the lncPV13 expression was suppressed in the ovary on day 7 after the injection of double-stranded RNA specific to lncPV13 (dslncPV13), while vitellogenin (Vg) expression was significantly decreased. In contrast, the gonad inhibiting hormone (GIH) expression was significantly increased in the lncPV13 knockdown shrimp. However, the oocyte proliferation was not significantly different between control and lncPV13 knockdown shrimp. This suggests that lncPV13 regulate Vg synthesis through GIH inhibition. Finally, our findings provide lncRNA information and potential lncRNAs involved in the vitellogenesis of female banana shrimp.
Collapse
Affiliation(s)
- Timpika Thepsuwan
- Center for Genomics and Bioinformatics Research, Division of Biological Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90112, Thailand
| | - Wanilada Rungrassamee
- National Center for Genetic Engineering and Biotechnology, 113 Thailand Science Park, Phahonyothin Rd., Khlong Luang, Pathum Thani 12120, Thailand
| | - Unitsa Sangket
- Center for Genomics and Bioinformatics Research, Division of Biological Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90112, Thailand
| | - Sukhuman Whankaew
- Department of Plant Science, Faculty of Technology and Community Development, Thaksin University, Phatthalung Campus, Phatthalung 93210, Thailand
| | - Ponsit Sathapondecha
- Center for Genomics and Bioinformatics Research, Division of Biological Science, Faculty of Science, Prince of Songkla University, Hat Yai, Songkhla 90112, Thailand.
| |
Collapse
|
7
|
Liu XF, Ding XB, Li X, Jin CF, Yue YW, Li GP, Guo H. An atlas and analysis of bovine skeletal muscle long noncoding RNAs. Anim Genet 2017; 48:278-286. [DOI: 10.1111/age.12539] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2016] [Indexed: 12/29/2022]
Affiliation(s)
- X. F. Liu
- College of Animal Science and Veterinary Medicine; Tianjin Agricultural University; Tianjin 300384 China
- The Key Laboratory of Mammalian Reproductive Biology and Biotechnology of the Ministry of Education; Inner Mongolia University; Hohhot 010071 China
| | - X. B. Ding
- College of Animal Science and Veterinary Medicine; Tianjin Agricultural University; Tianjin 300384 China
| | - X. Li
- College of Animal Science and Veterinary Medicine; Tianjin Agricultural University; Tianjin 300384 China
| | - C. F. Jin
- College of Animal Science and Veterinary Medicine; Tianjin Agricultural University; Tianjin 300384 China
| | - Y. W. Yue
- College of Animal Science and Veterinary Medicine; Tianjin Agricultural University; Tianjin 300384 China
| | - G. P. Li
- The Key Laboratory of Mammalian Reproductive Biology and Biotechnology of the Ministry of Education; Inner Mongolia University; Hohhot 010071 China
| | - H. Guo
- College of Animal Science and Veterinary Medicine; Tianjin Agricultural University; Tianjin 300384 China
| |
Collapse
|
8
|
A Review of Computational Methods for Finding Non-Coding RNA Genes. Genes (Basel) 2016; 7:genes7120113. [PMID: 27918472 PMCID: PMC5192489 DOI: 10.3390/genes7120113] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Revised: 11/04/2016] [Accepted: 11/17/2016] [Indexed: 12/19/2022] Open
Abstract
Finding non-coding RNA (ncRNA) genes has emerged over the past few years as a cutting-edge trend in bioinformatics. There are numerous computational intelligence (CI) challenges in the annotation and interpretation of ncRNAs because it requires a domain-related expert knowledge in CI techniques. Moreover, there are many classes predicted yet not experimentally verified by researchers. Recently, researchers have applied many CI methods to predict the classes of ncRNAs. However, the diverse CI approaches lack a definitive classification framework to take advantage of past studies. A few review papers have attempted to summarize CI approaches, but focused on the particular methodological viewpoints. Accordingly, in this article, we summarize in greater detail than previously available, the CI techniques for finding ncRNAs genes. We differentiate from the existing bodies of research and discuss concisely the technical merits of various techniques. Lastly, we review the limitations of ncRNA gene-finding CI methods with a point-of-view towards the development of new computational tools.
Collapse
|
9
|
Kumar M, DeVaux R, Herschkowitz J. Molecular and Cellular Changes in Breast Cancer and New Roles of lncRNAs in Breast Cancer Initiation and Progression. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2016; 144:563-586. [DOI: 10.1016/bs.pmbts.2016.09.011] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
10
|
Towards structural classification of long non-coding RNAs. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2015; 1859:41-5. [PMID: 26537437 DOI: 10.1016/j.bbagrm.2015.09.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Revised: 09/10/2015] [Accepted: 09/28/2015] [Indexed: 01/16/2023]
Abstract
While long non-coding RNAs play key roles in disease and development, few structural studies have been performed to date for this emerging class of RNAs. Previous structural studies are reviewed, and a pipeline is presented to determine secondary structures of long non-coding RNAs. Similar to riboswitches, experimentally determined secondary structures of long non-coding RNAs for one species, may be used to improve sequence/structure alignments for other species. As riboswitches have been classified according to their secondary structure, a similar scheme could be used to classify long non-coding RNAs. This article is part of a Special Issue titled: Clues to long noncoding RNA taxonomy1, edited by Dr. Tetsuro Hirose and Dr. Shinichi Nakagawa.
Collapse
|
11
|
de Hoon M, Shin JW, Carninci P. Paradigm shifts in genomics through the FANTOM projects. Mamm Genome 2015; 26:391-402. [PMID: 26253466 PMCID: PMC4602071 DOI: 10.1007/s00335-015-9593-8] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Accepted: 07/08/2015] [Indexed: 12/18/2022]
Abstract
Big leaps in science happen when scientists from different backgrounds interact. In the past 15 years, the FANTOM Consortium has brought together scientists from different fields to analyze and interpret genomic data produced with novel technologies, including mouse full-length cDNAs and, more recently, expression profiling at single-nucleotide resolution by cap-analysis gene expression. The FANTOM Consortium has provided the most comprehensive mouse cDNA collection for functional studies and extensive maps of the human and mouse transcriptome comprising promoters, enhancers, as well as the network of their regulatory interactions. More importantly, serendipitous observations of the FANTOM dataset led us to realize that the mammalian genome is pervasively transcribed, even from retrotransposon elements, which were previously considered junk DNA. The majority of products from the mammalian genome are long non-coding RNAs (lncRNAs), including sense-antisense, intergenic, and enhancer RNAs. While the biological function has been elucidated for some lncRNAs, more than 98 % of them remain without a known function. We argue that large-scale studies are urgently needed to address the functional role of lncRNAs.
Collapse
Affiliation(s)
- Michiel de Hoon
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, 230-0045, Japan.
| | - Jay W Shin
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, 230-0045, Japan.
| | - Piero Carninci
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, 230-0045, Japan.
| |
Collapse
|
12
|
Shahryari A, Jazi MS, Samaei NM, Mowla SJ. Long non-coding RNA SOX2OT: expression signature, splicing patterns, and emerging roles in pluripotency and tumorigenesis. Front Genet 2015; 6:196. [PMID: 26136768 PMCID: PMC4469893 DOI: 10.3389/fgene.2015.00196] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Accepted: 05/18/2015] [Indexed: 12/18/2022] Open
Abstract
SOX2 overlapping transcript (SOX2OT) is a long non-coding RNA which harbors one of the major regulators of pluripotency, SOX2 gene, in its intronic region. SOX2OT gene is mapped to human chromosome 3q26.3 (Chr3q26.3) locus and is extended in a high conserved region of over 700 kb. Little is known about the exact role of SOX2OT; however, recent studies have demonstrated a positive role for it in transcription regulation of SOX2 gene. Similar to SOX2, SOX2OT is highly expressed in embryonic stem cells and down-regulated upon the induction of differentiation. SOX2OT is dynamically regulated during the embryogenesis of vertebrates, and delimited to the brain in adult mice and human. Recently, the disregulation of SOX2OT expression and its concomitant expression with SOX2 have become highlighted in some somatic cancers including esophageal squamous cell carcinoma, lung squamous cell carcinoma, and breast cancer. Interestingly, SOX2OT is differentially spliced into multiple mRNA-like transcripts in stem and cancer cells. In this review, we are describing the structural and functional features of SOX2OT, with an emphasis on its expression signature, its splicing patterns and its critical function in the regulation of SOX2 expression during development and tumorigenesis.
Collapse
Affiliation(s)
- Alireza Shahryari
- Stem Cell Research Center, Golestan University of Medical Sciences , Gorgan, Iran
| | - Marie Saghaeian Jazi
- Department of Molecular Medicine, Faculty of Advanced Medical Technologies, Golestan University of Medical Sciences , Gorgan, Iran
| | - Nader M Samaei
- Department of Medical Genetics, Faculty of Advanced Medical Technologies, Golestan University of Medical Sciences , Gorgan, Iran
| | - Seyed J Mowla
- Department of Molecular Genetics, Faculty of Biological Sciences, Tarbiat Modares University , Tehran, Iran
| |
Collapse
|
13
|
Liu X, Hao L, Li D, Zhu L, Hu S. Long non-coding RNAs and their biological roles in plants. GENOMICS PROTEOMICS & BIOINFORMATICS 2015; 13:137-47. [PMID: 25936895 PMCID: PMC4563214 DOI: 10.1016/j.gpb.2015.02.003] [Citation(s) in RCA: 147] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Revised: 02/06/2015] [Accepted: 02/09/2015] [Indexed: 12/31/2022]
Abstract
With the development of genomics and bioinformatics, especially the extensive applications of high-throughput sequencing technology, more transcriptional units with little or no protein-coding potential have been discovered. Such RNA molecules are called non-protein-coding RNAs (npcRNAs or ncRNAs). Among them, long npcRNAs or ncRNAs (lnpcRNAs or lncRNAs) represent diverse classes of transcripts longer than 200 nucleotides. In recent years, the lncRNAs have been considered as important regulators in many essential biological processes. In plants, although a large number of lncRNA transcripts have been predicted and identified in few species, our current knowledge of their biological functions is still limited. Here, we have summarized recent studies on their identification, characteristics, classification, bioinformatics, resources, and current exploration of their biological functions in plants.
Collapse
Affiliation(s)
- Xue Liu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Lili Hao
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Dayong Li
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Lihuang Zhu
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China.
| | - Songnian Hu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
14
|
Wang J, Song YX, Wang ZN. Non-coding RNAs in gastric cancer. Gene 2015; 560:1-8. [PMID: 25659765 DOI: 10.1016/j.gene.2015.02.004] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Revised: 01/31/2015] [Accepted: 02/04/2015] [Indexed: 12/12/2022]
Abstract
Non-coding RNAs (ncRNAs) have recently become increasingly important in the study of cellular metabolism and regulation such as development, proliferation, differentiation and apoptosis. However, the functions of most ncRNAs have remained largely unknown. Recently, studies have begun to characterize the aberrant regulation of ncRNAs in gastric cancer (GC) cells and tissues. These ncRNAs have a close relationship with drug resistance, and with the occurrence, development, invasion and metastasis of tumors, so they could possibly become new therapeutic targets and treatment tools for GC in the future. The present review summarized current advances in our knowledge of the roles of ncRNAs in GC.
Collapse
Affiliation(s)
- Jun Wang
- Department of Surgical Oncology and General Surgery, First Hospital of China Medical University, 155 North Nanjing Street, Heping District, Shenyang City 110001, China.
| | - Yong-Xi Song
- Department of Surgical Oncology and General Surgery, First Hospital of China Medical University, 155 North Nanjing Street, Heping District, Shenyang City 110001, China.
| | - Zhen-Ning Wang
- Department of Surgical Oncology and General Surgery, First Hospital of China Medical University, 155 North Nanjing Street, Heping District, Shenyang City 110001, China.
| |
Collapse
|
15
|
Inoue H, Yoshimura J, Iwabuchi K. Gene expression of protein-coding and non-coding RNAs related to polyembryogenesis in the parasitic wasp, Copidosoma floridanum. PLoS One 2014; 9:e114372. [PMID: 25469914 PMCID: PMC4255003 DOI: 10.1371/journal.pone.0114372] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Accepted: 11/06/2014] [Indexed: 11/18/2022] Open
Abstract
Polyembryony is a unique form of development in which many embryos are clonally produced from a single egg. Polyembryony is known to occur in many animals, but the underlying genetic mechanism responsible is unknown. In a parasitic wasp, Copidosoma floridanum, polyembryogenesis is initiated during the formation and division of the morula. In the present study, cDNA libraries were constructed from embryos at the cleavage and subsequent primary morula stages, times when polyembryogenesis is likely to be controlled genetically. Of 182 and 263 cDNA clones isolated from these embryos, 38% and 70%, respectively, were very similar to protein-coding genes obtained from BLAST analysis and 55 and 65 clones, respectively, were stage-specific. In our libraries we also detected a high frequency of long non-coding RNA. Some of these showed stage-specific expression patterns in reverse transcription quantitative polymerase chain reaction (RT-qPCR) analysis. The stage-specificity of expression implies that these protein-coding and non-coding genes are related to polyembryogenesis in C. floridanum. The non-coding genes are not similar to any known non-coding RNAs and so are good candidates as regulators of polyembryogenesis.
Collapse
Affiliation(s)
- Hiroki Inoue
- Faculty of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Tokyo, Japan
| | - Jin Yoshimura
- Graduate School of Science and Technology, and Department of Mathematical and Systems Engineering, Shizuoka University, Hamamatsu, Shizuoka, Japan
- Department of Environmental and Forest Biology, State University of New York College of Environmental Science and Forestry, Syracuse, New York, United States of America
- Marine Biosystems Research Center, Chiba University, Kamogawa, Chiba, Japan
| | - Kikuo Iwabuchi
- Faculty of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Tokyo, Japan
- * E-mail:
| |
Collapse
|
16
|
Expression of a non-coding RNA in ectromelia virus is required for normal plaque formation. Virus Genes 2014; 48:38-47. [PMID: 24078045 DOI: 10.1007/s11262-013-0983-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 09/14/2013] [Indexed: 01/26/2023]
Abstract
Poxviruses are dsDNA viruses with large genomes. Many genes in the genome remain uncharacterized, and recent studies have demonstrated that the poxvirus transcriptome includes numerous so-called anomalous transcripts not associated with open reading frames. Here, we characterize the expression and role of an apparently non-coding RNA in orthopoxviruses, which we call viral hairpin RNA (vhRNA). Using a bioinformatics approach, we predicted expression of a transcript not associated with an open reading frame that is likely to form a stem-loop structure due to the presence of a 21 nt palindromic sequence. Expression of the transcript as early as 2 h post-infection was confirmed by northern blot and analysis of publicly available vaccinia virus infected cell transcriptomes. The transcription start site was determined by RACE PCE and transcriptome analysis, and early and late promoter sequences were identified. Finally, to test the function of the transcript we generated an ectromelia virus knockout, which failed to form plaques in cell culture. The important role of the transcript in viral replication was further demonstrated using siRNA. Although the function of the transcript remains unknown, our work contributes to evidence of an increasingly complex poxvirus transcriptome, suggesting that transcripts such as vhRNA not associated with an annotated open reading frame can play an important role in viral replication.
Collapse
|
17
|
Meyer KD, Jaffrey SR. The dynamic epitranscriptome: N6-methyladenosine and gene expression control. Nat Rev Mol Cell Biol 2014; 15:313-26. [PMID: 24713629 DOI: 10.1038/nrm3785] [Citation(s) in RCA: 707] [Impact Index Per Article: 70.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
N(6)-methyladenosine (m(6)A) is a modified base that has long been known to be present in non-coding RNAs, ribosomal RNA, polyadenylated RNA and at least one mammalian mRNA. However, our understanding of the prevalence of this modification has been fundamentally redefined by transcriptome-wide m(6)A mapping studies, which have shown that m(6)A is present in a large subset of the transcriptome in specific regions of mRNA. This suggests that mRNA may undergo post-transcriptional methylation to regulate its fate and function, which is analogous to methyl modifications in DNA. Thus, the pattern of methylation constitutes an mRNA 'epitranscriptome'. The identification of adenosine methyltransferases ('writers'), m(6)A demethylating enzymes ('erasers') and m(6)A-binding proteins ('readers') is helping to define cellular pathways for the post-transcriptional regulation of mRNAs.
Collapse
Affiliation(s)
- Kate D Meyer
- Department of Pharmacology, Weill Cornell Medical College, Cornell University, New York City, New York 10065, USA
| | - Samie R Jaffrey
- Department of Pharmacology, Weill Cornell Medical College, Cornell University, New York City, New York 10065, USA
| |
Collapse
|
18
|
Ma X, Zhu Y, Li C, Xue P, Zhao Y, Chen S, Yang F, Miao L. Characterisation of Caenorhabditis elegans sperm transcriptome and proteome. BMC Genomics 2014; 15:168. [PMID: 24581041 PMCID: PMC4028957 DOI: 10.1186/1471-2164-15-168] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 02/13/2014] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Although sperm is transcriptionally and translationally quiescent, complex populations of RNAs, including mRNAs and non-coding RNAs, exist in sperm. Previous microarray analysis of germ cell mutants identified hundreds of sperm genes in Caenorhabditis elegans. To take a more comprehensive view on C. elegans sperm genes, here, we isolate highly pure sperm cells and employ high-throughput technologies to obtain sperm transcriptome and proteome. RESULTS First, sperm transcriptome consists of considerable amounts of non-coding RNAs, many of which have not been annotated and may play functional roles during spermatogenesis. Second, apart from kinases/phosphatases as previously reported, ion binding proteins are also enriched in sperm, underlying the crucial roles of intracellular ions in post-translational regulation in sperm. Third, while the majority of sperm genes/proteins have low abundance, a small number of sperm genes/proteins are hugely enriched in sperm, implying that sperm only rely on a small set of proteins for post-translational regulation. Lastly, by extensive RNAi screening of sperm enriched genes, we identified a few genes that control fertility. Our further analysis reveals a tight correlation between sperm transcriptome and sperm small RNAome, suggesting that the endogenous siRNAs strongly repress sperm genes. This leads to an idea that the inefficient RNAi screening of sperm genes, a phenomenon currently with unknown causes, might result from the competition between the endogenous RNAi pathway and the exogenous RNAi pathway. CONCLUSIONS Together, the obtained sperm transcriptome and proteome serve as valuable resources to systematically study spermatogenesis in C. elegans.
Collapse
Affiliation(s)
- Xuan Ma
- Laboratory of Non-coding RNA, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yingjie Zhu
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100094, China
| | - Chunfang Li
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100094, China
| | - Peng Xue
- Key Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yanmei Zhao
- Laboratory of Non-coding RNA, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Shilin Chen
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Beijing 100094, China
| | - Fuquan Yang
- Key Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Long Miao
- Laboratory of Non-coding RNA, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
19
|
Livyatan I, Harikumar A, Nissim-Rafinia M, Duttagupta R, Gingeras TR, Meshorer E. Non-polyadenylated transcription in embryonic stem cells reveals novel non-coding RNA related to pluripotency and differentiation. Nucleic Acids Res 2013; 41:6300-15. [PMID: 23630323 PMCID: PMC3695530 DOI: 10.1093/nar/gkt316] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The transcriptional landscape in embryonic stem cells (ESCs) and during ESC differentiation has received considerable attention, albeit mostly confined to the polyadenylated fraction of RNA, whereas the non-polyadenylated (NPA) fraction remained largely unexplored. Notwithstanding, the NPA RNA super-family has every potential to participate in the regulation of pluripotency and stem cell fate. We conducted a comprehensive analysis of NPA RNA in ESCs using a combination of whole-genome tiling arrays and deep sequencing technologies. In addition to identifying previously characterized and new non-coding RNA members, we describe a group of novel conserved RNAs (snacRNAs: small NPA conserved), some of which are differentially expressed between ESC and neuronal progenitor cells, providing the first evidence of a novel group of potentially functional NPA RNA involved in the regulation of pluripotency and stem cell fate. We further show that minor spliceosomal small nuclear RNAs, which are NPA, are almost completely absent in ESCs and are upregulated in differentiation. Finally, we show differential processing of the minor intron of the polycomb group gene Eed. Our data suggest that NPA RNA, both known and novel, play important roles in ESCs.
Collapse
|
20
|
Identification and comparative analysis of ncRNAs in human, mouse and zebrafish indicate a conserved role in regulation of genes expressed in brain. PLoS One 2012; 7:e52275. [PMID: 23284966 PMCID: PMC3527520 DOI: 10.1371/journal.pone.0052275] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2012] [Accepted: 11/12/2012] [Indexed: 12/20/2022] Open
Abstract
ncRNAs (non-coding RNAs), in particular long ncRNAs, represent a significant proportion of the vertebrate transcriptome and probably regulate many biological processes. We used publically available ESTs (Expressed Sequence Tags) from human, mouse and zebrafish and a previously published analysis pipeline to annotate and analyze the vertebrate non-protein-coding transcriptome. Comparative analysis confirmed some previously described features of intergenic ncRNAs, such as a positionally biased distribution with respect to regulatory or development related protein-coding genes, and weak but clear sequence conservation across species. Significantly, comparative analysis of developmental and regulatory genes proximate to long ncRNAs indicated that the only conserved relationship of these genes to neighbor long ncRNAs was with respect to genes expressed in human brain, suggesting a conserved, ncRNA cis-regulatory network in vertebrate nervous system development. Most of the relationships between long ncRNAs and proximate coding genes were not conserved, providing evidence for the rapid evolution of species-specific gene associated long ncRNAs. We have reconstructed and annotated over 130,000 long ncRNAs in these three species, providing a significantly expanded number of candidates for functional testing by the research community.
Collapse
|
21
|
Abstract
Thousands of long noncoding RNAs (lncRNAs) have been found in vertebrate animals, a few of which have known biological roles. To better understand the genomics and features of lncRNAs in invertebrates, we used available RNA-seq, poly(A)-site, and ribosome-mapping data to identify lncRNAs of Caenorhabditis elegans. We found 170 long intervening ncRNAs (lincRNAs), which had single- or multiexonic structures that did not overlap protein-coding transcripts, and about sixty antisense lncRNAs (ancRNAs), which were complementary to protein-coding transcripts. Compared to protein-coding genes, the lncRNA genes tended to be expressed in a stage-dependent manner. Approximately 25% of the newly identified lincRNAs showed little signal for sequence conservation and mapped antisense to clusters of endogenous siRNAs, as would be expected if they serve as templates and targets for these siRNAs. The other 75% tended to be more conserved and included lincRNAs with intriguing expression and sequence features associating them with processes such as dauer formation, male identity, sperm formation, and interaction with sperm-specific mRNAs. Our study provides a glimpse into the lncRNA content of a nonvertebrate animal and a resource for future studies of lncRNA function.
Collapse
Affiliation(s)
- Jin-Wu Nam
- Whitehead Institute for Biomedical Research, Cambridge, Massachusetts 02142, USA
| | | |
Collapse
|
22
|
Wang S, Zhang H, Wiltshire T, Sealock R, Faber JE. Genetic dissection of the Canq1 locus governing variation in extent of the collateral circulation. PLoS One 2012; 7:e31910. [PMID: 22412848 PMCID: PMC3295810 DOI: 10.1371/journal.pone.0031910] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2011] [Accepted: 01/15/2012] [Indexed: 11/18/2022] Open
Abstract
Background Native (pre-existing) collaterals are arteriole-to-arteriole anastomoses that interconnect adjacent arterial trees and serve as endogenous bypass vessels that limit tissue injury in ischemic stroke, myocardial infarction, coronary and peripheral artery disease. Their extent (number and diameter) varies widely among mouse strains and healthy humans. We previously identified a major quantitative trait locus on chromosome 7 (Canq1, LOD = 29) responsible for 37% of the heritable variation in collateral extent between C57BL/6 and BALB/c mice. We sought to identify candidate genes in Canq1 responsible for collateral variation in the cerebral pial circulation, a tissue whose strain-dependent variation is shared by similar variation in other tissues. Methods and Findings Collateral extent was intermediate in a recombinant inbred line that splits Canq1 between the C57BL/6 and BALB/c strains. Phenotyping and SNP-mapping of an expanded panel of twenty-one informative inbred strains narrowed the Canq1 locus, and genome-wide linkage analysis of a SWRxSJL-F2 cross confirmed its haplotype structure. Collateral extent, infarct volume after cerebral artery occlusion, bleeding time, and re-bleeding time did not differ in knockout mice for two vascular-related genes located in Canq1, IL4ra and Itgal. Transcript abundance of 6 out of 116 genes within the 95% confidence interval of Canq1 were differentially expressed >2-fold (p-value<0.05÷150) in the cortical pia mater from C57BL/6 and BALB/c embryos at E14.5, E16.5 and E18.5 time-points that span the period of collateral formation. Conclusions These findings refine the Canq1 locus and identify several genes as high-priority candidates important in specifying native collateral formation and its wide variation.
Collapse
Affiliation(s)
- Shiliang Wang
- Department of Physiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- McAllister Heart Institute, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Hua Zhang
- Department of Physiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- McAllister Heart Institute, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Tim Wiltshire
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Robert Sealock
- Department of Physiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- McAllister Heart Institute, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - James E. Faber
- Department of Physiology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- McAllister Heart Institute, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
23
|
Li T, Wang S, Wu R, Zhou X, Zhu D, Zhang Y. Identification of long non-protein coding RNAs in chicken skeletal muscle using next generation sequencing. Genomics 2012; 99:292-8. [PMID: 22374175 DOI: 10.1016/j.ygeno.2012.02.003] [Citation(s) in RCA: 125] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2011] [Revised: 02/01/2012] [Accepted: 02/10/2012] [Indexed: 11/18/2022]
Abstract
Vertebrate genomes encode thousands of non-coding RNAs including short non-coding RNAs (such as microRNAs) and long non-coding RNAs (lncRNAs). Chicken (Gallus gallus) is an important model organism for developmental biology, and the recently assembled genome sequences for chicken will facilitate the understanding of the functional roles of non-coding RNA genes during development. The present study concerns the first systematic identification of lncRNAs using RNA-Seq to sample the transcriptome during chicken muscle development. A computational approach was used to identify 281 new intergenic lncRNAs in the chicken genome. Novel lncRNAs in general are less conserved than protein-coding genes and slightly more conserved than random non-coding sequences. The present study has provided an initial chicken lncRNA catalog and greatly increased the number of chicken ncRNAs in the non-protein coding RNA database. Furthermore, the computational pipeline presented in the current work will be useful for characterizing lncRNAs obtained from deep sequencing data.
Collapse
Affiliation(s)
- Tingting Li
- Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China
| | | | | | | | | | | |
Collapse
|
24
|
Wu B, Li Y, Yan H, Ma Y, Luo H, Yuan L, Chen S, Lu S. Comprehensive transcriptome analysis reveals novel genes involved in cardiac glycoside biosynthesis and mlncRNAs associated with secondary metabolism and stress response in Digitalis purpurea. BMC Genomics 2012; 13:15. [PMID: 22233149 PMCID: PMC3269984 DOI: 10.1186/1471-2164-13-15] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Accepted: 01/10/2012] [Indexed: 11/10/2022] Open
Abstract
Abstract Conclusions Through comprehensive transcriptome analysis, we not only identified 29 novel gene families potentially involved in the biosynthesis of cardiac glycosides but also characterized a large number of mlncRNAs. Our results suggest the importance of mlncRNAs in secondary metabolism and stress response in D. purpurea.
Collapse
Affiliation(s)
- Bin Wu
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, No,151, Malianwa North Road, Haidian District, Beijing 100193, China
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Comparative transcriptome sequencing of germline and somatic tissues of the Ascaris suum gonad. BMC Genomics 2011; 12:481. [PMID: 21962222 PMCID: PMC3203103 DOI: 10.1186/1471-2164-12-481] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2011] [Accepted: 10/01/2011] [Indexed: 11/16/2022] Open
Abstract
Background Ascaris suum (large roundworm of pigs) is a parasitic nematode that causes substantial losses to the meat industry. This nematode is suitable for biochemical studies because, unlike C. elegans, homogeneous tissue samples can be obtained by dissection. It has large sperm, produced in great numbers that permit biochemical studies of sperm motility. Widespread study of A. suum would be facilitated by more comprehensive genome resources and, to this end, we have produced a gonad transcriptome of A. suum. Results Two 454 pyrosequencing runs generated 572,982 and 588,651 reads for germline (TES) and somatic (VAS) tissues of the A. suum gonad, respectively. 86% of the high-quality (HQ) reads were assembled into 9,955 contigs and 69,791 HQ reads remained as singletons. 2.4 million bp of unique sequences were obtained with a coverage that reached 16.1-fold. 4,877 contigs and 14,339 singletons were annotated according to the C. elegans protein and the Kyoto Encyclopedia of Genes and Genomes (KEGG) protein databases. Comparison of TES and VAS transcriptomes demonstrated that genes participating in DNA replication, RNA transcription and ubiquitin-proteasome pathways are expressed at significantly higher levels in TES tissues than in VAS tissues. Comparison of the A. suum TES transcriptome with the C. elegans microarray dataset identified 165 A. suum germline-enriched genes (83% are spermatogenesis-enriched). Many of these genes encode serine/threonine kinases and phosphatases (KPs) as well as tyrosine KPs. Immunoblot analysis further suggested a critical role of phosphorylation in both testis development and spermatogenesis. A total of 2,681 A. suum genes were identified to have associated RNAi phenotypes in C. elegans, the majority of which display embryonic lethality, slow growth, larval arrest or sterility. Conclusions Using deep sequencing technology, this study has produced a gonad transcriptome of A. suum. By comparison with C. elegans datasets, we identified sets of genes associated with spermatogenesis and gonad development in A. suum. The newly identified genes encoding KPs may help determine signaling pathways that operate during spermatogenesis. A large portion of A. suum gonadal genes have related RNAi phenotypes in C. elegans and, thus, might be RNAi targets for parasite control.
Collapse
|
26
|
Jung S, Swart EC, Minx PJ, Magrini V, Mardis ER, Landweber LF, Eddy SR. Exploiting Oxytricha trifallax nanochromosomes to screen for non-coding RNA genes. Nucleic Acids Res 2011; 39:7529-47. [PMID: 21715380 PMCID: PMC3177221 DOI: 10.1093/nar/gkr501] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
We took advantage of the unusual genomic organization of the ciliate Oxytricha trifallax to screen for eukaryotic non-coding RNA (ncRNA) genes. Ciliates have two types of nuclei: a germ line micronucleus that is usually transcriptionally inactive, and a somatic macronucleus that contains a reduced, fragmented and rearranged genome that expresses all genes required for growth and asexual reproduction. In some ciliates including Oxytricha, the macronuclear genome is particularly extreme, consisting of thousands of tiny 'nanochromosomes', each of which usually contains only a single gene. Because the organism itself identifies and isolates most of its genes on single-gene nanochromosomes, nanochromosome structure could facilitate the discovery of unusual genes or gene classes, such as ncRNA genes. Using a draft Oxytricha genome assembly and a custom-written protein-coding genefinding program, we identified a subset of nanochromosomes that lack any detectable protein-coding gene, thereby strongly enriching for nanochromosomes that carry ncRNA genes. We found only a small proportion of non-coding nanochromosomes, suggesting that Oxytricha has few independent ncRNA genes besides homologs of already known RNAs. Other than new members of known ncRNA classes including C/D and H/ACA snoRNAs, our screen identified one new family of small RNA genes, named the Arisong RNAs, which share some of the features of small nuclear RNAs.
Collapse
Affiliation(s)
- Seolkyoung Jung
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn VA 20147, USA
| | | | | | | | | | | | | |
Collapse
|
27
|
Kanai A. [Virus, phage, transposon and their regulatory small non-coding RNAs]. Uirusu 2011; 61:25-34. [PMID: 21972553 DOI: 10.2222/jsv.61.25] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Many reports have been accumulated describing not a few microRNAs (miRNAs) in eukaryotes target viral genomes, whereas a number of viruses also encode miRNA genes. These small RNAs play important roles on viral infection and their replication. In germ cells, another small RNA, piRNA is reported to repress endogenous transposons. Furthermore, CRISPR RNA target virus/phage genomes in both archaea and bacteria. Therefore, small RNA is deeply involved in a broad range of biological defense systems. This system may be applied not only to control replication of viruses or phages but also provide implication on regulating the growth of microorganisms including pathogenic bacteria.
Collapse
Affiliation(s)
- Akio Kanai
- Institute for Advanced Biosciences, Keio University Tsuruoka, Yamagata 997-0017, Japan.
| |
Collapse
|
28
|
Saito R, Kohno K, Okada Y, Osada Y, Numata K, Kohama C, Watanabe K, Nakaoka H, Yamamoto N, Kanai A, Yasue H, Murata S, Abe K, Tomita M, Ohkohchi N, Kiyosawa H. Comprehensive expressional analyses of antisense transcripts in colon cancer tissues using artificial antisense probes. BMC Med Genomics 2011; 4:42. [PMID: 21575255 PMCID: PMC3125192 DOI: 10.1186/1755-8794-4-42] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2010] [Accepted: 05/16/2011] [Indexed: 11/10/2022] Open
Abstract
Background Recent studies have identified thousands of sense-antisense gene pairs across different genomes by computational mapping of cDNA sequences. These studies have shown that approximately 25% of all transcriptional units in the human and mouse genomes are involved in cis-sense-antisense pairs. However, the number of known sense-antisense pairs remains limited because currently available cDNA sequences represent only a fraction of the total number of transcripts comprising the transcriptome of each cell type. Methods To discover novel antisense transcripts encoded in the antisense strand of important genes, such as cancer-related genes, we conducted expression analyses of antisense transcripts using our custom microarray platform along with 2376 probes designed specifically to detect the potential antisense transcripts of 501 well-known genes suitable for cancer research. Results Using colon cancer tissue and normal tissue surrounding the cancer tissue obtained from 6 patients, we found that antisense transcripts without poly(A) tails are expressed from approximately 80% of these well-known genes. This observation is consistent with our previous finding that many antisense transcripts expressed in a cell are poly(A)-. We also identified 101 and 71 antisense probes displaying a high level of expression specifically in normal and cancer tissues respectively. Conclusion Our microarray analysis identified novel antisense transcripts with expression profiles specific to cancer tissue, some of which might play a role in the regulatory networks underlying oncogenesis and thus are potential targets for further experimental validation. Our microarray data are available at http://www.brc.riken.go.jp/ncrna2007/viewer-Saito-01/index.html.
Collapse
Affiliation(s)
- Rintaro Saito
- Institute for Advanced Biosciences, Keio University, Tsuruoka 997-0017, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Mattick JS. The central role of RNA in human development and cognition. FEBS Lett 2011; 585:1600-16. [DOI: 10.1016/j.febslet.2011.05.001] [Citation(s) in RCA: 149] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Accepted: 05/03/2011] [Indexed: 12/22/2022]
|
30
|
Xin M, Wang Y, Yao Y, Song N, Hu Z, Qin D, Xie C, Peng H, Ni Z, Sun Q. Identification and characterization of wheat long non-protein coding RNAs responsive to powdery mildew infection and heat stress by using microarray analysis and SBS sequencing. BMC PLANT BIOLOGY 2011; 11:61. [PMID: 21473757 PMCID: PMC3079642 DOI: 10.1186/1471-2229-11-61] [Citation(s) in RCA: 235] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2010] [Accepted: 04/07/2011] [Indexed: 05/05/2023]
Abstract
BACKGROUND Biotic and abiotic stresses, such as powdery mildew infection and high temperature, are important limiting factors for yield and grain quality in wheat production. Emerging evidences suggest that long non-protein coding RNAs (npcRNAs) are developmentally regulated and play roles in development and stress responses of plants. However, identification of long npcRNAs is limited to a few plant species, such as Arabidopsis, rice and maize, no systematic identification of long npcRNAs and their responses to abiotic and biotic stresses is reported in wheat. RESULTS In this study, by using computational analysis and experimental approach we identified 125 putative wheat stress responsive long npcRNAs, which are not conserved among plant species. Among them, some were precursors of small RNAs such as microRNAs and siRNAs, two long npcRNAs were identified as signal recognition particle (SRP) 7S RNA variants, and three were characterized as U3 snoRNAs. We found that wheat long npcRNAs showed tissue dependent expression patterns and were responsive to powdery mildew infection and heat stress. CONCLUSION Our results indicated that diverse sets of wheat long npcRNAs were responsive to powdery mildew infection and heat stress, and could function in wheat responses to both biotic and abiotic stresses, which provided a starting point to understand their functions and regulatory mechanisms in the future.
Collapse
Affiliation(s)
- Mingming Xin
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Yu Wang
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Yingyin Yao
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Na Song
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Zhaorong Hu
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Dandan Qin
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Chaojie Xie
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Huiru Peng
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Zhongfu Ni
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
| | - Qixin Sun
- State Key Laboratory for Agrobiotechnology and Key Laboratory of Crop Heterosis and Utilization (MOE) and Key Laboratory of Crop Genomics and Genetic Improvement (MOA), Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing, 100094, PR China
- National Plant Gene Research Centre (Beijing), Beijing 100094, PR China
- Department of Plant Genetics & Breeding, China Agricultural University, Yuanmingyuan Xi Road No. 2, Haidian District, Beijing, 100193, PR China
| |
Collapse
|
31
|
van Bakel H, Nislow C, Blencowe BJ, Hughes TR. Most "dark matter" transcripts are associated with known genes. PLoS Biol 2010; 8:e1000371. [PMID: 20502517 PMCID: PMC2872640 DOI: 10.1371/journal.pbio.1000371] [Citation(s) in RCA: 330] [Impact Index Per Article: 23.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2009] [Accepted: 04/09/2010] [Indexed: 12/18/2022] Open
Abstract
Short-read RNA sequencing in mouse and human tissues shows that most transcripts are encoded within or nearby known genes and that most of the genome is not transcribed. A series of reports over the last few years have indicated that a much larger portion of the mammalian genome is transcribed than can be accounted for by currently annotated genes, but the quantity and nature of these additional transcripts remains unclear. Here, we have used data from single- and paired-end RNA-Seq and tiling arrays to assess the quantity and composition of transcripts in PolyA+ RNA from human and mouse tissues. Relative to tiling arrays, RNA-Seq identifies many fewer transcribed regions (“seqfrags”) outside known exons and ncRNAs. Most nonexonic seqfrags are in introns, raising the possibility that they are fragments of pre-mRNAs. The chromosomal locations of the majority of intergenic seqfrags in RNA-Seq data are near known genes, consistent with alternative cleavage and polyadenylation site usage, promoter- and terminator-associated transcripts, or new alternative exons; indeed, reads that bridge splice sites identified 4,544 new exons, affecting 3,554 genes. Most of the remaining seqfrags correspond to either single reads that display characteristics of random sampling from a low-level background or several thousand small transcripts (median length = 111 bp) present at higher levels, which also tend to display sequence conservation and originate from regions with open chromatin. We conclude that, while there are bona fide new intergenic transcripts, their number and abundance is generally low in comparison to known exons, and the genome is not as pervasively transcribed as previously reported. The human genome was sequenced a decade ago, but its exact gene composition remains a subject of debate. The number of protein-coding genes is much lower than initially expected, and the number of distinct transcripts is much larger than the number of protein-coding genes. Moreover, the proportion of the genome that is transcribed in any given cell type remains an open question: results from “tiling” microarray analyses suggest that transcription is pervasive and that most of the genome is transcribed, whereas new deep sequencing-based methods suggest that most transcripts originate from known genes. We have addressed this discrepancy by comparing samples from the same tissues using both technologies. Our analyses indicate that RNA sequencing appears more reliable for transcripts with low expression levels, that most transcripts correspond to known genes or are near known genes, and that many transcripts may represent new exons or aberrant products of the transcription process. We also identify several thousand small transcripts that map outside known genes; their sequences are often conserved and are often encoded in regions of open chromatin. We propose that most of these transcripts may be by-products of the activity of enhancers, which associate with promoters as part of their role as long-range gene regulatory sites. Overall, however, we find that most of the genome is not appreciably transcribed.
Collapse
Affiliation(s)
- Harm van Bakel
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
| | - Corey Nislow
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Benjamin J. Blencowe
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Timothy R. Hughes
- Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- * E-mail:
| |
Collapse
|
32
|
Sathira N, Yamashita R, Tanimoto K, Kanai A, Arauchi T, Kanematsu S, Nakai K, Suzuki Y, Sugano S. Characterization of transcription start sites of putative non-coding RNAs by multifaceted use of massively paralleled sequencer. DNA Res 2010; 17:169-83. [PMID: 20400770 PMCID: PMC2885271 DOI: 10.1093/dnares/dsq007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
On the basis of integrated transcriptome analysis, we show that not all transcriptional start site clusters (TSCs) in the intergenic regions (iTSCs) have the same properties; thus, it is possible to discriminate the iTSCs that are likely to have biological relevance from the other noise-level iTSCs. We used a total of 251 933 381 short-read sequence tags generated from various types of transcriptome analyses in order to characterize 6039 iTSCs, which have significant expression levels. We analyzed and found that 23% of these iTSCs were located in the proximal regions of the RefSeq genes. These RefSeq-linked iTSCs showed similar expression patterns with the neighboring RefSeq genes, had widely fluctuating transcription start sites and lacked ordered nucleosome positioning. These iTSCs seemed not to form independent transcriptional units, simply representing the by-products of the neighboring RefSeq genes, in spite of their significant expression levels. Similar features were also observed for the TSCs located in the antisense regions of the RefSeq genes. Furthermore, for the remaining iTSCs that were not associated with any RefSeq genes, we demonstrate that integrative interpretation of the transcriptome data provides essential information to specify their biological functions in the hypoxic responses of the cells.
Collapse
Affiliation(s)
- Nuankanya Sathira
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba 277-8568, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Lebenthal I, Unger R. Computational evidence for functionality of noncoding mouse transcripts. Genomics 2010; 96:10-6. [PMID: 20347031 DOI: 10.1016/j.ygeno.2010.03.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2009] [Revised: 02/18/2010] [Accepted: 03/19/2010] [Indexed: 11/29/2022]
Abstract
Large-scale studies of mammalian genome transcription reveal that a large proportion of the genome is transcribed. It remains an open question whether the identified transcripts are functional. Here, we searched for computational evidence to support the functionality of 34,030 noncoding RNA (ncRNA) transcripts reported by the Fantom3 project. We show that compared to control sets, the Fantom ncRNA transcripts set is more conserved with human and rat. We also demonstrate that homologs of the Fantom ncRNA sequences in human and rat have more matches to ESTs. The conserved subgroup of sequences exhibits elevated expression levels in brain tissues. Finally, on average, the Fantom ncRNA sequences have lower minimal free energy of folding than the control sets. Taken together, these observations suggest that, as a group, the Fantom ncRNA set has properties that are different from random sets. Therefore, many of these transcripts may indeed have biological function.
Collapse
Affiliation(s)
- Ilana Lebenthal
- The Mina & Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, Israel.
| | | |
Collapse
|
34
|
Zhang Y, Liu J, Jia C, Li T, Wu R, Wang J, Chen Y, Zou X, Chen R, Wang XJ, Zhu D. Systematic identification and evolutionary features of rhesus monkey small nucleolar RNAs. BMC Genomics 2010; 11:61. [PMID: 20100322 PMCID: PMC2832892 DOI: 10.1186/1471-2164-11-61] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2009] [Accepted: 01/25/2010] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Recent studies have demonstrated that non-protein-coding RNAs (npcRNAs/ncRNAs) play important roles during eukaryotic development, species evolution, and in the etiology of disease. Rhesus macaques are the most widely used primate model in both biomedical research and primate evolutionary studies. However, most reports on these animals focus on the functional roles of protein-coding sequences, whereas very little is known about macaque ncRNAs. RESULTS In the present study, we performed the first systematic profiling of intermediate-size ncRNAs (50 to 500 nt) from the rhesus monkey by constructing a cDNA library. We identified 117 rhesus monkey ncRNAs, including 80 small nucleolar RNAs (snoRNAs), 29 other types of known RNAs (snRNAs, Y RNA, and others), and eight unclassified ncRNAs. Comparative genomic analysis and northern blot hybridizations demonstrated that some snoRNAs were lineage- or species-specific. Paralogous sequences were found for most rhesus monkey snoRNAs, the expression of which might be attributable to extensive duplication within the rhesus monkey genome. Further investigation of snoRNA flanking sequences showed that some rhesus monkey snoRNAs are retrogenes derived from L1-mediated integration. Finally, phylogenetic analysis demonstrated that birds and primates share some snoRNAs and host genes thereof, suggesting that both the relevant host genes and the snoRNAs contained therein may be inherited from a common ancestor. However, some rhesus monkey snoRNAs hosted by non-ribosome-related genes appeared after the evolutionary divergence between birds and mammals. CONCLUSIONS We provide the first experimentally-derived catalog of rhesus monkey ncRNAs and uncover some interesting genomic and evolutionary features. These findings provide important information for future functional characterization of snoRNAs during primate evolution.
Collapse
Affiliation(s)
- Yong Zhang
- National Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine, Peking Union Medical College, Beijing, PR China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Jacquier A. The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs. Nat Rev Genet 2009; 10:833-44. [PMID: 19920851 DOI: 10.1038/nrg2683] [Citation(s) in RCA: 318] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Over the past few years, techniques have been developed that have allowed the study of transcriptomes without bias from previous genome annotations, which has led to the discovery of a plethora of unexpected RNAs that have no obvious coding capacities. There are many different kinds of products that are generated by this pervasive transcription; this Review focuses on small non-coding RNAs (ncRNAs) that have been found to be associated with promoters in eukaryotes from animals to yeast. After comparing the different classes of such ncRNAs described in various studies, the Review discusses how the models proposed for their origins and their possible functions challenge previous views of the basic transcription process and its regulation.
Collapse
Affiliation(s)
- Alain Jacquier
- Unité de Génétique des Interactions Macromoléculaires, Institut Pasteur, Centre National de la Recherche Scientifique URA2171, 25 Rue du Dr Roux, F-75015, Paris, France.
| |
Collapse
|
36
|
Forrest ARR, Abdelhamid RF, Carninci P. Annotating non-coding transcription using functional genomics strategies. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:437-43. [PMID: 19833699 DOI: 10.1093/bfgp/elp041] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Non-coding RNA (ncRNA) transcripts are RNA molecules that do not code for proteins, but elicit function by other mechanisms. The vast majority of RNA produced in a cell is non-coding ribosomal RNA, produced from relatively few loci, however more recently complementary DNA (cDNA) cloning, tag sequencing, and genome tiling array studies suggest that ncRNAs also account for the majority of RNA species produced by a cell. ncRNA based regulation has been referred to as a 'hidden layer' of signals or 'dark matter' that control gene expression in cellular processes by poorly described mechanisms. These terms have appeared as ncRNAs until recently have been ignored by expression profiling and cDNA annotation projects and their mode of action is diverse (e.g. influencing chromatin structure and epigenetics, translational silencing, transcriptional silencing). Here, we highlight recent functional genomics strategies toward identifying and assigning function to ncRNA transcription.
Collapse
Affiliation(s)
- Alistair R R Forrest
- Omics Science Center, RIKEN Yokohama Institute, Yokohama, Kanagawa 230-0045 Japan
| | | | | |
Collapse
|
37
|
van Bakel H, Hughes TR. Establishing legitimacy and function in the new transcriptome. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:424-36. [PMID: 19833698 DOI: 10.1093/bfgp/elp037] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The last decade has seen an explosion of interest in new classes of non-coding RNA. While some are now firmly established as new categories of legitimate functional RNAs, the purpose and even existence of others remain to be solidified. Here, we discuss the challenges associated with discovery and characterization of non-traditional categories of non-coding RNA.
Collapse
Affiliation(s)
- Harm van Bakel
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M5S 3E1, Canada
| | | |
Collapse
|
38
|
Amaral PP, Neyt C, Wilkins SJ, Askarian-Amiri ME, Sunkin SM, Perkins AC, Mattick JS. Complex architecture and regulated expression of the Sox2ot locus during vertebrate development. RNA (NEW YORK, N.Y.) 2009; 15:2013-2027. [PMID: 19767420 PMCID: PMC2764477 DOI: 10.1261/rna.1705309] [Citation(s) in RCA: 167] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2009] [Accepted: 08/18/2009] [Indexed: 05/28/2023]
Abstract
The Sox2 gene is a key regulator of pluripotency embedded within an intron of a long noncoding RNA (ncRNA), termed Sox2 overlapping transcript (Sox2ot), which is transcribed in the same orientation. However, this ncRNA remains uncharacterized. Here we show that Sox2ot has multiple transcription start sites associated with genomic features that indicate regulated expression, including highly conserved elements (HCEs) and chromatin marks characteristic of gene promoters. To identify biological processes in which Sox2ot may be involved, we analyzed its expression in several developmental systems, compared to expression of Sox2. We show that Sox2ot is a stable transcript expressed in mouse embryonic stem cells, which, like Sox2, is down-regulated upon induction of embryoid body (EB) differentiation. However, in contrast to Sox2, Sox2ot is up-regulated during EB mesoderm-lineage differentiation. In adult mouse, Sox2ot isoforms were detected in tissues where Sox2 is expressed, as well as in different tissues, supporting independent regulation of expression of the ncRNA. Sox2dot, an isoform of Sox2ot transcribed from a distal HCE located >500 kb upstream of Sox2, was detected exclusively in the mouse brain, with enrichment in regions of adult neurogenesis. In addition, Sox2ot isoforms are transcribed from HCEs upstream of Sox2 in other vertebrates, including in several regions of the human brain. We also show that Sox2ot is dynamically regulated during chicken and zebrafish embryogenesis, consistently associated with central nervous system structures. These observations provide insight into the structure and regulation of the Sox2ot gene, and suggest conserved roles for Sox2ot orthologs during vertebrate development.
Collapse
Affiliation(s)
- Paulo P Amaral
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, The University of Queensland, St Lucia,QLD 4072, Australia
| | | | | | | | | | | | | |
Collapse
|
39
|
Nagano T, Fraser P. Emerging similarities in epigenetic gene silencing by long noncoding RNAs. Mamm Genome 2009; 20:557-62. [PMID: 19727951 DOI: 10.1007/s00335-009-9218-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2009] [Accepted: 07/20/2009] [Indexed: 10/20/2022]
Abstract
Long noncoding RNAs (lncRNAs) such as Xist, Air, and Kcnq1ot1 are required for epigenetic silencing of multiple genes in cis within large chromosomal domains, including distant genes located hundreds of kilobase pairs away. Recent evidence suggests that all three of these lncRNAs are functional and that they silence gene expression, in part, through an intimate interaction with chromatin. Here we provide an overview of lncRNA-dependent gene silencing, focusing on recent findings for the Air and Kcnq1ot1 lncRNAs. We review molecular evidence indicating that these lncRNAs interact with chromatin and correlate their presence with specific histone modifications associated with gene silencing. A general model for a lncRNA-dependent gene-silencing mechanism is presented based on the apparent ability of lncRNAs to recruit histone-modifying activities to chromatin. However, alternate mechanisms may be required to explain the silencing of some lncRNA-dependent genes. Finally, we discuss unanswered questions and future perspectives associated with these enigmatic lncRNA molecules.
Collapse
Affiliation(s)
- Takashi Nagano
- Laboratory of Chromatin and Gene Expression, The Babraham Institute, Babraham Research Campus, Cambridge CB223AT, UK.
| | | |
Collapse
|
40
|
Zhang Y, Wang J, Huang S, Zhu X, Liu J, Yang N, Song D, Wu R, Deng W, Skogerbø G, Wang XJ, Chen R, Zhu D. Systematic identification and characterization of chicken (Gallus gallus) ncRNAs. Nucleic Acids Res 2009; 37:6562-74. [PMID: 19720738 PMCID: PMC2770669 DOI: 10.1093/nar/gkp704] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Recent studies have demonstrated that non-coding RNAs (ncRNAs) play important roles during development and evolution. Chicken, the first genome-sequenced non-mammalian amniote, possesses unique features for developmental and evolutionary studies. However, apart from microRNAs, information on chicken ncRNAs has mainly been obtained from computational predictions without experimental validation. In the present study, we performed a systematic identification of intermediate size ncRNAs (50–500 nt) by ncRNA library construction and identified 125 chicken ncRNAs. Importantly, through the bioinformatics and expression analysis, we found the chicken ncRNAs has several novel features: (i) comparative genomic analysis against 18 sequenced vertebrate genomes revealed that the majority of the newly identified ncRNA candidates is not conserved and most are potentially bird/chicken specific, suggesting that ncRNAs play roles in lineage/species specification during evolution. (ii) The expression pattern analysis of intronic snoRNAs and their host genes suggested the coordinated expression between snoRNAs and their host genes. (iii) Several spatio-temporal specific expression patterns suggest involvement of ncRNAs in tissue development. Together, these findings provide new clues for future functional study of ncRNAs during development and evolution.
Collapse
Affiliation(s)
- Yong Zhang
- National Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100005, China
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Arrial RT, Togawa RC, Brigido MDM. Screening non-coding RNAs in transcriptomes from neglected species using PORTRAIT: case study of the pathogenic fungus Paracoccidioides brasiliensis. BMC Bioinformatics 2009; 10:239. [PMID: 19653905 PMCID: PMC2731755 DOI: 10.1186/1471-2105-10-239] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2008] [Accepted: 08/04/2009] [Indexed: 12/02/2022] Open
Abstract
Background Transcriptome sequences provide a complement to structural genomic information and provide snapshots of an organism's transcriptional profile. Such sequences also represent an alternative method for characterizing neglected species that are not expected to undergo whole-genome sequencing. One difficulty for transcriptome sequencing of these organisms is the low quality of reads and incomplete coverage of transcripts, both of which compromise further bioinformatics analyses. Another complicating factor is the lack of known protein homologs, which frustrates searches against established protein databases. This lack of homologs may be caused by divergence from well-characterized and over-represented model organisms. Another explanation is that non-coding RNAs (ncRNAs) may be caught during sequencing. NcRNAs are RNA sequences that, unlike messenger RNAs, do not code for protein products and instead perform unique functions by folding into higher order structural conformations. There is ncRNA screening software available that is specific for transcriptome sequences, but their analyses are optimized for those transcriptomes that are well represented in protein databases, and also assume that input ESTs are full-length and high quality. Results We propose an algorithm called PORTRAIT, which is suitable for ncRNA analysis of transcriptomes from poorly characterized species. Sequences are translated by software that is resistant to sequencing errors, and the predicted putative proteins, along with their source transcripts, are evaluated for coding potential by a support vector machine (SVM). Either of two SVM models may be employed: if a putative protein is found, a protein-dependent SVM model is used; if it is not found, a protein-independent SVM model is used instead. Only ab initio features are extracted, so that no homology information is needed. We illustrate the use of PORTRAIT by predicting ncRNAs from the transcriptome of the pathogenic fungus Paracoccidoides brasiliensis and five other related fungi. Conclusion PORTRAIT can be integrated into pipelines, and provides a low computational cost solution for ncRNA detection in transcriptome sequencing projects.
Collapse
|
42
|
Nichols M, Steinman RA. A recombinase-based palindrome generator capable of producing randomized shRNA libraries. J Biotechnol 2009; 143:79-84. [PMID: 19539675 DOI: 10.1016/j.jbiotec.2009.06.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2009] [Revised: 06/04/2009] [Accepted: 06/09/2009] [Indexed: 01/05/2023]
|
43
|
Nordström KJV, Mirza MAI, Almén MS, Gloriam DE, Fredriksson R, Schiöth HB. Critical evaluation of the FANTOM3 non-coding RNA transcripts. Genomics 2009; 94:169-76. [PMID: 19505569 DOI: 10.1016/j.ygeno.2009.05.012] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2007] [Revised: 05/25/2009] [Accepted: 05/26/2009] [Indexed: 01/15/2023]
Abstract
We studied the genomic positions of 38,129 putative ncRNAs from the RIKEN dataset in relation to protein-coding genes. We found that the dataset has 41% sense, 6% antisense, 24% intronic and 29% intergenic transcripts. Interestingly, 17,678 (47%) of the FANTOM3 transcripts were found to potentially be internally primed from longer transcripts. The highest fraction of these transcripts was found among the intronic transcripts and as many as 77% or 6929 intronic transcripts were both internally primed and unspliced. We defined a filtered subset of 8535 transcripts that did not overlap with protein-coding genes, did not contain ORFs longer than 100 residues and were not internally primed. This dataset contains 53% of the FANTOM3 transcripts associated to known ncRNA in RNAdb and expands previous similar efforts with 6523 novel transcripts. This bioinformatic filtering of the FANTOM3 non-coding dataset has generated a lead dataset of transcripts without signs of being artefacts, providing a suitable dataset for investigation with hybridization-based techniques.
Collapse
|
44
|
Mello BP, Abrantes EF, Torres CH, Machado-Lima A, Fonseca RDS, Carraro DM, Brentani RR, Reis LFL, Brentani H. No-match ORESTES explored as tumor markers. Nucleic Acids Res 2009; 37:2607-17. [PMID: 19270067 PMCID: PMC2677862 DOI: 10.1093/nar/gkp074] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Sequencing technologies and new bioinformatics tools have led to the complete sequencing of various genomes. However, information regarding the human transcriptome and its annotation is yet to be completed. The Human Cancer Genome Project, using ORESTES (open reading frame EST sequences) methodology, contributed to this objective by generating data from about 1.2 million expressed sequence tags. Approximately 30% of these sequences did not align to ESTs in the public databases and were considered no-match ORESTES. On the basis that a set of these ESTs could represent new transcripts, we constructed a cDNA microarray. This platform was used to hybridize against 12 different normal or tumor tissues. We identified 3421 transcribed regions not associated with annotated transcripts, representing 83.3% of the platform. The total number of differentially expressed sequences was 1007. Also, 28% of analyzed sequences could represent noncoding RNAs. Our data reinforces the knowledge of the human genome being pervasively transcribed, and point out molecular marker candidates for different cancers. To reinforce our data, we confirmed, by real-time PCR, the differential expression of three out of eight potentially tumor markers in prostate tissues. Lists of 1007 differentially expressed sequences, and the 291 potentially noncoding tumor markers were provided.
Collapse
Affiliation(s)
- Barbara P Mello
- Hospital A. C. Camargo, Rua Prof. Antônio Prudente 211, São Paulo, SP, Brazil
| | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Abstract
Eukaryotes using trans-splicing for transcript processing incorporate a taxon-specific sequence tag (the spliced leader, SL) to a proportion (either all or a fraction) of their mRNAs. This feature may be exploited for the preparation of full-length-enriched cDNA libraries from these organisms (a diverse group including euglenozoa and dinoflagellates, as well as members from five metazoan phyla: Cnidaria, Rotifera, Nematoda, Platyhelminths and Chordata). The strategy has indeed been widely used to construct cDNA libraries for the generation of ESTs, mainly from parasitic euglenozoa and helminths.We describe a set of optimised protocols to prepare directional SL-cDNA libraries; the method involves PCR-amplification of SL-cDNA and its subsequent cloning in a plasmid vector under a specific orientation. It uses small amounts of total RNA as starting material and may be applied to a variety of samples. The approach permits the selective cloning of mRNAs tagged with a particular SL from mixtures including large amounts of non-trans-spliced mRNAs. Thus, it allows exclusion of host contamination when isolating SL-cDNAs from parasitic organisms, and has other potential applications, such as the characterisation of the trans-spliced transcriptome from organisms in mixed pools of species.
Collapse
Affiliation(s)
- Cecilia Fernández
- Facultad de Química, Universidad de la República, Montevideo, Uruguay
| | | |
Collapse
|
46
|
Transcriptional profiling of hematopoietic stem cells by high-throughput sequencing. Int J Hematol 2008; 89:24-33. [PMID: 19050837 DOI: 10.1007/s12185-008-0212-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2008] [Revised: 10/01/2008] [Accepted: 10/23/2008] [Indexed: 10/21/2022]
Abstract
Microarray analysis has made it feasible to carry out extensive gene expression profiling in a single assay. Various hematopoietic stem cell (HSC) populations have been subjected to microarray analyses and their profiles of gene expression have been reported. However, this approach is not suitable to identify novel transcripts or for profiling of genes with low expression levels. To obtain a detailed gene expression profile of CD34(-)c-Kit(+)Sca-1(+)lineage marker-negative (Lin(-)) (CD34(-)KSL) HSCs, we constructed a CD34(-)KSL cDNA library, performed high-throughput sequencing, and compared the generated profile with that of another HSC fraction, side population (SP) Lin(-) (SP Lin(-)) cells. Sequencing of the 5'-termini of about 9,500 cDNAs from each HSC library identified 1,424 and 2,078 different genes from the CD34(-)KSL and SP Lin(-) libraries, respectively. To exclude ubiquitously expressed genes including housekeeping genes, digital subtraction was successfully performed against EST databases of other organs, leaving 25 HSC-specific genes including five novel genes. Among 4,450 transcripts from the CD34(-)KSL cDNA library that showed no homology to the presumable protein-coding genes, 29 were identified as strong candidates for mRNA-like non-coding RNAs by in silico analyses. Our cyclopedic approaches may contribute to understanding of novel molecular aspects of HSC function.
Collapse
|
47
|
Non-coding RNAs revealed during identification of genes involved in chicken immune responses. Immunogenetics 2008; 61:55-70. [PMID: 19009289 DOI: 10.1007/s00251-008-0337-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2008] [Accepted: 10/13/2008] [Indexed: 12/12/2022]
Abstract
Recent large-scale cDNA cloning studies have shown that a significant proportion of the transcripts expressed from vertebrate genomes do not appear to encode protein. Moreover, it was reported in mammals (human and mice) that these non-coding transcripts are expressed and regulated by mechanisms similar to those involved in the control of protein-coding genes. We have produced a collection of cDNA sequences from immunologically active tissues with the aim of discovering chicken genes involved in immune mechanisms, and we decided to explore the non-coding component of these immune-related libraries. After finding known non-coding RNAs (miRNA, snRNA, snoRNA), we identified new putative mRNA-like non-coding RNAs. We characterised their expression profiles in immune-related samples. Some of them showed changes in expression following viral infections. As they exhibit patterns of expression that parallel the behaviour of protein-coding RNAs in immune tissues, our study suggests that they could play an active role in the immune response.
Collapse
|
48
|
Ben Amor B, Wirth S, Merchan F, Laporte P, d'Aubenton-Carafa Y, Hirsch J, Maizel A, Mallory A, Lucas A, Deragon JM, Vaucheret H, Thermes C, Crespi M. Novel long non-protein coding RNAs involved in Arabidopsis differentiation and stress responses. Genome Res 2008; 19:57-69. [PMID: 18997003 DOI: 10.1101/gr.080275.108] [Citation(s) in RCA: 269] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Long non-protein coding RNAs (npcRNA) represent an emerging class of riboregulators, which either act directly in this long form or are processed to shorter miRNA and siRNA. Genome-wide bioinformatic analysis of full-length cDNA databases identified 76 Arabidopsis npcRNAs. Fourteen npcRNAs were antisense to protein-coding mRNAs, suggesting cis-regulatory roles. Numerous 24-nt siRNA matched to five different npcRNAs, suggesting that these npcRNAs are precursors of this type of siRNA. Expression analyses of the 76 npcRNAs identified a novel npcRNA that accumulates in a dcl1 mutant but does not appear to produce trans-acting siRNA or miRNA. Additionally, another npcRNA was the precursor of miR869 and shown to be up-regulated in dcl4 but not in dcl1 mutants, indicative of a young miRNA gene. Abiotic stress altered the accumulation of 22 npcRNAs among the 76, a fraction significantly higher than that observed for the RNA binding protein-coding fraction of the transcriptome. Overexpression analyses in Arabidopsis identified two npcRNAs as regulators of root growth during salt stress and leaf morphology, respectively. Hence, together with small RNAs, long npcRNAs encompass a sensitive component of the transcriptome that have diverse roles during growth and differentiation.
Collapse
Affiliation(s)
- Besma Ben Amor
- Institut des Sciences du Végétal (ISV), CNRS, 91198 Gif-sur-Yvette, France
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Abstract
Non-protein-coding sequences increasingly dominate the genomes of multicellular organisms as their complexity increases, in contrast to protein-coding genes, which remain relatively static. Most of the mammalian genome and indeed that of all eukaryotes is expressed in a cell- and tissue-specific manner, and there is mounting evidence that much of this transcription is involved in the regulation of differentiation and development. Different classes of small and large noncoding RNAs (ncRNAs) have been shown to regulate almost every level of gene expression, including the activation and repression of homeotic genes and the targeting of chromatin-remodeling complexes. ncRNAs are involved in developmental processes in both simple and complex eukaryotes, and we illustrate this in the latter by focusing on the animal germline, brain, and eye. While most have yet to be systematically studied, the emerging evidence suggests that there is a vast hidden layer of regulatory ncRNAs that constitutes the majority of the genomic programming of multicellular organisms and plays a major role in controlling the epigenetic trajectories that underlie their ontogeny.
Collapse
|
50
|
Silva SS, Paes HC, Soares CMA, Fernandes L, Felipe MSS. Insights into the pathobiology of Paracoccidioides brasiliensis from transcriptome analysis--advances and perspectives. Mycopathologia 2008; 165:249-58. [PMID: 18777632 DOI: 10.1007/s11046-007-9071-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Paracoccidioiddes brasiliensis is a thermodimorphic fungus endemic to Latin America, where it causes the most prevalent systemic mycosis, paracoccidioidomycosis (PCM). DNA microarray technology has been used to identify patterns of gene expression when a microbe is confronted with conditions of interest, such as in vitro and/or ex vivo interaction with specific cells. P. brasiliensis is one organism that has benefited from this approach. Even though its genome has not been sequenced yet, much has been discovered from its transcriptome and DNA array analyses. In this review, we will outline the current knowledge in P. brasiliensis transcriptome, with focus on differential expression analysis in vitro and on the discussion of the genes that are controlled during the host-pathogen interaction ex vivo in order to give insights into the pathobiology of this fungus. In vitro experiments enabled the delineation of whole metabolic pathways; the description of differential metabolism between mycelium and yeast cells and of the mainly signaling pathways controlling dimorphism, high temperature growth, thermal and oxidative stress, and virulence/ pathogenicity. Recent ex vivo experiments provided advances on the comprehension of the plasticity of response and indicate that P. brasiliensis is not only able to undergo fast and dramatic expression profile changes but can also discern subtle differences, such as whether it is being attacked by a macrophage or submitted to the bloodstream route conditions.
Collapse
Affiliation(s)
- Simoneide S Silva
- Laboratório de Biologia Molecular, Departamento de Biologia Celular, Universidade de Brasília, Brasilia, DF 70910-900, Brazil
| | | | | | | | | |
Collapse
|