1
|
Xu H, Li C, Xu C, Zhang J. Chance promoter activities illuminate the origins of eukaryotic intergenic transcriptions. Nat Commun 2023; 14:1826. [PMID: 37005399 PMCID: PMC10067814 DOI: 10.1038/s41467-023-37610-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 03/23/2023] [Indexed: 04/04/2023] Open
Abstract
It is debated whether the pervasive intergenic transcription from eukaryotic genomes has functional significance or simply reflects the promiscuity of RNA polymerases. We approach this question by comparing chance promoter activities with the expression levels of intergenic regions in the model eukaryote Saccharomyces cerevisiae. We build a library of over 105 strains, each carrying a 120-nucleotide, chromosomally integrated, completely random sequence driving the potential transcription of a barcode. Quantifying the RNA concentration of each barcode in two environments reveals that 41-63% of random sequences have significant, albeit usually low, promoter activities. Therefore, even in eukaryotes, where the presence of chromatin is thought to repress transcription, chance transcription is prevalent. We find that only 1-5% of yeast intergenic transcriptions are unattributable to chance promoter activities or neighboring gene expressions, and these transcriptions exhibit higher-than-expected environment-specificity. These findings suggest that only a minute fraction of intergenic transcription is functional in yeast.
Collapse
Affiliation(s)
- Haiqing Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Chuan Li
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
- Microsoft, Redmond, WA, USA
| | - Chuan Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
- Bio-X Institutes, Shanghai Jiao Tong University, Shanghai, China
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
2
|
Activity-Dependent Non-Coding RNA MAPK Interactome of the Human Epileptic Brain. Noncoding RNA 2023; 9:ncrna9010003. [PMID: 36649033 PMCID: PMC9844323 DOI: 10.3390/ncrna9010003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 12/07/2022] [Accepted: 12/16/2022] [Indexed: 01/06/2023] Open
Abstract
The human brain has evolved to have extraordinary capabilities, enabling complex behaviors. The uniqueness of the human brain is increasingly posited to be due in part to the functions of primate-specific, including human-specific, long non-coding RNA (lncRNA) genes, systemically less conserved than protein-coding genes in evolution. Patients who have surgery for drug-resistant epilepsy are subjected to extensive electrical recordings of the brain tissue that is subsequently removed in order to treat their epilepsy. Precise localization of brain tissues with distinct electrical properties offers a rare opportunity to explore the effects of brain activity on gene expression. Here, we identified 231 co-regulated, activity-dependent lncRNAs within the human MAPK signaling cascade. Six lncRNAs, four of which were antisense to known protein-coding genes, were further examined because of their high expression and potential impact on the disease phenotype. Using a model of repeated depolarizations in human neuronal-like cells (Sh-SY5Y), we show that five out of six lncRNAs were electrical activity-dependent, with three of four antisense lncRNAs having reciprocal expression patterns relative to their protein-coding gene partners. Some were directly regulated by MAPK signaling, while others effectively downregulated the expression of the protein-coding genes encoded on the opposite strands of their genomic loci. These lncRNAs, therefore, likely contribute to highly evolved and primate-specific human brain regulatory functions that could be therapeutically modulated to treat epilepsy.
Collapse
|
3
|
Wang C, Duan M, Lin J, Wang G, Gao H, Yan M, Chen L, He J, Liu W, Yang F, Zhu S. LncRNA and mRNA expression profiles in brown adipose tissue of obesity-prone and obesity-resistant mice. iScience 2022; 25:104809. [PMID: 35992072 PMCID: PMC9382264 DOI: 10.1016/j.isci.2022.104809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 06/10/2022] [Accepted: 07/15/2022] [Indexed: 11/28/2022] Open
Abstract
Obesity-prone or obesity-resistant phenotypes can exist in individuals who consume the same diet type. Brown adipose tissue functions to dissipate energy in response to cold exposure or overfeeding. Long noncoding RNAs play important roles in a wide range of biological processes. However, systematic examination of lncRNAs in phenotypically divergent mice has not yet been reported. Here, the lncRNA expression profiles in BAT of HFD-induced C57BL/6J mice were investigated by high-throughput RNA sequencing. Genes that play roles in thermogenesis and related pathways were identified. We found lncRNA (Gm44502) may play a thermogenic role in obesity resistance by interacting with six mRNAs. Our results also indicated that seven differentially expressed lncRNAs (4930528G23Rik, Gm39490, Gm5627, Gm15551, Gm16083, Gm36860, Gm42002) may play roles in reducing heat production in obesity susceptibility by interacting with seven differentially expressed mRNAs. The screened lncRNAs may participate in the pathogenesis of weight regulation and provide insight into obesity therapy. First lncRNA profiles in BAT of OR and OP mice via bioinformatic analysis Gm44502 may play a thermogenic role by interacting with 6 mRNAs 7 DElncRNAs may reduce thermogenesis by interacting with 7 DEmRNAs Validation of expression changes of candidate genes in BAT by in vivo or in vitro
Collapse
Affiliation(s)
- Congcong Wang
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - Meng Duan
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - Jinhua Lin
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - Guowei Wang
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - He Gao
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - Mengsha Yan
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - Lin Chen
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - Jialing He
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
| | - Wei Liu
- Department of Biochemistry, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Fei Yang
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
- Corresponding author
| | - Shankuan Zhu
- Chronic Disease Research Institute, The Children’s Hospital, and National Clinical Research Center for Child Health, School of Public Health, School of Medicine, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Nutrition and Food Hygiene, School of Public Health, School of Medicine, Zhejiang University, 866 Yu-hang-tang Road, Hangzhou, Zhejiang 310058, China
- Corresponding author
| |
Collapse
|
4
|
Exosomal RNAs: Novel Potential Biomarkers for Diseases-A Review. Int J Mol Sci 2022; 23:ijms23052461. [PMID: 35269604 PMCID: PMC8910301 DOI: 10.3390/ijms23052461] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 02/17/2022] [Accepted: 02/18/2022] [Indexed: 01/25/2023] Open
Abstract
Exosomes are a subset of nano-sized extracellular vesicles originating from endosomes. Exosomes mediate cell-to-cell communication with their cargos, which includes mRNAs, miRNAs, lncRNAs, and circRNAs. Exosomal RNAs have cell specificity and reflect the conditions of their donor cells. Notably, their detection in biofluids can be used as a diagnostic marker for various diseases. Exosomal RNAs are ideal biomarkers because their surrounding membranes confer stability and they are detectable in almost all biofluids, which helps to reduce trauma and avoid invasive examinations. However, knowledge of exosomal biomarkers remains scarce. The present review summarizes the biogenesis, secretion, and uptake of exosomes, the current researches exploring exosomal mRNAs, miRNAs, lncRNAs, and circRNAs as potential biomarkers for the diagnosis of human diseases, as well as recent techniques of exosome isolation.
Collapse
|
5
|
Ramírez-Colmenero A, Oktaba K, Fernandez-Valverde SL. Evolution of Genome-Organizing Long Non-coding RNAs in Metazoans. Front Genet 2020; 11:589697. [PMID: 33329735 PMCID: PMC7734150 DOI: 10.3389/fgene.2020.589697] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 11/09/2020] [Indexed: 12/28/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) have important regulatory functions across eukarya. It is now clear that many of these functions are related to gene expression regulation through their capacity to recruit epigenetic modifiers and establish chromatin interactions. Several lncRNAs have been recently shown to participate in modulating chromatin within the spatial organization of the genome in the three-dimensional space of the nucleus. The identification of lncRNA candidates is challenging, as it is their functional characterization. Conservation signatures of lncRNAs are different from those of protein-coding genes, making identifying lncRNAs under selection a difficult task, and the homology between lncRNAs may not be readily apparent. Here, we review the evidence for these higher-order genome organization functions of lncRNAs in animals and the evolutionary signatures they display.
Collapse
Affiliation(s)
- América Ramírez-Colmenero
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México
| | - Katarzyna Oktaba
- Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México
| | - Selene L Fernandez-Valverde
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México
| |
Collapse
|
6
|
Squillaro T, Peluso G, Galderisi U, Di Bernardo G. Long non-coding RNAs in regulation of adipogenesis and adipose tissue function. eLife 2020; 9:59053. [PMID: 32730204 PMCID: PMC7392603 DOI: 10.7554/elife.59053] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 07/23/2020] [Indexed: 12/12/2022] Open
Abstract
Complex interaction between genetics, epigenetics, environment, and nutrition affect the physiological activities of adipose tissues and their dysfunctions, which lead to several metabolic diseases including obesity or type 2 diabetes. Here, adipogenesis appears to be a process characterized by an intricate network that involves many transcription factors and long noncoding RNAs (lncRNAs) that regulate gene expression. LncRNAs are being investigated to determine their contribution to adipose tissue development and function. LncRNAs possess multiple cellular functions, and they regulate chromatin remodeling, along with transcriptional and post-transcriptional events; in this way, they affect gene expression. New investigations have demonstrated the pivotal role of these molecules in modulating white and brown/beige adipogenic tissue development and activity. This review aims to provide an update on the role of lncRNAs in adipogenesis and adipose tissue function to promote identification of new drug targets for treating obesity and related metabolic diseases.
Collapse
Affiliation(s)
- Tiziana Squillaro
- Department of Experimental Medicine, Biotechnology, and Molecular Biology Section, University of Campania Luigi Vanvitelli, Naples, Italy
| | | | - Umberto Galderisi
- Department of Experimental Medicine, Biotechnology, and Molecular Biology Section, University of Campania Luigi Vanvitelli, Naples, Italy
| | - Giovanni Di Bernardo
- Department of Experimental Medicine, Biotechnology, and Molecular Biology Section, University of Campania Luigi Vanvitelli, Naples, Italy
| |
Collapse
|
7
|
Genome-wide discovery and characterization of long noncoding RNAs in patients with multiple myeloma. BMC Med Genomics 2019; 12:135. [PMID: 31619233 PMCID: PMC6794882 DOI: 10.1186/s12920-019-0577-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Accepted: 08/30/2019] [Indexed: 12/12/2022] Open
Abstract
Background Long noncoding RNAs (lncRNAs) are involved in a wide range of biological processes in tumorigenesis. However, the role of lncRNA expression in the biology, prognosis, and molecular classification of human multiple myeloma (MM) remains unclear, especially the biological functions of the vast majority of lncRNAs. Recently, lncRNAs have been identified in neoplastic hematologic disorders. Evidence has accumulated on the molecular mechanisms of action of lncRNAs, providing insight into their functional roles in tumorigenesis. This study aimed to characterize potential lncRNAs in patients with MM. Methods In this study, the whole-transcriptome strand-specific RNA sequencing of samples from three newly diagnosed patients with MM was performed. The whole transcriptome, including lncRNAs, microRNAs, and mRNAs, was analyzed. Using these data, MM lncRNAs were systematically analyzed, and the lncRNAs involved in the occurrence of MM were identified. Results The results revealed that MM lncRNAs had distinctive characteristics different from those of other malignant tumors. Further, the functions of a set of lncRNAs preferentially expressed in MM were verified, and several lncRNAs were identified as competing endogenous RNAs. More importantly, the aberrant expression of certain lncRNAs, including maternally expressed gene3, colon cancer–associated transcript1, and coiled-coil domain-containing 26, as well as some novel lncRNAs involved in the occurrence of MM was established. Further, lncRNAs were related to some microRNAs, regulated each other, and participated in MM development. Conclusions Genome-wide screening and functional analysis enabled the identification of a set of lncRNAs involved in the occurrence of MM. The interaction exists among microRNAs and lncRNAs.
Collapse
|
8
|
LncRNA HOTAIR in Tumor Microenvironment: What Role? Int J Mol Sci 2019; 20:ijms20092279. [PMID: 31072041 PMCID: PMC6539022 DOI: 10.3390/ijms20092279] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 05/03/2019] [Accepted: 05/08/2019] [Indexed: 12/19/2022] Open
Abstract
lncRNAs participate in many cellular processes, including regulation of gene expression at the transcriptional and post-transcriptional levels. In addition, many lncRNAs can contribute to the development of different human diseases including cancer. The tumor microenvironment (TME) plays an important role during tumor growth and metastatic progression, and most of these lncRNAs have a key function in TME intracellular signaling. Among the numerous identified lncRNAs, several experimental evidences have shown the fundamental role of the lncRNA HOTAIR in carcinogenesis, also highlighting its use as a circulating biomarker. In this review we described the contribution of HOTAIR in the TME modulation, highlighting its relation with cellular and non-cellular components during tumor evolution and progression.
Collapse
|
9
|
Field AR, Jacobs FMJ, Fiddes IT, Phillips APR, Reyes-Ortiz AM, LaMontagne E, Whitehead L, Meng V, Rosenkrantz JL, Olsen M, Hauessler M, Katzman S, Salama SR, Haussler D. Structurally Conserved Primate LncRNAs Are Transiently Expressed during Human Cortical Differentiation and Influence Cell-Type-Specific Genes. Stem Cell Reports 2019; 12:245-257. [PMID: 30639214 PMCID: PMC6372947 DOI: 10.1016/j.stemcr.2018.12.006] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Revised: 12/10/2018] [Accepted: 12/11/2018] [Indexed: 01/30/2023] Open
Abstract
The cerebral cortex has expanded in size and complexity in primates, yet the molecular innovations that enabled primate-specific brain attributes remain obscure. We generated cerebral cortex organoids from human, chimpanzee, orangutan, and rhesus pluripotent stem cells and sequenced their transcriptomes at weekly time points for comparative analysis. We used transcript structure and expression conservation to discover gene regulatory long non-coding RNAs (lncRNAs). Of 2,975 human, multi-exonic lncRNAs, 2,472 were structurally conserved in at least one other species and 920 were conserved in all. Three hundred eighty-six human lncRNAs were transiently expressed (TrEx) and many were also TrEx in great apes (46%) and rhesus (31%). Many TrEx lncRNAs are expressed in specific cell types by single-cell RNA sequencing. Four TrEx lncRNAs selected based on cell-type specificity, gene structure, and expression pattern conservation were ectopically expressed in HEK293 cells by CRISPRa. All induced trans gene expression changes were consistent with neural gene regulatory activity.
Collapse
Affiliation(s)
- Andrew R Field
- Molecular, Cell, and Developmental Biology, University of California, Santa Cruz, Santa Cruz, CA 95064, USA; Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Frank M J Jacobs
- Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Ian T Fiddes
- Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA; Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Alex P R Phillips
- Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Andrea M Reyes-Ortiz
- Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Erin LaMontagne
- Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Lila Whitehead
- Molecular, Cell, and Developmental Biology, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Vincent Meng
- Molecular, Cell, and Developmental Biology, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Jimi L Rosenkrantz
- Howard Hughes Medical Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Mari Olsen
- Howard Hughes Medical Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Max Hauessler
- Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA; Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Sol Katzman
- Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Sofie R Salama
- Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA; Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA; Howard Hughes Medical Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA.
| | - David Haussler
- Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA; Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA 95064, USA; Howard Hughes Medical Institute, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| |
Collapse
|
10
|
Abstract
Brite/brown adipose tissue (BAT) is a thermogenic tissue able to dissipate energy via non-shivering thermogenesis. It is naturally activated by cold and has been demonstrated to increase thermogenic capacity, elevate energy expenditure, and to ultimately contribute to fat mass reduction. Thus, it emerges as novel therapeutic concept for pharmacological intervention in obesity and other metabolic disorders. Therefore, the comprehensive understanding of the regulatory network in thermogenic adipocytes is in demand.The surprising findings that (1) all human protein-coding genes make up not more than 2% of our genome, (2) organismal complexity goes well along with the percentage of nonprotein-coding sequences, and that (3) three quarters of our genome are pervasively transcribed, provide evidence that noncoding RNAs (ncRNAs) are not junk, but a significant and even predominant part of our transcriptome representing a treasure chest worth retrieving regulatory determinants in biological processes and diseases.In this chapter, the impact of regulatory small and long ncRNAs (lncRNAs) in particular microRNAs and lncRNAs on BAT formation and metabolic function and their involvement in physiological and pathological conditions has been reviewed.
Collapse
|
11
|
Long Non-Coding RNAs in Metabolic Organs and Energy Homeostasis. Int J Mol Sci 2017; 18:ijms18122578. [PMID: 29189723 PMCID: PMC5751181 DOI: 10.3390/ijms18122578] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Revised: 11/20/2017] [Accepted: 11/24/2017] [Indexed: 12/15/2022] Open
Abstract
Single cell organisms can surprisingly exceed the number of human protein-coding genes, which are thus not at the origin of the complexity of an organism. In contrast, the relative amount of non-protein-coding sequences increases consistently with organismal complexity. Moreover, the mammalian transcriptome predominantly comprises non-(protein)-coding RNAs (ncRNA), of which the long ncRNAs (lncRNAs) constitute the most abundant part. lncRNAs are highly species- and tissue-specific with very versatile modes of action in accordance with their binding to a large spectrum of molecules and their diverse localization. lncRNAs are transcriptional regulators adding an additional regulatory layer in biological processes and pathophysiological conditions. Here, we review lncRNAs affecting metabolic organs with a focus on the liver, pancreas, skeletal muscle, cardiac muscle, brain, and adipose organ. In addition, we will discuss the impact of lncRNAs on metabolic diseases such as obesity and diabetes. In contrast to the substantial number of lncRNA loci in the human genome, the functionally characterized lncRNAs are just the tip of the iceberg. So far, our knowledge concerning lncRNAs in energy homeostasis is still in its infancy, meaning that the rest of the iceberg is a treasure chest yet to be discovered.
Collapse
|
12
|
The New RNA World: Growing Evidence for Long Noncoding RNA Functionality. Trends Genet 2017; 33:665-676. [DOI: 10.1016/j.tig.2017.08.002] [Citation(s) in RCA: 155] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Revised: 08/01/2017] [Accepted: 08/02/2017] [Indexed: 12/18/2022]
|
13
|
Long non-coding RNAs: Mechanism of action and functional utility. Noncoding RNA Res 2016; 1:43-50. [PMID: 30159410 PMCID: PMC6096411 DOI: 10.1016/j.ncrna.2016.11.002] [Citation(s) in RCA: 181] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Revised: 11/09/2016] [Accepted: 11/10/2016] [Indexed: 12/22/2022] Open
Abstract
Recent RNA sequencing studies have revealed that most of the human genome is transcribed, but very little of the total transcriptomes has the ability to encode proteins. Long non-coding RNAs (lncRNAs) are non-coding transcripts longer than 200 nucleotides. Members of the non-coding genome include microRNA (miRNA), small regulatory RNAs and other short RNAs. Most of long non-coding RNA (lncRNAs) are poorly annotated. Recent recognition about lncRNAs highlights their effects in many biological and pathological processes. LncRNAs are dysfunctional in a variety of human diseases varying from cancerous to non-cancerous diseases. Characterization of these lncRNA genes and their modes of action may allow their use for diagnosis, monitoring of progression and targeted therapies in various diseases. In this review, we summarize the functional perspectives as well as the mechanism of action of lncRNAs.
Collapse
|
14
|
Abstract
The number of long noncoding RNAs (lncRNAs) has grown rapidly; however, our understanding of their function remains limited. Although cultured cells have facilitated investigations of lncRNA function at the molecular level, the use of animal models provides a rich context in which to investigate the phenotypic impact of these molecules. Promising initial studies using animal models demonstrated that lncRNAs influence a diverse number of phenotypes, ranging from subtle dysmorphia to viability. Here, we highlight the diversity of animal models and their unique advantages, discuss the use of animal models to profile lncRNA expression, evaluate experimental strategies to manipulate lncRNA function in vivo, and review the phenotypes attributable to lncRNAs. Despite a limited number of studies leveraging animal models, lncRNAs are already recognized as a notable class of molecules with important implications for health and disease.
Collapse
|
15
|
Tripathi R, Patel S, Kumari V, Chakraborty P, Varadwaj PK. DeepLNC, a long non-coding RNA prediction tool using deep neural network. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/s13721-016-0129-2] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
16
|
Quinn JJ, Zhang QC, Georgiev P, Ilik IA, Akhtar A, Chang HY. Rapid evolutionary turnover underlies conserved lncRNA-genome interactions. Genes Dev 2016; 30:191-207. [PMID: 26773003 PMCID: PMC4719309 DOI: 10.1101/gad.272187.115] [Citation(s) in RCA: 135] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Many long noncoding RNAs (lncRNAs) can regulate chromatin states, but the evolutionary origin and dynamics driving lncRNA-genome interactions are unclear. We adapted an integrative strategy that identifies lncRNA orthologs in different species despite limited sequence similarity, which is applicable to mammalian and insect lncRNAs. Analysis of the roX lncRNAs, which are essential for dosage compensation of the single X chromosome in Drosophila males, revealed 47 new roX orthologs in diverse Drosophilid species across ∼40 million years of evolution. Genetic rescue by roX orthologs and engineered synthetic lncRNAs showed that altering the number of focal, repetitive RNA structures determines roX ortholog function. Genomic occupancy maps of roX RNAs in four species revealed conserved targeting of X chromosome neighborhoods but rapid turnover of individual binding sites. Many new roX-binding sites evolved from DNA encoding a pre-existing RNA splicing signal, effectively linking dosage compensation to transcribed genes. Thus, dynamic change in lncRNAs and their genomic targets underlies conserved and essential lncRNA-genome interactions.
Collapse
Affiliation(s)
- Jeffrey J Quinn
- Center for Personal Dynamic Regulomes, Stanford University School of Medicine, Stanford, California 94305, USA; Department of Bioengineering, Stanford University School of Medicine and School of Engineering, Stanford, California 94305, USA
| | - Qiangfeng C Zhang
- Center for Personal Dynamic Regulomes, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Plamen Georgiev
- Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg im Breisgau, Germany
| | - Ibrahim A Ilik
- Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg im Breisgau, Germany
| | - Asifa Akhtar
- Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg im Breisgau, Germany
| | - Howard Y Chang
- Center for Personal Dynamic Regulomes, Stanford University School of Medicine, Stanford, California 94305, USA
| |
Collapse
|
17
|
Wei S, Du M, Jiang Z, Hausman GJ, Zhang L, Dodson MV. Long noncoding RNAs in regulating adipogenesis: new RNAs shed lights on obesity. Cell Mol Life Sci 2016; 73:2079-87. [PMID: 26943803 PMCID: PMC5737903 DOI: 10.1007/s00018-016-2169-2] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2015] [Revised: 02/13/2016] [Accepted: 02/23/2016] [Indexed: 10/22/2022]
Abstract
Long noncoding RNAs (lncRNAs) are an emerging class of regulators involved in a myriad of biological processes. Recent studies have revealed that many lncRNAs play pivotal roles in regulating adipocyte development. Due to the prevalence of obesity and the serious effects of adiposity on human health and society development, it is necessary to summarize functions and recent advances of lncRNAs in adipogenesis. In this review, we highlight functional lncRNAs contributed to the regulation of adipogenesis, discussing their potential use as therapeutic targets to combat human obesity.
Collapse
Affiliation(s)
- Shengjuan Wei
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China
| | - Min Du
- Department of Animal Sciences, Washington State University, Pullman, WA, 99164, USA
| | - Zhihua Jiang
- Department of Animal Sciences, Washington State University, Pullman, WA, 99164, USA
| | - Gary J Hausman
- Animal and Dairy Science, University of Georgia, Athens, GA, 30602-2771, USA
| | - Lifan Zhang
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing, 210095, People's Republic of China.
| | - Michael V Dodson
- Department of Animal Sciences, Washington State University, Pullman, WA, 99164, USA.
| |
Collapse
|
18
|
Chen X, Yan CC, Luo C, Ji W, Zhang Y, Dai Q. Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity. Sci Rep 2015; 5:11338. [PMID: 26061969 PMCID: PMC4462156 DOI: 10.1038/srep11338] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Accepted: 05/21/2015] [Indexed: 12/28/2022] Open
Abstract
Increasing evidence has indicated that plenty of lncRNAs play important roles in many critical biological processes. Developing powerful computational models to construct lncRNA functional similarity network based on heterogeneous biological datasets is one of the most important and popular topics in the fields of both lncRNAs and complex diseases. Functional similarity network consturction could benefit the model development for both lncRNA function inference and lncRNA-disease association identification. However, little effort has been attempted to analysis and calculate lncRNA functional similarity on a large scale. In this study, based on the assumption that functionally similar lncRNAs tend to be associated with similar diseases, we developed two novel lncRNA functional similarity calculation models (LNCSIM). LNCSIM was evaluated by introducing similarity scores into the model of Laplacian Regularized Least Squares for LncRNA–Disease Association (LRLSLDA) for lncRNA-disease association prediction. As a result, new predictive models improved the performance of LRLSLDA in the leave-one-out cross validation of various known lncRNA-disease associations datasets. Furthermore, some of the predictive results for colorectal cancer and lung cancer were verified by independent biological experimental studies. It is anticipated that LNCSIM could be a useful and important biological tool for human disease diagnosis, treatment, and prevention.
Collapse
Affiliation(s)
- Xing Chen
- 1] National Center for Mathematics and Interdisciplinary Sciences, Chinese Academy of Sciences, Beijing, 100190, China [2] Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China
| | | | - Cai Luo
- Department of Automation, Tsinghua University, Beijing, 100084, China
| | - Wen Ji
- Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
| | - Yongdong Zhang
- Key Lab of Intelligent Information Processing of Chinese Academy of Sciences, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
| | - Qionghai Dai
- Department of Automation, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
19
|
Ge P, Zhang S. Computational analysis of RNA structures with chemical probing data. Methods 2015; 79-80:60-6. [PMID: 25687190 DOI: 10.1016/j.ymeth.2015.02.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Revised: 01/16/2015] [Accepted: 02/09/2015] [Indexed: 11/28/2022] Open
Abstract
RNAs play various roles, not only as the genetic codes to synthesize proteins, but also as the direct participants of biological functions determined by their underlying high-order structures. Although many computational methods have been proposed for analyzing RNA structures, their accuracy and efficiency are limited, especially when applied to the large RNAs and the genome-wide data sets. Recently, advances in parallel sequencing and high-throughput chemical probing technologies have prompted the development of numerous new algorithms, which can incorporate the auxiliary structural information obtained from those experiments. Their potential has been revealed by the secondary structure prediction of ribosomal RNAs and the genome-wide ncRNA function annotation. In this review, the existing probing-directed computational methods for RNA secondary and tertiary structure analysis are discussed.
Collapse
Affiliation(s)
- Ping Ge
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816-2362, USA
| | - Shaojie Zhang
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816-2362, USA.
| |
Collapse
|
20
|
Kunej T, Obsteter J, Pogacar Z, Horvat S, Calin GA. The decalog of long non-coding RNA involvement in cancer diagnosis and monitoring. Crit Rev Clin Lab Sci 2014; 51:344-57. [PMID: 25123609 DOI: 10.3109/10408363.2014.944299] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Long non-coding RNAs (lncRNAs) are transcripts without protein-coding capacity; initially regarded as "transcriptional noise", lately they have emerged as essential factors in both cell biology and mechanisms of disease. In this article, we present basic knowledge of lncRNA molecular mechanisms, associated physiological processes and cancer association, as well as their diagnostic and therapeutic value in the form of a decalog: (1) Non-coding RNAs (ncRNAs) are transcripts without protein-coding capacity divided by size (short and long ncRNAs), function (housekeeping RNA and regulatory RNA) and direction of transcription (sense/antisense, bidirectional, intronic and intergenic), containing a broad range of molecules with diverse properties and functions, such as messenger RNA, transfer RNA, microRNA and long non-coding RNAs. (2) Long non-coding RNAs are implicated in many molecular mechanisms, such as transcriptional regulation, post-transcriptional regulation and processing of other short ncRNAs. (3) Long non-coding RNAs play an important role in many physiological processes such as X-chromosome inactivation, cell differentiation, immune response and apoptosis. (4) Long non-coding RNAs have been linked to hallmarks of cancer: (a) sustaining proliferative signaling; (b) evading growth suppressors; (c) enabling replicative immortality; (d) activating invasion and metastasis; (e) inducing angiogenesis; (f) resisting cell death; and (g) reprogramming energy metabolism. (5) Regarding their impact on cancer cells, lncRNAs are divided into two groups: oncogenic and tumor-suppressor lncRNAs. (6) Studies of lncRNA involvement in cancer usually analyze deregulated expression patterns at the RNA level as well as the effects of single nucleotide polymorphisms and copy number variations at the DNA level. (7) Long non-coding RNAs have potential as novel biomarkers due to tissue-specific expression patterns, efficient detection in body fluids and high stability. (8) LncRNAs serve as novel biomarkers for diagnostic, prognostic and monitoring purposes. (9) Tissue specificity of lncRNAs enables the development of selective therapeutic options. (10) Long non-coding RNAs are emerging as commercial biomarkers and therapeutic agents.
Collapse
Affiliation(s)
- Tanja Kunej
- Department of Animal Science, Biotechnical Faculty, University of Ljubljana , Domzale , Slovenia
| | | | | | | | | |
Collapse
|
21
|
Mehta SL, Dharap A, Vemuganti R. Expression of transcribed ultraconserved regions of genome in rat cerebral cortex. Neurochem Int 2014; 77:86-93. [PMID: 24953281 DOI: 10.1016/j.neuint.2014.06.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Revised: 06/09/2014] [Accepted: 06/10/2014] [Indexed: 11/29/2022]
Abstract
Emerging evidence indicates that 481 regions of the genome (>200 bp) that actively transcribe noncoding RNAs shows 100% homology between humans, rats and mice. These transcribed ultraconserved regions (T-UCRs) are thought to control the essential regulatory functions basic for life in rodents and mammals. Using microarray analysis, we presently show that 107 T-UCRs are actively expressed in adult rat cerebral cortex. They are grouped into intragenic (61) and intergenic (46) based on their genic location. Interestingly, 10 T-UCRs are expressed at unusually high levels in cerebral cortex. Additionally, many T-UCRs also showed cogenic expression. We further analyzed the correlation of intragenic T-UCRs with their host protein coding genes. Surprisingly, most of the expressed intragenic T-UCRs (54 out of 61) displayed a negative correlation with their host gene expression. T-UCRs are thought to control the splicing and transcription of the protein-coding genes that host them and flank them. Bioinformatics analysis indicated that the protein products of majority of these genes are nuclear in localization, share protein domains and are involved in the regulation of diverse biological and molecular functions including metabolism, development, cell cycle, binding and transcription factor regulation. In conclusion, this is the first study to shows that many T-UCRs are expressed in rodent brain and they might play a role in physiological brain functions.
Collapse
Affiliation(s)
- Suresh L Mehta
- Department of Neurological Surgery, University of Wisconsin, Madison, WI, USA
| | - Ashutosh Dharap
- Department of Neurological Surgery, University of Wisconsin, Madison, WI, USA; Theoretical Biology and Biophysics (T-6), Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Raghu Vemuganti
- Department of Neurological Surgery, University of Wisconsin, Madison, WI, USA.
| |
Collapse
|
22
|
Billerey C, Boussaha M, Esquerré D, Rebours E, Djari A, Meersseman C, Klopp C, Gautheret D, Rocha D. Identification of large intergenic non-coding RNAs in bovine muscle using next-generation transcriptomic sequencing. BMC Genomics 2014; 15:499. [PMID: 24948191 PMCID: PMC4073507 DOI: 10.1186/1471-2164-15-499] [Citation(s) in RCA: 100] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Accepted: 06/13/2014] [Indexed: 01/24/2023] Open
Abstract
Background The advent of large-scale gene expression technologies has helped to reveal in eukaryotic cells, the existence of thousands of non-coding transcripts, whose function and significance remain mostly poorly understood. Among these non-coding transcripts, long non-coding RNAs (lncRNAs) are the least well-studied but are emerging as key regulators of diverse cellular processes. In the present study, we performed a survey in bovine Longissimus thoraci of lincRNAs (long intergenic non-coding RNAs not overlapping protein-coding transcripts). To our knowledge, this represents the first such study in bovine muscle. Results To identify lincRNAs, we used paired-end RNA sequencing (RNA-Seq) to explore the transcriptomes of Longissimus thoraci from nine Limousin bull calves. Approximately 14–45 million paired-end reads were obtained per library. A total of 30,548 different transcripts were identified. Using a computational pipeline, we defined a stringent set of 584 different lincRNAs with 418 lincRNAs found in all nine muscle samples. Bovine lincRNAs share characteristics seen in their mammalian counterparts: relatively short transcript and gene lengths, low exon number and significantly lower expression, compared to protein-encoding genes. As for the first time, our study identified lincRNAs from nine different samples from the same tissue, it is possible to analyse the inter-individual variability of the gene expression level of the identified lincRNAs. Interestingly, there was a significant difference when we compared the expression variation of the 418 lincRNAs with the 10,775 known selected protein-encoding genes found in all muscle samples. In addition, we found 2,083 pairs of lincRNA/protein-encoding genes showing a highly significant correlated expression. Fourteen lincRNAs were selected and 13 were validated by RT-PCR. Some of the lincRNAs expressed in muscle are located within quantitative trait loci for meat quality traits. Conclusions Our study provides a glimpse into the lincRNA content of bovine muscle and will facilitate future experimental studies to unravel the function of these molecules. It may prove useful to elucidate their effect on mechanisms underlying the genetic variability of meat quality traits. This catalog will complement the list of lincRNAs already discovered in cattle and therefore will help to better annotate the bovine genome. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-499) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Dominique Rocha
- INRA, UMR1313, Unité Génétique Animale et Biologie Intégrative, Domaine de Vilvert, F-78352 Jouy-en-Josas, France.
| |
Collapse
|
23
|
Affiliation(s)
- Alexander F. Palazzo
- University of Toronto, Department of Biochemistry, Toronto, Ontario, Canada
- * E-mail: (AP); (TG)
| | - T. Ryan Gregory
- University of Guelph, Department of Integrative Biology, Guelph, Ontario, Canada
- * E-mail: (AP); (TG)
| |
Collapse
|
24
|
Abstract
Transcriptomics experiments and computational predictions both enable systematic discovery of new functional RNAs. However, many putative noncoding transcripts arise instead from artifacts and biological noise, and current computational prediction methods have high false positive rates. I discuss prospects for improving computational methods for analyzing and identifying functional RNAs, with a focus on detecting signatures of conserved RNA secondary structure. An interesting new front is the application of chemical and enzymatic experiments that probe RNA structure on a transcriptome-wide scale. I review several proposed approaches for incorporating structure probing data into the computational prediction of RNA secondary structure. Using probabilistic inference formalisms, I show how all these approaches can be unified in a well-principled framework, which in turn allows RNA probing data to be easily integrated into a wide range of analyses that depend on RNA secondary structure inference. Such analyses include homology search and genome-wide detection of new structural RNAs.
Collapse
Affiliation(s)
- Sean R Eddy
- Howard Hughes Medical Institute Janelia Farm Research Campus, Ashburn, Virginia 20147;
| |
Collapse
|
25
|
Chen X, Yan GY. Novel human lncRNA-disease association inference based on lncRNA expression profiles. ACTA ACUST UNITED AC 2013; 29:2617-24. [PMID: 24002109 DOI: 10.1093/bioinformatics/btt426] [Citation(s) in RCA: 429] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
MOTIVATION More and more evidences have indicated that long-non-coding RNAs (lncRNAs) play critical roles in many important biological processes. Therefore, mutations and dysregulations of these lncRNAs would contribute to the development of various complex diseases. Developing powerful computational models for potential disease-related lncRNAs identification would benefit biomarker identification and drug discovery for human disease diagnosis, treatment, prognosis and prevention. RESULTS In this article, we proposed the assumption that similar diseases tend to be associated with functionally similar lncRNAs. Then, we further developed the method of Laplacian Regularized Least Squares for LncRNA-Disease Association (LRLSLDA) in the semisupervised learning framework. Although known disease-lncRNA associations in the database are rare, LRLSLDA still obtained an AUC of 0.7760 in the leave-one-out cross validation, significantly improving the performance of previous methods. We also illustrated the performance of LRLSLDA is not sensitive (even robust) to the parameters selection and it can obtain a reliable performance in all the test classes. Plenty of potential disease-lncRNA associations were publicly released and some of them have been confirmed by recent results in biological experiments. It is anticipated that LRLSLDA could be an effective and important biological tool for biomedical research. AVAILABILITY The code of LRLSLDA is freely available at http://asdcd.amss.ac.cn/Software/Details/2.
Collapse
Affiliation(s)
- Xing Chen
- National Center for Mathematics and Interdisciplinary Sciences and Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, P.R. China
| | | |
Collapse
|
26
|
Abstract
It is now clear that eukaryotic cells produce many thousands of non-coding RNAs. The least well-studied of these are longer than 200 nt and are known as lncRNAs (long non-coding RNAs). These loci are of particular interest as their biological relevance remains uncertain. Sequencing projects have identified thousands of these loci in a variety of species, from flies to humans. Genome-wide scans for functionality, such as evolutionary and expression analyses, suggest that many of these molecules have functional roles to play in the cell. Nevertheless, only a handful of lncRNAs have been experimentally investigated, and most of these appear to possess roles in regulating gene expression at a variety of different levels. Several lncRNAs have also been implicated in cancer. This evidence suggests that lncRNAs represent a new class of non-coding gene whose importance should become clearer upon further experimental investigation.
Collapse
Affiliation(s)
- Robert S. Young
- MRC Human Genetics Unit, Western General Hospital, Crewe Road, Edinburgh EH4 2XU, U.K
| | - Chris P. Ponting
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford OX1 3PT, U.K
| |
Collapse
|
27
|
Ma H, Hao Y, Dong X, Gong Q, Chen J, Zhang J, Tian W. Molecular mechanisms and function prediction of long noncoding RNA. ScientificWorldJournal 2012; 2012:541786. [PMID: 23319885 PMCID: PMC3540756 DOI: 10.1100/2012/541786] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Accepted: 11/21/2012] [Indexed: 12/25/2022] Open
Abstract
The central dogma of gene expression considers RNA as the carrier of genetic information from DNA to protein. However, it has become more and more clear that RNA plays more important roles than simply being the information carrier. Recently, whole genome transcriptomic analyses have identified large numbers of dynamically expressed long noncoding RNAs (lncRNAs), many of which are involved in a variety of biological functions. Even so, the functions and molecular mechanisms of most lncRNAs still remain elusive. Therefore, it is necessary to develop computational methods to predict the function of lncRNAs in order to accelerate the study of lncRNAs. Here, we review the recent progress in the identification of lncRNAs, the molecular functions and mechanisms of lncRNAs, and the computational methods for predicting the function of lncRNAs.
Collapse
Affiliation(s)
- Handong Ma
- Institute of Biostatistics, School of Life Science, Fudan University, 220 Handan Road, Shanghai 2004333, China
| | | | | | | | | | | | | |
Collapse
|
28
|
Mitra SA, Mitra AP, Triche TJ. A central role for long non-coding RNA in cancer. Front Genet 2012; 3:17. [PMID: 22363342 PMCID: PMC3279698 DOI: 10.3389/fgene.2012.00017] [Citation(s) in RCA: 132] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2011] [Accepted: 01/28/2012] [Indexed: 01/29/2023] Open
Abstract
Long non-coding RNAs (ncRNAs) have been shown to regulate important biological processes that support normal cellular functions. Aberrant regulation of these essential functions can promote tumor development. In this review, we underscore the importance of the regulatory role played by this distinct class of ncRNAs in cancer-associated pathways that govern mechanisms such as cell growth, invasion, and metastasis. We also highlight the possibility of using these unique RNAs as diagnostic and prognostic biomarkers in malignancies.
Collapse
Affiliation(s)
- Sheetal A Mitra
- Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles Los Angeles, CA, USA
| | | | | |
Collapse
|
29
|
Nuclear export as a key arbiter of "mRNA identity" in eukaryotes. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2012; 1819:566-77. [PMID: 22248619 DOI: 10.1016/j.bbagrm.2011.12.012] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2011] [Revised: 12/23/2011] [Accepted: 12/29/2011] [Indexed: 01/15/2023]
Abstract
Over the past decade, various studies have indicated that most of the eukaryotic genome is transcribed at some level. The pervasiveness of transcription might seem surprising when one considers that only a quarter of the human genome comprises genes (including exons and introns) and less than 2% codes for protein. This conundrum is partially explained by the unique evolutionary pressures that are imposed on species with small population sizes, such as eukaryotes. These conditions promote the expansion of introns and non-functional intergenic DNA, and the accumulation of cryptic transcriptional start sites. As a result, the eukaryotic gene expression machinery must effectively evaluate whether or not a transcript has all the hallmarks of a protein-coding mRNA. If a transcript contains these features, then positive feedback loops are activated to further stimulate its transcription, processing, nuclear export and ultimately, translation. However if a transcript lacks features associated with "mRNA identity", then the RNA is degraded and/or used to inhibit further transcription and translation of the gene. Here we discuss how mRNA identity is assessed by the nuclear export machinery in order to extract meaningful information from the eukaryotic genome. In the process, we provide an explanation of why certain sequences that are enriched in protein-coding genes, such as the signal sequence coding region, promote mRNA nuclear export in vertebrates. This article is part of a Special Issue entitled: Nuclear Transport and RNA Processing.
Collapse
|
30
|
Gibb EA, Enfield KSS, Stewart GL, Lonergan KM, Chari R, Ng RT, Zhang L, MacAulay CE, Rosin MP, Lam WL. Long non-coding RNAs are expressed in oral mucosa and altered in oral premalignant lesions. Oral Oncol 2011; 47:1055-61. [PMID: 21835683 DOI: 10.1016/j.oraloncology.2011.07.008] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2011] [Revised: 06/17/2011] [Accepted: 07/09/2011] [Indexed: 01/01/2023]
Abstract
Oral epithelial dysplasias are believed to progress through a series of histopathological stages; from mild to severe dysplasia, to carcinoma in situ, and finally to invasive OSCC. Underlying this change in histopathological grade are gross chromosome alterations and changes in gene expression of both protein-coding genes and non-coding RNAs. Recent papers have described associations of aberrant expression of microRNAs, one class of non-coding RNAs, with oral cancer. However, expression profiling of long non-coding RNAs (lncRNAs) has not been reported. Long non-coding RNAs are a novel class of mRNA-like transcripts with no protein coding capacity, but with a variety of functions including roles in epigenetics and gene regulation. In recent reports, the aberrant expression of lncRNAs has been associated with human cancers, suggesting a critical role in tumorigenesis. Here, we present the first long non-coding RNA expression map for the human oral mucosa. We describe the expression of 325 long non-coding RNAs, suggesting lncRNA expression contributes significantly to the oral transcriptome. Intriguingly, ∼60% of the detected lncRNAs show aberrant expression in oral premalignant lesions. A number of these lncRNAs have been previously associated with other human cancers.
Collapse
Affiliation(s)
- Ewan A Gibb
- British Columbia Cancer Agency Research Centre, Vancouver, Canada V5Z 1L3.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Knowling S, Morris KV. Non-coding RNA and antisense RNA. Nature's trash or treasure? Biochimie 2011; 93:1922-7. [PMID: 21843589 DOI: 10.1016/j.biochi.2011.07.031] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2011] [Accepted: 07/29/2011] [Indexed: 01/13/2023]
Abstract
Although control of cellular function has classically been considered the responsibility of proteins, research over the last decade has elucidated many roles for RNA in regulation of not only the proteins that control cellular functions but also for the cellular functions themselves. In parallel to this advancement in knowledge about the regulatory roles of RNA there has been an explosion of knowledge about the role that epigenetics plays in controlling not only long-term cellular fate but also the short-term regulatory control of genes. Of particular interest is the crossover between these two worlds, a world where RNA can act out its part and subsequently elicit chromatin modifications that alter cellular function. Two main categories of RNA are examined here, non-coding RNA and antisense RNA both of which perform vital functions in controlling numerous genes, proteins and RNA itself. As the activities of non-coding and antisense RNA in both normal and aberrant cellular function are elucidated, so does the number of possible targets for pharmacopeic intervention.
Collapse
Affiliation(s)
- Stuart Knowling
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA.
| | | |
Collapse
|
32
|
Jung S, Swart EC, Minx PJ, Magrini V, Mardis ER, Landweber LF, Eddy SR. Exploiting Oxytricha trifallax nanochromosomes to screen for non-coding RNA genes. Nucleic Acids Res 2011; 39:7529-47. [PMID: 21715380 PMCID: PMC3177221 DOI: 10.1093/nar/gkr501] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
We took advantage of the unusual genomic organization of the ciliate Oxytricha trifallax to screen for eukaryotic non-coding RNA (ncRNA) genes. Ciliates have two types of nuclei: a germ line micronucleus that is usually transcriptionally inactive, and a somatic macronucleus that contains a reduced, fragmented and rearranged genome that expresses all genes required for growth and asexual reproduction. In some ciliates including Oxytricha, the macronuclear genome is particularly extreme, consisting of thousands of tiny 'nanochromosomes', each of which usually contains only a single gene. Because the organism itself identifies and isolates most of its genes on single-gene nanochromosomes, nanochromosome structure could facilitate the discovery of unusual genes or gene classes, such as ncRNA genes. Using a draft Oxytricha genome assembly and a custom-written protein-coding genefinding program, we identified a subset of nanochromosomes that lack any detectable protein-coding gene, thereby strongly enriching for nanochromosomes that carry ncRNA genes. We found only a small proportion of non-coding nanochromosomes, suggesting that Oxytricha has few independent ncRNA genes besides homologs of already known RNAs. Other than new members of known ncRNA classes including C/D and H/ACA snoRNAs, our screen identified one new family of small RNA genes, named the Arisong RNAs, which share some of the features of small nuclear RNAs.
Collapse
Affiliation(s)
- Seolkyoung Jung
- Janelia Farm Research Campus, Howard Hughes Medical Institute, Ashburn VA 20147, USA
| | | | | | | | | | | | | |
Collapse
|
33
|
Morris KV, Vogt PK. Long antisense non-coding RNAs and their role in transcription and oncogenesis. Cell Cycle 2011; 9:2544-7. [PMID: 20581457 DOI: 10.4161/cc.9.13.12145] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Long non-coding RNAs are estimated to qualitatively represent ~98% of expressed transcripts in human cells, a large proportion of which is antisense to protein-coding and non-coding transcripts. Here we review evidence from several experimental systems that suggests long antisense non-coding RNAs are involved in the transcriptional regulation of gene expression by altering epigenetic states at both adjacent and distal loci. We also review the initial evidence for a role of endogenous long antisense non-coding RNAs in oncogenic cellular transformation.
Collapse
Affiliation(s)
- Kevin V Morris
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA, USA.
| | | |
Collapse
|
34
|
Askarian-Amiri ME, Crawford J, French JD, Smart CE, Smith MA, Clark MB, Ru K, Mercer TR, Thompson ER, Lakhani SR, Vargas AC, Campbell IG, Brown MA, Dinger ME, Mattick JS. SNORD-host RNA Zfas1 is a regulator of mammary development and a potential marker for breast cancer. RNA (NEW YORK, N.Y.) 2011; 17:878-891. [PMID: 21460236 PMCID: PMC3078737 DOI: 10.1261/rna.2528811] [Citation(s) in RCA: 284] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2010] [Accepted: 02/15/2011] [Indexed: 05/30/2023]
Abstract
Long noncoding RNAs (lncRNAs) are increasingly recognized to play major regulatory roles in development and disease. To identify novel regulators in breast biology, we identified differentially regulated lncRNAs during mouse mammary development. Among the highest and most differentially expressed was a transcript (Zfas1) antisense to the 5' end of the protein-coding gene Znfx1. In vivo, Zfas1 RNA is localized within the ducts and alveoli of the mammary gland. Zfas1 intronically hosts three previously undescribed C/D box snoRNAs (SNORDs): Snord12, Snord12b, and Snord12c. In contrast to the general assumption that noncoding SNORD-host transcripts function only as vehicles to generate snoRNAs, knockdown of Zfas1 in a mammary epithelial cell line resulted in increased cellular proliferation and differentiation, while not substantially altering the levels of the SNORDs. In support of an independent function, we also found that Zfas1 is extremely stable, with a half-life >16 h. Expression analysis of the SNORDs revealed these were expressed at different levels, likely a result of distinct structures conferring differential stability. While there is relatively low primary sequence conservation between Zfas1 and its syntenic human ortholog ZFAS1, their predicted secondary structures have similar features. Like Zfas1, ZFAS1 is highly expressed in the mammary gland and is down-regulated in breast tumors compared to normal tissue. We propose a functional role for Zfas1/ ZFAS1 in the regulation of alveolar development and epithelial cell differentiation in the mammary gland, which, together with its dysregulation in human breast cancer, suggests ZFAS1 as a putative tumor suppressor gene.
Collapse
|
35
|
Spitale RC, Tsai MC, Chang HY. RNA templating the epigenome: long noncoding RNAs as molecular scaffolds. Epigenetics 2011; 6:539-43. [PMID: 21393997 DOI: 10.4161/epi.6.5.15221] [Citation(s) in RCA: 152] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Cellular pathways must be synergized, controlled and organized to manage homeostasis. To achieve high selectivity within the crowded cellular milieu the cell utilizes scaffolding complexes whose role is to bring molecules in proximity thereby controlling and enhancing intermolecular interactions and signaling events. To date, scaffolds have been shown to be composed of proteinaceous units; however, recent evidence has supported the idea that non-coding RNAs may also play a similar role. In this point of view article we discuss recent data on ncRNA scaffolds, with particular focus on ncRNA HOTAIR. Using our current knowledge of signaling networks we discuss the role that RNA may play in writing and regulating histone modifications and the information needed for correct gene expression. Further, we speculate on additional, yet undiscovered roles that ncRNAs may be playing as molecular scaffolds.
Collapse
Affiliation(s)
- Robert C Spitale
- Howard Hughes Medical Institute and Program in Epithelial Biology, Stanford University School of Medicine, Stanford, CA, USA
| | | | | |
Collapse
|
36
|
Gibb EA, Brown CJ, Lam WL. The functional role of long non-coding RNA in human carcinomas. Mol Cancer 2011; 10:38. [PMID: 21489289 PMCID: PMC3098824 DOI: 10.1186/1476-4598-10-38] [Citation(s) in RCA: 1314] [Impact Index Per Article: 101.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2011] [Accepted: 04/13/2011] [Indexed: 12/15/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) are emerging as new players in the cancer paradigm demonstrating potential roles in both oncogenic and tumor suppressive pathways. These novel genes are frequently aberrantly expressed in a variety of human cancers, however the biological functions of the vast majority remain unknown. Recently, evidence has begun to accumulate describing the molecular mechanisms by which these RNA species function, providing insight into the functional roles they may play in tumorigenesis. In this review, we highlight the emerging functional role of lncRNAs in human cancer.
Collapse
Affiliation(s)
- Ewan A Gibb
- British Columbia Cancer Agency Research Centre, Vancouver, Canada.
| | | | | |
Collapse
|
37
|
Chodroff RA, Goodstadt L, Sirey TM, Oliver PL, Davies KE, Green ED, Molnár Z, Ponting CP. Long noncoding RNA genes: conservation of sequence and brain expression among diverse amniotes. Genome Biol 2010; 11:R72. [PMID: 20624288 PMCID: PMC2926783 DOI: 10.1186/gb-2010-11-7-r72] [Citation(s) in RCA: 194] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2010] [Revised: 05/17/2010] [Accepted: 07/12/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Long considered to be the building block of life, it is now apparent that protein is only one of many functional products generated by the eukaryotic genome. Indeed, more of the human genome is transcribed into noncoding sequence than into protein-coding sequence. Nevertheless, whilst we have developed a deep understanding of the relationships between evolutionary constraint and function for protein-coding sequence, little is known about these relationships for non-coding transcribed sequence. This dearth of information is partially attributable to a lack of established non-protein-coding RNA (ncRNA) orthologs among birds and mammals within sequence and expression databases. RESULTS Here, we performed a multi-disciplinary study of four highly conserved and brain-expressed transcripts selected from a list of mouse long intergenic noncoding RNA (lncRNA) loci that generally show pronounced evolutionary constraint within their putative promoter regions and across exon-intron boundaries. We identify some of the first lncRNA orthologs present in birds (chicken), marsupial (opossum), and eutherian mammals (mouse), and investigate whether they exhibit conservation of brain expression. In contrast to conventional protein-coding genes, the sequences, transcriptional start sites, exon structures, and lengths for these non-coding genes are all highly variable. CONCLUSIONS The biological relevance of lncRNAs would be highly questionable if they were limited to closely related phyla. Instead, their preservation across diverse amniotes, their apparent conservation in exon structure, and similarities in their pattern of brain expression during embryonic and early postnatal stages together indicate that these are functional RNA molecules, of which some have roles in vertebrate brain development.
Collapse
Affiliation(s)
- Rebecca A Chodroff
- Department of Physiology, Anatomy, and Genetics, Le Gros Clark Building South Parks Road, University of Oxford, Oxford OX1 3QX, UK
| | | | | | | | | | | | | | | |
Collapse
|
38
|
Sathira N, Yamashita R, Tanimoto K, Kanai A, Arauchi T, Kanematsu S, Nakai K, Suzuki Y, Sugano S. Characterization of transcription start sites of putative non-coding RNAs by multifaceted use of massively paralleled sequencer. DNA Res 2010; 17:169-83. [PMID: 20400770 PMCID: PMC2885271 DOI: 10.1093/dnares/dsq007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
On the basis of integrated transcriptome analysis, we show that not all transcriptional start site clusters (TSCs) in the intergenic regions (iTSCs) have the same properties; thus, it is possible to discriminate the iTSCs that are likely to have biological relevance from the other noise-level iTSCs. We used a total of 251 933 381 short-read sequence tags generated from various types of transcriptome analyses in order to characterize 6039 iTSCs, which have significant expression levels. We analyzed and found that 23% of these iTSCs were located in the proximal regions of the RefSeq genes. These RefSeq-linked iTSCs showed similar expression patterns with the neighboring RefSeq genes, had widely fluctuating transcription start sites and lacked ordered nucleosome positioning. These iTSCs seemed not to form independent transcriptional units, simply representing the by-products of the neighboring RefSeq genes, in spite of their significant expression levels. Similar features were also observed for the TSCs located in the antisense regions of the RefSeq genes. Furthermore, for the remaining iTSCs that were not associated with any RefSeq genes, we demonstrate that integrative interpretation of the transcriptome data provides essential information to specify their biological functions in the hypoxic responses of the cells.
Collapse
Affiliation(s)
- Nuankanya Sathira
- Department of Medical Genome Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa-shi, Chiba 277-8568, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Sharov AA, Piao Y, Ko MSH. Gene expression profiling of mouse embryos with microarrays. Methods Enzymol 2010; 477:511-41. [PMID: 20699157 DOI: 10.1016/s0076-6879(10)77025-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Global expression profiling by DNA microarrays provides a snapshot of cell and tissue status and becomes an essential tool in biological and medical sciences. Typical questions that can be addressed by microarray analysis in developmental biology include: (1) to find a set of genes expressed in a specific cell type; (2) to identify genes expressed commonly in multiple cell types; (3) to follow the time-course changes of gene expression patterns; (4) to demonstrate cell's identity by showing similarities or differences among two or multiple cell types; (5) to find regulatory pathways and/or networks affected by gene manipulations, such as overexpression or repression of gene expression; (6) to find downstream target genes of transcription factors; (7) to find downstream target genes of cell signaling; (8) to examine the effects of environmental manipulation of cells on gene expression patterns; and (9) to find the effects of genetic manipulation in embryos and adults. Here, we describe strategies for executing these experiments and monitoring changes of cell state with gene expression microarrays in application to mouse embryology. Both statistical assessment and interpretation of data are discussed. We also present a protocol for performing microarray analysis on a small amount of embryonic materials.
Collapse
Affiliation(s)
- Alexei A Sharov
- Developmental Genomics and Aging Section, Laboratory of Genetics, National Institute on Aging, NIH, Baltimore, Maryland, USA
| | | | | |
Collapse
|
40
|
van Bakel H, Hughes TR. Establishing legitimacy and function in the new transcriptome. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:424-36. [PMID: 19833698 DOI: 10.1093/bfgp/elp037] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The last decade has seen an explosion of interest in new classes of non-coding RNA. While some are now firmly established as new categories of legitimate functional RNAs, the purpose and even existence of others remain to be solidified. Here, we discuss the challenges associated with discovery and characterization of non-traditional categories of non-coding RNA.
Collapse
Affiliation(s)
- Harm van Bakel
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M5S 3E1, Canada
| | | |
Collapse
|
41
|
Dinger ME, Amaral PP, Mercer TR, Mattick JS. Pervasive transcription of the eukaryotic genome: functional indices and conceptual implications. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:407-23. [PMID: 19770204 DOI: 10.1093/bfgp/elp038] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Genome-wide analyses of the eukaryotic transcriptome have revealed that the majority of the genome is transcribed, producing large numbers of non-protein-coding RNAs (ncRNAs). This surprising observation challenges many assumptions about the genetic programming of higher organisms and how information is stored and organized within the genome. Moreover, the rapid advances in genomics have given little opportunity for biologists to integrate these emerging findings into their intellectual and experimental frameworks. This problem has been compounded by the perception that genome-wide studies often generate more questions than answers, which in turn has led to confusion and controversy. In this article, we address common questions associated with the phenomenon of pervasive transcription and consider the indices that can be used to evaluate the function (or lack thereof) of the resulting ncRNAs. We suggest that many lines of evidence, including expression profiles, conservation signatures, chromatin modification patterns and examination of increasing numbers of individual cases, argue in favour of the widespread functionality of non-coding transcription. We also discuss how informatic and experimental approaches used to analyse protein-coding genes may not be applicable to ncRNAs and how the general perception that protein-coding genes form the main informational output of the genome has resulted in much of the misunderstanding surrounding pervasive transcription and its potential significance. Finally, we present the conceptual implications of the majority of the eukaryotic genome being functional and describe how appreciating this perspective will provide considerable opportunity to further understand the molecular basis of development and complex diseases.
Collapse
Affiliation(s)
- Marcel E Dinger
- Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia
| | | | | | | |
Collapse
|
42
|
Rapicavoli NA, Blackshaw S. New meaning in the message: Noncoding RNAs and their role in retinal development. Dev Dyn 2009; 238:2103-14. [DOI: 10.1002/dvdy.21844] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
|
43
|
Abstract
Over the last few years it has become increasingly apparent that RNA is involved in various forms of gene regulation. While much emphasis has been placed on the role of small non-coding RNAs in post-transcriptional modes of gene regulation it has become apparent that a far more complex scenario exists. Recent observations insinuate a paradigm whereby non-coding RNAs are operative effector molecules in the transcriptional regulation of endogenous gene expression. These observations support a route for how epigenetic gene silencing is directed, maintained and passed on as epigenetic memory in human cells. This perspective will highlight the endogenous effector RNAs and mechanism of action whereby non-coding RNAs transcriptionally regulate gene expression in human cells and discuss these recent observations in the context of human evolution.
Collapse
Affiliation(s)
- Kevin V Morris
- Department of Molecular and Experimental Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA.
| |
Collapse
|
44
|
Mello BP, Abrantes EF, Torres CH, Machado-Lima A, Fonseca RDS, Carraro DM, Brentani RR, Reis LFL, Brentani H. No-match ORESTES explored as tumor markers. Nucleic Acids Res 2009; 37:2607-17. [PMID: 19270067 PMCID: PMC2677862 DOI: 10.1093/nar/gkp074] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Sequencing technologies and new bioinformatics tools have led to the complete sequencing of various genomes. However, information regarding the human transcriptome and its annotation is yet to be completed. The Human Cancer Genome Project, using ORESTES (open reading frame EST sequences) methodology, contributed to this objective by generating data from about 1.2 million expressed sequence tags. Approximately 30% of these sequences did not align to ESTs in the public databases and were considered no-match ORESTES. On the basis that a set of these ESTs could represent new transcripts, we constructed a cDNA microarray. This platform was used to hybridize against 12 different normal or tumor tissues. We identified 3421 transcribed regions not associated with annotated transcripts, representing 83.3% of the platform. The total number of differentially expressed sequences was 1007. Also, 28% of analyzed sequences could represent noncoding RNAs. Our data reinforces the knowledge of the human genome being pervasively transcribed, and point out molecular marker candidates for different cancers. To reinforce our data, we confirmed, by real-time PCR, the differential expression of three out of eight potentially tumor markers in prostate tissues. Lists of 1007 differentially expressed sequences, and the 291 potentially noncoding tumor markers were provided.
Collapse
Affiliation(s)
- Barbara P Mello
- Hospital A. C. Camargo, Rua Prof. Antônio Prudente 211, São Paulo, SP, Brazil
| | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Guimarães AO, Motta FL, Alves VS, Castilho BA, Pesquero JB. Multiple RNAs from the mouse carboxypeptidase M locus: functional RNAs or transcription noise? BMC Mol Biol 2009; 10:7. [PMID: 19200403 PMCID: PMC2644694 DOI: 10.1186/1471-2199-10-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2008] [Accepted: 02/08/2009] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND A major effort of the scientific community has been to obtain complete pictures of the genomes of many organisms. This has been accomplished mainly by annotation of structural and functional elements in the genome sequence, a process that has been centred in the gene concept and, as a consequence, biased toward protein coding sequences. Recently, the explosion of transcriptome data generated and the discovery of many functional non-protein coding RNAs have painted a more detailed and complex scenario for the genome. Here we analyzed the mouse carboxypeptidase M locus in this broader perspective in order to define the mouse CPM gene structure and evaluate the existence of other transcripts from the same genomic region. RESULTS Bioinformatic analysis of nucleotide sequences that map to the mouse CPM locus suggests that, in addition to the mouse CPM mRNA, it expresses at least 33 different transcripts, many of which seem to be non-coding RNAs. We randomly chose to evaluate experimentally four of these extra transcripts. They are expressed in a tissue specific manner, indicating that they are not artefacts or transcriptional noise. Furthermore, one of these four extra transcripts shows expression patterns that differed considerably from the other ones and from the mouse CPM gene, suggesting that there may be more than one transcriptional unit in this locus. In addition, we have confirmed the mouse CPM gene RefSeq sequence by rapid amplification of cDNA ends (RACE) and directional cloning. CONCLUSION This study supports the recent view that the majority of the genome is transcribed and that many of the resulting transcripts seem to be non-coding RNAs from introns of genes or from independent transcriptional units. Although some of the information on the transcriptome of many organisms may actually be artefacts or transcriptional noise, we argue that it can be experimentally evaluated and used to find and define biological functional elements on the genome. Furthermore, the transcription of other functional RNAs besides the protein coding RNA from a specific genomic locus imposes extra care when designing and interpreting experiments involving genetic manipulations or expression detection and quantification.
Collapse
Affiliation(s)
| | - Fabiana L Motta
- Departamento de Biofísica, Universidade Federal de São Paulo, São Paulo, Brazil
| | - Viviane S Alves
- Departamento de Microbiologia e Imunologia, Universidade Federal de São Paulo, São Paulo, Brazil
| | - Beatriz A Castilho
- Departamento de Microbiologia e Imunologia, Universidade Federal de São Paulo, São Paulo, Brazil
| | - João B Pesquero
- Departamento de Biofísica, Universidade Federal de São Paulo, São Paulo, Brazil
| |
Collapse
|
46
|
A myelopoiesis-associated regulatory intergenic noncoding RNA transcript within the human HOXA cluster. Blood 2009; 113:2526-34. [PMID: 19144990 DOI: 10.1182/blood-2008-06-162164] [Citation(s) in RCA: 283] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
We have identified an intergenic transcriptional activity that is located between the human HOXA1 and HOXA2 genes, shows myeloid-specific expression, and is up-regulated during granulocytic differentiation. The novel gene, termed HOTAIRM1 (HOX antisense intergenic RNA myeloid 1), is transcribed antisense to the HOXA genes and originates from the same CpG island that embeds the start site of HOXA1. The transcript appears to be a noncoding RNA containing no long open-reading frame; sucrose gradient analysis shows no association with polyribosomal fractions. HOTAIRM1 is the most prominent intergenic transcript expressed and up-regulated during induced granulocytic differentiation of NB4 promyelocytic leukemia and normal human hematopoietic cells; its expression is specific to the myeloid lineage. Its induction during retinoic acid (RA)-driven granulocytic differentiation is through RA receptor and may depend on the expression of myeloid cell development factors targeted by RA signaling. Knockdown of HOTAIRM1 quantitatively blunted RA-induced expression of HOXA1 and HOXA4 during the myeloid differentiation of NB4 cells, and selectively attenuated induction of transcripts for the myeloid differentiation genes CD11b and CD18, but did not noticeably impact the more distal HOXA genes. These findings suggest that HOTAIRM1 plays a role in the myelopoiesis through modulation of gene expression in the HOXA cluster.
Collapse
|
47
|
Kavanaugh LA, Dietrich FS. Non-coding RNA prediction and verification in Saccharomyces cerevisiae. PLoS Genet 2009; 5:e1000321. [PMID: 19119416 PMCID: PMC2603021 DOI: 10.1371/journal.pgen.1000321] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2008] [Accepted: 12/01/2008] [Indexed: 11/18/2022] Open
Abstract
Non-coding RNA (ncRNA) play an important and varied role in cellular function. A significant amount of research has been devoted to computational prediction of these genes from genomic sequence, but the ability to do so has remained elusive due to a lack of apparent genomic features. In this work, thermodynamic stability of ncRNA structural elements, as summarized in a Z-score, is used to predict ncRNA in the yeast Saccharomyces cerevisiae. This analysis was coupled with comparative genomics to search for ncRNA genes on chromosome six of S. cerevisiae and S. bayanus. Sets of positive and negative control genes were evaluated to determine the efficacy of thermodynamic stability for discriminating ncRNA from background sequence. The effect of window sizes and step sizes on the sensitivity of ncRNA identification was also explored. Non-coding RNA gene candidates, common to both S. cerevisiae and S. bayanus, were verified using northern blot analysis, rapid amplification of cDNA ends (RACE), and publicly available cDNA library data. Four ncRNA transcripts are well supported by experimental data (RUF10, RUF11, RUF12, RUF13), while one additional putative ncRNA transcript is well supported but the data are not entirely conclusive. Six candidates appear to be structural elements in 5′ or 3′ untranslated regions of annotated protein-coding genes. This work shows that thermodynamic stability, coupled with comparative genomics, can be used to predict ncRNA with significant structural elements. Recent advances in DNA sequence technology have made it possible to sequence entire genomes. Once a genome is sequenced, it becomes necessary to identify the set of genes and other functional elements within the genome. This is particularly challenging as much of the genomic sequence does not appear to perform any function and is loosely referred to as “junk.” Identifying functional elements among the “junk” is difficult. Experimental methods have been developed for this purpose but they are time-consuming, expensive, and often provide an incomplete picture. Thus, it is important to develop the ability to identify these functional elements using computational methods. Protein-coding genes are relatively easy to identify computationally, but other categories of functional elements present a significantly greater challenge. In this work, we used a computational approach to identify genes that do not encode for a protein but rather function as an RNA molecule. We then used experimental methods to verify our predictions and thereby validate the computational method.
Collapse
Affiliation(s)
- Laura A. Kavanaugh
- Department of Molecular Genetics and Microbiology, Institute for Genome Sciences and Policy, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Fred S. Dietrich
- Department of Molecular Genetics and Microbiology, Institute for Genome Sciences and Policy, Duke University Medical Center, Durham, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
48
|
|
49
|
Chen L, Cai L, Skogerbø G, Zhao Y, Chen R. Assessing TF regulatory relationships of divergently transcribed genes. Genomics 2008; 92:316-21. [DOI: 10.1016/j.ygeno.2008.07.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2007] [Revised: 07/23/2008] [Accepted: 07/24/2008] [Indexed: 10/21/2022]
|
50
|
Li L, Zhu Q, He X, Sinha S, Halfon MS. Large-scale analysis of transcriptional cis-regulatory modules reveals both common features and distinct subclasses. Genome Biol 2008; 8:R101. [PMID: 17550599 PMCID: PMC2394749 DOI: 10.1186/gb-2007-8-6-r101] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2007] [Revised: 05/23/2007] [Accepted: 06/05/2007] [Indexed: 02/01/2023] Open
Abstract
Analysis of 280 experimentally-verified cis-regulatory modules from Drosophila reveal features both common to all and unique to distinct subclasses of modules. Background Transcriptional cis-regulatory modules (for example, enhancers) play a critical role in regulating gene expression. While many individual regulatory elements have been characterized, they have never been analyzed as a class. Results We have performed the first such large-scale study of cis-regulatory modules in order to determine whether they have common properties that might aid in their identification and contribute to our understanding of the mechanisms by which they function. A total of 280 individual, experimentally verified cis-regulatory modules from Drosophila were analyzed for a range of sequence-level and functional properties. We report here that regulatory modules do indeed share common properties, among them an elevated GC content, an increased level of interspecific sequence conservation, and a tendency to be transcribed into RNA. However, we find that dense clustering of transcription factor binding sites, especially homotypic clustering, which is commonly believed to be a general characteristic of regulatory modules, is rather a feature that belongs chiefly to a specific subclass. This has important implications for current computational approaches, many of which are biased toward this subset. We explore two new strategies to assess binding site clustering and gauge their performances with respect to their ability to detect all 280 modules and various functionally coherent subsets. Conclusion Our findings demonstrate that cis-regulatory modules share common features that help to define them as a class and that may lead to new insights into mechanisms of gene regulation. However, these properties alone may not be sufficient to reliably distinguish regulatory from non-regulatory sequences. We also demonstrate that there are distinct subclasses of cis-regulatory modules that are more amenable to in silico detection than others and that these differences must be taken into account when attempting genome-wide regulatory element discovery.
Collapse
Affiliation(s)
- Long Li
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY 14214, USA
| | - Qianqian Zhu
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY 14214, USA
| | - Xin He
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Marc S Halfon
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY 14214, USA
- Department of Biological Sciences, State University of New York at Buffalo, Buffalo, NY 14214, USA
- New York State Center of Excellence in Bioinformatics and the Life Sciences, Buffalo, NY 14203, USA
- Department of Molecular and Cellular Biology, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
| |
Collapse
|