1
|
The long non-coding RNA landscape of Candida yeast pathogens. Nat Commun 2021; 12:7317. [PMID: 34916523 PMCID: PMC8677757 DOI: 10.1038/s41467-021-27635-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 11/30/2021] [Indexed: 12/29/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) constitute a poorly studied class of transcripts with emerging roles in key cellular processes. Despite efforts to characterize lncRNAs across a wide range of species, these molecules remain largely unexplored in most eukaryotic microbes, including yeast pathogens of the Candida clade. Here, we analyze thousands of publicly available sequencing datasets to infer and characterize the lncRNA repertoires of five major Candida pathogens: Candida albicans, Candida tropicalis, Candida parapsilosis, Candida auris and Candida glabrata. Our results indicate that genomes of these species encode hundreds of lncRNAs that show levels of evolutionary constraint intermediate between those of intergenic genomic regions and protein-coding genes. Despite their low sequence conservation across the studied species, some lncRNAs are syntenic and are enriched in shared sequence motifs. We find co-expression of lncRNAs with certain protein-coding transcripts, hinting at potential functional associations. Finally, we identify lncRNAs that are differentially expressed during infection of human epithelial cells for four of the studied species. Our comprehensive bioinformatic analyses of Candida lncRNAs pave the way for future functional characterization of these transcripts. Long non-coding RNAs (lncRNAs) play roles in key cellular processes, but remain largely unexplored in fungal pathogens such as Candida. Here, Hovhannisyan and Gabaldón analyze thousands of sequencing datasets to infer and characterize the lncRNA repertoires of five Candida species, paving the way for their future functional characterization.
Collapse
|
2
|
Long Non-Coding RNAs Associated with Ribosomes in Human Adipose-Derived Stem Cells: From RNAs to Microproteins. Biomolecules 2021; 11:biom11111673. [PMID: 34827671 PMCID: PMC8615451 DOI: 10.3390/biom11111673] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 10/15/2021] [Accepted: 10/25/2021] [Indexed: 12/12/2022] Open
Abstract
Ribosome profiling reveals the translational dynamics of mRNAs by capturing a ribosomal footprint snapshot. Growing evidence shows that several long non-coding RNAs (lncRNAs) contain small open reading frames (smORFs) that are translated into functional peptides. The difficulty in identifying bona-fide translated smORFs is a constant challenge in experimental and bioinformatics fields due to their unconventional characteristics. This motivated us to isolate human adipose-derived stem cells (hASC) from adipose tissue and perform a ribosome profiling followed by bioinformatics analysis of transcriptome, translatome, and ribosome-protected fragments of lncRNAs. Here, we demonstrated that 222 lncRNAs were associated with the translational machinery in hASC, including the already demonstrated lncRNAs coding microproteins. The ribosomal occupancy of some transcripts was consistent with the translation of smORFs. In conclusion, we were able to identify a subset of 15 lncRNAs containing 35 smORFs that likely encode functional microproteins, including four previously demonstrated smORF-derived microproteins, suggesting a possible dual role of these lncRNAs in hASC self-renewal.
Collapse
|
3
|
A Novel Regulatory Player in the Innate Immune System: Long Non-Coding RNAs. Int J Mol Sci 2021; 22:ijms22179535. [PMID: 34502451 PMCID: PMC8430513 DOI: 10.3390/ijms22179535] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/12/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) represent crucial transcriptional and post-transcriptional gene regulators during antimicrobial responses in the host innate immune system. Studies have shown that lncRNAs are expressed in a highly tissue- and cell-specific- manner and are involved in the differentiation and function of innate immune cells, as well as inflammatory and antiviral processes, through versatile molecular mechanisms. These lncRNAs function via the interactions with DNA, RNA, or protein in either cis or trans pattern, relying on their specific sequences or their transcriptions and processing. The dysregulation of lncRNA function is associated with various human non-infectious diseases, such as inflammatory bowel disease, cardiovascular diseases, and diabetes mellitus. Here, we provide an overview of the regulation and mechanisms of lncRNA function in the development and differentiation of innate immune cells, and during the activation or repression of innate immune responses. These elucidations might be beneficial for the development of therapeutic strategies targeting inflammatory and innate immune-mediated diseases.
Collapse
|
4
|
Subramaniam N, Nair R, Marsden PA. Epigenetic Regulation of the Vascular Endothelium by Angiogenic LncRNAs. Front Genet 2021; 12:668313. [PMID: 34512715 PMCID: PMC8427604 DOI: 10.3389/fgene.2021.668313] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 05/17/2021] [Indexed: 12/15/2022] Open
Abstract
The functional properties of the vascular endothelium are diverse and heterogeneous between vascular beds. This is especially evident when new blood vessels develop from a pre-existing closed cardiovascular system, a process termed angiogenesis. Endothelial cells are key drivers of angiogenesis as they undergo a highly choreographed cascade of events that has both exogenous (e.g., hypoxia and VEGF) and endogenous regulatory inputs. Not surprisingly, angiogenesis is critical in health and disease. Diverse therapeutics target proteins involved in coordinating angiogenesis with varying degrees of efficacy. It is of great interest that recent work on non-coding RNAs, especially long non-coding RNAs (lncRNAs), indicates that they are also important regulators of the gene expression paradigms that underpin this cellular cascade. The protean effects of lncRNAs are dependent, in part, on their subcellular localization. For instance, lncRNAs enriched in the nucleus can act as epigenetic modifiers of gene expression in the vascular endothelium. Of great interest to genetic disease, they are undergoing rapid evolution and show extensive inter- and intra-species heterogeneity. In this review, we describe endothelial-enriched lncRNAs that have robust effects in angiogenesis.
Collapse
Affiliation(s)
- Noeline Subramaniam
- Marsden Lab, Institute of Medical Sciences, University of Toronto, Toronto, ON, Canada
- Marsden Lab, Keenan Research Centre in the Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, ON, Canada
| | - Ranju Nair
- Marsden Lab, Keenan Research Centre in the Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, ON, Canada
- Marsden Lab, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
| | - Philip A. Marsden
- Marsden Lab, Institute of Medical Sciences, University of Toronto, Toronto, ON, Canada
- Marsden Lab, Keenan Research Centre in the Li Ka Shing Knowledge Institute, St. Michael’s Hospital, Toronto, ON, Canada
- Marsden Lab, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Department of Medicine, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
5
|
Comparative genomics in the search for conserved long noncoding RNAs. Essays Biochem 2021; 65:741-749. [PMID: 33885137 PMCID: PMC8564735 DOI: 10.1042/ebc20200069] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 02/15/2021] [Accepted: 03/15/2021] [Indexed: 12/23/2022]
Abstract
Long noncoding RNAs (lncRNAs) have emerged as prominent regulators of gene expression in eukaryotes. The identification of lncRNA orthologs is essential in efforts to decipher their roles across model organisms, as homologous genes tend to have similar molecular and biological functions. The relatively high sequence plasticity of lncRNA genes compared with protein-coding genes, makes the identification of their orthologs a challenging task. This is why comparative genomics of lncRNAs requires the development of specific and, sometimes, complex approaches. Here, we briefly review current advancements and challenges associated with four levels of lncRNA conservation: genomic sequences, splicing signals, secondary structures and syntenic transcription.
Collapse
|
6
|
Specificity of RNA Folding and Its Association with Evolutionarily Adaptive mRNA Secondary Structures. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:882-900. [PMID: 33607297 PMCID: PMC9403030 DOI: 10.1016/j.gpb.2019.11.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 08/03/2019] [Accepted: 11/08/2019] [Indexed: 11/23/2022]
Abstract
The secondary structure is a fundamental feature of both noncoding and messenger RNAs. However, our understanding of the secondary structure of mRNA, especially that of the coding regions, remains elusive, likely due to translation and the lack of RNA-binding proteins that sustain the consensus structure, such as those that bind to noncoding RNA. Indeed, mRNA has recently been found to adopt diverse alternative structures, the overall functional significance of which remains untested. We hereby approached this problem by estimating the folding specificity, i.e., the probability that a fragment of RNA folds back to the same partner once refolded. We showed that the folding specificity of mRNA is lower than that of noncoding RNA and exhibits moderate evolutionary conservation. Notably, we found that specific rather than alternative folding is likely evolutionarily adaptive since specific folding is frequently associated with functionally important genes or sites within a gene. Additional analysis in combination with ribosome density suggests the ability to modulate ribosome movement as one potential functional advantage provided by specific folding. Our findings revealed a novel facet of the RNA structurome with important functional and evolutionary implications and indicated a potential method for distinguishing the mRNA secondary structures maintained by natural selection from molecular noise.
Collapse
|
7
|
De Novo Profiling of Long Non-Coding RNAs Involved in MC-LR-Induced Liver Injury in Whitefish: Discovery and Perspectives. Int J Mol Sci 2021; 22:ijms22020941. [PMID: 33477898 PMCID: PMC7833382 DOI: 10.3390/ijms22020941] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 01/08/2021] [Accepted: 01/15/2021] [Indexed: 12/18/2022] Open
Abstract
Microcystin-LR (MC-LR) is a potent hepatotoxin for which a substantial gap in knowledge persists regarding the underlying molecular mechanisms of liver toxicity and injury. Although long non-coding RNAs (lncRNAs) have been extensively studied in model organisms, our knowledge concerning the role of lncRNAs in liver injury is limited. Given that lncRNAs show low levels of sequence conservation, their role becomes even more unclear in non-model organisms without an annotated genome, like whitefish (Coregonus lavaretus). The objective of this study was to discover and profile aberrantly expressed polyadenylated lncRNAs that are involved in MC-LR-induced liver injury in whitefish. Using RNA sequencing (RNA-Seq) data, we de novo assembled a high-quality whitefish liver transcriptome. This enabled us to find 94 differentially expressed (DE) putative evolutionary conserved lncRNAs, such as MALAT1, HOTTIP, HOTAIR or HULC, and 4429 DE putative novel whitefish lncRNAs, which differed from annotated protein-coding transcripts (PCTs) in terms of minimum free energy, guanine-cytosine (GC) base-pair content and length. Additionally, we identified DE non-coding transcripts that might be 3′ autonomous untranslated regions (3′UTRs) of mRNAs. We found both evolutionary conserved lncRNAs as well as novel whitefish lncRNAs that could serve as biomarkers of liver injury.
Collapse
|
8
|
Jones AN, Pisignano G, Pavelitz T, White J, Kinisu M, Forino N, Albin D, Varani G. An evolutionarily conserved RNA structure in the functional core of the lincRNA Cyrano. RNA (NEW YORK, N.Y.) 2020; 26:1234-1246. [PMID: 32457084 PMCID: PMC7430676 DOI: 10.1261/rna.076117.120] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 05/18/2020] [Indexed: 05/08/2023]
Abstract
The wide prevalence and regulated expression of long noncoding RNAs (lncRNAs) highlight their functional roles, but the molecular basis for their activities and structure-function relationships remains to be investigated, with few exceptions. Among the relatively few lncRNAs conserved over significant evolutionary distances is the long intergenic noncoding RNA (lincRNA) Cyrano (orthologous to human OIP5-AS1), which contains a region of 300 highly conserved nucleotides within tetrapods, which in turn contains a functional stretch of 26 nt of deep conservation. This region binds to and facilitates the degradation of the microRNA miR-7, a short ncRNA with multiple cellular functions, including modulation of oncogenic expression. We probed the secondary structure of Cyrano in vitro and in cells using chemical and enzymatic probing, and validated the results using comparative sequence analysis. At the center of the functional core of Cyrano is a cloverleaf structure maintained over the >400 million years of divergent evolution that separates fish and primates. This strikingly conserved motif provides interaction sites for several RNA-binding proteins and masks a conserved recognition site for miR-7. Conservation in this region strongly suggests that the function of Cyrano depends on the formation of this RNA structure, which could modulate the rate and efficiency of degradation of miR-7.
Collapse
Affiliation(s)
- Alisha N Jones
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
| | - Giuseppina Pisignano
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
- Tumor Biology and Experimental Therapeutics Program, Institute of Oncology Research (IOR) and Oncology Institute of Southern Switzerland (IOSI), Bellinzona CH-6500, Switzerland
- Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, United Kingdom
| | - Thomas Pavelitz
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
| | - Jessica White
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
| | - Martin Kinisu
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
| | - Nicholas Forino
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
| | - Dreycey Albin
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
| | - Gabriele Varani
- Department of Chemistry, University of Washington, Box 351700, Seattle, Washington 98195, USA
| |
Collapse
|
9
|
Budak H, Kaya SB, Cagirici HB. Long Non-coding RNA in Plants in the Era of Reference Sequences. FRONTIERS IN PLANT SCIENCE 2020; 11:276. [PMID: 32226437 PMCID: PMC7080850 DOI: 10.3389/fpls.2020.00276] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Accepted: 02/21/2020] [Indexed: 05/04/2023]
Abstract
The discovery of non-coding RNAs (ncRNAs), and the subsequent elucidation of their functional roles, was largely delayed due to the misidentification of non-protein-coding parts of DNA as "junk DNA," which forced ncRNAs into the shadows of their protein-coding counterparts. However, over the past decade, insight into the important regulatory roles of ncRNAs has led to rapid progress in their identification and characterization. Of the different types of ncRNAs, long non-coding RNAs (lncRNAs), has attracted considerable attention due to their mRNA-like structures and gene regulatory functions in plant stress responses. While RNA sequencing has been commonly used for mining lncRNAs, a lack of widespread conservation at the sequence level in addition to relatively low and highly tissue-specific expression patterns challenges high-throughput in silico identification approaches. The complex folding characteristics of lncRNA molecules also complicate target predictions, as the knowledge about the interaction interfaces between lncRNAs and potential targets is insufficient. Progress in characterizing lncRNAs and their targets from different species may hold the key to efficient identification of this class of ncRNAs from transcriptomic and potentially genomic resources. In wheat and barley, two of the most important crops, the knowledge about lncRNAs is very limited. However, recently published high-quality genomes of these crops are considered as promising resources for the identification of not only lncRNAs, but any class of molecules. Considering the increasing demand for food, these resources should be used efficiently to discover molecular mechanisms lying behind development and a/biotic stress responses. As our understanding of lncRNAs expands, interactions among ncRNA classes, as well as interactions with the coding sequences, will likely define novel functional networks that may be modulated for crop improvement.
Collapse
Affiliation(s)
- Hikmet Budak
- Montana BioAgriculture, Inc., Bozeman, MT, United States
- *Correspondence: Hikmet Budak,
| | - Sezgi Biyiklioglu Kaya
- Engineering and Natural Sciences, Molecular Biology, Genetics and Bioengineering Program, Sabancı University, Istanbul, Turkey
| | - Halise Busra Cagirici
- Engineering and Natural Sciences, Molecular Biology, Genetics and Bioengineering Program, Sabancı University, Istanbul, Turkey
| |
Collapse
|
10
|
Sanchez de Groot N, Armaos A, Graña-Montes R, Alriquet M, Calloni G, Vabulas RM, Tartaglia GG. RNA structure drives interaction with proteins. Nat Commun 2019; 10:3246. [PMID: 31324771 PMCID: PMC6642211 DOI: 10.1038/s41467-019-10923-5] [Citation(s) in RCA: 96] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Accepted: 06/10/2019] [Indexed: 12/12/2022] Open
Abstract
The combination of high-throughput sequencing and in vivo crosslinking approaches leads to the progressive uncovering of the complex interdependence between cellular transcriptome and proteome. Yet, the molecular determinants governing interactions in protein-RNA networks are not well understood. Here we investigated the relationship between the structure of an RNA and its ability to interact with proteins. Analysing in silico, in vitro and in vivo experiments, we find that the amount of double-stranded regions in an RNA correlates with the number of protein contacts. This relationship -which we call structure-driven protein interactivity- allows classification of RNA types, plays a role in gene regulation and could have implications for the formation of phase-separated ribonucleoprotein assemblies. We validate our hypothesis by showing that a highly structured RNA can rearrange the composition of a protein aggregate. We report that the tendency of proteins to phase-separate is reduced by interactions with specific RNAs.
Collapse
Affiliation(s)
- Natalia Sanchez de Groot
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Alexandros Armaos
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain
| | - Ricardo Graña-Montes
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain.,Department of Biochemistry, University of Zürich, Winterthurerstrasse 190, 8057, Zürich, Switzerland
| | - Marion Alriquet
- Buchmann Institute for Molecular Life Sciences, Goethe University Frankfurt, 60438, Frankfurt am Main, Germany.,Institute of Biophysical Chemistry, Goethe University Frankfurt, 60438, Frankfurt am Main, Germany
| | - Giulia Calloni
- Buchmann Institute for Molecular Life Sciences, Goethe University Frankfurt, 60438, Frankfurt am Main, Germany.,Institute of Biophysical Chemistry, Goethe University Frankfurt, 60438, Frankfurt am Main, Germany
| | - R Martin Vabulas
- Buchmann Institute for Molecular Life Sciences, Goethe University Frankfurt, 60438, Frankfurt am Main, Germany. .,Institute of Biophysical Chemistry, Goethe University Frankfurt, 60438, Frankfurt am Main, Germany.
| | - Gian Gaetano Tartaglia
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain. .,ICREA 23 Passeig Lluis Companys 08010 and Universitat Pompeu Fabra (UPF), 08003, Barcelona, Spain. .,Department of Biology 'Charles Darwin', Sapienza University of Rome, P.le A. Moro 5, Rome, 00185, Italy. .,Department of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia, Via Morego 30, 16163, Genoa, Italy.
| |
Collapse
|
11
|
Distribution, Characteristics, and Regulatory Potential of Long Noncoding RNAs in Brown-Rot Fungi. Int J Genomics 2019; 2019:9702342. [PMID: 31192251 PMCID: PMC6525899 DOI: 10.1155/2019/9702342] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2019] [Accepted: 03/07/2019] [Indexed: 12/22/2022] Open
Abstract
Long noncoding RNAs have been thoroughly studied in plants, animals, and yeasts, where they play important roles as regulators of transcription. Nevertheless, almost nothing is known about their presence and characteristics in filamentous fungi, especially in basidiomycetes. In the present study, we have carried out an exhaustive annotation and characterization of lncRNAs in two lignin degrader basidiomycetes, Coniophora puteana and Serpula lacrymans. We identified 2,712 putative lncRNAs in the former and 2,242 in the latter, mainly originating from intergenic locations of transposon-sparse genomic regions. The lncRNA length, GC content, expression levels, and stability of the secondary structure differ from coding transcripts but are similar in these two species and resemble that of other eukaryotes. Nevertheless, they lack sequence conservation. Also, we found that lncRNAs are transcriptionally regulated in the same proportion as genes when the fungus actively decomposes soil organic matter. Finally, up to 7% of the upstream gene regions of Coniophora puteana and Serpula lacrymans are transcribed and produce lncRNAs. The study of expression trends in these gene-lncRNA pairs uncovered groups with similar and opposite transcriptional profiles which may be the result of cis-transcriptional regulation.
Collapse
|
12
|
Pegueroles C, Iraola-Guzmán S, Chorostecki U, Ksiezopolska E, Saus E, Gabaldón T. Transcriptomic analyses reveal groups of co-expressed, syntenic lncRNAs in four species of the genus Caenorhabditis. RNA Biol 2019; 16:320-329. [PMID: 30691342 PMCID: PMC6380332 DOI: 10.1080/15476286.2019.1572438] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 12/18/2018] [Accepted: 01/13/2019] [Indexed: 01/24/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) are a heterogeneous class of genes that do not code for proteins. Since lncRNAs (or a fraction thereof) are expected to be functional, many efforts have been dedicated to catalog lncRNAs in numerous organisms, but our knowledge of lncRNAs in non vertebrate species remains very limited. Here, we annotated lncRNAs using transcriptomic data from the same larval stage of four Caenorhabditis species. The number of annotated lncRNAs in self-fertile nematodes was lower than in out-crossing species. We used a combination of approaches to identify putatively homologous lncRNAs: synteny, sequence conservation, and structural conservation. We classified a total of 1,532 out of 7,635 genes from the four species into families of lncRNAs with conserved synteny and expression at the larval stage, suggesting that a large fraction of the predicted lncRNAs may be species specific. Despite both sequence and local secondary structure seem to be poorly conserved, sequences within families frequently shared BLASTn hits and short sequence motifs, which were more likely to be unpaired in the predicted structures. We provide the first multi-species catalog of lncRNAs in nematodes and identify groups of lncRNAs with conserved synteny and expression, that share exposed motifs.
Collapse
Affiliation(s)
- Cinta Pegueroles
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Susana Iraola-Guzmán
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Uciel Chorostecki
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Ewa Ksiezopolska
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Ester Saus
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Toni Gabaldón
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
13
|
Nohata N, Abba MC, Gutkind JS. Unraveling the oral cancer lncRNAome: Identification of novel lncRNAs associated with malignant progression and HPV infection. Oral Oncol 2018; 59:58-66. [PMID: 27424183 DOI: 10.1016/j.oraloncology.2016.05.014] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Revised: 04/25/2016] [Accepted: 05/20/2016] [Indexed: 12/31/2022]
Abstract
OBJECTIVES The role of long non-coding RNA (lncRNA) expression in human head and neck squamous cell carcinoma (HNSCC) is still poorly understood. In this study, we aimed at establishing the onco-lncRNAome profiling of HNSCC and to identify lncRNAs correlating with prognosis and patient survival. MATERIALS AND METHODS The Atlas of Noncoding RNAs in Cancer (TANRIC) database was employed to retrieve the lncRNA expression information generated from The Cancer Genome Atlas (TCGA) HNSCC RNA-sequencing data. RNA-sequencing data from HNSCC cell lines were also considered for this study. Bioinformatics approaches, such as differential gene expression analysis, survival analysis, principal component analysis, and Co-LncRNA enrichment analysis were performed. RESULTS Using TCGA HNSCC RNA-sequencing data from 426 HNSCC and 42 adjacent normal tissues, we found 728 lncRNA transcripts significantly and differentially expressed in HNSCC. Among the 728 lncRNAs, 55 lncRNAs were significantly associated with poor prognosis, such as overall survival and/or disease-free survival. Next, we found 140 lncRNA transcripts significantly and differentially expressed between Human Papilloma Virus (HPV) positive tumors and HPV negative tumors. Thirty lncRNA transcripts were differentially expressed between TP53 mutated and TP53 wild type tumors. Co-LncRNA analysis suggested that protein-coding genes that are co-expressed with these deregulated lncRNAs might be involved in cancer associated molecular events. With consideration of differential expression of lncRNAs in a HNSCC cell lines panel (n=22), we found several lncRNAs that may represent potential targets for diagnosis, therapy and prevention of HNSCC. CONCLUSION LncRNAs profiling could provide novel insights into the potential mechanisms of HNSCC oncogenesis.
Collapse
Affiliation(s)
- Nijiro Nohata
- Moores Cancer Center, University of California, San Diego, La Jolla, CA, United States
| | - Martin C Abba
- CINIBA, CONICET, School of Medical Sciences, National University of La Plata, La Plata, Argentina
| | - J Silvio Gutkind
- Moores Cancer Center, University of California, San Diego, La Jolla, CA, United States; Department of Pharmacology, University of California, San Diego, La Jolla, CA, United States.
| |
Collapse
|
14
|
Yang JR. Does mRNA structure contain genetic information for regulating co-translational protein folding? Zool Res 2018; 38:36-43. [PMID: 28271668 PMCID: PMC5368379 DOI: 10.13918/j.issn.2095-8137.2017.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Currently many facets of genetic information are illdefined. In particular, how protein folding is genetically regulated has been a long-standing issue for genetics and protein biology. And a generic mechanistic model with supports of genomic data is still lacking. Recent technological advances have enabled much needed genome-wide experiments. While putting the effect of codon optimality on debate, these studies have supplied mounting evidence suggesting a role of mRNA structure in the regulation of protein folding by modulating translational elongation rate. In conjunctions with previous theories, this mechanistic model of protein folding guided by mRNA structure shall expand our understandings of genetic information and offer new insights into various biomedical puzzles.
Collapse
Affiliation(s)
- Jian-Rong Yang
- Department of Biology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 510080, China.
| |
Collapse
|
15
|
Harris ZN, Kovacs LG, Londo JP. RNA-seq-based genome annotation and identification of long-noncoding RNAs in the grapevine cultivar 'Riesling'. BMC Genomics 2017; 18:937. [PMID: 29197332 PMCID: PMC5712117 DOI: 10.1186/s12864-017-4346-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 11/22/2017] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND The technological advances of RNA-seq and de novo transcriptome assembly have enabled genome annotation and transcriptome profiling in highly heterozygous species such as grapevine (Vitis vinifera L.). This work is an attempt to utilize a de novo-assembled transcriptome of the V. vinifera cultivar 'Riesling' to improve annotation of the grapevine reference genome sequence. RESULTS Here we show that the transcriptome assembly of a single V. vinifera cultivar is insufficient for a complete genome annotation of the grapevine reference genome constructed from V. vinifera PN40024. Further, we provide evidence that the gene models we identified cannot be completely anchored to the previously published V. vinifera PN40024 gene models. In addition to these findings, we present a computational pipeline for the de novo identification of lncRNAs. Our results demonstrate that, in grapevine, lncRNAs are significantly different from protein coding transcripts in such metrics as length, GC-content, minimum free energy, and length-corrected minimum free energy. CONCLUSIONS In grapevine, high-level heterozygosity necessitates that transcriptome characterization be based on cultivar-specific reference genome sequences. Our results strengthen the hypothesis that lncRNAs have thermodynamically different properties than protein-coding RNAs. The analyses of both coding and non-coding RNAs will be instrumental in uncovering inter-cultivar variation in wild and cultivated grapevine species.
Collapse
Affiliation(s)
- Zachary N. Harris
- Missouri State University, Biology Department, 901 S. National Ave, Springfield, MO USA
- Present address: Saint Louis University, Department of Biology, 1 N. Grand Blvd, Saint Louis, MO USA
| | - Laszlo G. Kovacs
- Missouri State University, Biology Department, 901 S. National Ave, Springfield, MO USA
| | - Jason P. Londo
- United States Department of Agriculture, Agricultural Research Service, Grape Genetics Research Unit, 630 W. North Street, Geneva, NY USA
| |
Collapse
|
16
|
Tarifeño-Saldivia E, Valenzuela-Miranda D, Gallardo-Escárate C. In the shadow: The emerging role of long non-coding RNAs in the immune response of Atlantic salmon. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2017; 73:193-205. [PMID: 28373064 DOI: 10.1016/j.dci.2017.03.024] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Revised: 03/28/2017] [Accepted: 03/29/2017] [Indexed: 06/07/2023]
Abstract
The genomic era has increased the research effort to uncover how the genome of an organism, and specifically the transcriptome, is modulated after interplaying with pathogenic microorganisms and ectoparasites. However, the ever-increasing accessibility of sequencing technology has also evidenced regulatory roles of long non-coding RNAs (lncRNAs) related to several biological processes including immune response. This study reports a high-confidence annotation and a comparative transcriptome analysis of lncRNAs from several tissues of Salmo salar infected with the most prevalent pathogens in the Chilean salmon aquaculture such as the infectious salmon anemia (ISA) virus, the intracellular bacterium Piscirickettsia salmonis and the ectoparasite copepod Caligus rogercresseyi. Our analyses showed that lncRNAs are widely modulated during infection. However, this modulation is pathogen-specific and highly correlated with immuno-related genes associated with innate immune response. These findings represent the first discovery for the widespread differential expression of lncRNAs in response to infections with different types of pathogens in Atlantic salmon, suggesting that lncRNAs are pivotal player during the fish immune response.
Collapse
Affiliation(s)
- E Tarifeño-Saldivia
- Laboratory of Biotechnology and Aquatic Genomics, Interdisciplinary Center for Aquaculture Research (INCAR), University of Concepción, Concepción, Chile
| | - D Valenzuela-Miranda
- Laboratory of Biotechnology and Aquatic Genomics, Interdisciplinary Center for Aquaculture Research (INCAR), University of Concepción, Concepción, Chile
| | - C Gallardo-Escárate
- Laboratory of Biotechnology and Aquatic Genomics, Interdisciplinary Center for Aquaculture Research (INCAR), University of Concepción, Concepción, Chile.
| |
Collapse
|
17
|
Lopez-Ezquerra A, Harrison MC, Bornberg-Bauer E. Comparative analysis of lincRNA in insect species. BMC Evol Biol 2017; 17:155. [PMID: 28673235 PMCID: PMC5494802 DOI: 10.1186/s12862-017-0985-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 06/02/2017] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND The ever increasing availability of genomes makes it possible to investigate and compare not only the genomic complements of genes and proteins, but also of RNAs. One class of RNAs, the long noncoding RNAs (lncRNAs) and, in particular, their subclass of long intergenic noncoding RNAs (lincRNAs) have recently gained much attention because of their roles in regulation of important biological processes such as immune response or cell differentiation and as possible evolutionary precursors for protein coding genes. lincRNAs seem to be poorly conserved at the sequence level but at least some lincRNAs have conserved structural elements and syntenic genomic positions. Previous studies showed that transposable elements are a main contribution to the evolution of lincRNAs in mammals. In contrast, plant lincRNA emergence and evolution has been linked with local duplication events. However, little is known about their evolutionary dynamics in general and in insect genomes in particular. RESULTS Here we compared lincRNAs between seven insect genomes and investigated possible evolutionary changes and functional roles. We find very low sequence conservation between different species and that similarities within a species are mostly due to their association with transposable elements (TE) and simple repeats. Furthermore, we find that TEs are less frequent in lincRNA exons than in their introns, indicating that TEs may have been removed by selection. When we analysed the predicted thermodynamic stabilities of lincRNAs we found that they are more stable than their randomized controls which might indicate some selection pressure to maintain certain structural elements. We list several of the most stable lincRNAs which could serve as prime candidates for future functional studies. We also discuss the possibility of de novo protein coding genes emerging from lincRNAs. This is because lincRNAs with high GC content and potentially with longer open reading frames (ORF) are candidate loci where de novo gene emergence might occur. CONCLUSION The processes responsible for the emergence and diversification of lincRNAs in insects remain unclear. Both duplication and transposable elements may be important for the creation of new lincRNAs in insects.
Collapse
Affiliation(s)
- Alberto Lopez-Ezquerra
- Institute of Evolution and Biodiversity, University of Münster, Hüfferstrasse,1, Münster, Münster, Germany
| | - Mark C Harrison
- Institute of Evolution and Biodiversity, University of Münster, Hüfferstrasse,1, Münster, Münster, Germany
| | - Erich Bornberg-Bauer
- Institute of Evolution and Biodiversity, University of Münster, Hüfferstrasse,1, Münster, Münster, Germany.
| |
Collapse
|
18
|
Long noncoding RNAs: Unexplored players in the drug response of the sea louse Caligus rogercresseyi. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.aggene.2017.03.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
19
|
Beermann J, Piccoli MT, Viereck J, Thum T. Non-coding RNAs in Development and Disease: Background, Mechanisms, and Therapeutic Approaches. Physiol Rev 2017; 96:1297-325. [PMID: 27535639 DOI: 10.1152/physrev.00041.2015] [Citation(s) in RCA: 1244] [Impact Index Per Article: 177.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Advances in RNA-sequencing techniques have led to the discovery of thousands of non-coding transcripts with unknown function. There are several types of non-coding linear RNAs such as microRNAs (miRNA) and long non-coding RNAs (lncRNA), as well as circular RNAs (circRNA) consisting of a closed continuous loop. This review guides the reader through important aspects of non-coding RNA biology. This includes their biogenesis, mode of actions, physiological function, as well as their role in the disease context (such as in cancer or the cardiovascular system). We specifically focus on non-coding RNAs as potential therapeutic targets and diagnostic biomarkers.
Collapse
Affiliation(s)
- Julia Beermann
- Institute of Molecular and Translational Therapeutic Strategies, Hannover Medical School, Hannover, Germany; and National Heart and Lung Institute, Imperial College London, London, United Kingdom
| | - Maria-Teresa Piccoli
- Institute of Molecular and Translational Therapeutic Strategies, Hannover Medical School, Hannover, Germany; and National Heart and Lung Institute, Imperial College London, London, United Kingdom
| | - Janika Viereck
- Institute of Molecular and Translational Therapeutic Strategies, Hannover Medical School, Hannover, Germany; and National Heart and Lung Institute, Imperial College London, London, United Kingdom
| | - Thomas Thum
- Institute of Molecular and Translational Therapeutic Strategies, Hannover Medical School, Hannover, Germany; and National Heart and Lung Institute, Imperial College London, London, United Kingdom
| |
Collapse
|
20
|
Integrative classification of human coding and noncoding genes through RNA metabolism profiles. Nat Struct Mol Biol 2016; 24:86-96. [PMID: 27870833 DOI: 10.1038/nsmb.3325] [Citation(s) in RCA: 117] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 10/18/2016] [Indexed: 12/26/2022]
Abstract
Pervasive transcription of the human genome results in a heterogeneous mix of coding RNAs and long noncoding RNAs (lncRNAs). Only a small fraction of lncRNAs have demonstrated regulatory functions, thus making functional lncRNAs difficult to distinguish from nonfunctional transcriptional byproducts. This difficulty has resulted in numerous competing human lncRNA classifications that are complicated by a steady increase in the number of annotated lncRNAs. To address these challenges, we quantitatively examined transcription, splicing, degradation, localization and translation for coding and noncoding human genes. We observed that annotated lncRNAs had lower synthesis and higher degradation rates than mRNAs and discovered mechanistic differences explaining slower lncRNA splicing. We grouped genes into classes with similar RNA metabolism profiles, containing both mRNAs and lncRNAs to varying extents. These classes exhibited distinct RNA metabolism, different evolutionary patterns and differential sensitivity to cellular RNA-regulatory pathways. Our classification provides an alternative to genomic context-driven annotations of lncRNAs.
Collapse
|
21
|
Ulitsky I. Evolution to the rescue: using comparative genomics to understand long non-coding RNAs. Nat Rev Genet 2016; 17:601-14. [DOI: 10.1038/nrg.2016.85] [Citation(s) in RCA: 373] [Impact Index Per Article: 46.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
22
|
Secondary structure impacts patterns of selection in human lncRNAs. BMC Biol 2016; 14:60. [PMID: 27457204 PMCID: PMC4960838 DOI: 10.1186/s12915-016-0283-0] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2016] [Accepted: 07/04/2016] [Indexed: 02/04/2023] Open
Abstract
Background Metazoans transcribe many long non-coding RNAs (lncRNAs) that are poorly conserved and whose function remains unknown. This has raised the questions of what fraction of the predicted lncRNAs is actually functional, and whether selection can effectively constrain lncRNAs in species with small effective population sizes such as human populations. Results Here we evaluate signatures of selection in human lncRNAs using inter-specific data and intra-specific comparisons from five major populations, as well as by assessing relationships between sequence variation and predictions of secondary structure. In all analyses we included a reference of functionally characterized lncRNAs. Altogether, our results show compelling evidence of recent purifying selection acting on both characterized and predicted lncRNAs. We found that RNA secondary structure constrains sequence variation in lncRNAs, so that polymorphisms are depleted in paired regions with low accessibility and tend to be neutral with respect to structural stability. Conclusions Important implications of our results are that secondary structure plays a role in the functionality of lncRNAs, and that the set of predicted lncRNAs contains a large fraction of functional ones that may play key roles that remain to be discovered. Electronic supplementary material The online version of this article (doi:10.1186/s12915-016-0283-0) contains supplementary material, which is available to authorized users.
Collapse
|
23
|
Ruminski DJ, Watson PY, Mahen EM, Fedor MJ. A DEAD-box RNA helicase promotes thermodynamic equilibration of kinetically trapped RNA structures in vivo. RNA (NEW YORK, N.Y.) 2016; 22:416-27. [PMID: 26759451 PMCID: PMC4748819 DOI: 10.1261/rna.055178.115] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Accepted: 12/05/2015] [Indexed: 05/24/2023]
Abstract
RNAs must assemble into specific structures in order to carry out their biological functions, but in vitro RNA folding reactions produce multiple misfolded structures that fail to exchange with functional structures on biological time scales. We used carefully designed self-cleaving mRNAs that assemble through well-defined folding pathways to identify factors that differentiate intracellular and in vitro folding reactions. Our previous work showed that simple base-paired RNA helices form and dissociate with the same rate and equilibrium constants in vivo and in vitro. However, exchange between adjacent secondary structures occurs much faster in vivo, enabling RNAs to quickly adopt structures with the lowest free energy. We have now used this approach to probe the effects of an extensively characterized DEAD-box RNA helicase, Mss116p, on a series of well-defined RNA folding steps in yeast. Mss116p overexpression had no detectable effect on helix formation or dissociation kinetics or on the stability of interdomain tertiary interactions, consistent with previous evidence that intracellular factors do not affect these folding parameters. However, Mss116p overexpression did accelerate exchange between adjacent helices. The nonprocessive nature of RNA duplex unwinding by DEAD-box RNA helicases is consistent with a branch migration mechanism in which Mss116p lowers barriers to exchange between otherwise stable helices by the melting and annealing of one or two base pairs at interhelical junctions. These results suggest that the helicase activity of DEAD-box proteins like Mss116p distinguish intracellular RNA folding pathways from nonproductive RNA folding reactions in vitro and allow RNA structures to overcome kinetic barriers to thermodynamic equilibration in vivo.
Collapse
Affiliation(s)
- Dana J Ruminski
- Department of Chemical Physiology, Department of Cell and Molecular Biology, and The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | - Peter Y Watson
- Department of Chemical Physiology, Department of Cell and Molecular Biology, and The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | - Elisabeth M Mahen
- Department of Chemical Physiology, Department of Cell and Molecular Biology, and The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| | - Martha J Fedor
- Department of Chemical Physiology, Department of Cell and Molecular Biology, and The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037, USA
| |
Collapse
|
24
|
Wiberg RAW, Halligan DL, Ness RW, Necsulea A, Kaessmann H, Keightley PD. Assessing Recent Selection and Functionality at Long Noncoding RNA Loci in the Mouse Genome. Genome Biol Evol 2015; 7:2432-44. [PMID: 26272717 PMCID: PMC4558870 DOI: 10.1093/gbe/evv155] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2015] [Indexed: 12/27/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) are one of the most intensively studied groups of noncoding elements. Debate continues over what proportion of lncRNAs are functional or merely represent transcriptional noise. Although characterization of individual lncRNAs has identified approximately 200 functional loci across the Eukarya, general surveys have found only modest or no evidence of long-term evolutionary conservation. Although this lack of conservation suggests that most lncRNAs are nonfunctional, the possibility remains that some represent recent evolutionary innovations. We examine recent selection pressures acting on lncRNAs in mouse populations. We compare patterns of within-species nucleotide variation at approximately 10,000 lncRNA loci in a cohort of the wild house mouse, Mus musculus castaneus, with between-species nucleotide divergence from the rat (Rattus norvegicus). Loci under selective constraint are expected to show reduced nucleotide diversity and divergence. We find limited evidence of sequence conservation compared with putatively neutrally evolving ancestral repeats (ARs). Comparisons of sequence diversity and divergence between ARs, protein-coding (PC) exons and lncRNAs, and the associated flanking regions, show weak, but significantly lower levels of sequence diversity and divergence at lncRNAs compared with ARs. lncRNAs conserved deep in the vertebrate phylogeny show lower within-species sequence diversity than lncRNAs in general. A set of 74 functionally characterized lncRNAs show levels of diversity and divergence comparable to PC exons, suggesting that these lncRNAs are under substantial selective constraints. Our results suggest that, in mouse populations, most lncRNA loci evolve at rates similar to ARs, whereas older lncRNAs tend to show signals of selection similar to PC genes.
Collapse
Affiliation(s)
- R Axel W Wiberg
- Institute of Evolutionary Biology, University of Edinburgh, United Kingdom Present address: Centre for Biological Diversity, School of Biology, University of St. Andrews, United Kingdom
| | - Daniel L Halligan
- Institute of Evolutionary Biology, University of Edinburgh, United Kingdom
| | - Rob W Ness
- Institute of Evolutionary Biology, University of Edinburgh, United Kingdom
| | - Anamaria Necsulea
- School of Life Sciences, Ecole Polytechnique Fédérale Lausanne, Lausanne, Switzerland
| | - Henrik Kaessmann
- Center for Integrative Genomics, University of Lausanne, Switzerland
| | - Peter D Keightley
- Institute of Evolutionary Biology, University of Edinburgh, United Kingdom
| |
Collapse
|
25
|
Housman G, Ulitsky I. Methods for distinguishing between protein-coding and long noncoding RNAs and the elusive biological purpose of translation of long noncoding RNAs. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2015; 1859:31-40. [PMID: 26265145 DOI: 10.1016/j.bbagrm.2015.07.017] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 06/18/2015] [Accepted: 07/19/2015] [Indexed: 12/12/2022]
Abstract
Long noncoding RNAs (lncRNAs) are a diverse class of RNAs with increasingly appreciated functions in vertebrates, yet much of their biology remains poorly understood. In particular, it is unclear to what extent the current catalog of over 10,000 annotated lncRNAs is indeed devoid of genes coding for proteins. Here we review the available computational and experimental schemes for distinguishing between coding and noncoding transcripts and assess the conclusions from their recent genome-wide applications. We conclude that the model most consistent with the available data is that a large number of mammalian lncRNAs undergo translation, but only a very small minority of such translation events results in stable and functional peptides. The outcomes of the majority of the translation events and their potential biological purposes remain an intriguing topic for future investigation. This article is part of a Special Issue entitled: Clues to long noncoding RNA taxonomy1, edited by Dr. Tetsuro Hirose and Dr. Shinichi Nakagawa.
Collapse
Affiliation(s)
- Gali Housman
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Igor Ulitsky
- Department of Biological Regulation, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|