1
|
Fafard-Couture É, Labialle S, Scott MS. The regulatory roles of small nucleolar RNAs within their host locus. RNA Biol 2024; 21:1-11. [PMID: 38626213 PMCID: PMC11028025 DOI: 10.1080/15476286.2024.2342685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/08/2024] [Indexed: 04/18/2024] Open
Abstract
Small nucleolar RNAs (snoRNAs) are a class of conserved noncoding RNAs forming complexes with proteins to catalyse site-specific modifications on ribosomal RNA. Besides this canonical role, several snoRNAs are now known to regulate diverse levels of gene expression. While these functions are carried out in trans by mature snoRNAs, evidence has also been emerging of regulatory roles of snoRNAs in cis, either within their genomic locus or as longer transcription intermediates during their maturation. Herein, we review recent findings that snoRNAs can interact in cis with their intron to regulate the expression of their host gene. We also explore the ever-growing diversity of longer host-derived snoRNA extensions and their functional impact across the transcriptome. Finally, we discuss the role of snoRNA duplications into forging these new layers of snoRNA-mediated regulation, as well as their involvement in the genomic imprinting of their host locus.
Collapse
Affiliation(s)
- Étienne Fafard-Couture
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | | | - Michelle S Scott
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, Canada
| |
Collapse
|
2
|
Lis B, Domagała PJ, Lis JA. Tribe Acalyptaini ( Hemiptera: Tingidae: Tinginae) Revisited: Can Apomorphies in Secondary and Tertiary Structures of 18S rRNA Length-Variable Regions (LVRs) Support Tribe Validity? INSECTS 2023; 14:600. [PMID: 37504606 PMCID: PMC10380217 DOI: 10.3390/insects14070600] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 06/25/2023] [Accepted: 06/28/2023] [Indexed: 07/29/2023]
Abstract
The lace bug tribe Acalyptaini (Tingidae: Tinginae) includes five genera, Acalypta, Derephysia, Dictyonota, Kalama, and Recaredus, and it was recently resurrected based on morphological and karyological characters. We aimed to validate the distinctiveness of this tribe using 18S rDNA sequences, which have not been used in previous Tingidae phylogenomic studies. Our results confirmed the monophyly of the tribe. Moreover, the monophyly of the subfamily Cantacaderinae and its basal position within the family Tingidae were indicated, as well as the position of the tribe Litadeini as sister to all other Tinginae. In addition, we attempted to determine the apomorphic morpho-molecular characters in the secondary and tertiary structures of length-variable regions of the 18S rRNA sequences of the analysed species. The results showed that two LVRs (LVR X and LVR L) of the hypervariable region V4 exhibited significant variability in the number of nucleotides and could be considered for apomorphic recognition.
Collapse
Affiliation(s)
- Barbara Lis
- Institute of Biology, University of Opole, 45-052 Opole, Oleska 22, Poland
| | - Paweł J Domagała
- Institute of Biology, University of Opole, 45-052 Opole, Oleska 22, Poland
| | - Jerzy A Lis
- Institute of Biology, University of Opole, 45-052 Opole, Oleska 22, Poland
| |
Collapse
|
3
|
Lis JA. Molecular Apomorphies in the Secondary and Tertiary Structures of Length-Variable Regions (LVRs) of 18S rRNA Shed Light on the Systematic Position of the Family Thaumastellidae (Hemiptera: Heteroptera: Pentatomoidea). Int J Mol Sci 2023; 24:ijms24097758. [PMID: 37175465 PMCID: PMC10178826 DOI: 10.3390/ijms24097758] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 04/17/2023] [Accepted: 04/19/2023] [Indexed: 05/15/2023] Open
Abstract
The SSU nrDNA, a small subunit of the nuclear ribosomal DNA (coding 18S rRNA), is one of the most frequently sequenced genes in molecular studies in Hexapoda. In insects, including true bugs (Hemiptera: Heteroptera), only its primary structures (i.e., aligned sequences) are predominantly used in phylogenetic reconstructions. It is known that including RNA secondary structures in the alignment procedure is essential for improving accuracy and robustness in phylogenetic tree reconstruction. Moreover, local plasticity in rRNAs might impact their tertiary structures and corresponding functions. To determine the systematic position of Thaumastellidae within the superfamily Pentatomoidea, the secondary and-for the first time among all Hexapoda-tertiary structures of 18S rRNAs in twelve pentatomoid families were compared and analysed. Results indicate that the shapes of the secondary and tertiary structures of the length-variable regions (LVRs) in the 18S rRNA are phylogenetically highly informative. Based on these results, it is suggested that the Thaumastellidae is maintained as an independent family within the superfamily Pentatomoidea, rather than as a part of the family Cydnidae. Moreover, the analyses indicate a close relationship between Sehirinae and Parastrachiidae, expressed in morpho-molecular synapomorphies in the predicted secondary and tertiary structures of the length-variable region L (LVR L).
Collapse
Affiliation(s)
- Jerzy A Lis
- Institute of Biology, University of Opole, Oleska 22, 45-052 Opole, Poland
| |
Collapse
|
4
|
Bergeron D, Laforest C, Carpentier S, Calvé A, Fafard-Couture É, Deschamps-Francoeur G, Scott MS. SnoRNA copy regulation affects family size, genomic location and family abundance levels. BMC Genomics 2021; 22:414. [PMID: 34090325 PMCID: PMC8178906 DOI: 10.1186/s12864-021-07757-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 05/26/2021] [Indexed: 12/03/2022] Open
Abstract
Background Small nucleolar RNAs (snoRNAs) are an abundant class of noncoding RNAs present in all eukaryotes and best known for their involvement in ribosome biogenesis. In mammalian genomes, many snoRNAs exist in multiple copies, resulting from recombination and retrotransposition from an ancestral snoRNA. To gain insight into snoRNA copy regulation, we used Rfam classification and normal human tissue expression datasets generated using low structure bias RNA-seq to characterize snoRNA families. Results We found that although box H/ACA families are on average larger than box C/D families, the number of expressed members is similar for both types. Family members can cover a wide range of average abundance values, but importantly, expression variability of individual members of a family is preferred over the total variability of the family, especially for box H/ACA snoRNAs, suggesting that while members are likely differentially regulated, mechanisms exist to ensure uniformity of the total family abundance across tissues. Box C/D snoRNA family members are mostly embedded in the same host gene while box H/ACA family members tend to be encoded in more than one different host, supporting a model in which box C/D snoRNA duplication occurred mostly by cis recombination while box H/ACA snoRNA families have gained copy members through retrotransposition. And unexpectedly, snoRNAs encoded in the same host gene can be regulated independently, as some snoRNAs within the same family vary in abundance in a divergent way between tissues. Conclusions SnoRNA copy regulation affects family sizes, genomic location of the members and controls simultaneously member and total family abundance to respond to the needs of individual tissues. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07757-1.
Collapse
Affiliation(s)
- Danny Bergeron
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, J1E 4K8, Canada
| | - Cédric Laforest
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, J1E 4K8, Canada
| | - Stacey Carpentier
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, J1E 4K8, Canada
| | - Annabelle Calvé
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, J1E 4K8, Canada
| | - Étienne Fafard-Couture
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, J1E 4K8, Canada
| | - Gabrielle Deschamps-Francoeur
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, J1E 4K8, Canada
| | - Michelle S Scott
- Département de biochimie et de génomique fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, Québec, J1E 4K8, Canada.
| |
Collapse
|
5
|
Lim CS, Weinstein BN, Roy SW, Brown CM. Analysis of fungal genomes reveals commonalities of intron gain or loss and functions in intron-poor species. Mol Biol Evol 2021; 38:4166-4186. [PMID: 33772558 PMCID: PMC8476143 DOI: 10.1093/molbev/msab094] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Previous evolutionary reconstructions have concluded that early eukaryotic ancestors including both the last common ancestor of eukaryotes and of all fungi had intron-rich genomes. By contrast, some extant eukaryotes have few introns, underscoring the complex histories of intron–exon structures, and raising the question as to why these few introns are retained. Here, we have used recently available fungal genomes to address a variety of questions related to intron evolution. Evolutionary reconstruction of intron presence and absence using 263 diverse fungal species supports the idea that massive intron reduction through intron loss has occurred in multiple clades. The intron densities estimated in various fungal ancestors differ from zero to 7.6 introns per 1 kb of protein-coding sequence. Massive intron loss has occurred not only in microsporidian parasites and saccharomycetous yeasts, but also in diverse smuts and allies. To investigate the roles of the remaining introns in highly-reduced species, we have searched for their special characteristics in eight intron-poor fungi. Notably, the introns of ribosome-associated genes RPL7 and NOG2 have conserved positions; both intron-containing genes encoding snoRNAs. Furthermore, both the proteins and snoRNAs are involved in ribosome biogenesis, suggesting that the expression of the protein-coding genes and noncoding snoRNAs may be functionally coordinated. Indeed, these introns are also conserved in three-quarters of fungi species. Our study shows that fungal introns have a complex evolutionary history and underappreciated roles in gene expression.
Collapse
Affiliation(s)
- Chun Shen Lim
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| | - Brooke N Weinstein
- Quantitative & Systems Biology, School of Natural Sciences, University of California-Merced, Merced, CA, USA.,Department of Biology, San Francisco State University, San Francisco, CA, USA
| | - Scott W Roy
- Quantitative & Systems Biology, School of Natural Sciences, University of California-Merced, Merced, CA, USA.,Department of Biology, San Francisco State University, San Francisco, CA, USA
| | - Chris M Brown
- Department of Biochemistry, School of Biomedical Sciences, University of Otago, Dunedin, New Zealand
| |
Collapse
|
6
|
Small nucleolar RNAs: continuing identification of novel members and increasing diversity of their molecular mechanisms of action. Biochem Soc Trans 2021; 48:645-656. [PMID: 32267490 PMCID: PMC7200641 DOI: 10.1042/bst20191046] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Revised: 03/17/2020] [Accepted: 03/19/2020] [Indexed: 12/11/2022]
Abstract
Identified five decades ago amongst the most abundant cellular RNAs, small nucleolar RNAs (snoRNAs) were initially described as serving as guides for the methylation and pseudouridylation of ribosomal RNA through direct base pairing. In recent years, however, increasingly powerful high-throughput genomic approaches and strategies have led to the discovery of many new members of the family and surprising diversity in snoRNA functionality and mechanisms of action. SnoRNAs are now known to target RNAs of many biotypes for a wider range of modifications, interact with diverse binding partners, compete with other binders for functional interactions, recruit diverse players to targets and affect protein function and accessibility through direct interaction. This mini-review presents the continuing characterization of the snoRNome through the identification of new snoRNA members and the discovery of their mechanisms of action, revealing a highly versatile noncoding family playing central regulatory roles and connecting the main cellular processes.
Collapse
|
7
|
Deschamps-Francoeur G, Simoneau J, Scott MS. Handling multi-mapped reads in RNA-seq. Comput Struct Biotechnol J 2020; 18:1569-1576. [PMID: 32637053 PMCID: PMC7330433 DOI: 10.1016/j.csbj.2020.06.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2020] [Revised: 06/06/2020] [Accepted: 06/07/2020] [Indexed: 11/07/2022] Open
Abstract
Many eukaryotic genomes harbour large numbers of duplicated sequences, of diverse biotypes, resulting from several mechanisms including recombination, whole genome duplication and retro-transposition. Such repeated sequences complicate gene/transcript quantification during RNA-seq analysis due to reads mapping to more than one locus, sometimes involving genes embedded in other genes. Genes of different biotypes have dissimilar levels of sequence duplication, with long-noncoding RNAs and messenger RNAs sharing less sequence similarity to other genes than biotypes encoding shorter RNAs. Many strategies have been elaborated to handle these multi-mapped reads, resulting in increased accuracy in gene/transcript quantification, although separate tools are typically used to estimate the abundance of short and long genes due to their dissimilar characteristics. This review discusses the mechanisms leading to sequence duplication, the biotypes affected, the computational strategies employed to deal with multi-mapped reads and the challenges that still remain to be overcome.
Collapse
Affiliation(s)
- Gabrielle Deschamps-Francoeur
- Département de Biochimie et Génomique Fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, QC J1E 4K8, Canada
| | - Joël Simoneau
- Département de Biochimie et Génomique Fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, QC J1E 4K8, Canada
| | - Michelle S. Scott
- Département de Biochimie et Génomique Fonctionnelle, Faculté de médecine et des sciences de la santé, Université de Sherbrooke, Sherbrooke, QC J1E 4K8, Canada
| |
Collapse
|
8
|
Kojima KK. Hagfish genome reveals parallel evolution of 7SL RNA-derived SINEs. Mob DNA 2020; 11:18. [PMID: 32489435 PMCID: PMC7245038 DOI: 10.1186/s13100-020-00210-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Accepted: 03/27/2020] [Indexed: 11/26/2022] Open
Abstract
Background Short interspersed elements (SINEs) are ubiquitous components of eukaryotic genomes. SINEs are composite transposable elements that are mobilized by non-long terminal repeat (non-LTR) retrotransposons, also called long interspersed elements (LINEs). The 3′ part of SINEs usually originated from that of counterpart non-LTR retrotransposons. The 5′ part of SINEs mostly originated from small RNA genes. SINE1 is a group of SINEs whose 5′ part originated from 7SL RNA, and is represented by primate Alu and murine B1. Well-defined SINE1 has been found only from Euarchontoglires, a group of mammals, in contrast to the wide distribution of SINE2, which has a tRNA-derived sequence, from animals to plants to protists. Both Alu and B1 are mobilized by L1-type non-LTR retrotransposons, which are the only lineage of autonomous non-LTR retrotransposons active in these mammalian lineages. Results Here a new lineage of SINE1 is characterized from the seashore hagfish Eptatretus burgeri genome. This SINE1 family, designated SINE1-1_EBu, is young, and is transposed by RTE-type non-LTR retrotransposon, not L1-type. Comparison with other SINE families from hagfish indicated the birth of SINE1-1_EBu through chimera formation of a 7SL RNA-derived sequence and an older tRNA-derived SINE family. It reveals parallel evolution of SINE1 in two vertebrate lineages with different autonomous non-LTR retrotransposon partners. The comparison between two SINE1 lineages supports that the RNA secondary structure of the Alu domain of 7SL RNA is required for the efficient retrotransposition. Conclusions The hagfish SINE1 is the first evident SINE1 family found outside of Euarchontoglires. Independent evolution of SINE1 with similar RNA secondary structure originated in 7SL RNA indicates the functional importance of 7SL RNA-derived sequence in the proliferation of SINEs.
Collapse
Affiliation(s)
- Kenji K Kojima
- Genetic Information Research Institute, Cupertino, CA 95014 USA
| |
Collapse
|
9
|
Zhang D, Zhou J, Gao J, Wu RY, Huang YL, Jin QW, Chen JS, Tang WZ, Yan LH. Targeting snoRNAs as an emerging method of therapeutic development for cancer. Am J Cancer Res 2019; 9:1504-1516. [PMID: 31497339 PMCID: PMC6726984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 05/18/2019] [Indexed: 06/10/2023] Open
Abstract
The relevance of the dysregulation of snoRNAs in human cancer has been widely investigated and has challenged the view that snoRNAs merely function as house-keeping genes for the posttranscriptional modification of rRNAs. Accumulating evidence has shown the intimate connection between snoRNAs and proliferation, apoptosis, invasion and migration of tumor cells via manual intervention patterns of snoRNA expression. In this review, we focused on how snoRNAs are dysregulated and its regulation of the formation and development of cancer. We summarized the non-classical functions of snoRNAs in the context of their regulations of the signaling pathways involving PI3K-AKT and K-Ras and p53-dependant manner. Under these novel functions and characteristics, snoRNAs can act as potential and feasible biomarkers for diagnosis. Simultaneously, these promising therapeutic strategies should be considered to counteract the perturbations of snoRNAs.
Collapse
Affiliation(s)
- Di Zhang
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Juan Zhou
- Department of Gynecological Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Jie Gao
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Ri-Ying Wu
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Ying-Long Huang
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Qin-Wen Jin
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Jian-Si Chen
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Wei-Zhong Tang
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| | - Lin-Hai Yan
- Department of Gastrointestinal Surgery, Affiliated Tumor Hospital of Guangxi Medical UniversityNanning 530021, Guangxi Zhuang Autonomous Region, China
- Guangxi Clinical Research Center for Colorectal CancerNanning 530021, Guangxi Zhuang Autonomous Region, China
| |
Collapse
|
10
|
Lui LM, Uzilov AV, Bernick DL, Corredor A, Lowe TM, Dennis PP. Methylation guide RNA evolution in archaea: structure, function and genomic organization of 110 C/D box sRNA families across six Pyrobaculum species. Nucleic Acids Res 2019; 46:5678-5691. [PMID: 29771354 PMCID: PMC6009581 DOI: 10.1093/nar/gky284] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 04/23/2018] [Indexed: 12/21/2022] Open
Abstract
Archaeal homologs of eukaryotic C/D box small nucleolar RNAs (C/D box sRNAs) guide precise 2′-O-methyl modification of ribosomal and transfer RNAs. Although C/D box sRNA genes constitute one of the largest RNA gene families in archaeal thermophiles, most genomes have incomplete sRNA gene annotation because reliable, fully automated detection methods are not available. We expanded and curated a comprehensive gene set across six species of the crenarchaeal genus Pyrobaculum, particularly rich in C/D box sRNA genes. Using high-throughput small RNA sequencing, specialized computational searches and comparative genomics, we analyzed 526 Pyrobaculum C/D box sRNAs, organizing them into 110 families based on synteny and conservation of guide sequences which determine methylation targets. We examined gene duplications and rearrangements, including one family that has expanded in a pattern similar to retrotransposed repetitive elements in eukaryotes. New training data and inclusion of kink-turn secondary structural features enabled creation of an improved search model. Our analyses provide the most comprehensive, dynamic view of C/D box sRNA evolutionary history within a genus, in terms of modification function, feature plasticity, and gene mobility.
Collapse
Affiliation(s)
- Lauren M Lui
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Andrew V Uzilov
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - David L Bernick
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Andrea Corredor
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Todd M Lowe
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Patrick P Dennis
- Department of Biology, Whitman College, Walla Walla, WA 99362, USA
| |
Collapse
|
11
|
Hoeppner MP, Denisenko E, Gardner PP, Schmeier S, Poole AM. An Evaluation of Function of Multicopy Noncoding RNAs in Mammals Using ENCODE/FANTOM Data and Comparative Genomics. Mol Biol Evol 2019; 35:1451-1462. [PMID: 29617896 PMCID: PMC5967550 DOI: 10.1093/molbev/msy046] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Mammalian diversification has coincided with a rapid proliferation of various types of noncoding RNAs, including members of both snRNAs and snoRNAs. The significance of this expansion however remains obscure. While some ncRNA copy-number expansions have been linked to functionally tractable effects, such events may equally likely be neutral, perhaps as a result of random retrotransposition. Hindering progress in our understanding of such observations is the difficulty in establishing function for the diverse features that have been identified in our own genome. Projects such as ENCODE and FANTOM have revealed a hidden world of genomic expression patterns, as well as a host of other potential indicators of biological function. However, such projects have been criticized, particularly from practitioners in the field of molecular evolution, where many suspect these data provide limited insight into biological function. The molecular evolution community has largely taken a skeptical view, thus it is important to establish tests of function. We use a range of data, including data drawn from ENCODE and FANTOM, to examine the case for function for the recent copy number expansion in mammals of six evolutionarily ancient RNA families involved in splicing and rRNA maturation. We use several criteria to assess evidence for function: conservation of sequence and structure, genomic synteny, evidence for transposition, and evidence for species-specific expression. Applying these criteria, we find that only a minority of loci show strong evidence for function and that, for the majority, we cannot reject the null hypothesis of no function.
Collapse
Affiliation(s)
- Marc P Hoeppner
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel, Kiel, Germany
| | - Elena Denisenko
- Institute of Natural and Mathematical Sciences, Massey University, Auckland, New Zealand
| | - Paul P Gardner
- Biomolecular Interaction Centre, School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Sebastian Schmeier
- Institute of Natural and Mathematical Sciences, Massey University, Auckland, New Zealand
| | - Anthony M Poole
- Bioinformatics Institute, School of Biological Sciences, University of Auckland, Auckland, New Zealand
| |
Collapse
|
12
|
Schrader L, Schmitz J. The impact of transposable elements in adaptive evolution. Mol Ecol 2018; 28:1537-1549. [PMID: 30003608 DOI: 10.1111/mec.14794] [Citation(s) in RCA: 151] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 07/06/2018] [Indexed: 12/16/2022]
Abstract
The growing knowledge about the influence of transposable elements (TEs) on (a) long-term genome and transcriptome evolution; (b) genomic, transcriptomic and epigenetic variation within populations; and (c) patterns of somatic genetic differences in individuals continues to spur the interest of evolutionary biologists in the role of TEs in adaptive evolution. As TEs can trigger a broad range of molecular variation in a population with potentially severe fitness and phenotypic consequences for individuals, different mechanisms evolved to keep TE activity in check, allowing for a dynamic interplay between the host, its TEs and the environment in evolution. Here, we review evidence for adaptive phenotypic changes associated with TEs and the basic molecular mechanisms by which the underlying genetic changes arise: (a) domestication, (b) exaptation, (c) host gene regulation, (d) TE-mediated formation of intronless gene copies-so-called retrogenes and (e) overall increased genome plasticity. Furthermore, we review and discuss how the stress-dependent incapacitation of defence mechanisms against the activity of TEs might facilitate adaptive responses to environmental challenges and how such mechanisms might be particularly relevant in species frequently facing novel environments, such as invasive, pathogenic or parasitic species.
Collapse
Affiliation(s)
- Lukas Schrader
- Institute for Evolution and Biodiversity (IEB), University of Münster, Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, University of Münster, Münster, Germany
| |
Collapse
|
13
|
Luchetti A, Plazzi F, Mantovani B. Evolution of Two Short Interspersed Elements in Callorhinchus milii (Chondrichthyes, Holocephali) and Related Elements in Sharks and the Coelacanth. Genome Biol Evol 2017; 9:3824762. [PMID: 28505260 PMCID: PMC5499810 DOI: 10.1093/gbe/evx094] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/11/2017] [Indexed: 12/11/2022] Open
Abstract
Short interspersed elements (SINEs) are non-autonomous retrotransposons. Although they usually show fast evolutionary rates, in some instances highly conserved domains (HCDs) have been observed in elements with otherwise divergent sequences and from distantly related species. Here, we document the life history of two HCD-SINE families in the elephant shark Callorhinchus milii, one specific to the holocephalan lineage (CmiSINEs) and another one (SacSINE1-CM) with homologous elements in sharks and the coelacanth (SacSINE1s, LmeSINE1s). The analyses of their relationships indicated that these elements share the same 3′-tail, which would have allowed both elements to rise to high copy number by exploiting the C. milii L2-2_CM long interspersed element (LINE) enzymes. Molecular clock analysis on SINE activity in C. milii genome evidenced two replication bursts occurring right after two major events in the holocephalan evolution: the end-Permian mass extinction and the radiation of modern Holocephali. Accordingly, the same analysis on the coelacanth homologous elements, LmeSINE1, identified a replication wave close to the split age of the two extant Latimeria species. The genomic distribution of the studied SINEs pointed out contrasting results: some elements were preferentially sorted out from gene regions, but accumulated in flanking regions, while others appear more conserved within genes. Moreover, data from the C. milii transcriptome suggest that these SINEs could be involved in miRNA biogenesis and may be targets for miRNA-based regulation.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| | - Federico Plazzi
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali - Università di Bologna, Italy
| |
Collapse
|
14
|
Panni S, Prakash A, Bateman A, Orchard S. The yeast noncoding RNA interaction network. RNA (NEW YORK, N.Y.) 2017; 23:1479-1492. [PMID: 28701522 PMCID: PMC5602107 DOI: 10.1261/rna.060996.117] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Accepted: 06/26/2017] [Indexed: 06/07/2023]
Abstract
This article describes the creation of the first expert manually curated noncoding RNA interaction networks for S. cerevisiae The RNA-RNA and RNA-protein interaction networks have been carefully extracted from the experimental literature and made available through the IntAct database (www.ebi.ac.uk/intact). We provide an initial network analysis and compare their properties to the much larger protein-protein interaction network. We find that the proteins that bind to ncRNAs in the network contain only a small proportion of classical RNA binding domains. We also see an enrichment of WD40 domains suggesting their direct involvement in ncRNA interactions. We discuss the challenges in collecting noncoding RNA interaction data and the opportunities for worldwide collaboration to fill the unmet need for this data.
Collapse
Affiliation(s)
- Simona Panni
- Università della Calabria, Dipartimento di Biologia, Ecologia e Scienze della Terra, Rende 87036, Italy
| | - Ananth Prakash
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
15
|
Sotero-Caio CG, Platt RN, Suh A, Ray DA. Evolution and Diversity of Transposable Elements in Vertebrate Genomes. Genome Biol Evol 2017; 9:161-177. [PMID: 28158585 PMCID: PMC5381603 DOI: 10.1093/gbe/evw264] [Citation(s) in RCA: 161] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2016] [Indexed: 12/21/2022] Open
Abstract
Transposable elements (TEs) are selfish genetic elements that mobilize in genomes via transposition or retrotransposition and often make up large fractions of vertebrate genomes. Here, we review the current understanding of vertebrate TE diversity and evolution in the context of recent advances in genome sequencing and assembly techniques. TEs make up 4-60% of assembled vertebrate genomes, and deeply branching lineages such as ray-finned fishes and amphibians generally exhibit a higher TE diversity than the more recent radiations of birds and mammals. Furthermore, the list of taxa with exceptional TE landscapes is growing. We emphasize that the current bottleneck in genome analyses lies in the proper annotation of TEs and provide examples where superficial analyses led to misleading conclusions about genome evolution. Finally, recent advances in long-read sequencing will soon permit access to TE-rich genomic regions that previously resisted assembly including the gigantic, TE-rich genomes of salamanders and lungfishes.
Collapse
Affiliation(s)
| | - Roy N. Platt
- Department of Biological Sciences, Texas Tech University, Lubbock, TX
| | - Alexander Suh
- Department of Evolutionary Biology (EBC), Uppsala University, Uppsala, Sweden
| | - David A. Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX
| |
Collapse
|
16
|
de Araujo Oliveira JV, Costa F, Backofen R, Stadler PF, Machado Telles Walter ME, Hertel J. SnoReport 2.0: new features and a refined Support Vector Machine to improve snoRNA identification. BMC Bioinformatics 2016; 17:464. [PMID: 28105919 PMCID: PMC5249026 DOI: 10.1186/s12859-016-1345-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/12/2024] Open
Abstract
Background
snoReport uses RNA secondary structure prediction combined with machine learning as the basis to identify the two main classes of small nucleolar RNAs, the box H/ACA snoRNAs and the box C/D snoRNAs. Here, we present snoReport 2.0, which substantially improves and extends in the original method by: extracting new features for both box C/D and H/ACA box snoRNAs; developing a more sophisticated technique in the SVM training phase with recent data from vertebrate organisms and a careful choice of the SVM parameters C and γ; and using updated versions of tools and databases used for the construction of the original version of snoReport. To validate the new version and to demonstrate its improved performance, we tested snoReport 2.0 in different organisms. Results Results of the training and test phases of boxes H/ACA and C/D snoRNAs, in both versions of snoReport, are discussed. Validation on real data was performed to evaluate the predictions of snoReport 2.0. Our program was applied to a set of previously annotated sequences, some of them experimentally confirmed, of humans, nematodes, drosophilids, platypus, chickens and leishmania. We significantly improved the predictions for vertebrates, since the training phase used information of these organisms, but H/ACA box snoRNAs identification was improved for the other ones. Conclusion We presented snoReport 2.0, to predict H/ACA box and C/D box snoRNAs, an efficient method to find true positives and avoid false positives in vertebrate organisms. H/ACA box snoRNA classifier showed an F-score of 93 % (an improvement of 10 % regarding the previous version), while C/D box snoRNA classifier, an F-Score of 94 % (improvement of 14 %). Besides, both classifiers exhibited performance measures above 90 %. These results show that snoReport 2.0 avoid false positives and false negatives, allowing to predict snoRNAs with high quality. In the validation phase, snoReport 2.0 predicted 67.43 % of vertebrate organisms for both classes. For Nematodes and Drosophilids, 69 % and 76.67 %, for H/ACA box snoRNAs were predicted, respectively, showing that snoReport 2.0 is good to identify snoRNAs in vertebrates and also H/ACA box snoRNAs in invertebrates organisms. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1345-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Fabrizio Costa
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg, 79110, Germany
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg, 79110, Germany
| | - Peter Florian Stadler
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Haertelstraße 16-18, Leipzig, D-04107, Germany.,German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Germany.,Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, Vienna, A-1090, Austria.,Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg, DK-1870, Denmark.,Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103, Germany.,RNomics Group, Fraunhofer Institut for Cell Therapy and Immunology, Perlickstraße 1, Leipzig, D-04103, Germany.,Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM87501, USA.,Young Investigators Group Bioinformatics & Transcriptomics, Helmholtz Centre for Environmental Research - UFZ, Permoserstraße 15, Leipzig, D-04318, Germany
| | | | - Jana Hertel
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Haertelstraße 16-18, Leipzig, D-04107, Germany
| |
Collapse
|
17
|
Patra Bhattacharya D, Canzler S, Kehr S, Hertel J, Grosse I, Stadler PF. Phylogenetic distribution of plant snoRNA families. BMC Genomics 2016; 17:969. [PMID: 27881081 PMCID: PMC5122169 DOI: 10.1186/s12864-016-3301-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 11/15/2016] [Indexed: 12/11/2022] Open
Abstract
Background Small nucleolar RNAs (snoRNAs) are one of the most ancient families amongst non-protein-coding RNAs. They are ubiquitous in Archaea and Eukarya but absent in bacteria. Their main function is to target chemical modifications of ribosomal RNAs. They fall into two classes, box C/D snoRNAs and box H/ACA snoRNAs, which are clearly distinguished by conserved sequence motifs and the type of chemical modification that they govern. Similarly to microRNAs, snoRNAs appear in distinct families of homologs that affect homologous targets. In animals, snoRNAs and their evolution have been studied in much detail. In plants, however, their evolution has attracted comparably little attention. Results In order to chart the phylogenetic distribution of individual snoRNA families in plants, we applied a sophisticated approach for identifying homologs of known plant snoRNAs across the plant kingdom. In response to the relatively fast evolution of snoRNAs, information on conserved sequence boxes, target sequences, and secondary structure is combined to identify additional snoRNAs. We identified 296 families of snoRNAs in 24 species and traced their evolution throughout the plant kingdom. Many of the plant snoRNA families comprise paralogs. We also found that targets are well-conserved for most snoRNA families. Conclusions The sequence conservation of snoRNAs is sufficient to establish homologies between phyla. The degree of this conservation tapers off, however, between land plants and algae. Plant snoRNAs are frequently organized in highly conserved spatial clusters. As a resource for further investigations we provide carefully curated and annotated alignments for each snoRNA family under investigation. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3301-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Deblina Patra Bhattacharya
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany.,Institut für Informatik, Halle (Saale), D-06120, Germany
| | - Sebastian Canzler
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany
| | - Stephanie Kehr
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany
| | - Jana Hertel
- Young Investigators Group Bioinformatics & Transcriptomics, Helmholtz Centre for Environmental Research - UFZ, Permoserstrasse 15, Leipzig, D-04318, Germany
| | - Ivo Grosse
- Institut für Informatik, Halle (Saale), D-06120, Germany.,German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Leipzig, Germany
| | - Peter F Stadler
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany. .,Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103, Germany. .,Fraunhofer Institute for Cell Therapy and Immunology, Perlickstrasse 1, Leipzig, D-04103, Germany. .,Department of Theoretical Chemistry of the University of Vienna, Währingerstrasse 17, Leipzig, A-1090, Germany. .,Center for RNA in Technology and Health, Univ. Copenhagen, Grønnegårdsvej 3, Frederiksberg C, Copenhagen, Denmark. .,Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA. .,German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Leipzig, Germany.
| |
Collapse
|
18
|
Nitsche A, Stadler PF. Evolutionary clues in lncRNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 8. [PMID: 27436689 DOI: 10.1002/wrna.1376] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Revised: 06/06/2016] [Accepted: 06/09/2016] [Indexed: 12/13/2022]
Abstract
The diversity of long non-coding RNAs (lncRNAs) in the human transcriptome is in stark contrast to the sparse exploration of their functions concomitant with their conservation and evolution. The pervasive transcription of the largely non-coding human genome makes the evolutionary age and conservation patterns of lncRNAs to a topic of interest. Yet it is a fairly unexplored field and not that easy to determine as for protein-coding genes. Although there are a few experimentally studied cases, which are conserved at the sequence level, most lncRNAs exhibit weak or untraceable primary sequence conservation. Recent studies shed light on the interspecies conservation of secondary structures among lncRNA homologs by using diverse computational methods. This highlights the importance of structure on functionality of lncRNAs as opposed to the poor impact of primary sequence changes. Further clues in the evolution of lncRNAs are given by selective constraints on non-coding gene structures (e.g., promoters or splice sites) as well as the conservation of prevalent spatio-temporal expression patterns. However, a rapid evolutionary turnover is observable throughout the heterogeneous group of lncRNAs. This still gives rise to questions about its functional meaning. WIREs RNA 2017, 8:e1376. doi: 10.1002/wrna.1376 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Anne Nitsche
- Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Institute de Biologie Moléculaire et Cellulaire, Université de Strasbourg, Cedex, France
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Interdisciplinary Center for Bioinformatics, University Leipzig, Leipzig, Germany.,Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany.,Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology - IZI, Leipzig, Germany.,Center for Non-Coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark.,Department of Theoretical Chemistry, University of Vienna, Wien, Austria.,Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
19
|
Godakova SA, Sevast'yanova GA, Semyenova SK. [STRUCTURE AND DISTRIBUTION OF THE RETROTRANSPOSON BOV-B LINE]. MOLECULAR GENETICS MICROBIOLOGY AND VIROLOGY 2016; 34:9-12. [PMID: 27183715 DOI: 10.3103/s0891416816010043] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The classification of mobile elements was discussed. Special attention was devoted to the retroelement of the LINE group: retrotransposon Bov-B LINE. The history of its origin and distribution in the nature was considered. The results of the phenomenon of horizontal transition of the retrotransposon Bov-B LINE between evolutionally distant classes were discussed.
Collapse
|
20
|
Pignatelli M, Vilella AJ, Muffato M, Gordon L, White S, Flicek P, Herrero J. ncRNA orthologies in the vertebrate lineage. Database (Oxford) 2016; 2016:bav127. [PMID: 26980512 PMCID: PMC4792531 DOI: 10.1093/database/bav127] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Revised: 12/23/2015] [Accepted: 12/23/2015] [Indexed: 01/07/2023]
Abstract
Annotation of orthologous and paralogous genes is necessary for many aspects of evolutionary analysis. Methods to infer these homology relationships have traditionally focused on protein-coding genes and evolutionary models used by these methods normally assume the positions in the protein evolve independently. However, as our appreciation for the roles of non-coding RNA genes has increased, consistently annotated sets of orthologous and paralogous ncRNA genes are increasingly needed. At the same time, methods such as PHASE or RAxML have implemented substitution models that consider pairs of sites to enable proper modelling of the loops and other features of RNA secondary structure. Here, we present a comprehensive analysis pipeline for the automatic detection of orthologues and paralogues for ncRNA genes. We focus on gene families represented in Rfam and for which a specific covariance model is provided. For each family ncRNA genes found in all Ensembl species are aligned using Infernal, and several trees are built using different substitution models. In parallel, a genomic alignment that includes the ncRNA genes and their flanking sequence regions is built with PRANK. This alignment is used to create two additional phylogenetic trees using the neighbour-joining (NJ) and maximum-likelihood (ML) methods. The trees arising from both the ncRNA and genomic alignments are merged using TreeBeST, which reconciles them with the species tree in order to identify speciation and duplication events. The final tree is used to infer the orthologues and paralogues following Fitch's definition. We also determine gene gain and loss events for each family using CAFE. All data are accessible through the Ensembl Comparative Genomics ('Compara') API, on our FTP site and are fully integrated in the Ensembl genome browser, where they can be accessed in a user-friendly manner. Database URL: http://www.ensembl.org.
Collapse
Affiliation(s)
- Miguel Pignatelli
- European Molecular Biology Laboratory, European Bioinformatics Institute
| | - Albert J Vilella
- European Molecular Biology Laboratory, European Bioinformatics Institute
| | - Matthieu Muffato
- European Molecular Biology Laboratory, European Bioinformatics Institute
| | - Leo Gordon
- European Molecular Biology Laboratory, European Bioinformatics Institute
| | - Simon White
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Javier Herrero
- European Molecular Biology Laboratory, European Bioinformatics Institute UCL Cancer Institute, University College London, London WC1E 6BT, UK
| |
Collapse
|
21
|
Dupuis-Sandoval F, Poirier M, Scott MS. The emerging landscape of small nucleolar RNAs in cell biology. WILEY INTERDISCIPLINARY REVIEWS-RNA 2015; 6:381-97. [PMID: 25879954 PMCID: PMC4696412 DOI: 10.1002/wrna.1284] [Citation(s) in RCA: 155] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/16/2015] [Revised: 03/18/2015] [Accepted: 03/20/2015] [Indexed: 01/07/2023]
Abstract
Small nucleolar RNAs (snoRNAs) are a large class of small noncoding RNAs present in all eukaryotes sequenced thus far. As a family, they have been well characterized as playing a central role in ribosome biogenesis, guiding either the sequence-specific chemical modification of pre-rRNA (ribosomal RNA) or its processing. However, in higher eukaryotes, numerous orphan snoRNAs were described over a decade ago, with no known target or ascribed function, suggesting the possibility of alternative cellular functionality. In recent years, thanks in great part to advances in sequencing methodologies, we have seen many examples of the diversity that exists in the snoRNA family on multiple levels. In this review, we discuss the identification of novel snoRNA members, of unexpected binding partners, as well as the clarification and extension of the snoRNA target space and the characterization of diverse new noncanonical functions, painting a new and extended picture of the snoRNA landscape. Under the deluge of novel features and functions that have recently come to light, snoRNAs emerge as a central, dynamic, and highly versatile group of small regulatory RNAs. WIREs RNA 2015, 6:381–397. doi: 10.1002/wrna.1284
Collapse
Affiliation(s)
- Fabien Dupuis-Sandoval
- Biochemistry Department, Faculty of Medicine and Health Sciences, University of Sherbrooke, Sherbrooke, Canada
| | - Mikaël Poirier
- Biochemistry Department, Faculty of Medicine and Health Sciences, University of Sherbrooke, Sherbrooke, Canada
| | - Michelle S Scott
- Biochemistry Department and RNA Group, Faculty of Medicine and Health Sciences, University of Sherbrooke, Sherbrooke, Canada
| |
Collapse
|
22
|
Gardner PP, Fasold M, Burge SW, Ninova M, Hertel J, Kehr S, Steeves TE, Griffiths-Jones S, Stadler PF. Conservation and losses of non-coding RNAs in avian genomes. PLoS One 2015; 10:e0121797. [PMID: 25822729 PMCID: PMC4378963 DOI: 10.1371/journal.pone.0121797] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2014] [Accepted: 02/03/2015] [Indexed: 11/21/2022] Open
Abstract
Here we present the results of a large-scale bioinformatics annotation of non-coding RNA loci in 48 avian genomes. Our approach uses probabilistic models of hand-curated families from the Rfam database to infer conserved RNA families within each avian genome. We supplement these annotations with predictions from the tRNA annotation tool, tRNAscan-SE and microRNAs from miRBase. We identify 34 lncRNA-associated loci that are conserved between birds and mammals and validate 12 of these in chicken. We report several intriguing cases where a reported mammalian lncRNA, but not its function, is conserved. We also demonstrate extensive conservation of classical ncRNAs (e.g., tRNAs) and more recently discovered ncRNAs (e.g., snoRNAs and miRNAs) in birds. Furthermore, we describe numerous “losses” of several RNA families, and attribute these to either genuine loss, divergence or missing data. In particular, we show that many of these losses are due to the challenges associated with assembling avian microchromosomes. These combined results illustrate the utility of applying homology-based methods for annotating novel vertebrate genomes.
Collapse
Affiliation(s)
- Paul P. Gardner
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
- Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand
- * E-mail:
| | - Mario Fasold
- Bioinformatics Group, Department of Computer Science; and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
- ecSeq Bioinformatics, Brandvorwerkstr.43, D-04275 Leipzig, Germany
| | - Sarah W. Burge
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, CB10 1SD, UK
| | - Maria Ninova
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Jana Hertel
- Bioinformatics Group, Department of Computer Science; and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Stephanie Kehr
- Bioinformatics Group, Department of Computer Science; and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Tammy E. Steeves
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Sam Griffiths-Jones
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Peter F. Stadler
- Bioinformatics Group, Department of Computer Science; and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstrasse 16-18, D-04107 Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, D-04103 Leipzig, Germany
- Fraunhofer Institute for Cell Therapy and Immunology, Perlickstrasse 1, D-04103 Leipzig, Germany
- Department of Theoretical Chemistry of the University of Vienna, Währingerstrasse 17, A-1090 Vienna, Austria
- Center for RNA in Technology and Health, Univ. Copenhagen, Grønnegårdsvej 3, Frederiksberg C, Denmark
- Santa Fe Institute, 1399 Hyde Park Road, Santa Fe NM 87501, USA
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Germany
| |
Collapse
|
23
|
Noll A, Raabe CA, Churakov G, Brosius J, Schmitz J. Ancient traces of tailless retropseudogenes in therian genomes. Genome Biol Evol 2015; 7:889-900. [PMID: 25724209 PMCID: PMC5322556 DOI: 10.1093/gbe/evv040] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Transposable elements, once described by Barbara McClintock as controlling genetic units, not only occupy the largest part of our genome but are also a prominent moving force of genomic plasticity and innovation. They usually replicate and reintegrate into genomes silently, sometimes causing malfunctions or misregulations, but occasionally millions of years later, a few may evolve into new functional units. Retrotransposons make their way into the genome following reverse transcription of RNA molecules and chromosomal insertion. In therian mammals, long interspersed elements 1 (LINE1s) self-propagate but also coretropose many RNAs, including mRNAs and small RNAs that usually exhibit an oligo(A) tail. The revitalization of specific LINE1 elements in the mammalian lineage about 150 Ma parallels the rise of many other nonautonomous mobilized genomic elements. We previously identified and described hundreds of tRNA-derived retropseudogenes missing characteristic oligo(A) tails consequently termed tailless retropseudogenes. Additional analyses now revealed hundreds of thousands of tailless retropseudogenes derived from nearly all types of RNAs. We extracted 2,402 perfect tailless sequences (with discernible flanking target site duplications) originating from tRNAs, spliceosomal RNAs, 5S rRNAs, 7SK RNAs, mRNAs, and others. Interestingly, all are truncated at one or more defined positions that coincide with internal single-stranded regions. 5S ribosomal and U2 spliceosomal RNAs were analyzed in the context of mammalian phylogeny to discern the origin of the therian LINE1 retropositional system that evolved in our 150-Myr-old ancestor.
Collapse
Affiliation(s)
- Angela Noll
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany
| | - Carsten A Raabe
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany Institute of Evolution and Biodiversity, University of Münster, Germany
| | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany Institute of Evolutionary and Medical Genomics, Brandenburg Medical School, Neuruppin, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, Germany
| |
Collapse
|
24
|
Machyna M, Kehr S, Straube K, Kappei D, Buchholz F, Butter F, Ule J, Hertel J, Stadler PF, Neugebauer KM. The coilin interactome identifies hundreds of small noncoding RNAs that traffic through Cajal bodies. Mol Cell 2014; 56:389-399. [PMID: 25514182 DOI: 10.1016/j.molcel.2014.10.004] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Revised: 08/25/2014] [Accepted: 10/02/2014] [Indexed: 12/21/2022]
Abstract
Coilin protein scaffolds Cajal bodies (CBs)-subnuclear compartments enriched in small nuclear RNAs (snRNAs)-and promotes efficient spliceosomal snRNP assembly. The molecular function of coilin, which is intrinsically disordered with no defined motifs, is poorly understood. We use UV crosslinking and immunoprecipitation (iCLIP) to determine whether mammalian coilin binds RNA in vivo and to identify targets. Robust detection of snRNA transcripts correlated with coilin ChIP-seq peaks on snRNA genes, indicating that coilin binding to nascent snRNAs is a site-specific CB nucleator. Surprisingly, several hundred small nucleolar RNAs (snoRNAs) were identified as coilin interactors, including numerous unannotated mouse and human snoRNAs. We show that all classes of snoRNAs concentrate in CBs. Moreover, snoRNAs lacking specific CB retention signals traffic through CBs en route to nucleoli, consistent with the role of CBs in small RNP assembly. Thus, coilin couples snRNA and snoRNA biogenesis, making CBs the cellular hub of small ncRNA metabolism.
Collapse
Affiliation(s)
- Martin Machyna
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; Department of Molecular Biophysics & Biochemistry, Yale University, 333 Cedar Street, New Haven, CT 06520, USA
| | - Stephanie Kehr
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Haertelstrasse 16-18, 04107 Leipzig, Germany
| | - Korinna Straube
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; Department of Molecular Biophysics & Biochemistry, Yale University, 333 Cedar Street, New Haven, CT 06520, USA
| | - Dennis Kappei
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Frank Buchholz
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Falk Butter
- Institute of Molecular Biology (IMB), Ackermannweg 4, 55128 Mainz, Germany
| | - Jernej Ule
- Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK
| | - Jana Hertel
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Haertelstrasse 16-18, 04107 Leipzig, Germany
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Haertelstrasse 16-18, 04107 Leipzig, Germany
| | - Karla M Neugebauer
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; Department of Molecular Biophysics & Biochemistry, Yale University, 333 Cedar Street, New Haven, CT 06520, USA.
| |
Collapse
|
25
|
Mo D, Raabe CA, Reinhardt R, Brosius J, Rozhdestvensky TS. Alternative processing as evolutionary mechanism for the origin of novel nonprotein coding RNAs. Genome Biol Evol 2014; 5:2061-71. [PMID: 24132753 PMCID: PMC3845636 DOI: 10.1093/gbe/evt155] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The evolution of new genes can ensue through either gene duplication and the neofunctionalization of one of the copies or the formation of a de novo gene from hitherto nonfunctional, neutrally evolving intergenic or intronic genomic sequences. Only very rarely are entire genes created de novo. Mostly, nonfunctional sequences are coopted as novel parts of existing genes, such as in the process of exonization whereby introns become exons through changes in splicing. Here, we report a case in which a novel nonprotein coding RNA evolved by intron-sequence recruitment into its structure. cDNAs derived from rat brain small RNAs, revealed a novel small nucleolar RNA (snoRNA) originating from one of the Snord115 copies in the rat Prader–Willi syndrome locus. We suggest that a single-point substitution in the Snord115 region led to the expression of a longer snoRNA variant, designated as L-Snord115. Cell culture and footprinting experiments confirmed that a single nucleotide substitution at Snord115 position 67 destabilized the kink-turn motif within the canonical snoRNA, while distal intronic sequences provided an alternate D-box region. The exapted sequence displays putative base pairing to 28S rRNA and mRNA targets.
Collapse
Affiliation(s)
- Dingding Mo
- Institute of Experimental Pathology, ZMBE, University of Muenster, Muenster, Germany
| | | | | | | | | |
Collapse
|
26
|
Noll A, Grundmann N, Churakov G, Brosius J, Makałowski W, Schmitz J. GPAC-genome presence/absence compiler: a web application to comparatively visualize multiple genome-level changes. Mol Biol Evol 2014; 32:275-86. [PMID: 25261406 DOI: 10.1093/molbev/msu276] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Our understanding of genome-wide and comparative sequence information has been broadened considerably by the databases available from the University of California Santa Cruz (UCSC) Genome Bioinformatics Department. In particular, the identification and visualization of genomic sequences, present in some species but absent in others, led to fundamental insights into gene and genome evolution. However, the UCSC tools currently enable one to visualize orthologous genomic loci for a range of species in only a single locus. For large-scale comparative analyses of such presence/absence patterns a multilocus view would be more desirable. Such a tool would enable us to compare thousands of relevant loci simultaneously and to resolve many different questions about, for example, phylogeny, specific aspects of genome and gene evolution, such as the gain or loss of exons and introns, the emergence of novel transposed elements, nonprotein-coding RNAs, and viral genomic particles. Here, we present the first tool to facilitate the parallel analysis of thousands of genomic loci for cross-species presence/absence patterns based on multiway genome alignments. This genome presence/absence compiler uses annotated or other compilations of coordinates of genomic locations and compiles all presence/absence patterns in a flexible, color-coded table linked to the individual UCSC Genome Browser alignments. We provide examples of the versatile information content of such a screening system especially for 7SL-derived transposed elements, nuclear mitochondrial DNA, DNA transposons, and miRNAs in primates (http://www.bioinformatics.uni-muenster.de/tools/gpac, last accessed October 1, 2014).
Collapse
Affiliation(s)
- Angela Noll
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| | - Norbert Grundmann
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| | - Wojciech Makałowski
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| |
Collapse
|
27
|
Matylla-Kulinska K, Tafer H, Weiss A, Schroeder R. Functional repeat-derived RNAs often originate from retrotransposon-propagated ncRNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2014; 5:591-600. [PMID: 25045147 PMCID: PMC4233971 DOI: 10.1002/wrna.1243] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 04/15/2014] [Accepted: 04/22/2014] [Indexed: 12/19/2022]
Abstract
The human genome is scattered with repetitive sequences, and the ENCODE project revealed that 60–70% of the genomic DNA is transcribed into RNA. As a consequence, the human transcriptome contains a large portion of repeat-derived RNAs (repRNAs). Here, we present a hypothesis for the evolution of novel functional repeat-derived RNAs from non-coding RNAs (ncRNAs) by retrotransposition. Upon amplification, the ncRNAs can diversify in sequence and subsequently evolve new activities, which can result in novel functions. Non-coding transcripts derived from highly repetitive regions can therefore serve as a reservoir for the evolution of novel functional RNAs. We base our hypothetical model on observations reported for short interspersed nuclear elements derived from 7SL RNA and tRNAs, α satellites derived from snoRNAs and SL RNAs derived from U1 small nuclear RNA. Furthermore, we present novel putative human repeat-derived ncRNAs obtained by the comparison of the Dfam and Rfam databases, as well as several examples in other species. We hypothesize that novel functional ncRNAs can derive also from other repetitive regions and propose Genomic SELEX as a tool for their identification.
Collapse
Affiliation(s)
- Katarzyna Matylla-Kulinska
- Department of Biochemistry and Cell Biology, Max F. Perutz Laboratories, University of Vienna, Vienna, Austria
| | | | | | | |
Collapse
|
28
|
Zhang YJ, Yang JH, Shi QS, Zheng LL, Liu J, Zhou H, Zhang H, Qu LH. Rapid birth-and-death evolution of imprinted snoRNAs in the Prader-Willi syndrome locus: implications for neural development in Euarchontoglires. PLoS One 2014; 9:e100329. [PMID: 24945811 PMCID: PMC4063771 DOI: 10.1371/journal.pone.0100329] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2014] [Accepted: 05/23/2014] [Indexed: 11/24/2022] Open
Abstract
Imprinted small nucleolar RNAs (snoRNAs) are only found in eutherian genomes and closely related to brain functions. A complex human neurological disease, Prader-Willi syndrome (PWS), is primarily attributed to the deletion of imprinted snoRNAs in chromosome 15q11-q13. Here we investigated the snoRNA repertoires in the PWS locus of 12 mammalian genomes and their evolution processes. A total of 613 imprinted snoRNAs were identified in the PWS homologous loci and the gene number was highly variable across lineages, with a peak in Euarchontoglires. Lineage-specific gene gain and loss events account for most extant genes of the HBII-52 (SNORD115) and the HBII-85 (SNORD116) gene family, and remarkable high gene-birth rates were observed in the primates and the rodents. Meanwhile, rapid sequence substitution occurred only in imprinted snoRNA genes, rather than their flanking sequences or the protein-coding genes located in the same imprinted locus. Strong selective constraints on the functional elements of these imprinted snoRNAs further suggest that they are subjected to birth-and-death evolution. Our data suggest that the regulatory role of HBII-52 on 5-HT2CR pre-mRNA might originate in the Euarchontoglires through adaptive process. We propose that the rapid evolution of PWS-related imprinted snoRNAs has contributed to the neural development of Euarchontoglires.
Collapse
Affiliation(s)
- Yi-Jun Zhang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, and School of Life Sciences, Sun Yat-sen University, Guangzhou, P. R. China
| | - Jian-Hua Yang
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, and School of Life Sciences, Sun Yat-sen University, Guangzhou, P. R. China
| | - Qiao-Su Shi
- Laboratory of Liver Disease Hospital, The Third Affiliated Hospital, Sun Yat-sen University, Guangzhou, P. R. China
| | - Ling-Ling Zheng
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, and School of Life Sciences, Sun Yat-sen University, Guangzhou, P. R. China
| | - Jun Liu
- Institute of Human Virology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, P. R. China
| | - Hui Zhou
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, and School of Life Sciences, Sun Yat-sen University, Guangzhou, P. R. China
| | - Hui Zhang
- Institute of Human Virology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, P. R. China
| | - Liang-Hu Qu
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory of Biocontrol, and School of Life Sciences, Sun Yat-sen University, Guangzhou, P. R. China
- * E-mail:
| |
Collapse
|
29
|
Kehr S, Bartschat S, Tafer H, Stadler PF, Hertel J. Matching of Soulmates: coevolution of snoRNAs and their targets. Mol Biol Evol 2013; 31:455-67. [PMID: 24162733 DOI: 10.1093/molbev/mst209] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Ribosomal and small nuclear RNAs (snRNAs) comprise numerous modified nucleotides. The modification patterns are retained during evolution, making it even possible to project them from yeast onto human. The stringent conservation of modification sites and the slow evolution of rRNAs and snRNAs contradicts the rapid evolution of small nucleolar RNA (snoRNA) sequences. To explain this discrepancy, we investigated the coevolution of snoRNAs and their targeted sites throughout vertebrates. To measure and evaluate the conservation of RNA-RNA interactions, we defined the interaction conservation index (ICI). It combines the quality of individual interaction with the scope of its conservation in a set of species and serves as an efficient measure to evaluate the conservation of the interaction of snoRNA and target. We show that functions of homologous snoRNAs are evolutionarily stable, thus, members of the same snoRNA family guide equivalent modifications. The conservation of snoRNA sequences is high at target binding regions while the remaining sequence varies significantly. In addition to elucidating principles of correlated evolution, we were able, with the help of the ICI measure, to assign functions to previously orphan snoRNAs and to associate snoRNAs as partners to known chemical modifications unassigned to a given snoRNA. Furthermore, we used predictions of snoRNA functions in conjunction with sequence conservation to identify distant homologies. Because of the high overall entropy of snoRNA sequences, such relationships are hard to detect by means of sequence homology search methods alone.
Collapse
Affiliation(s)
- Stephanie Kehr
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, University of Leipzig, Leipzig, Germany
| | | | | | | | | |
Collapse
|
30
|
RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2013; 2013:424726. [PMID: 23984183 PMCID: PMC3747384 DOI: 10.1155/2013/424726] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 07/01/2013] [Indexed: 11/18/2022]
Abstract
A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution.
Collapse
|
31
|
Hahn Y. Evidence for the dissemination of cryptic non-coding RNAs transcribed from intronic and intergenic segments by retroposition. Bioinformatics 2013; 29:1593-9. [PMID: 23652427 DOI: 10.1093/bioinformatics/btt258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Insertion of DNA segments is one mechanism by which genomes evolve. The bulk of genomic segments are now known to be transcribed into long and short non-coding RNAs (ncRNAs), promoter-associated transcripts and enhancer-templated transcripts. These various cryptic ncRNAs are thought to be dispersed in the human and other genomes by retroposition. RESULTS In this study, I report clear evidence for dissemination of cryptic ncRNAs transcribed from intronic and intergenic segments by retroposition. I used highly stringent conditions to find recently retroposed ncRNAs that had a poly(A) tract and were flanked by target site duplication. I identified 73 instances of retroposition in the human, mouse, and rat genomes (12, 36 and 25 instances, respectively). The inserted segments, in some cases, served as a novel exon or promoter for the associated gene, resulting in novel transcript variants. Some disseminated sequences showed sequence conservation across animals, implying a possible regulatory role. My results indicate that retroposition is one of the mechanisms for dispersion of ncRNAs. I propose that these newly inserted segments may play a role in genome evolution by potentially functioning as novel exons, promoters or enhancers. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yoonsoo Hahn
- Department of Life Science, Research Center for Biomolecules and Biosystems, Chung-Ang University, Seoul 156-756, Korea.
| |
Collapse
|
32
|
Comparative genomics of eukaryotic small nucleolar RNAs reveals deep evolutionary ancestry amidst ongoing intragenomic mobility. BMC Evol Biol 2012; 12:183. [PMID: 22978381 PMCID: PMC3511168 DOI: 10.1186/1471-2148-12-183] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2012] [Accepted: 09/04/2012] [Indexed: 01/28/2023] Open
Abstract
BACKGROUND Small nucleolar (sno)RNAs are required for posttranscriptional processing and modification of ribosomal, spliceosomal and messenger RNAs. Their presence in both eukaryotes and archaea indicates that snoRNAs are evolutionarily ancient. The location of some snoRNAs within the introns of ribosomal protein genes has been suggested to belie an RNA world origin, with the exons of the earliest protein-coding genes having evolved around snoRNAs after the advent of templated protein synthesis. Alternatively, this intronic location may reflect more recent selection for coexpression of snoRNAs and ribosomal components, ensuring rRNA modification by snoRNAs during ribosome synthesis. To gain insight into the evolutionary origins of this genetic organization, we examined the antiquity of snoRNA families and the stability of their genomic location across 44 eukaryote genomes. RESULTS We report that dozens of snoRNA families are traceable to the Last Eukaryotic Common Ancestor (LECA), but find only weak similarities between the oldest eukaryotic snoRNAs and archaeal snoRNA-like genes. Moreover, many of these LECA snoRNAs are located within the introns of host genes independently traceable to the LECA. Comparative genomic analyses reveal the intronic location of LECA snoRNAs is not ancestral however, suggesting the pattern we observe is the result of ongoing intragenomic mobility. Analysis of human transcriptome data indicates that the primary requirement for hosting intronic snoRNAs is a broad expression profile. Consistent with ongoing mobility across broadly-expressed genes, we report a case of recent migration of a non-LECA snoRNA from the intron of a ubiquitously expressed non-LECA host gene into the introns of two LECA genes during the evolution of primates. CONCLUSIONS Our analyses show that snoRNAs were a well-established family of RNAs at the time when eukaryotes began to diversify. While many are intronic, this association is not evolutionarily stable across the eukaryote tree; ongoing intragenomic mobility has erased signal of their ancestral gene organization, and neither introns-first nor evolved co-expression adequately explain our results. We therefore present a third model - constrained drift - whereby individual snoRNAs are intragenomically mobile and may occupy any genomic location from which expression satisfies phenotype.
Collapse
|
33
|
Perina D, Korolija M, Mikoč A, Roller M, Pleše B, Imešek M, Morrow C, Batel R, Ćetković H. Structural and functional characterization of ribosomal protein gene introns in sponges. PLoS One 2012; 7:e42523. [PMID: 22880015 PMCID: PMC3412847 DOI: 10.1371/journal.pone.0042523] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2011] [Accepted: 07/10/2012] [Indexed: 11/25/2022] Open
Abstract
Ribosomal protein genes (RPGs) are a powerful tool for studying intron evolution. They exist in all three domains of life and are much conserved. Accumulating genomic data suggest that RPG introns in many organisms abound with non-protein-coding-RNAs (ncRNAs). These ancient ncRNAs are small nucleolar RNAs (snoRNAs) essential for ribosome assembly. They are also mobile genetic elements and therefore probably important in diversification and enrichment of transcriptomes through various mechanisms such as intron/exon gain/loss. snoRNAs in basal metazoans are poorly characterized. We examined 449 RPG introns, in total, from four demosponges: Amphimedon queenslandica, Suberites domuncula, Suberites ficus and Suberites pagurorum and showed that RPG introns from A. queenslandica share position conservancy and some structural similarity with "higher" metazoans. Moreover, our study indicates that mobile element insertions play an important role in the evolution of their size. In four sponges 51 snoRNAs were identified. The analysis showed discrepancies between the snoRNA pools of orthologous RPG introns between S. domuncula and A. queenslandica. Furthermore, these two sponges show as much conservancy of RPG intron positions between each other as between themselves and human. Sponges from the Suberites genus show consistency in RPG intron position conservation. However, significant differences in some of the orthologous RPG introns of closely related sponges were observed. This indicates that RPG introns are dynamic even on these shorter evolutionary time scales.
Collapse
Affiliation(s)
- Drago Perina
- Department of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Marina Korolija
- Department of Molecular Medicine, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Andreja Mikoč
- Department of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Maša Roller
- Department of Molecular Biology, Faculty of Science University of Zagreb, Zagreb, Croatia
| | - Bruna Pleše
- Department of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Mirna Imešek
- Department of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Christine Morrow
- School of Biological Sciences, Queen's University, Belfast, United Kingdom
| | - Renato Batel
- Center for Marine Research, Rudjer Boskovic Institute, Rovinj, Croatia
| | - Helena Ćetković
- Department of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| |
Collapse
|
34
|
Makarova JA, Kramerov DA. SNOntology: Myriads of novel snoRNAs or just a mirage? BMC Genomics 2011; 12:543. [PMID: 22047601 PMCID: PMC3349704 DOI: 10.1186/1471-2164-12-543] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2011] [Accepted: 11/03/2011] [Indexed: 12/16/2022] Open
Abstract
Background Small nucleolar RNAs (snoRNAs) are a large group of non-coding RNAs (ncRNAs) that mainly guide 2'-O-methylation (C/D RNAs) and pseudouridylation (H/ACA RNAs) of ribosomal RNAs. The pattern of rRNA modifications and the set of snoRNAs that guide these modifications are conserved in vertebrates. Nearly all snoRNA genes in vertebrates are localized in introns of other genes and are processed from pre-mRNAs. Thus, the same promoter is used for the transcription of snoRNAs and host genes. Results The series of studies by Dahai Zhu and coworkers on snoRNAs and their genes were critically considered. We present evidence that dozens of species-specific snoRNAs that they described in vertebrates are experimental artifacts resulting from the improper use of Northern hybridization. The snoRNA genes with putative intrinsic promoters that were supposed to be transcribed independently proved to contain numerous substitutions and are, most likely, pseudogenes. In some cases, they are localized within introns of overlooked host genes. Finally, an increased number of snoRNA genes in mammalian genomes described by Zhu and coworkers is also an artifact resulting from two mistakes. First, numerous mammalian snoRNA pseudogenes were considered as genes, whereas most of them are localized outside of host genes and contain substitutions that question their functionality. Second, Zhu and coworkers failed to identify many snoRNA genes in non-mammalian species. As an illustration, we present 1352 C/D snoRNA genes that we have identified and annotated in vertebrates. Conclusions Our results demonstrate that conclusions based only on databases with automatically annotated ncRNAs can be erroneous. Special investigations aimed to distinguish true RNA genes from their pseudogenes should be done. Zhu and coworkers, as well as most other groups studying vertebrate snoRNAs, give new names to newly described homologs of human snoRNAs, which significantly complicates comparison between different species. It seems necessary to develop a uniform nomenclature for homologs of human snoRNAs in other vertebrates, e.g., human gene names prefixed with several-letter code denoting the vertebrate species.
Collapse
|
35
|
Marz M, Gruber AR, Höner Zu Siederdissen C, Amman F, Badelt S, Bartschat S, Bernhart SH, Beyer W, Kehr S, Lorenz R, Tanzer A, Yusuf D, Tafer H, Hofacker IL, Stadler PF. Animal snoRNAs and scaRNAs with exceptional structures. RNA Biol 2011; 8:938-46. [PMID: 21955586 DOI: 10.4161/rna.8.6.16603] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The overwhelming majority of small nucleolar RNAs (snoRNAs) fall into two clearly defined classes characterized by distinctive secondary structures and sequence motifs. A small group of diverse ncRNAs, however, shares the hallmarks of one or both classes of snoRNAs but differs substantially from the norm in some respects. Here, we compile the available information on these exceptional cases, conduct a thorough homology search throughout the available metazoan genomes, provide improved and expanded alignments, and investigate the evolutionary histories of these ncRNA families as well as their mutual relationships.
Collapse
Affiliation(s)
- Manja Marz
- RNA Bioinformatik Gruppe, Institut f¨ur Pharmazeutische Chemie, Philipps Universit¨at Marburg, Marburg, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Scott MS, Ono M. From snoRNA to miRNA: Dual function regulatory non-coding RNAs. Biochimie 2011; 93:1987-92. [PMID: 21664409 PMCID: PMC3476530 DOI: 10.1016/j.biochi.2011.05.026] [Citation(s) in RCA: 178] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2011] [Accepted: 05/19/2011] [Indexed: 11/03/2022]
Abstract
Small nucleolar RNAs (snoRNAs) are an ancient class of small non-coding RNAs present in all eukaryotes and a subset of archaea that carry out a fundamental role in the modification and processing of ribosomal RNA. In recent years, however, a large proportion of snoRNAs have been found to be further processed into smaller molecules, some of which display different functionality. In parallel, several studies have uncovered extensive similarities between snoRNAs and other types of small non-coding RNAs, and in particular microRNAs. Here, we explore the extent of the relationship between these types of non-coding RNA and the possible underlying evolutionary forces that shaped this subset of the current non-coding RNA landscape.
Collapse
Affiliation(s)
- Michelle S Scott
- Division of Biological Chemistry and Drug Discovery, College of Life Sciences, University of Dundee, Dow Street, Dundee DD1 5EH, UK.
| | | |
Collapse
|
37
|
Mattick JS. The central role of RNA in human development and cognition. FEBS Lett 2011; 585:1600-16. [DOI: 10.1016/j.febslet.2011.05.001] [Citation(s) in RCA: 149] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Accepted: 05/03/2011] [Indexed: 12/22/2022]
|
38
|
Langenberger D, Bartschat S, Hertel J, Hoffmann S, Tafer H, Stadler PF. MicroRNA or Not MicroRNA? ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY 2011. [DOI: 10.1007/978-3-642-22825-4_1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
39
|
Churakov G, Grundmann N, Kuritzin A, Brosius J, Makałowski W, Schmitz J. A novel web-based TinT application and the chronology of the Primate Alu retroposon activity. BMC Evol Biol 2010; 10:376. [PMID: 21126360 PMCID: PMC3014933 DOI: 10.1186/1471-2148-10-376] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2010] [Accepted: 12/02/2010] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND DNA sequences afford access to the evolutionary pathways of life. Particularly mobile elements that constantly co-evolve in genomes encrypt recent and ancient information of their host's history. In mammals there is an extraordinarily abundant activity of mobile elements that occurs in a dynamic succession of active families, subfamilies, types, and subtypes of retroposed elements. The high frequency of retroposons in mammals implies that, by chance, such elements also insert into each other. While inactive elements are no longer able to retropose, active elements retropose by chance into other active and inactive elements. Thousands of such directional, element-in-element insertions are found in present-day genomes. To help analyze these events, we developed a computational algorithm (Transpositions in Transpositions, or TinT) that examines the different frequencies of nested transpositions and reconstructs the chronological order of retroposon activities. RESULTS By examining the different frequencies of such nested transpositions, the TinT application reconstructs the chronological order of retroposon activities. We use such activity patterns as a comparative tool to (1) delineate the historical rise and fall of retroposons and their relations to each other, (2) understand the retroposon-induced complexity of recent genomes, and (3) find selective informative homoplasy-free markers of phylogeny. The efficiency of the new application is demonstrated by applying it to dimeric Alu Short INterspersed Elements (SINE) to derive a complete chronology of such elements in primates. CONCLUSION The user-friendly, web-based TinT interface presented here affords an easy, automated screening for nested transpositions from genome assemblies or trace data, assembles them in a frequency-matrix, and schematically displays their chronological activity history.
Collapse
Affiliation(s)
- Gennady Churakov
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| | - Norbert Grundmann
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Niels Stensen Str. 14, 48149 Münster, Germany
| | - Andrej Kuritzin
- Department of Physics and Mathematics, Saint Petersburg State Institute of Technology, 26 Moskovsky av., St.-Petersburg 198013, Russia
| | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| | - Wojciech Makałowski
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Niels Stensen Str. 14, 48149 Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, Von-Esmarch-Str. 56, 48149 Münster, Germany
| |
Collapse
|
40
|
Hancks DC, Kazazian H. SVA retrotransposons: Evolution and genetic instability. Semin Cancer Biol 2010; 20:234-45. [PMID: 20416380 PMCID: PMC2945828 DOI: 10.1016/j.semcancer.2010.04.001] [Citation(s) in RCA: 129] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2010] [Revised: 04/01/2010] [Accepted: 04/14/2010] [Indexed: 01/21/2023]
Abstract
SINE-VNTR-Alus (SVA) are non-autonomous hominid specific retrotransposons that are associated with disease in humans. SVAs are evolutionarily young and presumably mobilized by the LINE-1 reverse transcriptase in trans. SVAs are currently active and may impact the host through a variety of mechanisms including insertional mutagenesis, exon shuffling, alternative splicing, and the generation of differentially methylated regions (DMR). Here we review SVA biology, including SVA insertions associated with known diseases. Further, we discuss a model describing the initial formation of SVA and the mechanisms by which SVA may impact the host.
Collapse
Affiliation(s)
- Dustin C. Hancks
- Department of Genetics, The University of Pennsylvania School of Medicine
| | - Haig Kazazian
- Department of Genetics, The University of Pennsylvania School of Medicine
| |
Collapse
|
41
|
Zhang Y, Liu J, Jia C, Li T, Wu R, Wang J, Chen Y, Zou X, Chen R, Wang XJ, Zhu D. Systematic identification and evolutionary features of rhesus monkey small nucleolar RNAs. BMC Genomics 2010; 11:61. [PMID: 20100322 PMCID: PMC2832892 DOI: 10.1186/1471-2164-11-61] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2009] [Accepted: 01/25/2010] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Recent studies have demonstrated that non-protein-coding RNAs (npcRNAs/ncRNAs) play important roles during eukaryotic development, species evolution, and in the etiology of disease. Rhesus macaques are the most widely used primate model in both biomedical research and primate evolutionary studies. However, most reports on these animals focus on the functional roles of protein-coding sequences, whereas very little is known about macaque ncRNAs. RESULTS In the present study, we performed the first systematic profiling of intermediate-size ncRNAs (50 to 500 nt) from the rhesus monkey by constructing a cDNA library. We identified 117 rhesus monkey ncRNAs, including 80 small nucleolar RNAs (snoRNAs), 29 other types of known RNAs (snRNAs, Y RNA, and others), and eight unclassified ncRNAs. Comparative genomic analysis and northern blot hybridizations demonstrated that some snoRNAs were lineage- or species-specific. Paralogous sequences were found for most rhesus monkey snoRNAs, the expression of which might be attributable to extensive duplication within the rhesus monkey genome. Further investigation of snoRNA flanking sequences showed that some rhesus monkey snoRNAs are retrogenes derived from L1-mediated integration. Finally, phylogenetic analysis demonstrated that birds and primates share some snoRNAs and host genes thereof, suggesting that both the relevant host genes and the snoRNAs contained therein may be inherited from a common ancestor. However, some rhesus monkey snoRNAs hosted by non-ribosome-related genes appeared after the evolutionary divergence between birds and mammals. CONCLUSIONS We provide the first experimentally-derived catalog of rhesus monkey ncRNAs and uncover some interesting genomic and evolutionary features. These findings provide important information for future functional characterization of snoRNAs during primate evolution.
Collapse
Affiliation(s)
- Yong Zhang
- National Laboratory of Medical Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences, School of Basic Medicine, Peking Union Medical College, Beijing, PR China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Mattick JS. Deconstructing the dogma: a new view of the evolution and genetic programming of complex organisms. Ann N Y Acad Sci 2009; 1178:29-46. [PMID: 19845626 DOI: 10.1111/j.1749-6632.2009.04991.x] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Since the birth of molecular biology it has been generally assumed that most genetic information is transacted by proteins, and that RNA plays an intermediary role. This led to the subsidiary assumption that the vast tracts of noncoding sequences in the genomes of higher organisms are largely nonfunctional, despite the fact that they are transcribed. These assumptions have since become articles of faith, but they are not necessarily correct. I propose an alternative evolutionary history whereby developmental and cognitive complexity has arisen by constructing sophisticated RNA-based regulatory networks that interact with generic effector complexes to control gene expression patterns and the epigenetic trajectories of differentiation and development. Environmental information can also be conveyed into this regulatory system via RNA editing, especially in the brain. Moreover, the observations that RNA-directed epigenetic changes can be inherited raises the intriguing question: has evolution learnt how to learn?
Collapse
Affiliation(s)
- John S Mattick
- Institute for Molecular Bioscience, The University of Queensland, St Lucia QLD 4072, Australia.
| |
Collapse
|
43
|
Hoeppner MP, White S, Jeffares DC, Poole AM. Evolutionarily stable association of intronic snoRNAs and microRNAs with their host genes. Genome Biol Evol 2009; 1:420-8. [PMID: 20333211 PMCID: PMC2817437 DOI: 10.1093/gbe/evp045] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/31/2009] [Indexed: 12/20/2022] Open
Abstract
Small nucleolar RNAs (snoRNAs) and microRNAs (miRNAs) are integral to a range of processes, including ribosome biogenesis and gene regulation. Some are intron encoded, and this organization may facilitate coordinated coexpression of host gene and RNA. However, snoRNAs and miRNAs are known to be mobile, so intron-RNA associations may not be evolutionarily stable. We have used genome alignments across 11 mammals plus chicken to examine positional orthology of snoRNAs and miRNAs and report that 21% of annotated snoRNAs and 11% of miRNAs are positionally conserved across mammals. Among RNAs traceable to the bird–mammal common ancestor, 98% of snoRNAs and 76% of miRNAs are intronic. Comparison of the most evolutionarily stable mammalian intronic snoRNAs with those positionally conserved among primates reveals that the former are more overrepresented among host genes involved in translation or ribosome biogenesis and are more broadly and highly expressed. This stability is likely attributable to a requirement for overlap between host gene and intronic snoRNA expression profiles, consistent with an ancestral role in ribosome biogenesis. In contrast, whereas miRNA positional conservation is comparable to that observed for snoRNAs, intronic miRNAs show no obvious association with host genes of a particular functional category, and no statistically significant differences in host gene expression are found between those traceable to mammalian or primate ancestors. Our results indicate evolutionarily stable associations of numerous intronic snoRNAs and miRNAs and their host genes, with probable continued diversification of snoRNA function from an ancestral role in ribosome biogenesis.
Collapse
Affiliation(s)
- Marc P Hoeppner
- Department of Molecular Biology and Functional Genomics, Stockholm University, SE-106 91 Stockholm, Sweden
| | | | | | | |
Collapse
|
44
|
Dieci G, Preti M, Montanini B. Eukaryotic snoRNAs: a paradigm for gene expression flexibility. Genomics 2009; 94:83-8. [PMID: 19446021 DOI: 10.1016/j.ygeno.2009.05.002] [Citation(s) in RCA: 229] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2009] [Revised: 04/30/2009] [Accepted: 05/07/2009] [Indexed: 11/26/2022]
Abstract
Small nucleolar RNAs (snoRNAs) are one of the most ancient and numerous families of non-protein-coding RNAs (ncRNAs). The main function of snoRNAs - to guide site-specific rRNA modification - is the same in Archaea and all eukaryotic lineages. In contrast, as revealed by recent genomic and RNomic studies, their genomic organization and expression strategies are the most varied. Seemingly snoRNA coding units have adopted, in the course of evolution, all the possible ways of being transcribed, thus providing a unique paradigm of gene expression flexibility. By focusing on representative fungal, plant and animal genomes, we review here all the documented types of snoRNA gene organization and expression, and we provide a comprehensive account of snoRNA expressional freedom by precisely estimating the frequency, in each genome, of each type of genomic organization. We finally discuss the relevance of snoRNA genomic studies for our general understanding of ncRNA family evolution and expression in eukaryotes.
Collapse
Affiliation(s)
- Giorgio Dieci
- Dipartimento di Biochimica e Biologia Molecolare, Università degli Studi di Parma, Parma, Italy.
| | | | | |
Collapse
|
45
|
Marz M, Kirsten T, Stadler PF. Evolution of spliceosomal snRNA genes in metazoan animals. J Mol Evol 2009; 67:594-607. [PMID: 19030770 DOI: 10.1007/s00239-008-9149-6] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2008] [Accepted: 07/14/2008] [Indexed: 11/28/2022]
Abstract
While studies of the evolutionary histories of protein families are commonplace, little is known on noncoding RNAs beyond microRNAs and some snoRNAs. Here we investigate in detail the evolutionary history of the nine spliceosomal snRNA families (U1, U2, U4, U5, U6, U11, U12, U4atac, and U6atac) across the completely or partially sequenced genomes of metazoan animals. Representatives of the five major spliceosomal snRNAs were found in all genomes. None of the minor splicesomal snRNAs were detected in nematodes or in the shotgun traces of Oikopleura dioica, while in all other animal genomes at most one of them is missing. Although snRNAs are present in multiple copies in most genomes, distinguishable paralogue groups are not stable over long evolutionary times, although they appear independently in several clades. In general, animal snRNA secondary structures are highly conserved, albeit, in particular, U11 and U12 in insects exhibit dramatic variations. An analysis of genomic context of snRNAs reveals that they behave like mobile elements, exhibiting very little syntenic conservation.
Collapse
Affiliation(s)
- Manuela Marz
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Härtelstrasse 16-18, 04107 Leipzig, Germany.
| | | | | |
Collapse
|
46
|
Makarova JA, Kramerov DA. Analysis of C/D box snoRNA genes in vertebrates: The number of copies decreases in placental mammals. Genomics 2009; 94:11-9. [PMID: 19272437 DOI: 10.1016/j.ygeno.2009.02.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2008] [Revised: 02/26/2009] [Accepted: 02/27/2009] [Indexed: 12/31/2022]
Abstract
C/D box small nucleolar RNAs (snoRNAs) guide site-specific 2'-O-methylation of RNAs. Nearly all C/D box snoRNAs with known targets are involved in rRNA modification. In vertebrates, snoRNAs are encoded in introns of various genes and their processing is coupled with splicing of host gene pre-mRNA. Here, the genes encoding C/D box snoRNAs that guide 2'-O-methylation of rRNA were identified and analyzed in vertebrate genomes. The number of copies of most C/D box snoRNA genes proved to be lower in placental mammals compared to other vertebrates. This can be due to smaller oocytes and accordingly lower number of ribosomes in them in eutherians. The targets of snoRNAs encoded by single-copy and multiple-copy genes proved to have different distribution in rRNAs. The causes of this difference are discussed. In some cases, the transcripts of homologous C/D box RNA genes were shown to guide the modification of neighboring nucleotides in rRNA. C/D box snoRNA pseudogenes were found in all vertebrate classes. Three novel C/D box snoRNAs were found in Xenopus tropicalis that may guide 2'-O-methylation of Xenopus-specific rRNA sites. A list of 922 annotated C/D box snoRNA genes is presented.
Collapse
Affiliation(s)
- Julia A Makarova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov St., Moscow 119991, Russia
| | | |
Collapse
|
47
|
Genome-wide analysis of chicken snoRNAs provides unique implications for the evolution of vertebrate snoRNAs. BMC Genomics 2009; 10:86. [PMID: 19232134 PMCID: PMC2653536 DOI: 10.1186/1471-2164-10-86] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2008] [Accepted: 02/22/2009] [Indexed: 01/02/2023] Open
Abstract
Background Small nucleolar RNAs (snoRNAs) represent one of the largest groups of functionally diverse trans-acting non-protein-coding (npc) RNAs currently known in eukaryotic cells. Chicken snoRNAs have been very poorly characterized when compared to other vertebrate snoRNAs. A genome-wide analysis of chicken snoRNAs is therefore of great importance to further understand the functional evolution of snoRNAs in vertebrates. Results Two hundred and one gene variants encoding 93 box C/D and 62 box H/ACA snoRNAs were identified in the chicken genome and are predicted to guide 86 2'-O-ribose methylations and 69 pseudouridylations of rRNAs and spliceosomal RNAs. Forty-four snoRNA clusters were grouped into four categories based on synteny characteristics of the clustered snoRNAs between chicken and human. Comparative analyses of chicken snoRNAs revealed extensive recombination and separation of guiding function, with cooperative evolution between the guiding duplexes and modification sites. The gas5-like snoRNA host gene appears to be a hotspot of snoRNA gene expansion in vertebrates. Our results suggest that the chicken is a good model for the prediction of functional snoRNAs, and that intragenic duplication and divergence might be the major driving forces responsible for expansion of novel snoRNA genes in the chicken genome. Conclusion We have provided a detailed catalog of chicken snoRNAs that aids in understanding snoRNA gene repertoire differences between avians and other vertebrates. Our genome-wide analysis of chicken snoRNAs improves annotation of the 'darkness matter' in the npcRNA world and provides a unique perspective into snoRNA evolution in vertebrates.
Collapse
|
48
|
Abstract
Over two centuries after the first platypus specimen stirred the scientific community in Europe, the whole-genome sequence of the duck-billed platypus has been completed and is publicly available. After publication of eutherian and marsupial genomes, this is the first genome of a monotreme filling an important evolutionary gap between the divergence of birds more that 300 million years ago and marsupials more than 140 million years ago. Monotremes represent the most basal surviving branch of mammals and the platypus genome sequence allows unprecedented insights into the evolution of mammals and the fascinating biology of the egg-laying mammals. Here, we discuss some of the key findings of the analysis of the platypus genome and point to new findings and future research directions, which illustrate the broad impact of the platypus genome project for understanding monotreme biology and mammalian genome evolution.
Collapse
|
49
|
Sela N, Stern A, Makalowski W, Pupko T, Ast G. Transduplication resulted in the incorporation of two protein-coding sequences into the turmoil-1 transposable element of C. elegans. Biol Direct 2008; 3:41. [PMID: 18842128 PMCID: PMC2572040 DOI: 10.1186/1745-6150-3-41] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2008] [Accepted: 10/08/2008] [Indexed: 02/04/2023] Open
Abstract
Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR) sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM) that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif.
Collapse
Affiliation(s)
- Noa Sela
- Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | | | | | | | | |
Collapse
|
50
|
Abstract
Retrotransposons, mainly LINEs, SINEs, and endogenous retroviruses, make up roughly 40% of the mammalian genome and have played an important role in genome evolution. Their prevalence in genomes reflects a delicate balance between their further expansion and the restraint imposed by the host. In any human genome only a small number of LINE1s (L1s) are active, moving their own and SINE sequences into new genomic locations and occasionally causing disease. Recent insights and new technologies promise answers to fundamental questions about the biology of transposable elements.
Collapse
Affiliation(s)
- John L Goodier
- Department of Genetics, University of Pennsylvania School of Medicine, 415 Curie Boulevard, Philadelphia, PA 19104, USA.
| | | |
Collapse
|