1
|
Discovery of alternatively spliced isoforms and long non-coding RNA in full length brain transcriptomes of anadromous Hilsa shad, Tenualosa ilisha (Hamilton, 1822). Mol Biol Rep 2021; 48:7333-7342. [PMID: 34643921 DOI: 10.1007/s11033-021-06735-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Accepted: 09/30/2021] [Indexed: 12/07/2022]
Abstract
BACKGROUND Full length transcriptomes, achieved through long-read sequencing, along with the isoform analysis can reveal complexities in the gene expression profiles, as well as annotate the transcriptomes of non-model organisms. METHODS AND RESULT Full length transcripts of brain transcriptome of Tenualosa ilisha, Hilsa shad, were generated through PacBio single molecule real-time sequencing and were characterized. A total of 8.30 Gb clean reads were generated, with PacBio RSII, which resulted in 57,651 high quality consensus transcripts. After removing redundant reads, a total of 19,220 high-quality non-redundant transcripts and 17,341 full length ORF transcripts were classified to 7522 putative ortholog groups. Genes involved in various neural pathways were identified. In addition, isoform clusters and lncRNAs were discovered, along with Hilsa specific transcripts with coding frames and 29,147 SSRs in 944 transcripts (1141 annotated). CONCLUSION The present study provided, for the first time, a comprehensive view of the alternative isoforms of genes and transcriptome complexity in Hilsa shad brain and forms a rich resource for functional studies in brain of this anadromous fish.
Collapse
|
2
|
Abstract
Systematics is described for annotation of variations in RNA molecules. The conceptual framework is part of Variation Ontology (VariO) and facilitates depiction of types of variations, their functional and structural effects and other consequences in any RNA molecule in any organism. There are more than 150 RNA related VariO terms in seven levels, which can be further combined to generate even more complicated and detailed annotations. The terms are described together with examples, usually for variations and effects in human and in diseases. RNA variation type has two subcategories: variation classification and origin with subterms. Altogether six terms are available for function description. Several terms are available for affected RNA properties. The ontology contains also terms for structural description for affected RNA type, post-transcriptional RNA modifications, secondary and tertiary structure effects and RNA sugar variations. Together with the DNA and protein concepts and annotations, RNA terms allow comprehensive description of variations of genetic and non-genetic origin at all possible levels. The VariO annotations are readable both for humans and computer programs for advanced data integration and mining.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| |
Collapse
|
3
|
Seibt KM, Schmidt T, Heitkam T. The conserved 3' Angio-domain defines a superfamily of short interspersed nuclear elements (SINEs) in higher plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:681-699. [PMID: 31610059 DOI: 10.1111/tpj.14567] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 09/13/2019] [Accepted: 09/17/2019] [Indexed: 06/10/2023]
Abstract
Repetitive sequences are ubiquitous components of eukaryotic genomes affecting genome size and evolution as well as gene regulation. Among them, short interspersed nuclear elements (SINEs) are non-coding retrotransposons usually shorter than 1000 bp. They contain only few short conserved structural motifs, in particular an internal promoter derived from cellular RNAs and a mostly AT-rich 3' tail, whereas the remaining regions are highly variable. SINEs emerge and vanish during evolution, and often diversify into numerous families and subfamilies that are usually specific for only a limited number of species. In contrast, at the 3' end of multiple plant SINEs we detected the highly conserved 'Angio-domain'. This 37 bp segment defines the Angio-SINE superfamily, which encompasses 24 plant SINE families widely distributed across 13 orders within the plant kingdom. We retrieved 28 433 full-length Angio-SINE copies from genome assemblies of 46 plant species, frequently located in genes. Compensatory mutations in and adjacent to the Angio-domain imply selective restraints maintaining its RNA structure. Angio-SINE families share segmental sequence similarities, indicating a modular evolution with strong Angio-domain preservation. We suggest that the conserved domain contributes to the evolutionary success of Angio-SINEs through either structural interactions between SINE RNA and proteins increasing their transpositional efficiency, or by enhancing their accumulation in genes.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Thomas Schmidt
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| | - Tony Heitkam
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, Dresden, 01217, Germany
| |
Collapse
|
4
|
Shein A, Zaikin A, Poptsova M. Recognition of 3'-end L1, Alu, processed pseudogenes, and mRNA stem-loops in the human genome using sequence-based and structure-based machine-learning models. Sci Rep 2019; 9:7211. [PMID: 31076573 PMCID: PMC6510757 DOI: 10.1038/s41598-019-43403-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Accepted: 04/24/2019] [Indexed: 11/09/2022] Open
Abstract
The role of 3′-end stem-loops in retrotransposition was experimentally demonstrated for transposons of various species, where LINE-SINE retrotransposons share the same 3′-end sequences, containing a stem-loop. We have discovered that 62–68% of processed pseduogenes and mRNAs also have 3′-end stem-loops. We investigated the properties of 3′-end stem-loops of human L1s, Alus, processed pseudogenes and mRNAs that do not share the same sequences, but all have 3′-end stem-loops. We have built sequence-based and structure-based machine-learning models that are able to recognize 3′-end L1, Alu, processed pseudogene and mRNA stem-loops with high performance. The sequence-based models use only sequence information and capture compositional bias in 3′-ends. The structure-based models consider physical, chemical and geometrical properties of dinucleotides composing a stem and position-specific nucleotide content of a loop and a bulge. The most important parameters include shift, tilt, rise, and hydrophilicity. The obtained results clearly point to the existence of structural constrains for 3′-end stem-loops of L1 and Alu, which are probably important for transposition, and reveal the potential of mRNAs to be recognized by the L1 machinery. The proposed approach is applicable to a broader task of recognizing RNA (DNA) secondary structures. The constructed models are freely available at github (https://github.com/AlexShein/transposons/).
Collapse
Affiliation(s)
- Alexander Shein
- Laboratory of Bioinformatics, Big Data and Information Retrieval School, Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
| | - Anton Zaikin
- Laboratory of Bioinformatics, Big Data and Information Retrieval School, Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia
| | - Maria Poptsova
- Laboratory of Bioinformatics, Big Data and Information Retrieval School, Faculty of Computer Science, National Research University Higher School of Economics, Moscow, Russia.
| |
Collapse
|
5
|
Otsu M, Kawai G. Distinct RNA recognition mechanisms in closely related LINEs from zebrafish. NUCLEOSIDES NUCLEOTIDES & NUCLEIC ACIDS 2019; 38:294-304. [PMID: 30942141 DOI: 10.1080/15257770.2018.1527348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Long interspersed nuclear element (LINE) is known to be transposed by the reverse transcription using its RNA transcript. Recognition of the 3' stem-loop of LINE RNA by its reverse transcriptase (RT) is an important step of the retrotransposition. We focused on the RNA recognition by RT from two related LINEs, ZfL2-1 and ZfL2-2, from zebrafish. Previous study showed that RT from ZfL2-2 recognizes a single residue in the specific position of the RNA loop. In the present study, it was found that RT from ZfL2-1 recognizes the inserted stem-loop of ZfL2-1 RNA. Thus, these related RTs recognize the same region of LINE RNAs but discriminate them by different mechanism.
Collapse
Affiliation(s)
- Maina Otsu
- a Department of Life and Environmental Sciences, Faculty of Engineering , Chiba Institute of Technology , Narashino , Japan
| | - Gota Kawai
- a Department of Life and Environmental Sciences, Faculty of Engineering , Chiba Institute of Technology , Narashino , Japan
| |
Collapse
|
6
|
Otsu M, Kajikawa M, Okada N, Kawai G. Solution structure of a reverse transcriptase recognition site of a LINE RNA from zebrafish. J Biochem 2017; 162:279-285. [PMID: 28431120 DOI: 10.1093/jb/mvx026] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2017] [Accepted: 03/23/2017] [Indexed: 11/14/2022] Open
Abstract
Long interspersed nuclear element (LINE) is known to be transposed by reverse transcription using its RNA transcript. Recognition of the 3' stem-loop of LINE RNA by its reverse transcriptase (RT) is an important step of the retrotransposition. Our previous study revealed that the second G residue (G8) in the GGAUA loop of a 17mer LINE RNA from eel, UnaL2-17, is recognized by its RT and the U residue (U10) in the same loop is required to maintain the loop structure (Baba S, Kajikawa M, Okada N, Kawai G. Solution structure of an RNA stem-loop derived from the 3' conserved region of eel LINE UnaL2. RNA 2004;10:1380-1387). ZfL2-2, a LINE from zebrafish, has the same 3' stem-loop with UnaL2 and ZfL2-1 has similar but distinct 3' stem-loop with an insertion which can form an additional stem-loop. Here, we determined the solution structure of the 34mer RT recognition site of the LINE RNA (ZfL2-1-34). It was found that ZfL2-1-34 forms a hairpin with an internal loop, the tertiary structure of which is superimposed with that of ZfL2-2. It is noted that A10 and the inserted stem-loop, starting with A12, in ZfL2-1-34 located at the positions corresponding to those of G8 and U10, respectively, in UnaL2-17. These results strongly suggest that the two LINEs share the similar recognition mechanism and the A10 in ZfL2-1-34 is the determinant recognized by its RT.
Collapse
Affiliation(s)
- Maina Otsu
- Department of Life and Environmental Sciences, Faculty of Engineering, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba 275-0016, Japan
| | - Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-15 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | - Norihiro Okada
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-15 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan.,Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan.,Foundation for Advancement of International Science, Tsukuba 305-0821, Japan
| | - Gota Kawai
- Department of Life and Environmental Sciences, Faculty of Engineering, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba 275-0016, Japan
| |
Collapse
|
7
|
Warden MS, Tonelli M, Cornilescu G, Liu D, Hopersberger LJ, Ponniah K, Pascal SM. Structure of RNA Stem Loop B from the Picornavirus Replication Platform. Biochemistry 2017; 56:2549-2557. [PMID: 28459542 DOI: 10.1021/acs.biochem.7b00141] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The presumptive RNA cloverleaf at the start of the 5'-untranslated region of the picornavirus genome is an essential element in replication. Stem loop B (SLB) of the cloverleaf is a recognition site for the host polyC-binding protein, which initiates a switch from translation to replication. Here we present the solution structure of human rhinovirus isotype 14 SLB using nuclear magnetic resonance spectroscopy. SLB adopts a predominantly A-form helical structure. The stem contains five Watson-Crick base pairs and one wobble base pair and is capped by an eight-nucleotide loop. The wobble base pair introduces perturbations into the helical parameters but does not appear to introduce flexibility. However, the helix major groove appears to be accessible. Flexibility is seen throughout the loop and in the terminal nucleotides. The pyrimidine-rich region of the loop, the apparent recognition site for the polyC-binding protein, is the most disordered region of the structure.
Collapse
Affiliation(s)
- Meghan S Warden
- Department of Chemistry and Biochemistry, Old Dominion University , Norfolk, Virginia 23529, United States
| | - Marco Tonelli
- National Magnetic Resonance Facility at Madison (NMRFAM), University of Wisconsin-Madison , Madison, Wisconsin 53706, United States
| | - Gabriel Cornilescu
- National Magnetic Resonance Facility at Madison (NMRFAM), University of Wisconsin-Madison , Madison, Wisconsin 53706, United States
| | - Dong Liu
- Department of Chemistry and Biochemistry, Old Dominion University , Norfolk, Virginia 23529, United States
| | - Lorelei J Hopersberger
- Department of Chemistry and Biochemistry, Old Dominion University , Norfolk, Virginia 23529, United States
| | - Komala Ponniah
- Department of Chemistry and Biochemistry, Old Dominion University , Norfolk, Virginia 23529, United States
| | - Steven M Pascal
- Department of Chemistry and Biochemistry, Old Dominion University , Norfolk, Virginia 23529, United States
| |
Collapse
|
8
|
Conserved 3' UTR stem-loop structure in L1 and Alu transposons in human genome: possible role in retrotransposition. BMC Genomics 2016; 17:992. [PMID: 27914481 PMCID: PMC5135761 DOI: 10.1186/s12864-016-3344-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 11/25/2016] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND In the process of retrotransposition LINEs use their own machinery for copying and inserting themselves into new genomic locations, while SINEs are parasitic and require the machinery of LINEs. The exact mechanism of how a LINE-encoded reverse transcriptase (RT) recognizes its own and SINE RNA remains unclear. However it was shown for the stringent-type LINEs that recognition of a stem-loop at the 3'UTR by RT is essential for retrotransposition. For the relaxed-type LINEs it is believed that the poly-A tail is a common recognition element between LINE and SINE RNA. However polyadenylation is a property of any messenger RNA, and how the LINE RT recognizes transposon and non-transposon RNAs remains an open question. It is likely that RNA secondary structures play an important role in RNA recognition by LINE encoded proteins. RESULTS Here we selected a set of L1 and Alu elements from the human genome and investigated their sequences for the presence of position-specific stem-loop structures. We found highly conserved stem-loop positions at the 3'UTR. Comparative structural analyses of a human L1 3'UTR stem-loop showed a similarity to 3'UTR stem-loops of the stringent-type LINEs, which were experimentally shown to be recognized by LINE RT. The consensus stem-loop structure consists of 5-7 bp loop, 8-10 bp stem with a bulge at a distance of 4-6 bp from the loop. The results show that a stem loop with a bulge exists at the 3'-end of Alu. We also found conserved stem-loop positions at 5'UTR and at the end of ORF2 and discuss their possible role. CONCLUSIONS Here we presented an evidence for the presence of a highly conserved 3'UTR stem-loop structure in L1 and Alu retrotransposons in the human genome. Both stem-loops show structural similarity to the stem-loops of the stringent-type LINEs experimentally confirmed as essential for retrotransposition. Here we hypothesize that both L1 and Alu RNA are recognized by L1 RT via the 3'-end RNA stem-loop structure. Other conserved stem-loop positions in L1 suggest their possible functions in protein-RNA interactions but to date no experimental evidence has been reported.
Collapse
|
9
|
Okui S, Kawai G. In NMR tube transcription for rapid screening of RNA conformation. NUCLEOSIDES NUCLEOTIDES & NUCLEIC ACIDS 2015; 34:103-13. [PMID: 25621704 DOI: 10.1080/15257770.2014.964412] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
A simple method for rapid structure screening for RNA by NMR is proposed. Target RNA is transcribed in a NMR tube and its spectra are measured without purification. The proposed method, in NMR tube transcription or INTT, was applied for three RNAs for which NMR spectra have been measured by using the conventionally purified samples. By the real-time measuring, increase of imino proton signals and decrease of NTP signals can be observed. It was confirmed that INTT spectra are in general similar to those obtained by the conventional method. INTT can be used for the first-step screening of RNA folding.
Collapse
Affiliation(s)
- Saya Okui
- a Department of Life and Environmental Sciences, Faculty of Engineering , Chiba Institute of Technology , Narashino , Chiba , Japan
| | | |
Collapse
|
10
|
Hayashi Y, Kajikawa M, Matsumoto T, Okada N. Mechanism by which a LINE protein recognizes its 3' tail RNA. Nucleic Acids Res 2014; 42:10605-17. [PMID: 25143533 PMCID: PMC4176376 DOI: 10.1093/nar/gku753] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
LINEs mobilize their own copies via retrotransposition. LINEs can be divided into two types. One is a stringent type, which constitutes a majority of LINEs. The other is a relaxed type. To elucidate the molecular mechanism of retrotransposition, we used here two different zebrafish LINEs belonging to the stringent type. By using retrotransposition assays, we demonstrated that proteins (ORF2) encoded by an individual LINE recognize the cognate 3′ tail sequence of the LINE RNA strictly. By conducting in vitro binding assays with a variety of ORF2 proteins, we demonstrated that the region between the endonuclease and reverse transcriptase domains in ORF2 is the site at which the proteins bind the stem-loop structure of the 3′ tail RNA, showing that the strict recognition of the stem-loop structure by the cognate ORF2 protein is an important step in retrotransposition. This recognition can be bipartite, involving the general recognition of the stem by cTBR (conserved tail-binding region) of ORF2 and the specific recognition of the loop by vTBR (variable tail-binding region). This is the first report that clearly characterized the RNA-binding region in ORF2, providing the generality for the recognition mechanism of the RNA tail by the ORF2 protein encoded by LINEs.
Collapse
Affiliation(s)
- Yoshinori Hayashi
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | - Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | - Takuma Matsumoto
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | - Norihiro Okada
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan Department of Life Sciences, National Cheng Kung University, Tainan 701, Taiwan Foundation for Advancement of International Science, Tsukuba 305-0821, Japan
| |
Collapse
|
11
|
Gardner DP, Ren P, Ozer S, Gutell RR. Statistical potentials for hairpin and internal loops improve the accuracy of the predicted RNA structure. J Mol Biol 2011; 413:473-83. [PMID: 21889515 DOI: 10.1016/j.jmb.2011.08.033] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2011] [Revised: 08/12/2011] [Accepted: 08/16/2011] [Indexed: 01/19/2023]
Abstract
RNA is directly associated with a growing number of functions within the cell. The accurate prediction of different RNA higher-order structures from their nucleic acid sequences will provide insight into their functions and molecular mechanics. We have been determining statistical potentials for a collection of structural elements that is larger than the number of structural elements determined with experimentally determined energy values. The experimentally derived free energies and the statistical potentials for canonical base-pair stacks are analogous, demonstrating that statistical potentials derived from comparative data can be used as an alternative energetic parameter. A new computational infrastructure-RNA Comparative Analysis Database (rCAD)-that utilizes a relational database was developed to manipulate and analyze very large sequence alignments and secondary-structure data sets. Using rCAD, we determined a richer set of energetic parameters for RNA fundamental structural elements including hairpin and internal loops. A new version of RNAfold was developed to utilize these statistical potentials. Overall, these new statistical potentials for hairpin and internal loops integrated into the new version of RNAfold demonstrated significant improvements in the prediction accuracy of RNA secondary structure.
Collapse
Affiliation(s)
- David P Gardner
- Center for Computational Biology and Bioinformatics, Section of Integrative Biology in the School of Biological Sciences, University of Texas at Austin, Austin, TX 78712, USA
| | | | | | | |
Collapse
|
12
|
Tong C, Guo B, He S. Bead-probe complex capture a couple of SINE and LINE family from genomes of two closely related species of East Asian cyprinid directly using magnetic separation. BMC Genomics 2009; 10:83. [PMID: 19224649 PMCID: PMC2653535 DOI: 10.1186/1471-2164-10-83] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2008] [Accepted: 02/19/2009] [Indexed: 11/16/2022] Open
Abstract
Background Short and long interspersed elements (SINEs and LINEs, respectively), two types of retroposons, are active in shaping the architecture of genomes and powerful tools for studies of phylogeny and population biology. Here we developed special protocol to apply biotin-streptavidin bead system into isolation of interspersed repeated sequences rapidly and efficiently, in which SINEs and LINEs were captured directly from digested genomic DNA by hybridization to bead-probe complex in solution instead of traditional strategy including genomic library construction and screening. Results A new couple of SINEs and LINEs that shared an almost identical 3'tail was isolated and characterized in silver carp and bighead carp of two closely related species. These SINEs (34 members), designated HAmo SINE family, were little divergent in sequence and flanked by obvious TSD indicated that HAmo SINE was very young family. The copy numbers of this family was estimated to 2 × 105 and 1.7 × 105 per haploid genome by Real-Time qPCR, respectively. The LINEs, identified as the homologs of LINE2 in other fishes, had a conserved primary sequence and secondary structures of the 3'tail region that was almost identical to that of HAmo SINE. These evidences suggest that HAmo SINEs are active and amplified recently utilizing the enzymatic machinery for retroposition of HAmoL2 through the recognition of higher-order structures of the conserved 42-tail region. We analyzed the possible structures of HAmo SINE that lead to successful amplification in genome and then deduced that HAmo SINE, SmaI SINE and FokI SINE that were similar in sequence each other, were probably generated independently and created by LINE family within the same lineage of a LINE phylogeny in the genomes of different hosts. Conclusion The presented results show the advantage of the novel method for retroposons isolation and a pair of young SINE family and its partner LINE family in two carp fishes, which strengthened the hypotheses containing the slippage model for initiation of reverse transcription, retropositional parasitism of SINEs on LINEs, the formation of the stem loop structure in 3'tail region of some SINEs and LINEs and the mechanism of template switching in generating new SINE family.
Collapse
Affiliation(s)
- Chaobo Tong
- Laboratory of Fish Phylogenetics and Biogeography, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, PR China.
| | | | | |
Collapse
|
13
|
Matveev V, Okada N. Retroposons of salmonoid fishes (Actinopterygii: Salmonoidei) and their evolution. Gene 2008; 434:16-28. [PMID: 18590946 DOI: 10.1016/j.gene.2008.04.022] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2008] [Revised: 04/28/2008] [Accepted: 04/29/2008] [Indexed: 11/27/2022]
Abstract
Short and long retroposons, or non-LTR retrotransposons (SINEs and LINEs, respectively) are two groups of interspersed repetitive elements amplifying in the genome via RNA and cDNA-mediated reverse transcription. In this process, SINEs entirely depend on the enzymatic machinery of autonomous LINEs. The impact of retroposons on the host genome is difficult to overestimate: their sequences account for significant portion of the eukaryotic genome, while propagation of their active copies gradually reshapes it. In this way, the retropositional activity plays a role of important evolutionary factor. More than 100 LINE and nearly 100 SINE families have been described to date from the genomes of various eukaryotes, and it is salmonoid fishes (Actinopterygii: Salmonoidei) that are particularly noticeable for the diversity of transposons they host in their genomes, including two LINE and seven SINE families. Moreover, this group of ray-finned fish represents an excellent opportunity to study such a rare evolutionary phenomenon as lateral gene transfer, due to a great variety of transposons and other sequences salmons share with a blood fluke, Schistosoma japonicum (Trematoda: Strigeiformes)--a parasitic helminth infecting various vertebrates. The aim of the present review is to structure all knowledge accumulated about salmonoid retroposons by now, as well as to complement it with the new data pertaining to the distribution of some SINE families.
Collapse
Affiliation(s)
- Vitaliy Matveev
- Faculty of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Japan
| | | |
Collapse
|
14
|
Matveev V, Nishihara H, Okada N. Novel SINE families from salmons validate Parahucho (Salmonidae) as a distinct genus and give evidence that SINEs can incorporate LINE-related 3'-tails of other SINEs. Mol Biol Evol 2007; 24:1656-66. [PMID: 17470437 DOI: 10.1093/molbev/msm083] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Short interspersed elements (SINEs) constitute a group of retroposons propagating in the genome via a mechanism of reverse transcription, in which they depend on the enzymatic machinery of long retroposons (LINEs). Over 70 SINE families have been described to date from the genomes of various eukaryotes. Here, we characterize two novel SINEs from salmons (Actinopterygii: Salmonoidei). The first family, termed SlmI, was shown to be widespread among all genera of the suborder. These SINEs have a tRNA(Leu)-related promoter region at their 5'-end, a unique central conserved domain with a subfamily-specific region, and an end with RSg-1-LINE-derived 3'-terminus preceding the A/T-rich tail. The same LINE-related segment is also shared by two other salmonid SINEs: HpaI and OS-SINE1. The structural peculiarities and overall sequence identity of the SlmI 3'-terminus suggest that it has been acquired from HpaI SINEs but not directly from the partner LINE. This region plays a crucial role in the process of retrotransposition of short interspersed elements, and the case of its SINE-to-SINE transmission is the first recorded to date. Possible scenarios and potential evolutionary implications of the observed interaction between short retroposons are discussed. Apart from the above, we found a copy of the SlmI SINE in the GenBank entry for the blood fluke, Schistosoma japonicum (Trematoda: Strigeiformes) -- a trematode causing one of the most important human helminth infections, with its genome known to host other groups of salmonoid retroposons. In the present article, we suggest our views with regard to possible ways in which such an intensive horizontal transfer of salmonoid retroposons to the schistosomal genome occurs. The second novel SINE family, termed SlmII, originates from one of the SlmI subfamilies, with which it shares the same tRNA-related region, central domain, and a part of RSg-1-derived segment, but has a different 3'-tail of unidentified origin. Its distribution among salmonids validates Parahucho (Japanese huchen) as a distinct monotypic genus.
Collapse
Affiliation(s)
- Vitaliy Matveev
- Faculty of Bioscience and Biotechnology, Department of Biological Sciences, Tokyo Institute of Technology, Yokohama, Japan
| | | | | |
Collapse
|
15
|
Nomura Y, Kajikawa M, Baba S, Nakazato S, Imai T, Sakamoto T, Okada N, Kawai G. Solution structure and functional importance of a conserved RNA hairpin of eel LINE UnaL2. Nucleic Acids Res 2006; 34:5184-93. [PMID: 17000640 PMCID: PMC1636434 DOI: 10.1093/nar/gkl664] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The eel long interspersed element (LINE) UnaL2 and its partner short interspersed element (SINE) share a conserved 3′ tail that is critical for their retrotransposition. The predicted secondary structure of the conserved 3′ tail of UnaL2 RNA contains a stem region with a putative internal loop. Deletion of the putative internal loop region abolishes UnaL2 mobilization, indicating that this putative internal loop is required for UnaL2 retrotransposition; the exact role of the putative internal loop in retrotransposition, however, has not been elucidated. To establish a structure-based foundation on which to address the issue of the putative internal loop function in retrotransposition, we used NMR to determine the solution structure of a 36 nt RNA derived from the 3′ conserved tail of UnaL2. The region forms a compact structure containing a single bulged cytidine and a U–U mismatch. The bulge and mismatch region have conformational flexibility and molecular dynamics simulation indicate that the entire stem of the 3′ conserved tail RNA can anisotropically fluctuate at the bulge and mismatch region. Our structural and mutational analyses suggest that stem flexibility contributes to UnaL2 function and that the bulged cytidine and the U–U mismatch are required for efficient retrotransposition.
Collapse
Affiliation(s)
| | - Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | | | | | | | | | - Norihiro Okada
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology4259-B-21 Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
- Department of Evolutionary Biology and Biodiversity, National Institute for Basic Biology38 Nishigonaka, Myodaiji-cho, Okazaki, Aichi 444-8585, Japan
| | - Gota Kawai
- To whom correspondence should be addressed. Tel/Fax: +81 47 478 0425;
| |
Collapse
|
16
|
Piskurek O, Austin CC, Okada N. Sauria SINEs: Novel short interspersed retroposable elements that are widespread in reptile genomes. J Mol Evol 2006; 62:630-44. [PMID: 16612539 DOI: 10.1007/s00239-005-0201-5] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2005] [Accepted: 12/16/2005] [Indexed: 10/24/2022]
Abstract
SINEs are short interspersed retrotransposable elements that invade new genomic sites. Their retrotransposition depends on reverse transcriptase and endonuclease activities encoded by partner LINEs (long interspersed elements). Recent genomic research has demonstrated that retroposons account for at least 40% of the human genome. Hitherto, more than 30 families of SINEs have been characterized in mammalian genomes, comprising approximately 4600 extant species; the distribution and extent of SINEs in reptilian genomes, however, are poorly documented. With more than 7400 species of lizards and snakes, Squamata constitutes the largest and most diverse group of living reptiles. We have discovered and characterized a novel SINE family, Sauria SINEs, whose members are widely distributed among genomes of lizards, snakes, and tuataras. Sauria SINEs comprise a 5' tRNA-related region, a tRNA-unrelated region, and a 3' tail region (containing short tandem repeats) derived from LINEs. We distinguished eight Sauria SINE subfamilies in genomes of four major squamate lineages and investigated their evolutionary relationships. Our data illustrate the overall efficacy of Sauria SINEs as novel retrotransposable markers for elucidation of squamate evolutionary history. We show that all Sauria SINEs share an identical 3' sequence with Bov-B LINEs and propose that they utilize the enzymatic machinery of Bov-B LINEs for their own retrotransposition. This finding, along with the ubiquity of Bov-B LINEs previously demonstrated in squamate genomes, suggests that these LINEs have been an active partner of Sauria SINEs since this SINE family was generated more than 200 million years ago.
Collapse
Affiliation(s)
- Oliver Piskurek
- Faculty of Bioscience and Biotechnology, Department of Biological Sciences, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Yokohama, 226-8501, Japan
| | | | | |
Collapse
|
17
|
Kawagoe-Takaki H, Nameki N, Kajikawa M, Okada N. Probing the secondary structure of salmon SmaI SINE RNA. Gene 2006; 365:67-73. [PMID: 16356664 DOI: 10.1016/j.gene.2005.09.027] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2005] [Revised: 08/11/2005] [Accepted: 09/07/2005] [Indexed: 11/24/2022]
Abstract
SmaI is a short interspersed element (SINE) of the salmon genome, and is derived from tRNA(Lys). We probed the secondary structure of SmaI SINE RNA by enzymatic cleavage and found that the RNA structure comprises three separate domains. The 5'-terminal region (the 5' domain) forms a tRNA-like cloverleaf structure, whereas the 3'-terminal region (the 3' domain) forms an extended stem-loop. The loop region is thought to be recognized by the reverse transcriptase (RT) encoded by the long interspersed element (LINE). The two structural domains are linked by a single-stranded region (the linker domain). Our melting profile analyses indicated the presence of two structural domains having different thermal stabilities, thus supporting the domain composition described above. Based on these results, we discuss the structural generality and evolutionary advantage of the domain composition of SINE RNA.
Collapse
Affiliation(s)
- Hiroko Kawagoe-Takaki
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Kanagawa 226-8501, Japan
| | | | | | | |
Collapse
|
18
|
Grajcar L, El Amri C, Ghomi M, Fermandjian S, Huteau V, Mandel R, Lecomte S, Baron MH. Assessment of adenyl residue reactivity within model nucleic acids by surface enhanced Raman spectroscopy. Biopolymers 2006; 82:6-28. [PMID: 16425174 DOI: 10.1002/bip.20455] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
We rank the reactivity of the adenyl residues (A) of model DNA and RNA molecules with electropositive subnano size [Ag]n+ sites as a function of nucleic acid primary sequences and secondary structures and in the presence of biological amounts of Cl- and Na+ or Mg2+ ions. In these conditions A is markedly more reactive than any other nucleic acid bases. A reactivity is higher in ribo (r) than in deoxyribo (d) species [pA>pdA and (pA)n>>(pdA)n]. Base pairing decreases A reactivity in corresponding duplexes but much less in r than in d. In linear single and paired dCAG or dGAC loci, base stacking inhibits A reactivity even if A is bulged or mispaired (A.A). dA tracts are highly reactive only when dilution prevents self-association and duplex structures. In d hairpins the solvent-exposed A residues are reactive in CAG and GAC triloops and even more in ATC loops. Among the eight rG1N2R3A4 loops, those bearing a single A (A4) are the least reactive. The solvent-exposed A2 is reactive, but synergistic structural transitions make the initially stacked A residues of any rGNAA loop much more reactive. Mg2+ cross-bridging single strands via phosphates may screen A reactivity. In contrast d duplexes cross-bridging enables "A flipping" much more in rA.U pairs than in dA.T. Mg2+ promotes A reactivity in unpaired strands. For hairpins Mg2+ binding stabilizes the stems, but according to A position in the loops, A reactivity may be abolished, reduced, or enhanced. It is emphasized that not only accessibility but also local flexibility, concerted docking, and cation and anion binding control A reactivity.
Collapse
Affiliation(s)
- Lydie Grajcar
- Laboratoire de Dynamique Interactions et Réactivité, UMR 7075, Université Paris 6 CNRS, 2 rue Henri Dunant, 94320, Thiais, France
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Sugano T, Kajikawa M, Okada N. Isolation and characterization of retrotransposition-competent LINEs from zebrafish. Gene 2005; 365:74-82. [PMID: 16356661 DOI: 10.1016/j.gene.2005.09.037] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2005] [Revised: 09/01/2005] [Accepted: 09/27/2005] [Indexed: 11/30/2022]
Abstract
Long interspersed elements (LINEs) are a type of retroposon and are widely distributed in most eukaryotic genomes. LINEs are classified into two groups, the stringent type and relaxed type, based on the recognition of the 3' tail of their own RNA by reverse transcriptase (RT) during retrotransposition. Although most LINEs are thought to belong to the stringent type, retrotransposition studies of the stringent type LINEs are relatively limited compared with those of the relaxed type. We have now isolated two retrotransposition-competent LINEs (ZfL2-1 and ZfL2-2) from the zebrafish genome. Both ZfL2-1 and ZfL2-2 are members of the L2 clade; ZfL2-1 encodes two open reading frames (ORFs) and ZfL2-2 encodes one ORF, and each of the ORFs is required for retrotransposition. Using a retrotransposition assay in HeLa cells, we established that both ZfL2-1 and Zfl2-2 belong to the stringent type. We also demonstrated that an esterase (ES) domain encoded by ZfL2-1 ORF1 strongly enhances its own retrotransposition. The ES domain is encoded only in ORF1 of LINEs classified in the CR1 and L2 clades, although its function or significance in retrotransposition has not been elucidated. Thus, this is the first experimental evidence that the ES domain has an enhancing function during retrotransposition. These zebrafish LINEs will be useful for determining the function of ORF1 and the retrotransposition mechanism of stringent-type LINEs.
Collapse
Affiliation(s)
- Tomohiro Sugano
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Yokohama, Kanagawa 226-8501, Japan
| | | | | |
Collapse
|
20
|
Kajikawa M, Ichiyanagi K, Tanaka N, Okada N. Isolation and Characterization of Active LINE and SINEs from the Eel. Mol Biol Evol 2004; 22:673-82. [PMID: 15548748 DOI: 10.1093/molbev/msi054] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Long interspersed elements (LINEs) and short interspersed elements (SINEs) are retrotransposons. These elements can mobilize by the "copy-and-paste" mechanism, in which their own RNA is reverse-transcribed into complementary DNA (cDNA). LINEs and SINEs not only are components of eukaryotic genomes but also drivers of genomic evolution. Thus, studies of the amplification mechanism of LINEs and SINEs are important for understanding eukaryotic genome evolution. Here we report the characterization of one LINE family (UnaL2) and two SINE families (UnaSINE1 and UnaSINE2) from the eel (Anguilla japonica) genome. UnaL2 is approximately 3.6 kilobases (kb) and encodes only one open reading frame (ORF). UnaL2 belongs to the stringent type--thought to be a major group of LINEs--and can mobilize in HeLa cells. We also show that UnaL2 and the two UnaSINEs have similar 3' tails, and that both UnaSINE1 and UnaSINE2 can be mobilized by UnaL2 in HeLa cells. These elements are thus useful for delineating the amplification mechanism of stringent type LINEs as well as that of SINEs.
Collapse
Affiliation(s)
- Masaki Kajikawa
- Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, Midori-ku, Yokohama, Kanagawa 226-8501, Japan
| | | | | | | |
Collapse
|