1
|
Noto JJ, Schmidt CA, Matera AG. Engineering and expressing circular RNAs via tRNA splicing. RNA Biol 2017; 14:978-984. [PMID: 28402213 DOI: 10.1080/15476286.2017.1317911] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Circular (circ)RNAs have recently become a subject of great biologic interest. It is now clear that they represent a diverse and abundant class of RNAs with regulated expression and evolutionarily conserved functions. There are several mechanisms by which RNA circularization can occur in vivo. Here, we focus on the biogenesis of tRNA intronic circular RNAs (tricRNAs) in archaea and animals, and we detail their use as research tools for orthogonal, directed circRNA expression in vivo.
Collapse
Affiliation(s)
- John J Noto
- a Curriculum in Genetics and Molecular Biology , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA.,b Integrative Program for Biological and Genome Sciences , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA
| | - Casey A Schmidt
- a Curriculum in Genetics and Molecular Biology , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA.,b Integrative Program for Biological and Genome Sciences , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA
| | - A Gregory Matera
- a Curriculum in Genetics and Molecular Biology , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA.,b Integrative Program for Biological and Genome Sciences , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA.,c Lineberger Comprehensive Cancer Center , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA.,d Department of Biology , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA.,e Department of Genetics , The University of North Carolina at Chapel Hill , Chapel Hill , NC , USA
| |
Collapse
|
2
|
Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles. J Comput Aided Mol Des 2014; 28:49-60. [PMID: 24442949 DOI: 10.1007/s10822-014-9706-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2013] [Accepted: 01/07/2014] [Indexed: 12/20/2022]
Abstract
Machinery of pre-mRNA splicing is carried out through the interaction of RNA sequence elements and a variety of RNA splicing-related proteins (SRPs) (e.g. spliceosome and splicing factors). Alternative splicing, which is an important post-transcriptional regulation in eukaryotes, gives rise to multiple mature mRNA isoforms, which encodes proteins with functional diversities. However, the regulation of RNA splicing is not yet fully elucidated, partly because SRPs have not yet been exhaustively identified and the experimental identification is labor-intensive. Therefore, we are motivated to design a new method for identifying SRPs with their functional roles in the regulation of RNA splicing. The experimentally verified SRPs were manually curated from research articles. According to the functional annotation of Splicing Related Gene Database, the collected SRPs were further categorized into four functional groups including small nuclear Ribonucleoprotein, Splicing Factor, Splicing Regulation Factor and Novel Spliceosome Protein. The composition of amino acid pairs indicates that there are remarkable differences among four functional groups of SRPs. Then, support vector machines (SVMs) were utilized to learn the predictive models for identifying SRPs as well as their functional roles. The cross-validation evaluation presents that the SVM models trained with significant amino acid pairs and functional domains could provide a better predictive performance. In addition, the independent testing demonstrates that the proposed method could accurately identify SRPs in mammals/plants as well as effectively distinguish between SRPs and RNA-binding proteins. This investigation provides a practical means to identifying potential SRPs and a perspective for exploring the regulation of RNA splicing.
Collapse
|
3
|
Hsu JBK, Bretaña NA, Lee TY, Huang HD. Incorporating evolutionary information and functional domains for identifying RNA splicing factors in humans. PLoS One 2011; 6:e27567. [PMID: 22110674 PMCID: PMC3217973 DOI: 10.1371/journal.pone.0027567] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Accepted: 10/19/2011] [Indexed: 11/19/2022] Open
Abstract
Regulation of pre-mRNA splicing is achieved through the interaction of RNA sequence elements and a variety of RNA-splicing related proteins (splicing factors). The splicing machinery in humans is not yet fully elucidated, partly because splicing factors in humans have not been exhaustively identified. Furthermore, experimental methods for splicing factor identification are time-consuming and lab-intensive. Although many computational methods have been proposed for the identification of RNA-binding proteins, there exists no development that focuses on the identification of RNA-splicing related proteins so far. Therefore, we are motivated to design a method that focuses on the identification of human splicing factors using experimentally verified splicing factors. The investigation of amino acid composition reveals that there are remarkable differences between splicing factors and non-splicing proteins. A support vector machine (SVM) is utilized to construct a predictive model, and the five-fold cross-validation evaluation indicates that the SVM model trained with amino acid composition could provide a promising accuracy (80.22%). Another basic feature, amino acid dipeptide composition, is also examined to yield a similar predictive performance to amino acid composition. In addition, this work presents that the incorporation of evolutionary information and domain information could improve the predictive performance. The constructed models have been demonstrated to effectively classify (73.65% accuracy) an independent data set of human splicing factors. The result of independent testing indicates that in silico identification could be a feasible means of conducting preliminary analyses of splicing factors and significantly reducing the number of potential targets that require further in vivo or in vitro confirmation.
Collapse
Affiliation(s)
- Justin Bo-Kai Hsu
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsin-Chu, Taiwan
| | - Neil Arvin Bretaña
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, Taiwan
- * E-mail: (T-YL); (H-DH)
| | - Hsien-Da Huang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsin-Chu, Taiwan
- Department of Biological Science and Technology, National Chiao Tung University, Hsin-Chu, Taiwan
- Core Facility for Structural Bioinformatics, National Chiao Tung University, Hsin-Chu, Taiwan
- * E-mail: (T-YL); (H-DH)
| |
Collapse
|
4
|
Kamikawa R, Inagaki Y, Hashimoto T. A novel spliceosome-mediated trans-splicing can change our view on genome complexity of the divergent eukaryote Giardia intestinalis. Biophys Rev 2011; 3:193-197. [PMID: 28510047 DOI: 10.1007/s12551-011-0058-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2011] [Accepted: 09/29/2011] [Indexed: 11/30/2022] Open
Abstract
Although spliceosomal introns are an abundant landmark in eukaryotic genomes, the nuclear genome of the divergent eukaryote Giardia intestinalis, the causative agent of giardiasis, has been considered as "intron-poor" with only five canonical (cis-spliced) introns. However, three research groups (including ours) have independently reported a novel class of spliceosomal introns in the G. intestinalis genome. Three protein-coding genes are split into pieces in the G. intestinalis genome, and each of the partial coding regions was independently transcribed into polyadenylated premature mRNAs (pre-mRNAs). The two pre-mRNAs directly interact with each other by an intermolecular-stem structure formed between their non-coding portions, and are then processed into mature mRNAs by spliceosome-mediated trans-splicing. Here, we summarize the recently published works on split introns ("splintrons") in the G. intestinalis genome, and then provide our speculation on the functional property of the Giardia spliceosomes based on the putative ratio of splintrons to canonical introns. Finally, we discuss a scenario for the transition from typical GT-AG boundaries to non-typical AT-AC boundaries in a particular splintron of Giardia.
Collapse
Affiliation(s)
- Ryoma Kamikawa
- Center for Computational Sciences and Institute of Biological Sciences, University of Tsukuba, 1-1-1 Tennoudai, Tsukuba, Ibaraki, 305-8577, Japan.
| | - Yuji Inagaki
- Center for Computational Sciences and Institute of Biological Sciences, University of Tsukuba, 1-1-1 Tennoudai, Tsukuba, Ibaraki, 305-8577, Japan
| | - Tetsuo Hashimoto
- Center for Computational Sciences and Institute of Biological Sciences, University of Tsukuba, 1-1-1 Tennoudai, Tsukuba, Ibaraki, 305-8577, Japan
| |
Collapse
|
5
|
Gromoll J, Lahrmann L, Godmann M, Müller T, Michel C, Stamm S, Simoni M. Genomic checkpoints for exon 10 usage in the luteinizing hormone receptor type 1 and type 2. Mol Endocrinol 2007; 21:1984-96. [PMID: 17505059 DOI: 10.1210/me.2006-0506] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Alternative splicing is a hallmark of glycoprotein hormone receptor gene regulation, but its molecular mechanism is unknown. The LH receptor (LHR) gene possesses 11 exons, but exon 10 is constitutively skipped in the New World monkey lineage (LHR type 2), whereas it is constitutively spliced in the human (LHR type 1). This study identifies the regulatory elements of exon 10 usage. Sequencing of genomic marmoset DNA revealed that the cryptic LHR exon 10 is highly homologous to exon 10 from other species and displays intact splice sites. Functional studies using a minigene approach excluded the contribution of intronic, marmoset-specific long interspersed nucleotide-1 elements to exon 10 skipping. Sequencing of the genomic regions surrounding exon 10 from several primate lineages, sequence comparisons including the human and mouse LHR gene, revealed the presence of unique nucleotides at 3'-intronic position -19 and -10 and at position +26 within exon 10 of the marmoset LHR. Exon trap experiments and in vitro mutagenesis of these nucleotides resulted in the identification of a composite regulatory element of splicing consisting of cis-acting elements represented by two polypyrimidine tracts and a trans-acting element within exon 10, which affect the secondary RNA structure. Changes within this complex resulted either in constitutive exon inclusion, constitutive skipping, or alternative splicing of exon 10. This work delineates the molecular pathway leading to intronization of exon 10 in the LHR type 2 and reveals, for the first time, the essential function of regulatory and structural elements involved in glycoprotein hormone receptor splicing.
Collapse
Affiliation(s)
- Jörg Gromoll
- Institute of Reproductive Medicine, University Hospital, D-48129 Münster, Germany.
| | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
7
|
Barbosa-Morais NL, Carmo-Fonseca M, Aparício S. Systematic genome-wide annotation of spliceosomal proteins reveals differential gene family expansion. Genome Res 2005; 16:66-77. [PMID: 16344558 PMCID: PMC1356130 DOI: 10.1101/gr.3936206] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Although more than 200 human spliceosomal and splicing-associated proteins are known, the evolution of the splicing machinery has not been studied extensively. The recent near-complete sequencing and annotation of distant vertebrate and chordate genomes provides the opportunity for an exhaustive comparative analysis of splicing factors across eukaryotes. We describe here our semiautomated computational pipeline to identify and annotate splicing factors in representative species of eukaryotes. We focused on protein families whose role in splicing is confirmed by experimental evidence. We visually inspected 1894 proteins and manually curated 224 of them. Our analysis shows a general conservation of the core spliceosomal proteins across the eukaryotic lineage, contrasting with selective expansions of protein families known to play a role in the regulation of splicing, most notably of SR proteins in metazoans and of heterogeneous nuclear ribonucleoproteins (hnRNP) in vertebrates. We also observed vertebrate-specific expansion of the CLK and SRPK kinases (which phosphorylate SR proteins), and the CUG-BP/CELF family of splicing regulators. Furthermore, we report several intronless genes amongst splicing proteins in mammals, suggesting that retrotransposition contributed to the complexity of the mammalian splicing apparatus.
Collapse
Affiliation(s)
- Nuno L Barbosa-Morais
- University of Cambridge, Department of Oncology, Hutchison-MRC Research Centre, Cambridge CB2 2XZ, United Kingdom
| | | | | |
Collapse
|
8
|
Russell AG, Shutt TE, Watkins RF, Gray MW. An ancient spliceosomal intron in the ribosomal protein L7a gene (Rpl7a) of Giardia lamblia. BMC Evol Biol 2005; 5:45. [PMID: 16109161 PMCID: PMC1201135 DOI: 10.1186/1471-2148-5-45] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2005] [Accepted: 08/18/2005] [Indexed: 11/16/2022] Open
Abstract
Background Only one spliceosomal-type intron has previously been identified in the unicellular eukaryotic parasite, Giardia lamblia (a diplomonad). This intron is only 35 nucleotides in length and is unusual in possessing a non-canonical 5' intron boundary sequence, CT, instead of GT. Results We have identified a second spliceosomal-type intron in G. lamblia, in the ribosomal protein L7a gene (Rpl7a), that possesses a canonical GT 5' intron boundary sequence. A comparison of the two known Giardia intron sequences revealed extensive nucleotide identity at both the 5' and 3' intron boundaries, similar to the conserved sequence motifs recently identified at the boundaries of spliceosomal-type introns in Trichomonas vaginalis (a parabasalid). Based on these observations, we searched the partial G. lamblia genome sequence for these conserved features and identified a third spliceosomal intron, in an unassigned open reading frame. Our comprehensive analysis of the Rpl7a intron in other eukaryotic taxa demonstrates that it is evolutionarily conserved and is an ancient eukaryotic intron. Conclusion An analysis of the phylogenetic distribution and properties of the Rpl7a intron suggests its utility as a phylogenetic marker to evaluate particular eukaryotic groupings. Additionally, analysis of the G. lamblia introns has provided further insight into some of the conserved and unique features possessed by the recently identified spliceosomal introns in related organisms such as T. vaginalis and Carpediemonas membranifera.
Collapse
Affiliation(s)
- Anthony G Russell
- Program in Evolutionary Biology, Canadian Institute for Advanced Research, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 1X5, Canada
| | - Timothy E Shutt
- Program in Evolutionary Biology, Canadian Institute for Advanced Research, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 1X5, Canada
| | - Russell F Watkins
- Program in Evolutionary Biology, Canadian Institute for Advanced Research, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 1X5, Canada
| | - Michael W Gray
- Program in Evolutionary Biology, Canadian Institute for Advanced Research, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 1X5, Canada
| |
Collapse
|
9
|
Vanácová S, Yan W, Carlton JM, Johnson PJ. Spliceosomal introns in the deep-branching eukaryote Trichomonas vaginalis. Proc Natl Acad Sci U S A 2005; 102:4430-5. [PMID: 15764705 PMCID: PMC554003 DOI: 10.1073/pnas.0407500102] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Eukaryotes have evolved elaborate splicing mechanisms to remove introns that would otherwise destroy the protein-coding capacity of genes. Nuclear premRNA splicing requires sequence motifs in the intron and is mediated by a ribonucleoprotein complex, the spliceosome. Here we demonstrate the presence of a splicing apparatus in the protist Trichomonas vaginalis and show that RNA motifs found in yeast and metazoan introns are required for splicing. We also describe the first introns in this deep-branching lineage. The positions of these introns are often conserved in orthologous genes, indicating they were present in a common ancestor of trichomonads, yeast, and metazoa. All examined T. vaginalis introns have a highly conserved 12-nt 3' splice-site motif that encompasses the branch point and is necessary for splicing. This motif is also found in the only described intron in a gene from another deep-branching eukaryote, Giardia intestinalis. These studies demonstrate the conservation of intron splicing signals across large evolutionary distances, reveal unexpected motif conservation in deep-branching lineages that suggest a simplified mechanism of splicing in primitive unicellular eukaryotes, and support the presence of introns in the earliest eukaryote.
Collapse
Affiliation(s)
- Stepánka Vanácová
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, CA 90095, USA
| | | | | | | |
Collapse
|
10
|
Abstract
Spliceosomal introns, one of the hallmarks of eukaryotic genomes, were thought to have originated late in evolution and were assumed not to exist in eukaryotes that diverged early -- until the discovery of a single intron with an aberrant splice boundary in the primitive 'protozoan' Giardia. Here we describe introns from a close relative of Giardia, Carpediemonas membranifera, that have boundary sequences of the normal eukaryotic type, indicating that canonical introns are likely to have arisen very early in eukaryotic evolution.
Collapse
Affiliation(s)
- Alastair G B Simpson
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada.
| | | | | |
Collapse
|