1
|
Frenkel-Morgenstern M. Identification of Chimeric RNAs Using RNA-Seq Reads and Protein-Protein Interactions of Translated Chimeras. Methods Mol Biol 2020; 2079:27-40. [PMID: 31728960 DOI: 10.1007/978-1-4939-9904-0_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Chimeric RNA moieties typically consist of exons from two genes expressed from different genomic locations and produced by chromosomal translocations, trans-splicing or transcription errors. Recent advances in next-generation sequencing procedures have opened new horizons for identification of novel chimeric transcripts in various diseases in a personalized manner. Here we describe the detailed computational procedures to identify chimeric transcripts using RNA-seq reads. Moreover, we elaborate on the domain-domain co-occurrence method to detect alterations in chimeric protein-protein interaction (ChiPPI) networks produced by chimeric RNA that are translated to chimeric proteins.
Collapse
|
2
|
Mayer MG, Floeter-Winter LM. Identification of SL addition trans-splicing acceptor sites in the internal transcribed spacer I region of pre-rRNA in Leishmania (Leishmania) amazonensis. Mem Inst Oswaldo Cruz 2013; 107:1070-2. [PMID: 23295762 DOI: 10.1590/s0074-02762012000800019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2012] [Accepted: 07/28/2012] [Indexed: 11/22/2022] Open
Abstract
Trypanosomatidae is a family of early branching eukaryotes harbouring a distinctive repertoire of gene expression strategies. Functional mature messenger RNA is generated via the trans-splicing and polyadenylation processing of constitutively transcribed polycistronic units. Recently, trans-splicing of pre-small subunit ribosomal RNA in the 5' external transcribed spacer region and of precursor tRNAsec have been described. Here, we used a previously validated semi-nested reverse transcription-polymerase chain reaction strategy to investigate internal transcribed spacer (ITS) I acceptor sites in total RNA from Leishmania (Leishmania) amazonensis. Two distinct spliced leader-containing RNAs were detected indicating that trans-splicing reactions occur at two AG acceptor sites mapped in this ITS region. These data provide further evidence of the wide spectrum of RNA molecules that act as trans-splicing acceptors in trypanosomatids.
Collapse
|
3
|
Bompfünewerer AF, Flamm C, Fried C, Fritzsch G, Hofacker IL, Lehmann J, Missal K, Mosig A, Müller B, Prohaska SJ, Stadler BMR, Stadler PF, Tanzer A, Washietl S, Witwer C. Evolutionary patterns of non-coding RNAs. Theory Biosci 2012; 123:301-69. [PMID: 18202870 DOI: 10.1016/j.thbio.2005.01.002] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2004] [Accepted: 01/24/2005] [Indexed: 01/04/2023]
Abstract
A plethora of new functions of non-coding RNAs (ncRNAs) have been discovered in past few years. In fact, RNA is emerging as the central player in cellular regulation, taking on active roles in multiple regulatory layers from transcription, RNA maturation, and RNA modification to translational regulation. Nevertheless, very little is known about the evolution of this "Modern RNA World" and its components. In this contribution, we attempt to provide at least a cursory overview of the diversity of ncRNAs and functional RNA motifs in non-translated regions of regular messenger RNAs (mRNAs) with an emphasis on evolutionary questions. This survey is complemented by an in-depth analysis of examples from different classes of RNAs focusing mostly on their evolution in the vertebrate lineage. We present a survey of Y RNA genes in vertebrates and study the molecular evolution of the U7 snRNA, the snoRNAs E1/U17, E2, and E3, the Y RNA family, the let-7 microRNA (miRNA) family, and the mRNA-like evf-1 gene. We furthermore discuss the statistical distribution of miRNAs in metazoans, which suggests an explosive increase in the miRNA repertoire in vertebrates. The analysis of the transcription of ncRNAs suggests that small RNAs in general are genetically mobile in the sense that their association with a hostgene (e.g. when transcribed from introns of a mRNA) can change on evolutionary time scales. The let-7 family demonstrates, that even the mode of transcription (as intron or as exon) can change among paralogous ncRNA.
Collapse
|
4
|
Frenkel-Morgenstern M, Lacroix V, Ezkurdia I, Levin Y, Gabashvili A, Prilusky J, Del Pozo A, Tress M, Johnson R, Guigo R, Valencia A. Chimeras taking shape: potential functions of proteins encoded by chimeric RNA transcripts. Genome Res 2012; 22:1231-42. [PMID: 22588898 PMCID: PMC3396365 DOI: 10.1101/gr.130062.111] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Chimeric RNAs comprise exons from two or more different genes and have the potential to encode novel proteins that alter cellular phenotypes. To date, numerous putative chimeric transcripts have been identified among the ESTs isolated from several organisms and using high throughput RNA sequencing. The few corresponding protein products that have been characterized mostly result from chromosomal translocations and are associated with cancer. Here, we systematically establish that some of the putative chimeric transcripts are genuinely expressed in human cells. Using high throughput RNA sequencing, mass spectrometry experimental data, and functional annotation, we studied 7424 putative human chimeric RNAs. We confirmed the expression of 175 chimeric RNAs in 16 human tissues, with an abundance varying from 0.06 to 17 RPKM (Reads Per Kilobase per Million mapped reads). We show that these chimeric RNAs are significantly more tissue-specific than non-chimeric transcripts. Moreover, we present evidence that chimeras tend to incorporate highly expressed genes. Despite the low expression level of most chimeric RNAs, we show that 12 novel chimeras are translated into proteins detectable in multiple shotgun mass spectrometry experiments. Furthermore, we confirm the expression of three novel chimeric proteins using targeted mass spectrometry. Finally, based on our functional annotation of exon organization and preserved domains, we discuss the potential features of chimeric proteins with illustrative examples and suggest that chimeras significantly exploit signal peptides and transmembrane domains, which can alter the cellular localization of cognate proteins. Taken together, these findings establish that some chimeric RNAs are translated into potentially functional proteins in humans.
Collapse
|
5
|
Abstract
In eukaryotes, RNA trans-splicing is an important RNA-processing form for the end-to-end ligation of primary transcripts that are derived from separately transcribed exons. So far, three different categories of RNA trans-splicing have been found in organisms as diverse as algae to man. Here, we review one of these categories: the trans-splicing of discontinuous group II introns, which occurs in chloroplasts and mitochondria of lower eukaryotes and plants. Trans-spliced exons can be predicted from DNA sequences derived from a large number of sequenced organelle genomes. Further molecular genetic analysis of mutants has unravelled proteins, some of which being part of high-molecular-weight complexes that promote the splicing process. Based on data derived from the alga Chlamydomonas reinhardtii, a model is provided which defines the composition of an organelle spliceosome. This will have a general relevance for understanding the function of RNA-processing machineries in eukaryotic organelles.
Collapse
Affiliation(s)
- Stephanie Glanz
- Lehrstuhl für Allgemeine und Molekulare Botanik, Ruhr-Universität Bochum, Bochum, Germany
| | | |
Collapse
|
6
|
Abstract
Current knowledge about the variety and complexity of the processes that allow regulated gene expression in living organisms calls for a new understanding of genes. A 'postgenomic' understanding of genes as entities constituted during genome expression is outlined and illustrated with specific examples that formed part of a survey research instrument developed by two of the authors for an ongoing empirical study of conceptual change in contemporary biology.
Collapse
Affiliation(s)
- Karola C Stotz
- Cognitive Science Program, Indiana University Bloomington, Bloomington, IN 47406-7512, USA.
| | | | | |
Collapse
|
7
|
Zeng XC, Luo F, Li WX. Characterization of a novel cDNA encoding a short venom peptide derived from venom gland of scorpion Buthus martensii Karsch: trans-splicing may play an important role in the diversification of scorpion venom peptides. Peptides 2006; 27:675-81. [PMID: 16150513 DOI: 10.1016/j.peptides.2005.07.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/30/2005] [Revised: 07/29/2005] [Accepted: 07/29/2005] [Indexed: 11/25/2022]
Abstract
A novel cDNA clone (named BmKT-u) which is a hybrid molecule of the 5'-terminal region of BmKT' cDNA and the 3'-terminal region of an undocumented cDNA (named BmKu), was isolated from a cDNA library made from the venom gland of scorpion Buthus martensii Karsch. BmKT-u codes for a 30 amino acid residue precursor peptide composed of a 20-residue signal sequence, and a putative 10-residue novel mature peptide. Northern blot hybridization showed BmKT-u cDNA is generated from a transcript. RT-PCR experiments excluded the possibility that BmKT-u cDNA is an artifact generated during reverse transcription. Genomic amplifications performed with three pairs of BmKT-u gene-specific primers showed the BmKT-u gene does not exist in the genome of the scorpion as a single transcriptional unit. Genomic cloning for BmKT' showed that the BmKT' gene contains an intron of 509 bp inserted into the region encoding the C-terminal region of the signal peptide. A sequence alignment comparison of the cDNA of BmKT-u with genomic BmKT' revealed that the junction site of the hybrid molecule is located at the 5'-splicing site of the intron. The data suggest that the BmKT-u transcript is a naturally occurring mature mRNA that is generated by trans-splicing. Trans-splicing may contribute to the diversity of venom peptides from venomous animals.
Collapse
Affiliation(s)
- Xian-Chun Zeng
- State Key Laboratory of Virology, Institute of Virology, Department of Biotechnology, College of Life Sciences, Wuhan University, Wuhan 430072, PR China.
| | | | | |
Collapse
|
8
|
Taniguchi G, Yamamoto N, Tsuchida H, Umino A, Shimazu D, Sakurai SI, Takebayashi H, Nishikawa T. Cloning of a D-serine-regulated transcript dsr-2 from rat cerebral neocortex. J Neurochem 2005; 95:1541-9. [PMID: 16277602 DOI: 10.1111/j.1471-4159.2005.03535.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
D-serine is now considered to be an endogenous co-agonist of the NMDA receptor in mammalian brain. To obtain insight into the molecular mechanisms underlying D-serine metabolism and function, we explored transcripts that are responsive to D-serine in the neocortex of the 8-day-old infant rat by a differential cloning technique, RNA arbitrarily primed PCR. We isolated a novel D-serine inducible transcript, D-serine-responsive transcript-2 (dsr-2), that was exclusively expressed in the brain. Sequence analysis of the corresponding cDNAs to the transcript revealed that the dsr-2 mRNA consists of 7199 nucleotides with an open reading frame encoding 111 amino acids. The dsr-2 gene was located on the reverse strand within an intron of the neurexin-3alpha gene, mapped to rat chromosome 6q24-31. The regional distribution of the basal expression of dsr-2 and its ontogenic changes in the brain closely correlated with those of free D-serine and of NMDA receptor R2B subunit mRNA, but were somewhat different from those of the neurexin-3alpha transcript. These findings suggest that dsr-2 may be involved in D-serine metabolism and/or function, and in the interactions between D-serine, NMDA receptor and neurexin-3alpha, in mammalian brain.
Collapse
Affiliation(s)
- Go Taniguchi
- Section of Psychiatry and Behavioral Sciences, Tokyo Medical and Dental University Graduate School, Tokyo, Japan
| | | | | | | | | | | | | | | |
Collapse
|
9
|
Zhang Z, Zhang Y, Shi B, Deng W, Zhao Y, Chen R. Detecting chimeric 5′/3′UTRs with cross-chromosomal splicing by bioinformatics. CHINESE SCIENCE BULLETIN-CHINESE 2004. [DOI: 10.1007/bf03184036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
10
|
Krauss V, Dorn R. Evolution of the trans-splicing Drosophila locus mod(mdg4) in several species of Diptera and Lepidoptera. Gene 2004; 331:165-76. [PMID: 15094203 DOI: 10.1016/j.gene.2004.02.019] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2003] [Revised: 02/03/2004] [Accepted: 02/09/2004] [Indexed: 11/30/2022]
Abstract
The modifier of mdg4 (mod[mdg4]) locus of Drosophila melanogaster (Dme) encodes chromatin proteins which are involved in position effect variegation, establishment of chromatin boundaries, nerve pathfinding, meiotic chromosome pairing and apoptosis. It was recently shown that mRNA trans-splicing is involved in the generation of at least 26 different mod(mdg4) transcripts. Here, we show that a similar complex mod(mdg4) locus exists in Drosophila pseudoobscura (Dps), Drosophila virilis (Dvi), Anopheles gambiae (Aga) and Bombyx mori (Bmo). As in D. melanogaster, most isoforms of these species contain a strongly conserved BTB/POZ domain (hereafter referred to as BTB domain) within the common N-terminal part and a Cys(2)His(2) motif containing FLYWCH domain within the isoform-specific C-terminal parts. By sequence comparison, we identified six novel isoforms in D. melanogaster and show that altogether 31 isoforms are perfectly conserved by sequence and position in the mod(mdg4) locus of the Drosophila species analyzed. We found significant differences in evolutionary speed of synonymous/nonsynonymous divergence between the various isoform specific exons. These results were extended by tree reconstruction analysis based on the evolved FLYWCH domains of predicted Mod(mdg4) proteins in Drosophila and Anopheles. Comparative analysis of mod(mdg4) gene structure in species of dipterans implicates that several internal inversions occurred making the mRNA trans-splicing mechanism indispensable for mod(mdg4) expression. Finally, we propose a model for the evolution of trans-splicing implementing effective regulation of many alternative gene products in a composite gene structure.
Collapse
Affiliation(s)
- Veiko Krauss
- Department of Genetics, University of Leipzig, Johannisallee 21-23, 04103 Leipzig, Germany
| | | |
Collapse
|
11
|
Mattick JS. Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms. Bioessays 2003; 25:930-9. [PMID: 14505360 DOI: 10.1002/bies.10332] [Citation(s) in RCA: 313] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The central dogma of biology holds that genetic information normally flows from DNA to RNA to protein. As a consequence it has been generally assumed that genes generally code for proteins, and that proteins fulfil not only most structural and catalytic but also most regulatory functions, in all cells, from microbes to mammals. However, the latter may not be the case in complex organisms. A number of startling observations about the extent of non-protein-coding RNA (ncRNA) transcription in the higher eukaryotes and the range of genetic and epigenetic phenomena that are RNA-directed suggests that the traditional view of the structure of genetic regulatory systems in animals and plants may be incorrect. ncRNA dominates the genomic output of the higher organisms and has been shown to control chromosome architecture, mRNA turnover and the developmental timing of protein expression, and may also regulate transcription and alternative splicing. This paper re-examines the available evidence and suggests a new framework for considering and understanding the genomic programming of biological complexity, autopoietic development and phenotypic variation.
Collapse
Affiliation(s)
- John S Mattick
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, St. Lucia QLD 4072, Australia.
| |
Collapse
|
12
|
Labrador M, Corces VG. Extensive exon reshuffling over evolutionary time coupled to trans-splicing in Drosophila. Genome Res 2003; 13:2220-8. [PMID: 14525924 PMCID: PMC403680 DOI: 10.1101/gr.1440703] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2003] [Accepted: 08/04/2003] [Indexed: 11/24/2022]
Abstract
The relative position of exons in genes can be altered only after large structural mutations. These mutations are frequently deleterious, impairing transcription, splicing, RNA stability, or protein function, as well as imposing strong inflexibility to protein evolution. Alternative cis- or trans-splicing may overcome the need for genomic structural stability, allowing genes to encode new proteins without the need to maintain a specific exon order. Trans-splicing in the Drosophila melanogaster modifier of mdg4 (mod[mdg4]) gene is the best documented example in which this process plays a major role in the maturation of mRNAs. Comparison of the genomic organization of this locus among several insect species suggests that the divergence between the lineages of the mosquito Anopheles gambiae and D. melanogaster involved an extensive exon rearrangement, requiring >11 breakpoints within the mod(mdg4) gene. The massive reorganization of the locus also included the deletion or addition of a new function as well as exon duplications. Whereas both DNA strands are sense strands in the Drosophila gene, the coding region in mosquito lays in a single strand, suggesting that trans-splicing may have originated in the Drosophila lineage and might have been the triggering factor for such a dramatic reorganization.
Collapse
Affiliation(s)
- Mariano Labrador
- Department of Biology, Johns Hopkins University, Baltimore, Maryland 21218, USA
| | | |
Collapse
|