1
|
Singh S, Shi X, Haddox S, Elfman J, Ahmad SB, Lynch S, Manley T, Piczak C, Phung C, Sun Y, Sharma A, Li H. RTCpredictor: identification of read-through chimeric RNAs from RNA sequencing data. Brief Bioinform 2024; 25:bbae251. [PMID: 38796690 PMCID: PMC11128028 DOI: 10.1093/bib/bbae251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 03/30/2024] [Accepted: 05/09/2024] [Indexed: 05/28/2024] Open
Abstract
Read-through chimeric RNAs are being recognized as a means to expand the functional transcriptome and contribute to cancer tumorigenesis when mis-regulated. However, current software tools often fail to predict them. We have developed RTCpredictor, utilizing a fast ripgrep tool to search for all possible exon-exon combinations of parental gene pairs. We also added exonic variants allowing searches containing common SNPs. To our knowledge, it is the first read-through chimeric RNA specific prediction method that also provides breakpoint coordinates. Compared with 10 other popular tools, RTCpredictor achieved high sensitivity on a simulated and three real datasets. In addition, RTCpredictor has less memory requirements and faster execution time, making it ideal for applying on large datasets.
Collapse
Affiliation(s)
- Sandeep Singh
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Xinrui Shi
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Samuel Haddox
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Justin Elfman
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Syed Basil Ahmad
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Sarah Lynch
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Tommy Manley
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Claire Piczak
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Christopher Phung
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Yunan Sun
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Aadi Sharma
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| | - Hui Li
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, United States
| |
Collapse
|
2
|
Singh S, Shi X, Ahmad SB, Manley T, Piczak C, Phung C, Sun Y, Lynch S, Sharma A, Li H. RTCpredictor: Identification of Read-Through Chimeric RNAs from RNA Sequencing Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.02.526869. [PMID: 36778443 PMCID: PMC9915620 DOI: 10.1101/2023.02.02.526869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Read-through chimeric RNAs are gaining attention in cancer and other research fields, yet current tools often fail in predicting them. We have thus developed the first read-through chimeric RNA specific prediction method, RTCpredictor, utilizing a fast ripgrep algorithm to search for all possible exon-exon combinations of parental gene pairs. Compared with other ten popular tools, RTCpredictor achieved top performance on both simulated and real datasets. We randomly selected up to 30 candidate read-through chimeras predicted from each software method and experimentally validated a total of 109 read-throughs and on this set, RTCpredictor outperformed all the other methods. In addition, RTCpredictor ( https://github.com/sandybioteck/RTCpredictor ) has less memory requirements and faster execution time.
Collapse
|
3
|
Zhou J, Guan X, Xu E, Zhou J, Xiong R, Yang Q. Chimeric RNA RRM2-C2orf48 plays an oncogenic role in the development of NNK-induced lung cancer. iScience 2022; 26:105708. [PMID: 36570773 PMCID: PMC9771722 DOI: 10.1016/j.isci.2022.105708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 10/24/2022] [Accepted: 11/29/2022] [Indexed: 12/03/2022] Open
Abstract
Chimeric RNAs have been used as biomarkers and therapeutic targets for multiple types of cancers. However, less attention has been paid to their mechanism of action in neoplasia. Here, we reported that high-expressed chimeric RNA RRM2-C2orf48 was found in malignantly transformed BEAS-2B cells induced by 4-(methyl nitrosamine)-1-(3-pyridinyl)-1-butanone (NNK) in 74 lung cancer patients and several lung cancer cell lines. The expression level of RRM2-C2orf48 was significantly correlated with lymph node metastasis, distant metastasis, tumor-lymph node-metastasis (TNM) stage, and smoking. Overexpressing RRM2-C2orf48 promoted cell growth and accelerated the process of NNK-induced lung cancer. RRM2-C2orf48 knockdown inhibited the growth of RRM2-C2orf48-overexpressing BEAS-2B cells. Finally, we identified miR-219a-2-3p as a potential target of RRM2-C2orf48 in lung cancer. In summary, chimeric RNA RRM2-C2orf48 accelerated the process of NNK-induced lung cancer, and miR-219a-2-3p may be involved in this process.
Collapse
Affiliation(s)
- Jiazhen Zhou
- The Institute for Chemical Carcinogenesis, School of Public Health, Guangzhou Medical University, Xinzao, Panyu District, Guangzhou 511436, China
| | - Xinchao Guan
- The Institute for Chemical Carcinogenesis, School of Public Health, Guangzhou Medical University, Xinzao, Panyu District, Guangzhou 511436, China
| | - Enwu Xu
- Department of Thoracic Surgery, General Hospital of Southern Theater Command, PLA, Guangzhou 510010, PR China
| | - Jiaxin Zhou
- The Institute for Chemical Carcinogenesis, School of Public Health, Guangzhou Medical University, Xinzao, Panyu District, Guangzhou 511436, China
| | - Rui Xiong
- The Institute for Chemical Carcinogenesis, School of Public Health, Guangzhou Medical University, Xinzao, Panyu District, Guangzhou 511436, China
| | - Qiaoyuan Yang
- The Institute for Chemical Carcinogenesis, School of Public Health, Guangzhou Medical University, Xinzao, Panyu District, Guangzhou 511436, China,Corresponding author
| |
Collapse
|
4
|
Friedrich S, Sonnhammer ELL. Fusion transcript detection using spatial transcriptomics. BMC Med Genomics 2020; 13:110. [PMID: 32753032 PMCID: PMC7437936 DOI: 10.1186/s12920-020-00738-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 06/11/2020] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Fusion transcripts are involved in tumourigenesis and play a crucial role in tumour heterogeneity, tumour evolution and cancer treatment resistance. However, fusion transcripts have not been studied at high spatial resolution in tissue sections due to the lack of full-length transcripts with spatial information. New high-throughput technologies like spatial transcriptomics measure the transcriptome of tissue sections on almost single-cell level. While this technique does not allow for direct detection of fusion transcripts, we show that they can be inferred using the relative poly(A) tail abundance of the involved parental genes. METHOD We present a new method STfusion, which uses spatial transcriptomics to infer the presence and absence of poly(A) tails. A fusion transcript lacks a poly(A) tail for the 5' gene and has an elevated number of poly(A) tails for the 3' gene. Its expression level is defined by the upstream promoter of the 5' gene. STfusion measures the difference between the observed and expected number of poly(A) tails with a novel C-score. RESULTS We verified the STfusion ability to predict fusion transcripts on HeLa cells with known fusions. STfusion and C-score applied to clinical prostate cancer data revealed the spatial distribution of the cis-SAGe SLC45A3-ELK4 in 12 tissue sections with almost single-cell resolution. The cis-SAGe occurred in disease areas, e.g. inflamed, prostatic intraepithelial neoplastic, or cancerous areas, and occasionally in normal glands. CONCLUSIONS STfusion detects fusion transcripts in cancer cell line and clinical tissue data, and distinguishes chimeric transcripts from chimeras caused by trans-splicing events. With STfusion and the use of C-scores, fusion transcripts can be spatially localised in clinical tissue sections on almost single cell level.
Collapse
Affiliation(s)
- Stefanie Friedrich
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Box 1031, 17121, Solna, Sweden.
| | - Erik L L Sonnhammer
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Box 1031, 17121, Solna, Sweden
| |
Collapse
|
5
|
Interfering Expression of Chimeric Transcript SEPT7P2-PSPH Promotes Cell Proliferation in Patients with Nasopharyngeal Carcinoma. JOURNAL OF ONCOLOGY 2019; 2019:1654724. [PMID: 31057610 PMCID: PMC6463592 DOI: 10.1155/2019/1654724] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2018] [Revised: 01/09/2019] [Accepted: 02/03/2019] [Indexed: 01/09/2023]
Abstract
Introduction Nasopharyngeal carcinoma (NPC) is a distinct type of head and neck cancer which is mostly prevalent in southern China. The development of NPC involves accumulation of multiple genetic changes. Chromosomal translocation is always thought to be accompanied with the fusion chimeric products. To data, the role of the fusion chimeric transcript remains obscure. Materials and Methods We performed RNA sequencing to detect the fusion genes in ten NPC tissues. Sanger sequencing and quantitative RT-PCR were used to measure the level of the fusion chimeric transcript in NPC tissues and cell lines. The functional experiments such as CCK8 assay, colony formation, and migration/invasion were conducted to analyze the role of this transcript in NPC in vitro. Results We demonstrated that the chimeric transcript SEPT7P2-PSPH was formed by trans-splicing of adjacent genes in the absence of chromosomal rearrangement and observed in both NPC patients and cell lines in parallel. Low-expression of the SEPT7P2-PSPH chimeric transcript induced the protein expression of PSPH and promoted cell proliferation, metastasis/invasion, and transforming ability in vitro. Conclusions Our findings indicate that the chimeric transcript SEPT7P2-PSPH is a product of trans-splicing of two adjacent genes and might be a tumor suppressor gene, potentially having the role of anticancer activity.
Collapse
|
6
|
Novel chimeric transcript RRM2-c2orf48 promotes metastasis in nasopharyngeal carcinoma. Cell Death Dis 2017; 8:e3047. [PMID: 28906488 PMCID: PMC5636969 DOI: 10.1038/cddis.2017.402] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2016] [Revised: 06/29/2017] [Accepted: 07/14/2017] [Indexed: 12/13/2022]
Abstract
Recently, chimeric transcripts have been found to be associated with the pathogenesis and poor prognosis of malignant tumors. Through our preliminary experiment, a novel chimeric transcript called chimeric transcript RRM2-c2orf48 was detected in C666-1, a classical cell line of human nasopharyngeal carcinoma (NPC). Therefore, the objective of this study was to demonstrate the existence and expression of novel chimeric transcript RRM2-c2orf48 and to explore the main functions and mechanisms of RRM2-c2orf48 in NPC. In this study, the expression of RRM2-c2orf48 was evaluated in NPC cells and specimens. Effects of RRM2-c2orf48 on migration and invasive capacities were detected invivo and vitro. Moreover, ways in which RRM2-c2orf48 increases the invasive capacities of NPC were explored. As a result, the presence of novel chimeric transcript RRM2-c2orf48 was confirmed in C666-1 by RT-PCR and sequencing, and it was a read-through between RRM2 and c2orf48 through the transcription of interchromosome. Higher expressions of novel RRM2-c2orf48 were detected in NPC cell lines and NPC tissue specimens relative to the controls and its expression was be statistically relevant to TNM staging. High level of RRM2-c2orf48 could increase the migration and invasive capacities of NPC cells, potentially as a result of NPC cell epithelial–mesenchymal transition. RRM2-c2orf48 could also enhance resistance of chemotherapy. In vivo, RRM2-c2orf48 could enhance lung and lymph node metastasis in nude mice. These results demonstrate that high levels of RRM2-c2orf48 expression may be a useful predictor of NPC patients of metastatic potency, presenting potential implications for NPC diagnosis and therapy.
Collapse
|
7
|
It Is Imperative to Establish a Pellucid Definition of Chimeric RNA and to Clear Up a Lot of Confusion in the Relevant Research. Int J Mol Sci 2017; 18:ijms18040714. [PMID: 28350330 PMCID: PMC5412300 DOI: 10.3390/ijms18040714] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Revised: 03/15/2017] [Accepted: 03/17/2017] [Indexed: 12/27/2022] Open
Abstract
There have been tens of thousands of RNAs deposited in different databases that contain sequences of two genes and are coined chimeric RNAs, or chimeras. However, "chimeric RNA" has never been lucidly defined, partly because "gene" itself is still ill-defined and because the means of production for many RNAs is unclear. Since the number of putative chimeras is soaring, it is imperative to establish a pellucid definition for it, in order to differentiate chimeras from regular RNAs. Otherwise, not only will chimeric RNA studies be misled but also characterization of fusion genes and unannotated genes will be hindered. We propose that only those RNAs that are formed by joining two RNA transcripts together without a fusion gene as a genomic basis should be regarded as authentic chimeras, whereas those RNAs transcribed as, and cis-spliced from, single transcripts should not be deemed as chimeras. Many RNAs containing sequences of two neighboring genes may be transcribed via a readthrough mechanism, and thus are actually RNAs of unannotated genes or RNA variants of known genes, but not chimeras. In today's chimeric RNA research, there are still several key flaws, technical constraints and understudied tasks, which are also described in this perspective essay.
Collapse
|
8
|
Xie B, Yang W, Ouyang Y, Chen L, Jiang H, Liao Y, Liao DJ. Two RNAs or DNAs May Artificially Fuse Together at a Short Homologous Sequence (SHS) during Reverse Transcription or Polymerase Chain Reactions, and Thus Reporting an SHS-Containing Chimeric RNA Requires Extra Caution. PLoS One 2016; 11:e0154855. [PMID: 27148738 PMCID: PMC4858267 DOI: 10.1371/journal.pone.0154855] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2015] [Accepted: 04/20/2016] [Indexed: 11/18/2022] Open
Abstract
Tens of thousands of chimeric RNAs have been reported. Most of them contain a short homologous sequence (SHS) at the joining site of the two partner genes but are not associated with a fusion gene. We hypothesize that many of these chimeras may be technical artifacts derived from SHS-caused mis-priming in reverse transcription (RT) or polymerase chain reactions (PCR). We cloned six chimeric complementary DNAs (cDNAs) formed by human mitochondrial (mt) 16S rRNA sequences at an SHS, which were similar to several expression sequence tags (ESTs).These chimeras, which could not be detected with cDNA protection assay, were likely formed because some regions of the 16S rRNA are reversely complementary to another region to form an SHS, which allows the downstream sequence to loop back and anneal at the SHS to prime the synthesis of its complementary strand, yielding a palindromic sequence that can form a hairpin-like structure.We identified a 16S rRNA that ended at the 4th nucleotide(nt) of the mt-tRNA-leu was dominant and thus should be the wild type. We also cloned a mouse Bcl2-Nek9 chimeric cDNA that contained a 5-nt unmatchable sequence between the two partners, contained two copies of the reverse primer in the same direction but did not contain the forward primer, making it unclear how this Bcl2-Nek9 was formed and amplified. Moreover, a cDNA was amplified because one primer has 4 nts matched to the template, suggesting that there may be many more artificial cDNAs than we have realized, because the nuclear and mt genomes have many more 4-nt than 5-nt or longer homologues. Altogether, the chimeric cDNAs we cloned are good examples suggesting that many cDNAs may be artifacts due to SHS-caused mis-priming and thus greater caution should be taken when new sequence is obtained from a technique involving DNA polymerization.
Collapse
Affiliation(s)
- Bingkun Xie
- Guangxi Institute of Animal Sciences, Guangxi Key Laboratory of Livestock Genetic Improvement, Nanning, Guangxi, 530001, P.R. China
- * E-mail: (BKX); (HSJ); (DJL)
| | - Wei Yang
- Guangxi Veterinary Research Institute, Nanning, Guangxi, P.R. China
| | - Yongchang Ouyang
- Hormel Institute, University of Minnesota, Austin, Minnesota, 55912, United States of America
| | - Lichan Chen
- Hormel Institute, University of Minnesota, Austin, Minnesota, 55912, United States of America
| | - Hesheng Jiang
- College of Animal Science and Technology, Guangxi University, Nanning, 530004, P.R. China
- * E-mail: (BKX); (HSJ); (DJL)
| | - Yuying Liao
- Guangxi Institute of Animal Sciences, Guangxi Key Laboratory of Livestock Genetic Improvement, Nanning, Guangxi, 530001, P.R. China
| | - D. Joshua Liao
- Department of Pathology, Guizhou Medical University Hospital, Guizhou, Guiyang, 550004, P.R. China
- * E-mail: (BKX); (HSJ); (DJL)
| |
Collapse
|
9
|
Lei Q, Li C, Zuo Z, Huang C, Cheng H, Zhou R. Evolutionary Insights into RNA trans-Splicing in Vertebrates. Genome Biol Evol 2016; 8:562-77. [PMID: 26966239 PMCID: PMC4824033 DOI: 10.1093/gbe/evw025] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Pre-RNA splicing is an essential step in generating mature mRNA. RNA trans-splicing combines two separate pre-mRNA molecules to form a chimeric non-co-linear RNA, which may exert a function distinct from its original molecules. Trans-spliced RNAs may encode novel proteins or serve as noncoding or regulatory RNAs. These novel RNAs not only increase the complexity of the proteome but also provide new regulatory mechanisms for gene expression. An increasing amount of evidence indicates that trans-splicing occurs frequently in both physiological and pathological processes. In addition, mRNA reprogramming based on trans-splicing has been successfully applied in RNA-based therapies for human genetic diseases. Nevertheless, clarifying the extent and evolution of trans-splicing in vertebrates and developing detection methods for trans-splicing remain challenging. In this review, we summarize previous research, highlight recent advances in trans-splicing, and discuss possible splicing mechanisms and functions from an evolutionary viewpoint.
Collapse
Affiliation(s)
- Quan Lei
- Department of Genetics, College of Life Sciences, Wuhan University, P.R. China
| | - Cong Li
- Department of Genetics, College of Life Sciences, Wuhan University, P.R. China
| | - Zhixiang Zuo
- Department of Genetics, College of Life Sciences, Wuhan University, P.R. China
| | - Chunhua Huang
- Department of Cell Biology, College of Life Sciences, Wuhan University, P.R. China
| | - Hanhua Cheng
- Department of Cell Biology, College of Life Sciences, Wuhan University, P.R. China
| | - Rongjia Zhou
- Department of Genetics, College of Life Sciences, Wuhan University, P.R. China
| |
Collapse
|
10
|
Verigos J, Magklara A. Revealing the Complexity of Breast Cancer by Next Generation Sequencing. Cancers (Basel) 2015; 7:2183-200. [PMID: 26561834 PMCID: PMC4695885 DOI: 10.3390/cancers7040885] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2015] [Revised: 10/18/2015] [Accepted: 10/26/2015] [Indexed: 02/06/2023] Open
Abstract
Over the last few years the increasing usage of "-omic" platforms, supported by next-generation sequencing, in the analysis of breast cancer samples has tremendously advanced our understanding of the disease. New driver and passenger mutations, rare chromosomal rearrangements and other genomic aberrations identified by whole genome and exome sequencing are providing missing pieces of the genomic architecture of breast cancer. High resolution maps of breast cancer methylomes and sequencing of the miRNA microworld are beginning to paint the epigenomic landscape of the disease. Transcriptomic profiling is giving us a glimpse into the gene regulatory networks that govern the fate of the breast cancer cell. At the same time, integrative analysis of sequencing data confirms an extensive intertumor and intratumor heterogeneity and plasticity in breast cancer arguing for a new approach to the problem. In this review, we report on the latest findings on the molecular characterization of breast cancer using NGS technologies, and we discuss their potential implications for the improvement of existing therapies.
Collapse
Affiliation(s)
- John Verigos
- Laboratory of Clinical Chemistry, Faculty of Medicine, School of Health Sciences, University of Ioannina, Ioannina 45110, Greece.
- Department of Biomedical Research, Institute of Molecular Biology & Biotechnology,Foundation for Research & Technology-Hellas, Ioannina 45110, Greece.
| | - Angeliki Magklara
- Laboratory of Clinical Chemistry, Faculty of Medicine, School of Health Sciences, University of Ioannina, Ioannina 45110, Greece.
- Department of Biomedical Research, Institute of Molecular Biology & Biotechnology,Foundation for Research & Technology-Hellas, Ioannina 45110, Greece.
| |
Collapse
|
11
|
Chuang TJ, Wu CS, Chen CY, Hung LY, Chiang TW, Yang MY. NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision. Nucleic Acids Res 2015; 44:e29. [PMID: 26442529 PMCID: PMC4756807 DOI: 10.1093/nar/gkv1013] [Citation(s) in RCA: 87] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2015] [Accepted: 09/24/2015] [Indexed: 12/19/2022] Open
Abstract
Analysis of RNA-seq data often detects numerous ‘non-co-linear’ (NCL) transcripts, which comprised sequence segments that are topologically inconsistent with their corresponding DNA sequences in the reference genome. However, detection of NCL transcripts involves two major challenges: removal of false positives arising from alignment artifacts and discrimination between different types of NCL transcripts (trans-spliced, circular or fusion transcripts). Here, we developed a new NCL-transcript-detecting method (‘NCLscan’), which utilized a stepwise alignment strategy to almost completely eliminate false calls (>98% precision) without sacrificing true positives, enabling NCLscan outperform 18 other publicly-available tools (including fusion- and circular-RNA-detecting tools) in terms of sensitivity and precision, regardless of the generation strategy of simulated dataset, type of intragenic or intergenic NCL event, read depth of coverage, read length or expression level of NCL transcript. With the high accuracy, NCLscan was applied to distinguishing between trans-spliced, circular and fusion transcripts on the basis of poly(A)- and nonpoly(A)-selected RNA-seq data. We showed that circular RNAs were expressed more ubiquitously, more abundantly and less cell type-specifically than trans-spliced and fusion transcripts. Our study thus describes a robust pipeline for the discovery of NCL transcripts, and sheds light on the fundamental biology of these non-canonical RNA events in human transcriptome.
Collapse
Affiliation(s)
- Trees-Juen Chuang
- Division of Physical and Computational Genomics, Genomics Research Center, Academia Sinica, Taipei 11529, Taiwan
| | - Chan-Shuo Wu
- Division of Physical and Computational Genomics, Genomics Research Center, Academia Sinica, Taipei 11529, Taiwan
| | - Chia-Ying Chen
- Division of Physical and Computational Genomics, Genomics Research Center, Academia Sinica, Taipei 11529, Taiwan
| | - Li-Yuan Hung
- Division of Physical and Computational Genomics, Genomics Research Center, Academia Sinica, Taipei 11529, Taiwan
| | - Tai-Wei Chiang
- Division of Physical and Computational Genomics, Genomics Research Center, Academia Sinica, Taipei 11529, Taiwan
| | - Min-Yu Yang
- Division of Physical and Computational Genomics, Genomics Research Center, Academia Sinica, Taipei 11529, Taiwan
| |
Collapse
|
12
|
Varley KE, Gertz J, Roberts BS, Davis NS, Bowling KM, Kirby MK, Nesmith AS, Oliver PG, Grizzle WE, Forero A, Buchsbaum DJ, LoBuglio AF, Myers RM. Recurrent read-through fusion transcripts in breast cancer. Breast Cancer Res Treat 2014; 146:287-97. [PMID: 24929677 PMCID: PMC4085473 DOI: 10.1007/s10549-014-3019-2] [Citation(s) in RCA: 122] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Accepted: 05/31/2014] [Indexed: 11/25/2022]
Abstract
Read-through fusion transcripts that result from the splicing of two adjacent genes in the same coding orientation are a recently discovered type of chimeric RNA. We sought to determine if read-through fusion transcripts exist in breast cancer. We performed paired-end RNA-seq of 168 breast samples, including 28 breast cancer cell lines, 42 triple negative breast cancer primary tumors, 42 estrogen receptor positive (ER+) breast cancer primary tumors, and 56 non-malignant breast tissue samples. We analyzed the sequencing data to identify breast cancer associated read-through fusion transcripts. We discovered two recurrent read-through fusion transcripts that were identified in breast cancer cell lines, confirmed across breast cancer primary tumors, and were not detected in normal tissues (SCNN1A-TNFRSF1A and CTSD-IFITM10). Both fusion transcripts use canonical splice sites to join the last splice donor of the 5′ gene to the first splice acceptor of the 3′ gene, creating an in-frame fusion transcript. Western blots indicated that the fusion transcripts are translated into fusion proteins in breast cancer cells. Custom small interfering RNAs targeting the CTSD-IFITM10 fusion junction reduced expression of the fusion transcript and reduced breast cancer cell proliferation. Read-through fusion transcripts between adjacent genes with different biochemical functions represent a new type of recurrent molecular defect in breast cancer that warrant further investigation as potential biomarkers and therapeutic targets. Both breast cancer associated fusion transcripts identified in this study involve membrane proteins (SCNN1A-TNFRSF1A and CTSD-IFITM10), which raises the possibility that they could be breast cancer-specific cell surface markers.
Collapse
Affiliation(s)
- Katherine E Varley
- HudsonAlpha Institute for Biotechnology, 601 Genome Way, Huntsville, AL, 35806, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Abstract
Rice is a monocot gramineous crop, and one of the most important staple foods. Rice is considered a model species for most gramineous crops. Extensive research on rice has provided critical guidance for other crops, such as maize and wheat. In recent years, climate change and exacerbated soil degradation have resulted in a variety of abiotic stresses, such as greenhouse effects, lower temperatures, drought, floods, soil salinization and heavy metal pollution. As such, there is an extremely high demand for additional research, in order to address these negative factors. Studies have shown that the alternative splicing of many genes in rice is affected by stress conditions, suggesting that manipulation of the alternative splicing of specific genes may be an effective approach for rice to adapt to abiotic stress. With the advancement of microarrays, and more recently, next generation sequencing technology, several studies have shown that more than half of the genes in the rice genome undergo alternative splicing. This mini-review summarizes the latest progress in the research of splicing and alternative splicing in rice, compared to splicing in humans. Furthermore, we discuss how additional studies may change the landscape of investigation of rice functional genomics and genetically improved rice. [BMB Reports 2013; 46(9): 439-447]
Collapse
Affiliation(s)
- Zhiguo E
- Nantong University, Nantong 226001, P.R. China ;
| | | | | |
Collapse
|
14
|
Kalender Atak Z, Gianfelici V, Hulselmans G, De Keersmaecker K, Devasia AG, Geerdens E, Mentens N, Chiaretti S, Durinck K, Uyttebroeck A, Vandenberghe P, Wlodarska I, Cloos J, Foà R, Speleman F, Cools J, Aerts S. Comprehensive analysis of transcriptome variation uncovers known and novel driver events in T-cell acute lymphoblastic leukemia. PLoS Genet 2013; 9:e1003997. [PMID: 24367274 PMCID: PMC3868543 DOI: 10.1371/journal.pgen.1003997] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2013] [Accepted: 10/16/2013] [Indexed: 12/22/2022] Open
Abstract
RNA-seq is a promising technology to re-sequence protein coding genes for the identification of single nucleotide variants (SNV), while simultaneously obtaining information on structural variations and gene expression perturbations. We asked whether RNA-seq is suitable for the detection of driver mutations in T-cell acute lymphoblastic leukemia (T-ALL). These leukemias are caused by a combination of gene fusions, over-expression of transcription factors and cooperative point mutations in oncogenes and tumor suppressor genes. We analyzed 31 T-ALL patient samples and 18 T-ALL cell lines by high-coverage paired-end RNA-seq. First, we optimized the detection of SNVs in RNA-seq data by comparing the results with exome re-sequencing data. We identified known driver genes with recurrent protein altering variations, as well as several new candidates including H3F3A, PTK2B, and STAT5B. Next, we determined accurate gene expression levels from the RNA-seq data through normalizations and batch effect removal, and used these to classify patients into T-ALL subtypes. Finally, we detected gene fusions, of which several can explain the over-expression of key driver genes such as TLX1, PLAG1, LMO1, or NKX2-1; and others result in novel fusion transcripts encoding activated kinases (SSBP2-FER and TPM3-JAK2) or involving MLLT10. In conclusion, we present novel analysis pipelines for variant calling, variant filtering, and expression normalization on RNA-seq data, and successfully applied these for the detection of translocations, point mutations, INDELs, exon-skipping events, and expression perturbations in T-ALL. The quest for somatic mutations underlying oncogenic processes is a central theme in today's cancer research. High-throughput genomics approaches including amplicon re-sequencing, exome re-sequencing, full genome re-sequencing, and SNP arrays have contributed to cataloguing driver genes across cancer types. Thus far transcriptome sequencing by RNA-seq has been mainly used for the detection of fusion genes, while few studies have assessed its value for the combined detection of SNPs, INDELs, fusions, gene expression changes, and alternative transcript events. Here we apply RNA-seq to 49 T-ALL samples and perform a critical assessment of the bioinformatics pipelines and filters to identify each type of aberration. By comparing to exome re-sequencing, and by exploiting the catalogues of known cancer drivers, we identified many known and several novel driver genes in T-ALL. We also determined an optimal normalization strategy to obtain accurate gene expression levels and used these to identify over-expressed transcription factors that characterize different T-ALL subtypes. Finally, by PCR, cloning, and in vitro cellular assays we uncover new fusion genes that have consequences at the level of gene expression, oncogenic chimaeras, and tumor suppressor inactivation. In conclusion, we present the first RNA-seq data set across T-ALL patients and identify new driver events.
Collapse
Affiliation(s)
- Zeynep Kalender Atak
- Laboratory of Computational Biology, Center for Human Genetics, KU Leuven, Leuven, Belgium
| | - Valentina Gianfelici
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
- Division of Hematology, Department of Cellular Biotechnologies and Hematology, ‘Sapienza’ University of Rome, Rome, Italy
| | - Gert Hulselmans
- Laboratory of Computational Biology, Center for Human Genetics, KU Leuven, Leuven, Belgium
| | - Kim De Keersmaecker
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
| | - Arun George Devasia
- Laboratory of Computational Biology, Center for Human Genetics, KU Leuven, Leuven, Belgium
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
| | - Ellen Geerdens
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
| | - Nicole Mentens
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
| | - Sabina Chiaretti
- Division of Hematology, Department of Cellular Biotechnologies and Hematology, ‘Sapienza’ University of Rome, Rome, Italy
| | - Kaat Durinck
- Center for Medical Genetics, Ghent University, Ghent, Belgium
| | - Anne Uyttebroeck
- Pediatric Hemato-Oncology, University Hospitals Leuven, Leuven, Belgium
| | - Peter Vandenberghe
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
| | - Iwona Wlodarska
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
| | - Jacqueline Cloos
- Pediatric Oncology/Hematology and Hematology, VU Medical Center, Amsterdam, The Netherlands
| | - Robin Foà
- Division of Hematology, Department of Cellular Biotechnologies and Hematology, ‘Sapienza’ University of Rome, Rome, Italy
| | - Frank Speleman
- Center for Medical Genetics, Ghent University, Ghent, Belgium
| | - Jan Cools
- Laboratory for the Molecular Biology of Leukemia, Center for Human Genetics, KU Leuven and Center for the Biology of Disease, VIB, Leuven, Belgium
- * E-mail: (JC); (SA)
| | - Stein Aerts
- Laboratory of Computational Biology, Center for Human Genetics, KU Leuven, Leuven, Belgium
- * E-mail: (JC); (SA)
| |
Collapse
|
15
|
Kumar-Sinha C, Kalyana-Sundaram S, Chinnaiyan AM. SLC45A3-ELK4 chimera in prostate cancer: spotlight on cis-splicing. Cancer Discov 2012; 2:582-5. [PMID: 22787087 DOI: 10.1158/2159-8290.cd-12-0212] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Using a series of detailed experiments, Zhang and colleagues establish that the prostate cancer RNA chimera SLC45A3-ELK4 is generated by cis-splicing between the 2 adjacent genes and does not involve DNA rearrangements or trans-splicing. The chimera expression is induced by androgen treatment likely by overcoming the read-through block imposed by the intergenic CCCTC insulators bound by CCCTC-binding factor repressor protein. The chimeric transcript, but not wild-type ELK4, is shown to augment prostate cancer cell proliferation.
Collapse
Affiliation(s)
- Chandan Kumar-Sinha
- Michigan Center for Translational Pathology, University of Michigan Medical School, Ann Arbor, Michigan 48109, USA
| | | | | |
Collapse
|