1
|
Wang TY, Yang R. Detecting Medium and Large Insertions and Deletions with transIndel. Methods Mol Biol 2022; 2493:67-75. [PMID: 35751809 DOI: 10.1007/978-1-0716-2293-3_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Insertions and deletions (indels) are primarily detected from DNA sequencing (DNA-seq) data, but their transcriptional consequences remain unexplored due to challenges in distinguishing medium- and large-sized indels from RNA splicing events in RNA-seq data. We introduce transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. Here, we describe the method and provide a tutorial on the installation and application of transIndel.
Collapse
Affiliation(s)
- Ting-You Wang
- The Hormel Institute, University of Minnesota, Austin, MN, USA
| | - Rendong Yang
- The Hormel Institute, University of Minnesota, Austin, MN, USA.
| |
Collapse
|
2
|
Hoogstrate Y, Komor MA, Böttcher R, van Riet J, van de Werken HJG, van Lieshout S, Hoffmann R, van den Broek E, Bolijn AS, Dits N, Sie D, van der Meer D, Pepers F, Bangma CH, van Leenders GJLH, Smid M, French PJ, Martens JWM, van Workum W, van der Spek PJ, Janssen B, Caldenhoven E, Rausch C, de Jong M, Stubbs AP, Meijer GA, Fijneman RJA, Jenster GW. Fusion transcripts and their genomic breakpoints in polyadenylated and ribosomal RNA-minus RNA sequencing data. Gigascience 2021; 10:6458609. [PMID: 34891161 PMCID: PMC8673554 DOI: 10.1093/gigascience/giab080] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/08/2021] [Accepted: 11/16/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Fusion genes are typically identified by RNA sequencing (RNA-seq) without elucidating the causal genomic breakpoints. However, non-poly(A)-enriched RNA-seq contains large proportions of intronic reads that also span genomic breakpoints. RESULTS We have developed an algorithm, Dr. Disco, that searches for fusion transcripts by taking an entire reference genome into account as search space. This includes exons but also introns, intergenic regions, and sequences that do not meet splice junction motifs. Using 1,275 RNA-seq samples, we investigated to what extent genomic breakpoints can be extracted from RNA-seq data and their implications regarding poly(A)-enriched and ribosomal RNA-minus RNA-seq data. Comparison with whole-genome sequencing data revealed that most genomic breakpoints are not, or minimally, transcribed while, in contrast, the genomic breakpoints of all 32 TMPRSS2-ERG-positive tumours were present at RNA level. We also revealed tumours in which the ERG breakpoint was located before ERG, which co-existed with additional deletions and messenger RNA that incorporated intergenic cryptic exons. In breast cancer we identified rearrangement hot spots near CCND1 and in glioma near CDK4 and MDM2 and could directly associate this with increased expression. Furthermore, in all datasets we find fusions to intergenic regions, often spanning multiple cryptic exons that potentially encode neo-antigens. Thus, fusion transcripts other than classical gene-to-gene fusions are prominently present and can be identified using RNA-seq. CONCLUSION By using the full potential of non-poly(A)-enriched RNA-seq data, sophisticated analysis can reliably identify expressed genomic breakpoints and their transcriptional effects.
Collapse
Affiliation(s)
- Youri Hoogstrate
- Department of Urology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands.,Department of Neurology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands
| | - Malgorzata A Komor
- Department of Pathology, Netherlands Cancer Institute, Amsterdam 3015GD, The Netherlands
| | - René Böttcher
- Department of Urology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands.,Department of Life Sciences, Barcelona Supercomputing Center, Barcelona 08034, Spain
| | - Job van Riet
- Department of Medical Oncology, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands
| | - Harmen J G van de Werken
- Department of Urology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands.,Cancer Computational Biology Center, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands
| | | | | | - Evert van den Broek
- Department of Pathology, Netherlands Cancer Institute, Amsterdam 3015GD, The Netherlands.,Department of Pathology and Medical Biology, University Medical Center Groningen, Groningen 9713GZ, The Netherlands
| | - Anne S Bolijn
- Department of Pathology, Netherlands Cancer Institute, Amsterdam 3015GD, The Netherlands
| | - Natasja Dits
- Department of Urology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands
| | - Daoud Sie
- Department of Pathology, Netherlands Cancer Institute, Amsterdam 3015GD, The Netherlands
| | | | | | - Chris H Bangma
- Department of Urology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands
| | | | - Marcel Smid
- Department of Medical Oncology, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands
| | - Pim J French
- Department of Neurology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands
| | - John W M Martens
- Department of Medical Oncology, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands
| | | | - Peter J van der Spek
- Department of Pathology, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands
| | | | | | | | | | - Andrew P Stubbs
- Department of Pathology, Erasmus Medical Center, Rotterdam 3015GD, The Netherlands
| | - Gerrit A Meijer
- Department of Pathology, Netherlands Cancer Institute, Amsterdam 3015GD, The Netherlands
| | - Remond J A Fijneman
- Department of Pathology, Netherlands Cancer Institute, Amsterdam 3015GD, The Netherlands
| | - Guido W Jenster
- Department of Urology, Erasmus Medical Center Cancer Institute, Wytemaweg 80, Rotterdam 3015GD, The Netherlands
| |
Collapse
|
3
|
Lu B, Jiang R, Xie B, Wu W, Zhao Y. Fusion genes in gynecologic tumors: the occurrence, molecular mechanism and prospect for therapy. Cell Death Dis 2021; 12:783. [PMID: 34381020 PMCID: PMC8357806 DOI: 10.1038/s41419-021-04065-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 07/21/2021] [Accepted: 07/23/2021] [Indexed: 12/12/2022]
Abstract
Gene fusions are thought to be driver mutations in multiple cancers and are an important factor for poor patient prognosis. Most of them appear in specific cancers, thus satisfactory strategies can be developed for the precise treatment of these types of cancer. Currently, there are few targeted drugs to treat gynecologic tumors, and patients with gynecologic cancer often have a poor prognosis because of tumor progression or recurrence. With the application of massively parallel sequencing, a large number of fusion genes have been discovered in gynecologic tumors, and some fusions have been confirmed to be involved in the biological process of tumor progression. To this end, the present article reviews the current research status of all confirmed fusion genes in gynecologic tumors, including their rearrangement mechanism and frequency in ovarian cancer, endometrial cancer, endometrial stromal sarcoma, and other types of uterine tumors. We also describe the mechanisms by which fusion genes are generated and their oncogenic mechanism. Finally, we discuss the prospect of fusion genes as therapeutic targets in gynecologic tumors.
Collapse
Affiliation(s)
- Bingfeng Lu
- Department of Obstetrics and Gynecology, Department of Gynecologic Oncology Research Office, Key Laboratory for Major Obstetric Diseases of Guangdong Province, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Ruqi Jiang
- Department of Obstetrics and Gynecology, Department of Gynecologic Oncology Research Office, Key Laboratory for Major Obstetric Diseases of Guangdong Province, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Bumin Xie
- Department of Obstetrics and Gynecology, Department of Gynecologic Oncology Research Office, Key Laboratory for Major Obstetric Diseases of Guangdong Province, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Wu Wu
- Department of Obstetrics and Gynecology, Department of Gynecologic Oncology Research Office, Key Laboratory for Major Obstetric Diseases of Guangdong Province, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Yang Zhao
- Department of Obstetrics and Gynecology, Department of Gynecologic Oncology Research Office, Key Laboratory for Major Obstetric Diseases of Guangdong Province, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
| |
Collapse
|
4
|
Chen CY, Chuang TJ. NCLcomparator: systematically post-screening non-co-linear transcripts (circular, trans-spliced, or fusion RNAs) identified from various detectors. BMC Bioinformatics 2019; 20:3. [PMID: 30606103 PMCID: PMC6318855 DOI: 10.1186/s12859-018-2589-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Accepted: 12/21/2018] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND Non-co-linear (NCL) transcripts consist of exonic sequences that are topologically inconsistent with the reference genome in an intragenic fashion (circular or intragenic trans-spliced RNAs) or in an intergenic fashion (fusion or intergenic trans-spliced RNAs). On the basis of RNA-seq data, numerous NCL event detectors have been developed and detected thousands of NCL events in diverse species. However, there are great discrepancies in the identification results among detectors, indicating a considerable proportion of false positives in the detected NCL events. Although several helpful guidelines for evaluating the performance of NCL event detectors have been provided, a systematic guideline for measurement of NCL events identified by existing tools has not been available. RESULTS We develop a software, NCLcomparator, for systematically post-screening the intragenic or intergenic NCL events identified by various NCL detectors. NCLcomparator first examine whether the input NCL events are potentially false positives derived from ambiguous alignments (i.e., the NCL events have an alternative co-linear explanation or multiple matches against the reference genome). To evaluate the reliability of the identified NCL events, we define the NCL score (NCLscore) based on the variation in the number of supporting NCL junction reads identified by the tools examined. Of the input NCL events, we show that the ambiguous alignment-derived events have relatively lower NCLscore values than the other events, indicating that an NCL event with a higher NCLscore has a higher level of reliability. To help selecting highly expressed NCL events, NCLcomparator also provides a series of useful measurements such as the expression levels of the detected NCL events and their corresponding host genes and the junction usage of the co-linear splice junctions at both NCL donor and acceptor sites. CONCLUSION NCLcomparator provides useful guidelines, with the input of identified NCL events from various detectors and the corresponding paired-end RNA-seq data only, to help users selecting potentially high-confidence NCL events for further functional investigation. The software thus helps to facilitate future studies into NCL events, shedding light on the fundamental biology of this important but understudied class of transcripts. NCLcomparator is freely accessible at https://github.com/TreesLab/NCLcomparator .
Collapse
Affiliation(s)
- Chia-Ying Chen
- Genomics Research Center, Academia Sinica, Taipei, 11529 Taiwan
| | | |
Collapse
|
5
|
Adamopoulos PG, Theodoropoulou MC, Scorilas A. Alternative Splicing Detection Tool-a novel PERL algorithm for sensitive detection of splicing events, based on next-generation sequencing data analysis. ANNALS OF TRANSLATIONAL MEDICINE 2018; 6:244. [PMID: 30069446 DOI: 10.21037/atm.2018.06.32] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Next-generation sequencing (NGS) can provide researchers with high impact information regarding alternative splice variants or transcript identifications. However, the enormous amount of data acquired from NGS platforms make the analysis of alternative splicing events hard to accomplish. For this reason, we designed the "Alternative Splicing Detection Tool" (ASDT), an algorithm that is capable of identifying alternative splicing events, including novel ones from high-throughput NGS data. ASDT is available as a PERL script at http://aias.biol.uoa.gr/~mtheo and can be executed on any system with PERL installed. In addition to the detection of annotated and novel alternative splicing events from high-throughput NGS data, ASDT can also analyze the intronic regions of genes, thus enabling the detection of novel cryptic exons residing in annotated introns, extensions of previously annotated exons, or even intron retentions. Consequently, ASDT demonstrates many innovative and unique features that can efficiently contribute to alternative splicing analysis of NGS data.
Collapse
Affiliation(s)
- Panagiotis G Adamopoulos
- Department of Biochemistry and Molecular Biology, National and Kapodistrian University of Athens, Panepistimiopolis, Athens, Greece
| | - Margarita C Theodoropoulou
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou, Lamia, Greece
| | - Andreas Scorilas
- Department of Biochemistry and Molecular Biology, National and Kapodistrian University of Athens, Panepistimiopolis, Athens, Greece
| |
Collapse
|
6
|
Lin YY, Gawronski A, Hach F, Li S, Numanagić I, Sarrafi I, Mishra S, McPherson A, Collins CC, Radovich M, Tang H, Sahinalp SC. Computational identification of micro-structural variations and their proteogenomic consequences in cancer. Bioinformatics 2018; 34:1672-1681. [PMID: 29267878 PMCID: PMC5946953 DOI: 10.1093/bioinformatics/btx807] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Revised: 11/24/2017] [Accepted: 12/15/2017] [Indexed: 12/18/2022] Open
Abstract
Motivation Rapid advancement in high throughput genome and transcriptome sequencing (HTS) and mass spectrometry (MS) technologies has enabled the acquisition of the genomic, transcriptomic and proteomic data from the same tissue sample. We introduce a computational framework, ProTIE, to integratively analyze all three types of omics data for a complete molecular profile of a tissue sample. Our framework features MiStrVar, a novel algorithmic method to identify micro structural variants (microSVs) on genomic HTS data. Coupled with deFuse, a popular gene fusion detection method we developed earlier, MiStrVar can accurately profile structurally aberrant transcripts in tumors. Given the breakpoints obtained by MiStrVar and deFuse, our framework can then identify all relevant peptides that span the breakpoint junctions and match them with unique proteomic signatures. Observing structural aberrations in all three types of omics data validates their presence in the tumor samples. Results We have applied our framework to all The Cancer Genome Atlas (TCGA) breast cancer Whole Genome Sequencing (WGS) and/or RNA-Seq datasets, spanning all four major subtypes, for which proteomics data from Clinical Proteomic Tumor Analysis Consortium (CPTAC) have been released. A recent study on this dataset focusing on SNVs has reported many that lead to novel peptides. Complementing and significantly broadening this study, we detected 244 novel peptides from 432 candidate genomic or transcriptomic sequence aberrations. Many of the fusions and microSVs we discovered have not been reported in the literature. Interestingly, the vast majority of these translated aberrations, fusions in particular, were private, demonstrating the extensive inter-genomic heterogeneity present in breast cancer. Many of these aberrations also have matching out-of-frame downstream peptides, potentially indicating novel protein sequence and structure. Availability and implementation MiStrVar is available for download at https://bitbucket.org/compbio/mistrvar, and ProTIE is available at https://bitbucket.org/compbio/protie. Contact cenksahi@indiana.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yen-Yi Lin
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
- Vancouver Prostate Centre, Vancouver, BC, Canada
| | | | - Faraz Hach
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
- Vancouver Prostate Centre, Vancouver, BC, Canada
- Department of Urologic Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Sujun Li
- Department of Computer Science, Indiana University, Bloomington, IN, USA
| | - Ibrahim Numanagić
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
| | - Iman Sarrafi
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
- Vancouver Prostate Centre, Vancouver, BC, Canada
| | - Swati Mishra
- Department of Surgery, Indiana University, School of Medicine, Indianapolis, IN, USA
| | - Andrew McPherson
- School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
| | - Colin C Collins
- Vancouver Prostate Centre, Vancouver, BC, Canada
- Department of Urologic Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Milan Radovich
- Department of Surgery, Indiana University, School of Medicine, Indianapolis, IN, USA
| | - Haixu Tang
- Department of Computer Science, Indiana University, Bloomington, IN, USA
| | - S Cenk Sahinalp
- Vancouver Prostate Centre, Vancouver, BC, Canada
- Department of Computer Science, Indiana University, Bloomington, IN, USA
| |
Collapse
|
7
|
Yang R, Van Etten JL, Dehm SM. Indel detection from DNA and RNA sequencing data with transIndel. BMC Genomics 2018; 19:270. [PMID: 29673323 PMCID: PMC5909256 DOI: 10.1186/s12864-018-4671-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 04/13/2018] [Indexed: 12/18/2022] Open
Abstract
Background Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data. Results Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. TransIndel exhibits competitive or superior performance over eight state-of-the-art indel detection tools on benchmarks using both synthetic and real DNA-seq data. Additionally, we applied transIndel to DNA-seq and RNA-seq datasets from 333 primary prostate cancer patients from The Cancer Genome Atlas (TCGA) and 59 metastatic prostate cancer patients from AACR-PCF Stand-Up- To-Cancer (SU2C) studies. TransIndel enhanced the taxonomy of DNA- and RNA-level alterations in prostate cancer by identifying recurrent FOXA1 indels as well as exitron splicing in genes implicated in disease progression. Conclusions Our study demonstrates that transIndel is a robust tool for elucidation of medium- and large-sized indels from DNA-seq and RNA-seq data. Including RNA-seq in indel discovery efforts leads to significant improvements in sensitivity for identification of med-sized and large indels missed by DNA-seq, and reveals non-canonical RNA-splicing events in genes associated with disease pathology. Electronic supplementary material The online version of this article (10.1186/s12864-018-4671-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rendong Yang
- The Hormel Institute, University of Minnesota, 801 16th AVE NE, Austin, MN, 55912, USA. .,Masonic Cancer Center, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA.
| | - Jamie L Van Etten
- Masonic Cancer Center, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA
| | - Scott M Dehm
- Masonic Cancer Center, University of Minnesota, 420 Delaware St SE, Minneapolis, MN, 55455, USA. .,Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN, 55455, USA.
| |
Collapse
|
8
|
Abstract
BACKGROUND Gene fusions are known in many cancers as driver or passenger mutations. They play an important role in both the etiology and pathogenesis of cancer and are considered as potential diagnostic and prognostic markers and possible therapeutic targets. The spectrum and prevalence of gene fusions in thyroid cancer ranges from single cases up to 80%, depending on the specific type of cancer. During last three years, massive parallel sequencing technologies have revealed new fusions and allowed detailed characteristics of fusions in different types of thyroid cancer. SUMMARY This article reviews all known fusions and their prevalence in papillary, poorly differentiated and anaplastic, follicular, and medullary carcinomas. The mechanisms of fusion formation are described. In addition, the mechanisms of oncogenic transformation, such as altered gene expression, forced oligomerization, and subcellular localization, are given. CONCLUSION The prognostic value and perspectives of the utilization of gene fusions as therapeutic targets are discussed.
Collapse
Affiliation(s)
- Valentina D Yakushina
- 1 Research Centre for Medical Genetics , Moscow, Russian Federation
- 2 Moscow Institute of Physics and Technology , Moscow, Russian Federation
| | | | - Alexander V Lavrov
- 1 Research Centre for Medical Genetics , Moscow, Russian Federation
- 4 Russian National Research Medical University , Moscow, Russian Federation
| |
Collapse
|
9
|
Henzler C, Li Y, Yang R, McBride T, Ho Y, Sprenger C, Liu G, Coleman I, Lakely B, Li R, Ma S, Landman SR, Kumar V, Hwang TH, Raj GV, Higano CS, Morrissey C, Nelson PS, Plymate SR, Dehm SM. Truncation and constitutive activation of the androgen receptor by diverse genomic rearrangements in prostate cancer. Nat Commun 2016; 7:13668. [PMID: 27897170 PMCID: PMC5141345 DOI: 10.1038/ncomms13668] [Citation(s) in RCA: 129] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2016] [Accepted: 10/23/2016] [Indexed: 01/04/2023] Open
Abstract
Molecularly targeted therapies for advanced prostate cancer include castration modalities that suppress ligand-dependent transcriptional activity of the androgen receptor (AR). However, persistent AR signalling undermines therapeutic efficacy and promotes progression to lethal castration-resistant prostate cancer (CRPC), even when patients are treated with potent second-generation AR-targeted therapies abiraterone and enzalutamide. Here we define diverse AR genomic structural rearrangements (AR-GSRs) as a class of molecular alterations occurring in one third of CRPC-stage tumours. AR-GSRs occur in the context of copy-neutral and amplified AR and display heterogeneity in breakpoint location, rearrangement class and sub-clonal enrichment in tumours within and between patients. Despite this heterogeneity, one common outcome in tumours with high sub-clonal enrichment of AR-GSRs is outlier expression of diverse AR variant species lacking the ligand-binding domain and possessing ligand-independent transcriptional activity. Collectively, these findings reveal AR-GSRs as important drivers of persistent AR signalling in CRPC. Castration-resistant prostate cancer frequently presents with persistent androgen receptor signalling. Here, the authors find that the androgen receptor is subject to genetic rearrangements, resulting in variants with ligand-independent activity.
Collapse
Affiliation(s)
- Christine Henzler
- Minnesota Supercomputing Institute, University of Minnesota, 117 Pleasant Street Southeast, Minneapolis, Minnesota 55455, USA
| | - Yingming Li
- Masonic Cancer Center, University of Minnesota, Mayo Mail Code 806, 420 Delaware Street Southeast, Minneapolis, Minnesota 55455, USA
| | - Rendong Yang
- Minnesota Supercomputing Institute, University of Minnesota, 117 Pleasant Street Southeast, Minneapolis, Minnesota 55455, USA
| | - Terri McBride
- Masonic Cancer Center, University of Minnesota, Mayo Mail Code 806, 420 Delaware Street Southeast, Minneapolis, Minnesota 55455, USA.,Medical Scientist Training Program, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Yeung Ho
- Masonic Cancer Center, University of Minnesota, Mayo Mail Code 806, 420 Delaware Street Southeast, Minneapolis, Minnesota 55455, USA
| | - Cynthia Sprenger
- Division of Gerontology and Geriatric Medicine, University of Washington, Seattle, Washington 98104, USA
| | - Gang Liu
- Division of Gerontology and Geriatric Medicine, University of Washington, Seattle, Washington 98104, USA
| | - Ilsa Coleman
- Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, USA
| | - Bryce Lakely
- Department of Urology, University of Washington, 1959 Northeast Pacific Street, Box 356510, Seattle, Washington 98195, USA
| | - Rui Li
- Department of Urology, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, Texas 75390, USA
| | - Shihong Ma
- Department of Urology, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, Texas 75390, USA
| | - Sean R Landman
- Department of Computer Science and Engineering, University of Minnesota, 200 Union Street Southeast, Minneapolis, Minnesota 55455, USA
| | - Vipin Kumar
- Department of Computer Science and Engineering, University of Minnesota, 200 Union Street Southeast, Minneapolis, Minnesota 55455, USA
| | - Tae Hyun Hwang
- Quantitative Biomedical Research Center, University of Texas Southwestern Medical Center, 2201 Inwood Road, Dallas, Texas 75390, USA
| | - Ganesh V Raj
- Department of Urology, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, Texas 75390, USA
| | - Celestia S Higano
- Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, USA.,Department of Urology, University of Washington, 1959 Northeast Pacific Street, Box 356510, Seattle, Washington 98195, USA.,Department of Medicine, University of Washington, 825 Eastlake Avenue East, Seattle, Washington 98109, USA
| | - Colm Morrissey
- Department of Urology, University of Washington, 1959 Northeast Pacific Street, Box 356510, Seattle, Washington 98195, USA
| | - Peter S Nelson
- Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, Washington 98109, USA
| | - Stephen R Plymate
- Division of Gerontology and Geriatric Medicine, University of Washington, Seattle, Washington 98104, USA.,Department of Urology, University of Washington, 1959 Northeast Pacific Street, Box 356510, Seattle, Washington 98195, USA.,Geriatric Research Education and Clinical Centers, VA Puget Sound Health Care System, 325 9th Avenue, Box 359625, Seattle, Washington 98104, USA
| | - Scott M Dehm
- Masonic Cancer Center, University of Minnesota, Mayo Mail Code 806, 420 Delaware Street Southeast, Minneapolis, Minnesota 55455, USA.,Departments of Laboratory Medicine and Pathology and Urology, University of Minnesota, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
10
|
Kumar S, Razzaq SK, Vo AD, Gautam M, Li H. Identifying fusion transcripts using next generation sequencing. WILEY INTERDISCIPLINARY REVIEWS. RNA 2016; 7:811-823. [PMID: 27485475 PMCID: PMC5065767 DOI: 10.1002/wrna.1382] [Citation(s) in RCA: 65] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2016] [Revised: 07/05/2016] [Accepted: 07/07/2016] [Indexed: 01/14/2023]
Abstract
Fusion transcripts (i.e., chimeric RNAs) resulting from gene fusions have been used successfully for cancer diagnosis, prognosis, and therapeutic applications. In addition, many fusion transcripts are found in normal human cell lines and tissues, with some data supporting their role in normal physiology. Besides chromosomal rearrangement, intergenic splicing can generate them. Global identification of fusion transcripts becomes possible with the help of next generation sequencing technology like RNA-Seq. In the past decade, major advancements have been made for chimeric RNA discovery due to the development of advanced sequencing platform and software packages. However, current software tools behave differently in terms of specificity, sensitivity, time, and computational memory usage. Recent benchmarking studies showed that none of the tools are inclusive. The development of high performance (accurate and fast), and user-friendly fusion detection tool/pipeline is still an open quest. In this article, we review the existing software packages for fusion detection. We explain the methods of the tools, and discuss various factors that affect fusion detection. We summarize conclusions drawn from several comparative studies, and then discuss some of the pitfalls of these studies. We also describe the limitations of current tools, and suggest directions for future development. WIREs RNA 2016, 7:811-823. doi: 10.1002/wrna.1382 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Shailesh Kumar
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Sundus Khalid Razzaq
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Angie Duy Vo
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Mamta Gautam
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Hui Li
- Department of Pathology, School of Medicine, University of Virginia, Charlottesville, VA, USA.
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
11
|
Veeraraghavan J, Ma J, Hu Y, Wang XS. Recurrent and pathological gene fusions in breast cancer: current advances in genomic discovery and clinical implications. Breast Cancer Res Treat 2016; 158:219-32. [PMID: 27372070 DOI: 10.1007/s10549-016-3876-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2016] [Accepted: 06/18/2016] [Indexed: 12/22/2022]
Abstract
Gene fusions have long been considered principally as the oncogenic events of hematologic malignancies, but have recently gained wide attention in solid tumors due to several milestone discoveries and the advancement of deep sequencing technologies. With the progress in deep sequencing studies of breast cancer transcriptomes and genomes, the discovery of recurrent and pathological gene fusions in breast cancer is on the focus. Recently, driven by new deep sequencing studies, several recurrent or pathological gene fusions have been identified in breast cancer, including ESR1-CCDC170, SEC16A-NOTCH1, SEC22B-NOTCH2, and ESR1-YAP1 etc. More important, most of these gene fusions are preferentially identified in the more aggressive breast cancers, such as luminal B, basal-like, or endocrine-resistant breast cancer, suggesting recurrent gene fusions as additional key driver events in these tumors other than the known drivers such as the estrogen receptor. In this paper, we have comprehensively summarized the newly identified recurrent or pathological gene fusion events in breast cancer, reviewed the contributions of new genomic and deep sequencing technologies to new fusion discovery and the integrative bioinformatics tools to analyze these data, highlighted the biological relevance and clinical implications of these fusion discoveries, and discussed future directions of gene fusion research in breast cancer.
Collapse
Affiliation(s)
- Jamunarani Veeraraghavan
- Lester & Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Medicine, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Jiacheng Ma
- Lester & Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Medicine, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Yiheng Hu
- Lester & Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Medicine, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Xiao-Song Wang
- Lester & Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, 77030, USA. .,Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, TX, 77030, USA. .,Department of Medicine, Baylor College of Medicine, Houston, TX, 77030, USA. .,Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, 77030, USA. .,University of Pittsburgh Cancer Institute, University of Pittsburgh, Pittsburgh, PA, 15232, USA. .,Department of Pathology, University of Pittsburgh, Pittsburgh, PA, 15232, USA. .,Hillman Cancer Center, Research Pavilion, University of Pittsburgh Cancer Institute, 5117 Centre Avenue, Room G.5a, Pittsburgh, PA, 15213, USA.
| |
Collapse
|
12
|
Latysheva NS, Babu MM. Discovering and understanding oncogenic gene fusions through data intensive computational approaches. Nucleic Acids Res 2016; 44:4487-503. [PMID: 27105842 PMCID: PMC4889949 DOI: 10.1093/nar/gkw282] [Citation(s) in RCA: 111] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2016] [Accepted: 03/24/2016] [Indexed: 12/21/2022] Open
Abstract
Although gene fusions have been recognized as important drivers of cancer for decades, our understanding of the prevalence and function of gene fusions has been revolutionized by the rise of next-generation sequencing, advances in bioinformatics theory and an increasing capacity for large-scale computational biology. The computational work on gene fusions has been vastly diverse, and the present state of the literature is fragmented. It will be fruitful to merge three camps of gene fusion bioinformatics that appear to rarely cross over: (i) data-intensive computational work characterizing the molecular biology of gene fusions; (ii) development research on fusion detection tools, candidate fusion prioritization algorithms and dedicated fusion databases and (iii) clinical research that seeks to either therapeutically target fusion transcripts and proteins or leverages advances in detection tools to perform large-scale surveys of gene fusion landscapes in specific cancer types. In this review, we unify these different-yet highly complementary and symbiotic-approaches with the view that increased synergy will catalyze advancements in gene fusion identification, characterization and significance evaluation.
Collapse
Affiliation(s)
- Natasha S Latysheva
- MRC Laboratory of Molecular Biology, Francis Crick Ave, Cambridge CB2 0QH, United Kingdom
| | - M Madan Babu
- MRC Laboratory of Molecular Biology, Francis Crick Ave, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
13
|
Comparative assessment of methods for the fusion transcripts detection from RNA-Seq data. Sci Rep 2016; 6:21597. [PMID: 26862001 PMCID: PMC4748267 DOI: 10.1038/srep21597] [Citation(s) in RCA: 115] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Accepted: 01/27/2016] [Indexed: 12/12/2022] Open
Abstract
RNA-Seq made possible the global identification of fusion transcripts, i.e. "chimeric RNAs". Even though various software packages have been developed to serve this purpose, they behave differently in different datasets provided by different developers. It is important for both users, and developers to have an unbiased assessment of the performance of existing fusion detection tools. Toward this goal, we compared the performance of 12 well-known fusion detection software packages. We evaluated the sensitivity, false discovery rate, computing time, and memory usage of these tools in four different datasets (positive, negative, mixed, and test). We conclude that some tools are better than others in terms of sensitivity, positive prediction value, time consumption and memory usage. We also observed small overlaps of the fusions detected by different tools in the real dataset (test dataset). This could be due to false discoveries by various tools, but could also be due to the reason that none of the tools are inclusive. We have found that the performance of the tools depends on the quality, read length, and number of reads of the RNA-Seq data. We recommend that users choose the proper tools for their purpose based on the properties of their RNA-Seq data.
Collapse
|
14
|
Conesa A, Madrigal P, Tarazona S, Gomez-Cabrero D, Cervera A, McPherson A, Szcześniak MW, Gaffney DJ, Elo LL, Zhang X, Mortazavi A. A survey of best practices for RNA-seq data analysis. Genome Biol 2016; 17:13. [PMID: 26813401 PMCID: PMC4728800 DOI: 10.1186/s13059-016-0881-8] [Citation(s) in RCA: 1405] [Impact Index Per Article: 175.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
RNA-sequencing (RNA-seq) has a wide variety of applications, but no single analysis pipeline can be used in all cases. We review all of the major steps in RNA-seq data analysis, including experimental design, quality control, read alignment, quantification of gene and transcript levels, visualization, differential gene expression, alternative splicing, functional analysis, gene fusion detection and eQTL mapping. We highlight the challenges associated with each step. We discuss the analysis of small RNAs and the integration of RNA-seq with other functional genomics techniques. Finally, we discuss the outlook for novel technologies that are changing the state of the art in transcriptomics.
Collapse
Affiliation(s)
- Ana Conesa
- Institute for Food and Agricultural Sciences, Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, 32603, USA. .,Centro de Investigación Príncipe Felipe, Genomics of Gene Expression Laboratory, 46012, Valencia, Spain.
| | - Pedro Madrigal
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK. .,Wellcome Trust-Medical Research Council Cambridge Stem Cell Institute, Anne McLaren Laboratory for Regenerative Medicine, Department of Surgery, University of Cambridge, Cambridge, CB2 0SZ, UK.
| | - Sonia Tarazona
- Centro de Investigación Príncipe Felipe, Genomics of Gene Expression Laboratory, 46012, Valencia, Spain.,Department of Applied Statistics, Operations Research and Quality, Universidad Politécnica de Valencia, 46020, Valencia, Spain
| | - David Gomez-Cabrero
- Unit of Computational Medicine, Department of Medicine, Karolinska Institutet, Karolinska University Hospital, 171 77, Stockholm, Sweden.,Center for Molecular Medicine, Karolinska Institutet, 17177, Stockholm, Sweden.,Unit of Clinical Epidemiology, Department of Medicine, Karolinska University Hospital, L8, 17176, Stockholm, Sweden.,Science for Life Laboratory, 17121, Solna, Sweden
| | - Alejandra Cervera
- Systems Biology Laboratory, Institute of Biomedicine and Genome-Scale Biology Research Program, University of Helsinki, 00014, Helsinki, Finland
| | - Andrew McPherson
- School of Computing Science, Simon Fraser University, Burnaby, V5A 1S6, BC, Canada
| | - Michał Wojciech Szcześniak
- Department of Bioinformatics, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznań, 61-614, Poznań, Poland
| | - Daniel J Gaffney
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Laura L Elo
- Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520, Turku, Finland
| | - Xuegong Zhang
- Key Lab of Bioinformatics/Bioinformatics Division, TNLIST and Department of Automation, Tsinghua University, Beijing, 100084, China.,School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, 92697-2300, USA. .,Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, 92697, USA.
| |
Collapse
|
15
|
Arsenijevic V, Davis-Dusenbery BN. Reproducible, Scalable Fusion Gene Detection from RNA-Seq. Methods Mol Biol 2016; 1381:223-37. [PMID: 26667464 DOI: 10.1007/978-1-4939-3204-7_13] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Chromosomal rearrangements resulting in the creation of novel gene products, termed fusion genes, have been identified as driving events in the development of multiple types of cancer. As these gene products typically do not exist in normal cells, they represent valuable prognostic and therapeutic targets. Advances in next-generation sequencing and computational approaches have greatly improved our ability to detect and identify fusion genes. Nevertheless, these approaches require significant computational resources. Here we describe an approach which leverages cloud computing technologies to perform fusion gene detection from RNA sequencing data at any scale. We additionally highlight methods to enhance reproducibility of bioinformatics analyses which may be applied to any next-generation sequencing experiment.
Collapse
Affiliation(s)
- Vladan Arsenijevic
- Department of Bioinformatics, Seven Bridges Genomics, One Broadway, 14th Floor, Cambridge, MA, 02142, USA
| | - Brandi N Davis-Dusenbery
- Department of Bioinformatics, Seven Bridges Genomics, One Broadway, 14th Floor, Cambridge, MA, 02142, USA.
| |
Collapse
|
16
|
Zhang J, White NM, Schmidt HK, Fulton RS, Tomlinson C, Warren WC, Wilson RK, Maher CA. INTEGRATE: gene fusion discovery using whole genome and transcriptome data. Genome Res 2015; 26:108-18. [PMID: 26556708 PMCID: PMC4691743 DOI: 10.1101/gr.186114.114] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 11/09/2015] [Indexed: 12/13/2022]
Abstract
While next-generation sequencing (NGS) has become the primary technology for discovering gene fusions, we are still faced with the challenge of ensuring that causative mutations are not missed while minimizing false positives. Currently, there are many computational tools that predict structural variations (SV) and gene fusions using whole genome (WGS) and transcriptome sequencing (RNA-seq) data separately. However, as both WGS and RNA-seq have their limitations when used independently, we hypothesize that the orthogonal validation from integrating both data could generate a sensitive and specific approach for detecting high-confidence gene fusion predictions. Fortunately, decreasing NGS costs have resulted in a growing quantity of patients with both data available. Therefore, we developed a gene fusion discovery tool, INTEGRATE, that leverages both RNA-seq and WGS data to reconstruct gene fusion junctions and genomic breakpoints by split-read mapping. To evaluate INTEGRATE, we compared it with eight additional gene fusion discovery tools using the well-characterized breast cell line HCC1395 and peripheral blood lymphocytes derived from the same patient (HCC1395BL). The predictions subsequently underwent a targeted validation leading to the discovery of 131 novel fusions in addition to the seven previously reported fusions. Overall, INTEGRATE only missed six out of the 138 validated fusions and had the highest accuracy of the nine tools evaluated. Additionally, we applied INTEGRATE to 62 breast cancer patients from The Cancer Genome Atlas (TCGA) and found multiple recurrent gene fusions including a subset involving estrogen receptor. Taken together, INTEGRATE is a highly sensitive and accurate tool that is freely available for academic use.
Collapse
Affiliation(s)
- Jin Zhang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA; Department of Internal Medicine, Division of Oncology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Nicole M White
- Department of Internal Medicine, Division of Oncology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Heather K Schmidt
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Robert S Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA; Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Wesley C Warren
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Richard K Wilson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA; Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Christopher A Maher
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA; Department of Internal Medicine, Division of Oncology, Washington University School of Medicine, St. Louis, Missouri 63110, USA; Alvin J. Siteman Cancer Center, Washington University School of Medicine, St. Louis, Missouri 63110, USA; Department of Biomedical Engineering, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| |
Collapse
|
17
|
Chen I, Chen CY, Chuang TJ. Biogenesis, identification, and function of exonic circular RNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2015; 6:563-79. [PMID: 26230526 PMCID: PMC5042038 DOI: 10.1002/wrna.1294] [Citation(s) in RCA: 300] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2015] [Revised: 06/11/2015] [Accepted: 06/16/2015] [Indexed: 01/20/2023]
Abstract
Circular RNAs (circRNAs) arise during post-transcriptional processes, in which a single-stranded RNA molecule forms a circle through covalent binding. Previously, circRNA products were often regarded to be splicing intermediates, by-products, or products of aberrant splicing. But recently, rapid advances in high-throughput RNA sequencing (RNA-seq) for global investigation of nonco-linear (NCL) RNAs, which comprised sequence segments that are topologically inconsistent with the reference genome, leads to renewed interest in this type of NCL RNA (i.e., circRNA), especially exonic circRNAs (ecircRNAs). Although the biogenesis and function of ecircRNAs are mostly unknown, some ecircRNAs are abundant, highly expressed, or evolutionarily conserved. Some ecircRNAs have been shown to affect microRNA regulation, and probably play roles in regulating parental gene transcription, cell proliferation, and RNA-binding proteins, indicating their functional potential for development as diagnostic tools. To date, thousands of ecircRNAs have been identified in multiple tissues/cell types from diverse species, through analyses of RNA-seq data. However, the detection of ecircRNA candidates involves several major challenges, including discrimination between ecircRNAs and other types of NCL RNAs (e.g., trans-spliced RNAs and genetic rearrangements); removal of sequencing errors, alignment errors, and in vitro artifacts; and the reconciliation of heterogeneous results arising from the use of different bioinformatics methods or sequencing data generated under different treatments. Such challenges may severely hamper the understanding of ecircRNAs. Herein, we review the biogenesis, identification, properties, and function of ecircRNAs, and discuss some unanswered questions regarding ecircRNAs. We also evaluate the accuracy (in terms of sensitivity and precision) of some well-known circRNA-detecting methods.
Collapse
Affiliation(s)
- Iju Chen
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | - Chia-Ying Chen
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | | |
Collapse
|
18
|
Davare MA, Tognon CE. Detecting and targetting oncogenic fusion proteins in the genomic era. Biol Cell 2015; 107:111-29. [PMID: 25631473 DOI: 10.1111/boc.201400096] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2014] [Accepted: 01/23/2015] [Indexed: 12/15/2022]
Abstract
The advent of widespread cancer genome sequencing has accelerated our understanding of the molecular aberrations underlying malignant disease at an unprecedented rate. Coupling the large number of bioinformatic methods developed to locate genomic breakpoints with increased sequence read length and a deeper understanding of coding region function has enabled rapid identification of novel actionable oncogenic fusion genes. Using examples of kinase fusions found in liquid and solid tumours, this review highlights major concepts that have arisen in our understanding of cancer pathogenesis through the study of fusion proteins. We provide an overview of recently developed methods to identify potential fusion proteins from next-generation sequencing data, describe the validation of their oncogenic potential and discuss the role of targetted therapies in treating cancers driven by fusion oncoproteins.
Collapse
Affiliation(s)
- Monika A Davare
- Knight Cancer Institute, Oregon Health & Science University, Portland, OR, 97239, U.S.A; Department of Pediatrics, Oregon Health & Science University, Portland, OR, 97239, U.S.A
| | | |
Collapse
|
19
|
O'Meara E, Stack D, Phelan S, McDonagh N, Kelly L, Sciot R, Debiec-Rychter M, Morris T, Cochrane D, Sorensen P, O'Sullivan MJ. Identification of anMLL4-GPS2fusion as an oncogenic driver of undifferentiated spindle cell sarcoma in a child. Genes Chromosomes Cancer 2014; 53:991-8. [DOI: 10.1002/gcc.22208] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2014] [Revised: 07/23/2014] [Accepted: 07/24/2014] [Indexed: 02/03/2023] Open
Affiliation(s)
- Elaine O'Meara
- Cancer Genetics Program, The National Children's Research Centre; Crumlin Dublin 12 Ireland
| | - Deirdre Stack
- Cancer Genetics Program, The National Children's Research Centre; Crumlin Dublin 12 Ireland
| | - Susan Phelan
- Cancer Genetics Program, The National Children's Research Centre; Crumlin Dublin 12 Ireland
| | - Naomi McDonagh
- Cancer Genetics Program, The National Children's Research Centre; Crumlin Dublin 12 Ireland
| | - Lorna Kelly
- Cancer Genetics Program, The National Children's Research Centre; Crumlin Dublin 12 Ireland
| | - Raf Sciot
- Department of Pathology; K.U. Leuven, University Hospital Gasthuisberg; Leuven Belgium
| | - Maria Debiec-Rychter
- Department of Genetics; K.U. Leuven, University Hospital Gasthuisberg; Leuven Belgium
| | - Thomas Morris
- Cytogenetics Laboratory, National Centre for Medical Genetics, Our Lady's Children's Hospital; Crumlin Dublin 12 Ireland
| | - Doug Cochrane
- Division of Neurosurgery; British Columbia's Children's Hospital; Vancouver V6H 3V4 Canada
| | - Poul Sorensen
- Department of Molecular Oncology; British Columbia Cancer Center; Vancouver V5Z 1L3 Canada
| | - Maureen J. O'Sullivan
- Cancer Genetics Program, The National Children's Research Centre; Crumlin Dublin 12 Ireland
- Department of Pathology; Our Lady's Children's Hospital; Crumlin Dublin 12 Ireland
- Department of Histopathology; School of Medicine, University of Dublin; Trinity College Dublin 2 Ireland
| |
Collapse
|
20
|
Abstract
High-throughput DNA sequencing has revolutionized the study of cancer genomics with numerous discoveries that are relevant to cancer diagnosis and treatment. The latest sequencing and analysis methods have successfully identified somatic alterations, including single-nucleotide variants, insertions and deletions, copy-number aberrations, structural variants and gene fusions. Additional computational techniques have proved useful for defining the mutations, genes and molecular networks that drive diverse cancer phenotypes and that determine clonal architectures in tumour samples. Collectively, these tools have advanced the study of genomic, transcriptomic and epigenomic alterations in cancer, and their association to clinical properties. Here, we review cancer genomics software and the insights that have been gained from their application.
Collapse
|
21
|
Wang Y, Wang Y, Liu Q, Xu G, Mao F, Qin T, Teng H, Cai W, Yu P, Cai T, Zhao M, Sun ZS, Xie C. Comparative RNA-seq analysis reveals potential mechanisms mediating the conversion to androgen independence in an LNCaP progression cell model. Cancer Lett 2014; 342:130-8. [DOI: 10.1016/j.canlet.2013.08.044] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2013] [Revised: 08/16/2013] [Accepted: 08/28/2013] [Indexed: 01/14/2023]
|
22
|
Annala MJ, Parker BC, Zhang W, Nykter M. Fusion genes and their discovery using high throughput sequencing. Cancer Lett 2013; 340:192-200. [PMID: 23376639 PMCID: PMC3675181 DOI: 10.1016/j.canlet.2013.01.011] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2012] [Revised: 12/28/2012] [Accepted: 01/04/2013] [Indexed: 01/25/2023]
Abstract
Fusion genes are hybrid genes that combine parts of two or more original genes. They can form as a result of chromosomal rearrangements or abnormal transcription, and have been shown to act as drivers of malignant transformation and progression in many human cancers. The biological significance of fusion genes together with their specificity to cancer cells has made them into excellent targets for molecular therapy. Fusion genes are also used as diagnostic and prognostic markers to confirm cancer diagnosis and monitor response to molecular therapies. High-throughput sequencing has enabled the systematic discovery of fusion genes in a wide variety of cancer types. In this review, we describe the history of fusion genes in cancer and the ways in which fusion genes form and affect cellular function. We also describe computational methodologies for detecting fusion genes from high-throughput sequencing experiments, and the most common sources of error that lead to false discovery of fusion genes.
Collapse
Affiliation(s)
- M J Annala
- Tampere University of Technology, Tampere, Finland.
| | | | | | | |
Collapse
|
23
|
Chen K, Navin NE, Wang Y, Schmidt HK, Wallis JW, Niu B, Fan X, Zhao H, McLellan MD, Hoadley KA, Mardis ER, Ley TJ, Perou CM, Wilson RK, Ding L. BreakTrans: uncovering the genomic architecture of gene fusions. Genome Biol 2013; 14:R87. [PMID: 23972288 PMCID: PMC4054677 DOI: 10.1186/gb-2013-14-8-r87] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Accepted: 08/23/2013] [Indexed: 01/18/2023] Open
Abstract
Producing gene fusions through genomic structural rearrangements is a major mechanism for tumor evolution. Therefore, accurately detecting gene fusions and the originating rearrangements is of great importance for personalized cancer diagnosis and targeted therapy. We present a tool, BreakTrans, that systematically maps predicted gene fusions to structural rearrangements. Thus, BreakTrans not only validates both types of predictions, but also provides mechanistic interpretations. BreakTrans effectively validates known fusions and discovers novel events in a breast cancer cell line. Applying BreakTrans to 43 breast cancer samples in The Cancer Genome Atlas identifies 90 genomically validated gene fusions. BreakTrans is available at http://bioinformatics.mdanderson.org/main/BreakTrans.
Collapse
|
24
|
Wu CC, Kannan K, Lin S, Yen L, Milosavljevic A. Identification of cancer fusion drivers using network fusion centrality. ACTA ACUST UNITED AC 2013; 29:1174-81. [PMID: 23505294 DOI: 10.1093/bioinformatics/btt131] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
SUMMARY Gene fusions are being discovered at an increasing rate using massively parallel sequencing technologies. Prioritization of cancer fusion drivers for validation cannot be performed using traditional single-gene based methods because fusions involve portions of two partner genes. To address this problem, we propose a novel network analysis method called fusion centrality that is specifically tailored for prioritizing gene fusions. We first propose a domain-based fusion model built on the theory of exon/domain shuffling. The model leads to a hypothesis that a fusion is more likely to be an oncogenic driver if its partner genes act like hubs in a network because the fusion mutation can deregulate normal functions of many other genes and their pathways. The hypothesis is supported by the observation that for most known cancer fusion genes, at least one of the fusion partners appears to be a hub in a network, and even for many fusions both partners appear to be hubs. Based on this model, we construct fusion centrality, a multi-gene-based network metric, and use it to score fusion drivers. We show that the fusion centrality outperforms other single gene-based methods. Specifically, the method successfully predicts most of 38 newly discovered fusions that had validated oncogenic importance. To our best knowledge, this is the first network-based approach for identifying fusion drivers. AVAILABILITY Matlab code implementing the fusion centrality method is available upon request from the corresponding authors.
Collapse
Affiliation(s)
- Chia-Chin Wu
- Department of Genomic Medicine, UT MD Anderson Cancer Center, Houston, TX 77030, USA.
| | | | | | | | | |
Collapse
|
25
|
Yorukoglu D, Hach F, Swanson L, Collins CC, Birol I, Sahinalp SC. Dissect: detection and characterization of novel structural alterations in transcribed sequences. Bioinformatics 2013; 28:i179-87. [PMID: 22689759 PMCID: PMC3371846 DOI: 10.1093/bioinformatics/bts214] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Motivation: Computational identification of genomic structural variants via high-throughput sequencing is an important problem for which a number of highly sophisticated solutions have been recently developed. With the advent of high-throughput transcriptome sequencing (RNA-Seq), the problem of identifying structural alterations in the transcriptome is now attracting significant attention. In this article, we introduce two novel algorithmic formulations for identifying transcriptomic structural variants through aligning transcripts to the reference genome under the consideration of such variation. The first formulation is based on a nucleotide-level alignment model; a second, potentially faster formulation is based on chaining fragments shared between each transcript and the reference genome. Based on these formulations, we introduce a novel transcriptome-to-genome alignment tool, Dissect (DIScovery of Structural Alteration Event Containing Transcripts), which can identify and characterize transcriptomic events such as duplications, inversions, rearrangements and fusions. Dissect is suitable for whole transcriptome structural variation discovery problems involving sufficiently long reads or accurately assembled contigs. Results: We tested Dissect on simulated transcripts altered via structural events, as well as assembled RNA-Seq contigs from human prostate cancer cell line C4-2. Our results indicate that Dissect has high sensitivity and specificity in identifying structural alteration events in simulated transcripts as well as uncovering novel structural alterations in cancer transcriptomes. Availability: Dissect is available for public use at: http://dissect-trans.sourceforge.net Contact:denizy@mit.edu; fhach@cs.sfu.ca; cenk@cs.sfu.ca
Collapse
Affiliation(s)
- Deniz Yorukoglu
- School of Computing Science, Simon Fraser University, Burnaby, V5A 1S6 BC, Canada.
| | | | | | | | | | | |
Collapse
|
26
|
Wu C, Wyatt AW, McPherson A, Lin D, McConeghy BJ, Mo F, Shukin R, Lapuk AV, Jones SJM, Zhao Y, Marra MA, Gleave ME, Volik SV, Wang Y, Sahinalp SC, Collins CC. Poly-gene fusion transcripts and chromothripsis in prostate cancer. Genes Chromosomes Cancer 2012; 51:1144-53. [PMID: 22927308 DOI: 10.1002/gcc.21999] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2012] [Accepted: 07/26/2012] [Indexed: 01/12/2023] Open
Abstract
Complex genome rearrangements are frequently observed in cancer but their impact on tumor molecular biology is largely unknown. Recent studies have identified a new phenomenon involving the simultaneous generation of tens to hundreds of genomic rearrangements, called chromothripsis. To understand the molecular consequences of these events, we sequenced the genomes and transcriptomes of two prostate tumors exhibiting evidence of chromothripsis. We identified several complex fusion transcripts, each containing sequence from three different genes, originating from different parts of the genome. One such poly-gene fusion transcript appeared to be expressed from a chain of small genomic fragments. Furthermore, we detected poly-gene fusion transcripts in the prostate cancer cell line LNCaP, suggesting they may represent a common phenomenon. Finally in one tumor with chromothripsis, we identified multiple mutations in the p53 signaling pathway, expanding on recent work associating aberrant DNA damage response mechanisms with chromothripsis. Overall, our data show that chromothripsis can manifest as massively rearranged transcriptomes. The implication that multigenic changes can give rise to poly-gene fusion transcripts is potentially of great significance to cancer genetics.
Collapse
Affiliation(s)
- Chunxiao Wu
- Vancouver Prostate Centre and Department of Urologic Sciences, University of British Columbia, Vancouver, BC, Canada
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Lapuk AV, Wu C, Wyatt AW, McPherson A, McConeghy BJ, Brahmbhatt S, Mo F, Zoubeidi A, Anderson S, Bell RH, Haegert A, Shukin R, Wang Y, Fazli L, Hurtado-Coll A, Jones EC, Hach F, Hormozdiari F, Hajirasouliha I, Boutros PC, Bristow RG, Zhao Y, Marra MA, Fanjul A, Maher CA, Chinnaiyan AM, Rubin MA, Beltran H, Sahinalp SC, Gleave ME, Volik SV, Collins CC. From sequence to molecular pathology, and a mechanism driving the neuroendocrine phenotype in prostate cancer. J Pathol 2012; 227:286-97. [PMID: 22553170 DOI: 10.1002/path.4047] [Citation(s) in RCA: 147] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The current paradigm of cancer care relies on predictive nomograms which integrate detailed histopathology with clinical data. However, when predictions fail, the consequences for patients are often catastrophic, especially in prostate cancer where nomograms influence the decision to therapeutically intervene. We hypothesized that the high dimensional data afforded by massively parallel sequencing (MPS) is not only capable of providing biological insights, but may aid molecular pathology of prostate tumours. We assembled a cohort of six patients with high-risk disease, and performed deep RNA and shallow DNA sequencing in primary tumours and matched metastases where available. Our analysis identified copy number abnormalities, accurately profiled gene expression levels, and detected both differential splicing and expressed fusion genes. We revealed occult and potentially dormant metastases, unambiguously supporting the patients' clinical history, and implicated the REST transcriptional complex in the development of neuroendocrine prostate cancer, validating this finding in a large independent cohort. We massively expand on the number of novel fusion genes described in prostate cancer; provide fresh evidence for the growing link between fusion gene aetiology and gene expression profiles; and show the utility of fusion genes for molecular pathology. Finally, we identified chromothripsis in a patient with chronic prostatitis. Our results provide a strong foundation for further development of MPS-based molecular pathology.
Collapse
MESH Headings
- Adenocarcinoma/genetics
- Adenocarcinoma/metabolism
- Adenocarcinoma/secondary
- Adenocarcinoma/therapy
- Aged
- Alternative Splicing
- Biomarkers, Tumor/blood
- Biomarkers, Tumor/genetics
- British Columbia
- Cell Line, Tumor
- Cell Transformation, Neoplastic/genetics
- Cell Transformation, Neoplastic/metabolism
- Cell Transformation, Neoplastic/pathology
- Cluster Analysis
- Decision Support Techniques
- Gene Dosage
- Gene Expression Profiling/methods
- Gene Expression Regulation, Neoplastic
- Gene Fusion
- Genetic Predisposition to Disease
- Humans
- Lymphatic Metastasis
- Male
- Middle Aged
- Neoplasm Grading
- Neoplasms, Hormone-Dependent/genetics
- Neoplasms, Hormone-Dependent/metabolism
- Neoplasms, Hormone-Dependent/pathology
- Neoplasms, Hormone-Dependent/therapy
- Neuroendocrine Cells/metabolism
- Neuroendocrine Cells/pathology
- Nomograms
- Oligonucleotide Array Sequence Analysis
- Patient Selection
- Phenotype
- Precision Medicine
- Prognosis
- Prostate-Specific Antigen/blood
- Prostatic Neoplasms/genetics
- Prostatic Neoplasms/metabolism
- Prostatic Neoplasms/pathology
- Prostatic Neoplasms/therapy
- RNA Interference
- Transfection
Collapse
Affiliation(s)
- Anna V Lapuk
- Vancouver Prostate Centre and Department of Urologic Sciences, University of British Columbia, Vancouver, BC, Canada
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Wang Q, Xia J, Jia P, Pao W, Zhao Z. Application of next generation sequencing to human gene fusion detection: computational tools, features and perspectives. Brief Bioinform 2012; 14:506-19. [PMID: 22877769 DOI: 10.1093/bib/bbs044] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Gene fusions are important genomic events in human cancer because their fusion gene products can drive the development of cancer and thus are potential prognostic tools or therapeutic targets in anti-cancer treatment. Major advancements have been made in computational approaches for fusion gene discovery over the past 3 years due to improvements and widespread applications of high-throughput next generation sequencing (NGS) technologies. To identify fusions from NGS data, existing methods typically leverage the strengths of both sequencing technologies and computational strategies. In this article, we review the NGS and computational features of existing methods for fusion gene detection and suggest directions for future development.
Collapse
|
29
|
McPherson A, Wu C, Wyatt AW, Shah S, Collins C, Sahinalp SC. nFuse: discovery of complex genomic rearrangements in cancer using high-throughput sequencing. Genome Res 2012; 22:2250-61. [PMID: 22745232 PMCID: PMC3483554 DOI: 10.1101/gr.136572.111] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Complex genomic rearrangements (CGRs) are emerging as a new feature of cancer genomes. CGRs are characterized by multiple genomic breakpoints and thus have the potential to simultaneously affect multiple genes, fusing some genes and interrupting other genes. Analysis of high-throughput whole-genome shotgun sequencing (WGSS) is beginning to facilitate the discovery and characterization of CGRs, but further development of computational methods is required. We have developed an algorithmic method for identifying CGRs in WGSS data based on shortest alternating paths in breakpoint graphs. Aiming for a method with the highest possible sensitivity, we use breakpoint graphs built from all WGSS data, including sequences with ambiguous genomic origin. Since the majority of cell function is encoded by the transcriptome, we target our search to find CGRs that underlie fusion transcripts predicted from matched high-throughput cDNA sequencing (RNA-seq). We have applied our method, nFuse, to the discovery of CGRs in publicly available data from the well-studied breast cancer cell line HCC1954 and primary prostate tumor sample 963. We first establish the sensitivity and specificity of the nFuse breakpoint prediction and scoring method using breakpoints previously discovered in HCC1954. We then validate five out of six CGRs in HCC1954 and two out of two CGRs in 963. We show examples of gene fusions that would be difficult to discover using methods that do not account for the existence of CGRs, including one important event that was missed in a previous study of the HCC1954 genome. Finally, we illustrate how CGRs may be used to infer the gene expression history of a tumor.
Collapse
Affiliation(s)
- Andrew McPherson
- School of Computing Science, Simon Fraser University, Vancouver, British Columbia V5A 1S6, Canada.
| | | | | | | | | | | |
Collapse
|
30
|
Wu C, Wyatt AW, Lapuk AV, McPherson A, McConeghy BJ, Bell RH, Anderson S, Haegert A, Brahmbhatt S, Shukin R, Mo F, Li E, Fazli L, Hurtado-Coll A, Jones EC, Butterfield YS, Hach F, Hormozdiari F, Hajirasouliha I, Boutros PC, Bristow RG, Jones SJ, Hirst M, Marra MA, Maher CA, Chinnaiyan AM, Sahinalp SC, Gleave ME, Volik SV, Collins CC. Integrated genome and transcriptome sequencing identifies a novel form of hybrid and aggressive prostate cancer. J Pathol 2012; 227:53-61. [PMID: 22294438 DOI: 10.1002/path.3987] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2011] [Revised: 12/23/2011] [Accepted: 12/29/2011] [Indexed: 01/07/2023]
Abstract
Next-generation sequencing is making sequence-based molecular pathology and personalized oncology viable. We selected an individual initially diagnosed with conventional but aggressive prostate adenocarcinoma and sequenced the genome and transcriptome from primary and metastatic tissues collected prior to hormone therapy. The histology-pathology and copy number profiles were remarkably homogeneous, yet it was possible to propose the quadrant of the prostate tumour that likely seeded the metastatic diaspora. Despite a homogeneous cell type, our transcriptome analysis revealed signatures of both luminal and neuroendocrine cell types. Remarkably, the repertoire of expressed but apparently private gene fusions, including C15orf21:MYC, recapitulated this biology. We hypothesize that the amplification and over-expression of the stem cell gene MSI2 may have contributed to the stable hybrid cellular identity. This hybrid luminal-neuroendocrine tumour appears to represent a novel and highly aggressive case of prostate cancer with unique biological features and, conceivably, a propensity for rapid progression to castrate-resistance. Overall, this work highlights the importance of integrated analyses of genome, exome and transcriptome sequences for basic tumour biology, sequence-based molecular pathology and personalized oncology.
Collapse
Affiliation(s)
- Chunxiao Wu
- Vancouver Prostate Centre and Department of Urologic Sciences, University of British Columbia, Vancouver, BC, Canada
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Roychowdhury S, Iyer MK, Robinson DR, Lonigro RJ, Wu YM, Cao X, Kalyana-Sundaram S, Sam L, Balbin OA, Quist MJ, Barrette T, Everett J, Siddiqui J, Kunju LP, Navone N, Araujo JC, Troncoso P, Logothetis CJ, Innis JW, Smith DC, Lao CD, Kim SY, Roberts JS, Gruber SB, Pienta KJ, Talpaz M, Chinnaiyan AM. Personalized oncology through integrative high-throughput sequencing: a pilot study. Sci Transl Med 2011; 3:111ra121. [PMID: 22133722 PMCID: PMC3476478 DOI: 10.1126/scitranslmed.3003161] [Citation(s) in RCA: 448] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Individual cancers harbor a set of genetic aberrations that can be informative for identifying rational therapies currently available or in clinical trials. We implemented a pilot study to explore the practical challenges of applying high-throughput sequencing in clinical oncology. We enrolled patients with advanced or refractory cancer who were eligible for clinical trials. For each patient, we performed whole-genome sequencing of the tumor, targeted whole-exome sequencing of tumor and normal DNA, and transcriptome sequencing (RNA-Seq) of the tumor to identify potentially informative mutations in a clinically relevant time frame of 3 to 4 weeks. With this approach, we detected several classes of cancer mutations including structural rearrangements, copy number alterations, point mutations, and gene expression alterations. A multidisciplinary Sequencing Tumor Board (STB) deliberated on the clinical interpretation of the sequencing results obtained. We tested our sequencing strategy on human prostate cancer xenografts. Next, we enrolled two patients into the clinical protocol and were able to review the results at our STB within 24 days of biopsy. The first patient had metastatic colorectal cancer in which we identified somatic point mutations in NRAS, TP53, AURKA, FAS, and MYH11, plus amplification and overexpression of cyclin-dependent kinase 8 (CDK8). The second patient had malignant melanoma, in which we identified a somatic point mutation in HRAS and a structural rearrangement affecting CDKN2C. The STB identified the CDK8 amplification and Ras mutation as providing a rationale for clinical trials with CDK inhibitors or MEK (mitogen-activated or extracellular signal-regulated protein kinase kinase) and PI3K (phosphatidylinositol 3-kinase) inhibitors, respectively. Integrative high-throughput sequencing of patients with advanced cancer generates a comprehensive, individual mutational landscape to facilitate biomarker-driven clinical trials in oncology.
Collapse
Affiliation(s)
- Sameek Roychowdhury
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Division of Hematology and Oncology, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Matthew K. Iyer
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Dan R. Robinson
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Robert J. Lonigro
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yi-Mi Wu
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Xuhong Cao
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
- Howard Hughes Medical Institute, University of Michigan, Ann Arbor, MI 48109, USA
| | - Shanker Kalyana-Sundaram
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Environmental Biotechnology, Bharathidasan University, Tiruchirappalli 620024, India
| | - Lee Sam
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - O. Alejandro Balbin
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Michael J. Quist
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Terrence Barrette
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jessica Everett
- Division of Molecular Medicine and Genetics, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
| | - Javed Siddiqui
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Lakshmi P. Kunju
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Nora Navone
- Division of Cancer Medicine, Department of Genitourinary Medical Oncology, M. D. Anderson Cancer Center, Houston, TX 77030, USA
| | - John C. Araujo
- Division of Cancer Medicine, Department of Genitourinary Medical Oncology, M. D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Patricia Troncoso
- Division of Cancer Medicine, Department of Genitourinary Medical Oncology, M. D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Christopher J. Logothetis
- Division of Cancer Medicine, Department of Genitourinary Medical Oncology, M. D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Jeffrey W. Innis
- Department of Human Genetics and Pediatrics and Communicable Diseases, University of Michigan, Ann Arbor, MI 48109, USA
| | - David C. Smith
- Division of Hematology and Oncology, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
- Comprehensive Cancer Center, University of Michigan, Ann Arbor, MI 48109, USA
| | - Christopher D. Lao
- Division of Hematology and Oncology, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
- Comprehensive Cancer Center, University of Michigan, Ann Arbor, MI 48109, USA
| | - Scott Y. Kim
- Department of Psychiatry, University of Michigan, Ann Arbor, MI 48109, USA
| | - J. Scott Roberts
- Department of Psychiatry, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Health Behavior and Health Education, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA
| | - Stephen B. Gruber
- Division of Hematology and Oncology, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
- Comprehensive Cancer Center, University of Michigan, Ann Arbor, MI 48109, USA
| | - Kenneth J. Pienta
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Division of Hematology and Oncology, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
- Comprehensive Cancer Center, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Urology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Moshe Talpaz
- Division of Hematology and Oncology, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA
- Comprehensive Cancer Center, University of Michigan, Ann Arbor, MI 48109, USA
| | - Arul M. Chinnaiyan
- Michigan Center for Translational Pathology, Ann Arbor, MI 48109, USA
- Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI 48109, USA
- Howard Hughes Medical Institute, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Urology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|