51
|
Linehan WM, Spellman PT, Ricketts CJ, Creighton CJ, Fei SS, Davis C, Wheeler DA, Murray BA, Schmidt L, Vocke CD, Peto M, Al Mamun AAM, Shinbrot E, Sethi A, Brooks S, Rathmell WK, Brooks AN, Hoadley KA, Robertson AG, Brooks D, Bowlby R, Sadeghi S, Shen H, Weisenberger DJ, Bootwalla M, Baylin SB, Laird PW, Cherniack AD, Saksena G, Haake S, Li J, Liang H, Lu Y, Mills GB, Akbani R, Leiserson MD, Raphael BJ, Anur P, Bottaro D, Albiges L, Barnabas N, Choueiri TK, Czerniak B, Godwin AK, Hakimi AA, Ho T, Hsieh J, Ittmann M, Kim WY, Krishnan B, Merino MJ, Mills Shaw KR, Reuter VE, Reznik E, Shelley CS, Shuch B, Signoretti S, Srinivasan R, Tamboli P, Thomas G, Tickoo S, Burnett K, Crain D, Gardner J, Lau K, Mallery D, Morris S, Paulauskis JD, Penny RJ, Shelton C, Shelton WT, Sherman M, Thompson E, Yena P, Avedon MT, Bowen J, Gastier-Foster JM, Gerken M, Leraas KM, Lichtenberg TM, Ramirez NC, Santos T, Wise L, Zmuda E, Demchok JA, Felau I, Hutter CM, Sheth M, Sofia HJ, Tarnuzzer R, Wang Z, Yang L, Zenklusen JC, Zhang J(J, Ayala B, Baboud J, Chudamani S, Liu J, Lolla L, Naresh R, Pihl T, Sun Q, Wan Y, Wu Y, Ally A, Balasundaram M, Balu S, Beroukhim R, Bodenheimer T, Buhay C, Butterfield YS, Carlsen R, Carter SL, Chao H, Chuah E, Clarke A, Covington KR, Dahdouli M, Dewal N, Dhalla N, Doddapaneni H, Drummond J, Gabriel SB, Gibbs RA, Guin R, Hale W, Hawes A, Hayes DN, Holt RA, Hoyle AP, Jefferys SR, Jones SJ, Jones CD, Kalra D, Kovar C, Lewis L, Li J, Ma Y, Marra MA, Mayo M, Meng S, Meyerson M, Mieczkowski PA, Moore RA, Morton D, Mose LE, Mungall AJ, Muzny D, Parker JS, Perou CM, Roach J, Schein JE, Schumacher SE, Shi Y, Simons JV, Sipahimalani P, Skelly T, Soloway MG, Sougnez C, Tam A, Tan D, Thiessen N, Veluvolu U, Wang M, Wilkerson MD, Wong T, Wu J, Xi L, Zhou J, Bedford J, Chen F, Fu Y, Gerstein M, Haussler D, Kasaian K, Lai P, Ling S, Radenbaugh A, Van Den Berg D, Weinstein JN, Zhu J, Albert M, Alexopoulou I, Andersen JJ, Auman JT, Bartlett J, Bastacky S, Bergsten J, Blute ML, Boice L, Bollag RJ, Boyd J, Castle E, Chen YB, Cheville JC, Curley E, Davies B, DeVolk A, Dhir R, Dike L, Eckman J, Engel J, Harr J, Hrebinko R, Huang M, Huelsenbeck-Dill L, Iacocca M, Jacobs B, Lobis M, Maranchie JK, McMeekin S, Myers J, Nelson J, Parfitt J, Parwani A, Petrelli N, Rabeno B, Roy S, Salner AL, Slaton J, Stanton M, Thompson RH, Thorne L, Tucker K, Weinberger PM, Winemiller C, Zach LA, Zuna R. Comprehensive Molecular Characterization of Papillary Renal-Cell Carcinoma. N Engl J Med 2016; 374:135-45. [PMID: 26536169 PMCID: PMC4775252 DOI: 10.1056/nejmoa1505917] [Citation(s) in RCA: 895] [Impact Index Per Article: 111.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
BACKGROUND Papillary renal-cell carcinoma, which accounts for 15 to 20% of renal-cell carcinomas, is a heterogeneous disease that consists of various types of renal cancer, including tumors with indolent, multifocal presentation and solitary tumors with an aggressive, highly lethal phenotype. Little is known about the genetic basis of sporadic papillary renal-cell carcinoma, and no effective forms of therapy for advanced disease exist. METHODS We performed comprehensive molecular characterization of 161 primary papillary renal-cell carcinomas, using whole-exome sequencing, copy-number analysis, messenger RNA and microRNA sequencing, DNA-methylation analysis, and proteomic analysis. RESULTS Type 1 and type 2 papillary renal-cell carcinomas were shown to be different types of renal cancer characterized by specific genetic alterations, with type 2 further classified into three individual subgroups on the basis of molecular differences associated with patient survival. Type 1 tumors were associated with MET alterations, whereas type 2 tumors were characterized by CDKN2A silencing, SETD2 mutations, TFE3 fusions, and increased expression of the NRF2-antioxidant response element (ARE) pathway. A CpG island methylator phenotype (CIMP) was observed in a distinct subgroup of type 2 papillary renal-cell carcinomas that was characterized by poor survival and mutation of the gene encoding fumarate hydratase (FH). CONCLUSIONS Type 1 and type 2 papillary renal-cell carcinomas were shown to be clinically and biologically distinct. Alterations in the MET pathway were associated with type 1, and activation of the NRF2-ARE pathway was associated with type 2; CDKN2A loss and CIMP in type 2 conveyed a poor prognosis. Furthermore, type 2 papillary renal-cell carcinoma consisted of at least three subtypes based on molecular and phenotypic features. (Funded by the National Institutes of Health.).
Collapse
|
52
|
Martin SD, Coukos G, Holt RA, Nelson BH. Targeting the undruggable: immunotherapy meets personalized oncology in the genomic era. Ann Oncol 2015; 26:2367-74. [PMID: 26371284 PMCID: PMC4658541 DOI: 10.1093/annonc/mdv382] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Revised: 08/14/2015] [Accepted: 08/17/2015] [Indexed: 12/22/2022] Open
Abstract
Owing to recent advances in genomic technologies, personalized oncology is poised to fundamentally alter cancer therapy. In this paradigm, the mutational and transcriptional profiles of tumors are assessed, and personalized treatments are designed based on the specific molecular abnormalities relevant to each patient's cancer. To date, such approaches have yielded impressive clinical responses in some patients. However, a major limitation of this strategy has also been revealed: the vast majority of tumor mutations are not targetable by current pharmacological approaches. Immunotherapy offers a promising alternative to exploit tumor mutations as targets for clinical intervention. Mutated proteins can give rise to novel antigens (called neoantigens) that are recognized with high specificity by patient T cells. Indeed, neoantigen-specific T cells have been shown to underlie clinical responses to many standard treatments and immunotherapeutic interventions. Moreover, studies in mouse models targeting neoantigens, and early results from clinical trials, have established proof of concept for personalized immunotherapies targeting next-generation sequencing identified neoantigens. Here, we review basic immunological principles related to T-cell recognition of neoantigens, and we examine recent studies that use genomic data to design personalized immunotherapies. We discuss the opportunities and challenges that lie ahead on the road to improving patient outcomes by incorporating immunotherapy into the paradigm of personalized oncology.
Collapse
|
53
|
Brown SD, Raeburn LA, Holt RA. Profiling tissue-resident T cell repertoires by RNA sequencing. Genome Med 2015; 7:125. [PMID: 26620832 PMCID: PMC4666197 DOI: 10.1186/s13073-015-0248-x] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 11/13/2015] [Indexed: 12/21/2022] Open
Abstract
Deep sequencing of recombined T cell receptor (TCR) genes and transcripts has provided a view of T cell repertoire diversity at an unprecedented resolution. Beyond profiling peripheral blood, analysis of tissue-resident T cells provides further insight into immune-related diseases. We describe the extraction of TCR sequence information directly from RNA-sequencing data from 6738 tumor and 604 control tissues, with a typical yield of 1 TCR per 10 million reads. This method circumvents the need for PCR amplification of the TCR template and provides TCR information in the context of global gene expression, allowing integrated analysis of extensive RNA-sequencing data resources.
Collapse
|
54
|
Abstract
The complexity of the immune system is now being interrogated using methodologies that generate extensive multi-dimensional data. Effective collection, integration and interpretation of these data remain difficult, but overcoming these important challenges will provide new insights into immune function and opportunities for the rational design of new immune interventions.
Collapse
|
55
|
Brat DJ, Verhaak RGW, Aldape KD, Yung WKA, Salama SR, Cooper LAD, Rheinbay E, Miller CR, Vitucci M, Morozova O, Robertson AG, Noushmehr H, Laird PW, Cherniack AD, Akbani R, Huse JT, Ciriello G, Poisson LM, Barnholtz-Sloan JS, Berger MS, Brennan C, Colen RR, Colman H, Flanders AE, Giannini C, Grifford M, Iavarone A, Jain R, Joseph I, Kim J, Kasaian K, Mikkelsen T, Murray BA, O'Neill BP, Pachter L, Parsons DW, Sougnez C, Sulman EP, Vandenberg SR, Van Meir EG, von Deimling A, Zhang H, Crain D, Lau K, Mallery D, Morris S, Paulauskis J, Penny R, Shelton T, Sherman M, Yena P, Black A, Bowen J, Dicostanzo K, Gastier-Foster J, Leraas KM, Lichtenberg TM, Pierson CR, Ramirez NC, Taylor C, Weaver S, Wise L, Zmuda E, Davidsen T, Demchok JA, Eley G, Ferguson ML, Hutter CM, Mills Shaw KR, Ozenberger BA, Sheth M, Sofia HJ, Tarnuzzer R, Wang Z, Yang L, Zenklusen JC, Ayala B, Baboud J, Chudamani S, Jensen MA, Liu J, Pihl T, Raman R, Wan Y, Wu Y, Ally A, Auman JT, Balasundaram M, Balu S, Baylin SB, Beroukhim R, Bootwalla MS, Bowlby R, Bristow CA, Brooks D, Butterfield Y, Carlsen R, Carter S, Chin L, Chu A, Chuah E, Cibulskis K, Clarke A, Coetzee SG, Dhalla N, Fennell T, Fisher S, Gabriel S, Getz G, Gibbs R, Guin R, Hadjipanayis A, Hayes DN, Hinoue T, Hoadley K, Holt RA, Hoyle AP, Jefferys SR, Jones S, Jones CD, Kucherlapati R, Lai PH, Lander E, Lee S, Lichtenstein L, Ma Y, Maglinte DT, Mahadeshwar HS, Marra MA, Mayo M, Meng S, Meyerson ML, Mieczkowski PA, Moore RA, Mose LE, Mungall AJ, Pantazi A, Parfenov M, Park PJ, Parker JS, Perou CM, Protopopov A, Ren X, Roach J, Sabedot TS, Schein J, Schumacher SE, Seidman JG, Seth S, Shen H, Simons JV, Sipahimalani P, Soloway MG, Song X, Sun H, Tabak B, Tam A, Tan D, Tang J, Thiessen N, Triche T, Van Den Berg DJ, Veluvolu U, Waring S, Weisenberger DJ, Wilkerson MD, Wong T, Wu J, Xi L, Xu AW, Yang L, Zack TI, Zhang J, Aksoy BA, Arachchi H, Benz C, Bernard B, Carlin D, Cho J, DiCara D, Frazer S, Fuller GN, Gao J, Gehlenborg N, Haussler D, Heiman DI, Iype L, Jacobsen A, Ju Z, Katzman S, Kim H, Knijnenburg T, Kreisberg RB, Lawrence MS, Lee W, Leinonen K, Lin P, Ling S, Liu W, Liu Y, Liu Y, Lu Y, Mills G, Ng S, Noble MS, Paull E, Rao A, Reynolds S, Saksena G, Sanborn Z, Sander C, Schultz N, Senbabaoglu Y, Shen R, Shmulevich I, Sinha R, Stuart J, Sumer SO, Sun Y, Tasman N, Taylor BS, Voet D, Weinhold N, Weinstein JN, Yang D, Yoshihara K, Zheng S, Zhang W, Zou L, Abel T, Sadeghi S, Cohen ML, Eschbacher J, Hattab EM, Raghunathan A, Schniederjan MJ, Aziz D, Barnett G, Barrett W, Bigner DD, Boice L, Brewer C, Calatozzolo C, Campos B, Carlotti CG, Chan TA, Cuppini L, Curley E, Cuzzubbo S, Devine K, DiMeco F, Duell R, Elder JB, Fehrenbach A, Finocchiaro G, Friedman W, Fulop J, Gardner J, Hermes B, Herold-Mende C, Jungk C, Kendler A, Lehman NL, Lipp E, Liu O, Mandt R, McGraw M, Mclendon R, McPherson C, Neder L, Nguyen P, Noss A, Nunziata R, Ostrom QT, Palmer C, Perin A, Pollo B, Potapov A, Potapova O, Rathmell WK, Rotin D, Scarpace L, Schilero C, Senecal K, Shimmel K, Shurkhay V, Sifri S, Singh R, Sloan AE, Smolenski K, Staugaitis SM, Steele R, Thorne L, Tirapelli DPC, Unterberg A, Vallurupalli M, Wang Y, Warnick R, Williams F, Wolinsky Y, Bell S, Rosenberg M, Stewart C, Huang F, Grimsby JL, Radenbaugh AJ, Zhang J. Comprehensive, Integrative Genomic Analysis of Diffuse Lower-Grade Gliomas. N Engl J Med 2015; 372:2481-98. [PMID: 26061751 PMCID: PMC4530011 DOI: 10.1056/nejmoa1402121] [Citation(s) in RCA: 2142] [Impact Index Per Article: 238.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
BACKGROUND Diffuse low-grade and intermediate-grade gliomas (which together make up the lower-grade gliomas, World Health Organization grades II and III) have highly variable clinical behavior that is not adequately predicted on the basis of histologic class. Some are indolent; others quickly progress to glioblastoma. The uncertainty is compounded by interobserver variability in histologic diagnosis. Mutations in IDH, TP53, and ATRX and codeletion of chromosome arms 1p and 19q (1p/19q codeletion) have been implicated as clinically relevant markers of lower-grade gliomas. METHODS We performed genomewide analyses of 293 lower-grade gliomas from adults, incorporating exome sequence, DNA copy number, DNA methylation, messenger RNA expression, microRNA expression, and targeted protein expression. These data were integrated and tested for correlation with clinical outcomes. RESULTS Unsupervised clustering of mutations and data from RNA, DNA-copy-number, and DNA-methylation platforms uncovered concordant classification of three robust, nonoverlapping, prognostically significant subtypes of lower-grade glioma that were captured more accurately by IDH, 1p/19q, and TP53 status than by histologic class. Patients who had lower-grade gliomas with an IDH mutation and 1p/19q codeletion had the most favorable clinical outcomes. Their gliomas harbored mutations in CIC, FUBP1, NOTCH1, and the TERT promoter. Nearly all lower-grade gliomas with IDH mutations and no 1p/19q codeletion had mutations in TP53 (94%) and ATRX inactivation (86%). The large majority of lower-grade gliomas without an IDH mutation had genomic aberrations and clinical behavior strikingly similar to those found in primary glioblastoma. CONCLUSIONS The integration of genomewide data from multiple platforms delineated three molecular classes of lower-grade gliomas that were more concordant with IDH, 1p/19q, and TP53 status than with histologic class. Lower-grade gliomas with an IDH mutation either had 1p/19q codeletion or carried a TP53 mutation. Most lower-grade gliomas without an IDH mutation were molecularly and clinically similar to glioblastoma. (Funded by the National Institutes of Health.).
Collapse
|
56
|
Gibb EA, Warren RL, Wilson GW, Brown SD, Robertson GA, Morin GB, Holt RA. Activation of an endogenous retrovirus-associated long non-coding RNA in human adenocarcinoma. Genome Med 2015; 7:22. [PMID: 25821520 PMCID: PMC4375928 DOI: 10.1186/s13073-015-0142-6] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 02/12/2015] [Indexed: 11/15/2022] Open
Abstract
Background Long non-coding RNAs (lncRNAs) are emerging as molecules that significantly impact many cellular processes and have been associated with almost every human cancer. Compared to protein-coding genes, lncRNA genes are often associated with transposable elements, particularly with endogenous retroviral elements (ERVs). ERVs can have potentially deleterious effects on genome structure and function, so these elements are typically silenced in normal somatic tissues, albeit with varying efficiency. The aberrant regulation of ERVs associated with lncRNAs (ERV-lncRNAs), coupled with the diverse range of lncRNA functions, creates significant potential for ERV-lncRNAs to impact cancer biology. Methods We used RNA-seq analysis to identify and profile the expression of a novel lncRNA in six large cohorts, including over 7,500 samples from The Cancer Genome Atlas (TCGA). Results We identified the tumor-specific expression of a novel lncRNA that we have named Endogenous retroViral-associated ADenocarcinoma RNA or ‘EVADR’, by analyzing RNA-seq data derived from colorectal tumors and matched normal control tissues. Subsequent analysis of TCGA RNA-seq data revealed the striking association of EVADR with adenocarcinomas, which are tumors of glandular origin. Moderate to high levels of EVADR were detected in 25 to 53% of colon, rectal, lung, pancreas and stomach adenocarcinomas (mean = 30 to 144 FPKM), and EVADR expression correlated with decreased patient survival (Cox regression; hazard ratio = 1.47, 95% confidence interval = 1.06 to 2.04, P = 0.02). In tumor sites of non-glandular origin, EVADR expression was detectable at only very low levels and in less than 10% of patients. For EVADR, a MER48 ERV element provides an active promoter to drive its transcription. Genome-wide, MER48 insertions are associated with nine lncRNAs, but none of the MER48-associated lncRNAs other than EVADR were consistently expressed in adenocarcinomas, demonstrating the specific activation of EVADR. The sequence and structure of the EVADR locus is highly conserved among Old World monkeys and apes but not New World monkeys or prosimians, where the MER48 insertion is absent. Conservation of the EVADR locus suggests a functional role for this novel lncRNA in humans and our closest primate relatives. Conclusions Our results describe the specific activation of a highly conserved ERV-lncRNA in numerous cancers of glandular origin, a finding with diagnostic, prognostic and therapeutic implications. Electronic supplementary material The online version of this article (doi:10.1186/s13073-015-0142-6) contains supplementary material, which is available to authorized users.
Collapse
|
57
|
Watson CT, Steinberg KM, Graves TA, Warren RL, Malig M, Schein J, Wilson RK, Holt RA, Eichler EE, Breden F. Sequencing of the human IG light chain loci from a hydatidiform mole BAC library reveals locus-specific signatures of genetic diversity. Genes Immun 2015; 16:24-34. [PMID: 25338678 PMCID: PMC4304971 DOI: 10.1038/gene.2014.56] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Revised: 09/03/2014] [Accepted: 09/03/2014] [Indexed: 12/24/2022]
Abstract
Germline variation at immunoglobulin (IG) loci is critical for pathogen-mediated immunity, but establishing complete haplotype sequences in these regions has been problematic because of complex sequence architecture and diploid source DNA. We sequenced BAC clones from the effectively haploid human hydatidiform mole cell line, CHM1htert, across the light chain IG loci, kappa (IGK) and lambda (IGL), creating single haplotype representations of these regions. The IGL haplotype generated here is 1.25 Mb of contiguous sequence, including four novel IGLV alleles, one novel IGLC allele, and an 11.9-kb insertion. The CH17 IGK haplotype consists of two 644 kb proximal and 466 kb distal contigs separated by a large gap of unknown size; these assemblies added 49 kb of unique sequence extending into this gap. Our analysis also resulted in the characterization of seven novel IGKV alleles and a 16.7-kb region exhibiting signatures of interlocus sequence exchange between distal and proximal IGKV gene clusters. Genetic diversity in IGK/IGL was compared with that of the IG heavy chain (IGH) locus within the same haploid genome, revealing threefold (IGK) and sixfold (IGL) higher diversity in the IGH locus, potentially associated with increased levels of segmental duplication and the telomeric location of IGH.
Collapse
|
58
|
McKinnon ML, Rozmus J, Fung SY, Hirschfeld AF, Del Bel KL, Thomas L, Marr N, Martin SD, Marwaha AK, Priatel JJ, Tan R, Senger C, Tsang A, Prendiville J, Junker AK, Seear M, Schultz KR, Sly LM, Holt RA, Patel MS, Friedman JM, Turvey SE. Combined immunodeficiency associated with homozygous MALT1 mutations. J Allergy Clin Immunol 2014; 133:1458-62, 1462.e1-7. [DOI: 10.1016/j.jaci.2013.10.045] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2013] [Revised: 09/20/2013] [Accepted: 10/22/2013] [Indexed: 10/25/2022]
|
59
|
Brown SD, Warren RL, Gibb EA, Martin SD, Spinelli JJ, Nelson BH, Holt RA. Neo-antigens predicted by tumor genome meta-analysis correlate with increased patient survival. Genome Res 2014; 24:743-50. [PMID: 24782321 PMCID: PMC4009604 DOI: 10.1101/gr.165985.113] [Citation(s) in RCA: 460] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Somatic missense mutations can initiate tumorogenesis and, conversely, anti-tumor cytotoxic T cell (CTL) responses. Tumor genome analysis has revealed extreme heterogeneity among tumor missense mutation profiles, but their relevance to tumor immunology and patient outcomes has awaited comprehensive evaluation. Here, for 515 patients from six tumor sites, we used RNA-seq data from The Cancer Genome Atlas to identify mutations that are predicted to be immunogenic in that they yielded mutational epitopes presented by the MHC proteins encoded by each patient’s autologous HLA-A alleles. Mutational epitopes were associated with increased patient survival. Moreover, the corresponding tumors had higher CTL content, inferred from CD8A gene expression, and elevated expression of the CTL exhaustion markers PDCD1 and CTLA4. Mutational epitopes were very scarce in tumors without evidence of CTL infiltration. These findings suggest that the abundance of predicted immunogenic mutations may be useful for identifying patients likely to benefit from checkpoint blockade and related immunotherapies.
Collapse
|
60
|
Sharma G, Holt RA. T-cell epitope discovery technologies. Hum Immunol 2014; 75:514-9. [PMID: 24755351 DOI: 10.1016/j.humimm.2014.03.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Revised: 03/18/2014] [Accepted: 03/27/2014] [Indexed: 01/21/2023]
Abstract
Despite tremendous potential utility in clinical medicine and research, the discovery and characterization of T-cell antigens has lagged behind most other areas of health research in joining the high-throughput '-omics' revolution. Partially responsible for this is the complex nature of the interactions between effector T cells and antigen-presenting cells. Further contributing to the challenge is the vastness of both the T-cell repertoire and the large number of potential T-cell epitopes. In this review, we trace the development of various discovery strategies, the technical platforms used to carry them out, and we assess the level of success achieved in the field today.
Collapse
|
61
|
de Leeuw CN, Dyka FM, Boye SL, Laprise S, Zhou M, Chou AY, Borretta L, McInerny SC, Banks KG, Portales-Casamar E, Swanson MI, D’Souza CA, Boye SE, Jones SJM, Holt RA, Goldowitz D, Hauswirth WW, Wasserman WW, Simpson EM. Targeted CNS Delivery Using Human MiniPromoters and Demonstrated Compatibility with Adeno-Associated Viral Vectors. Mol Ther Methods Clin Dev 2014; 1:5. [PMID: 24761428 PMCID: PMC3992516 DOI: 10.1038/mtm.2013.5] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2013] [Accepted: 11/05/2013] [Indexed: 01/21/2023]
Abstract
Critical for human gene therapy is the availability of small promoter tools to drive gene expression in a highly specific and reproducible manner. We tackled this challenge by developing human DNA MiniPromoters using computational biology and phylogenetic conservation. MiniPromoters were tested in mouse as single-copy knock-ins at the Hprt locus on the X Chromosome, and evaluated for lacZ reporter expression in CNS and non-CNS tissue. Eighteen novel MiniPromoters driving expression in mouse brain were identified, two MiniPromoters for driving pan-neuronal expression, and 17 MiniPromoters for the mouse eye. Key areas of therapeutic interest were represented in this set: the cerebral cortex, embryonic hypothalamus, spinal cord, bipolar and ganglion cells of the retina, and skeletal muscle. We also demonstrated that three retinal ganglion cell MiniPromoters exhibit similar cell-type specificity when delivered via adeno-associated virus (AAV) vectors intravitreally. We conclude that our methodology and characterization has resulted in desirable expression characteristics that are intrinsic to the MiniPromoter, not dictated by copy number effects or genomic location, and results in constructs predisposed to success in AAV. These MiniPromoters are immediately applicable for pre-clinical studies towards gene therapy in humans, and are publicly available to facilitate basic and clinical research, and human gene therapy.
Collapse
|
62
|
Woodsworth DJ, Castellarin M, Holt RA. Sequence analysis of T-cell repertoires in health and disease. Genome Med 2013; 5:98. [PMID: 24172704 PMCID: PMC3979016 DOI: 10.1186/gm502] [Citation(s) in RCA: 130] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
T-cell antigen receptor (TCR) variability enables the cellular immune system to discriminate between self and non-self. High-throughput TCR sequencing (TCR-seq) involves the use of next generation sequencing platforms to generate large numbers of short DNA sequences covering key regions of the TCR coding sequence, which enables quantification of T-cell diversity at unprecedented resolution. TCR-seq studies have provided new insights into the healthy human T-cell repertoire, such as revised estimates of repertoire size and the understanding that TCR specificities are shared among individuals more frequently than previously anticipated. In the context of disease, TCR-seq has been instrumental in characterizing the recovery of the immune repertoire after hematopoietic stem cell transplantation, and the method has been used to develop biomarkers and diagnostics for various infectious and neoplastic diseases. However, T-cell repertoire sequencing is still in its infancy. It is expected that maturation of the field will involve the introduction of improved, standardized tools for data handling, deposition and statistical analysis, as well as the emergence of new and equivalently large-scale technologies for T-cell functional analysis and antigen discovery. In this review, we introduce this nascent field and TCR-seq methodology, we discuss recent insights into healthy and diseased TCR repertoires, and we examine the applications and challenges for TCR-seq in the clinic.
Collapse
|
63
|
Schmouth JF, Castellarin M, Laprise S, Banks KG, Bonaguro RJ, McInerny SC, Borretta L, Amirabbasi M, Korecki AJ, Portales-Casamar E, Wilson G, Dreolini L, Jones SJM, Wasserman WW, Goldowitz D, Holt RA, Simpson EM. Non-coding-regulatory regions of human brain genes delineated by bacterial artificial chromosome knock-in mice. BMC Biol 2013; 11:106. [PMID: 24124870 PMCID: PMC4015596 DOI: 10.1186/1741-7007-11-106] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Accepted: 09/30/2013] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND The next big challenge in human genetics is understanding the 98% of the genome that comprises non-coding DNA. Hidden in this DNA are sequences critical for gene regulation, and new experimental strategies are needed to understand the functional role of gene-regulation sequences in health and disease. In this study, we build upon our HuGX ('high-throughput human genes on the X chromosome') strategy to expand our understanding of human gene regulation in vivo. RESULTS In all, ten human genes known to express in therapeutically important brain regions were chosen for study. For eight of these genes, human bacterial artificial chromosome clones were identified, retrofitted with a reporter, knocked single-copy into the Hprt locus in mouse embryonic stem cells, and mouse strains derived. Five of these human genes expressed in mouse, and all expressed in the adult brain region for which they were chosen. This defined the boundaries of the genomic DNA sufficient for brain expression, and refined our knowledge regarding the complexity of gene regulation. We also characterized for the first time the expression of human MAOA and NR2F2, two genes for which the mouse homologs have been extensively studied in the central nervous system (CNS), and AMOTL1 and NOV, for which roles in CNS have been unclear. CONCLUSIONS We have demonstrated the use of the HuGX strategy to functionally delineate non-coding-regulatory regions of therapeutically important human brain genes. Our results also show that a careful investigation, using publicly available resources and bioinformatics, can lead to accurate predictions of gene expression.
Collapse
|
64
|
Ley TJ, Miller C, Ding L, Raphael BJ, Mungall AJ, Robertson AG, Hoadley K, Triche TJ, Laird PW, Baty JD, Fulton LL, Fulton R, Heath SE, Kalicki-Veizer J, Kandoth C, Klco JM, Koboldt DC, Kanchi KL, Kulkarni S, Lamprecht TL, Larson DE, Lin L, Lu C, McLellan MD, McMichael JF, Payton J, Schmidt H, Spencer DH, Tomasson MH, Wallis JW, Wartman LD, Watson MA, Welch J, Wendl MC, Ally A, Balasundaram M, Birol I, Butterfield Y, Chiu R, Chu A, Chuah E, Chun HJ, Corbett R, Dhalla N, Guin R, He A, Hirst C, Hirst M, Holt RA, Jones S, Karsan A, Lee D, Li HI, Marra MA, Mayo M, Moore RA, Mungall K, Parker J, Pleasance E, Plettner P, Schein J, Stoll D, Swanson L, Tam A, Thiessen N, Varhol R, Wye N, Zhao Y, Gabriel S, Getz G, Sougnez C, Zou L, Leiserson MDM, Vandin F, Wu HT, Applebaum F, Baylin SB, Akbani R, Broom BM, Chen K, Motter TC, Nguyen K, Weinstein JN, Zhang N, Ferguson ML, Adams C, Black A, Bowen J, Gastier-Foster J, Grossman T, Lichtenberg T, Wise L, Davidsen T, Demchok JA, Shaw KRM, Sheth M, Sofia HJ, Yang L, Downing JR, Eley G. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N Engl J Med 2013; 368:2059-74. [PMID: 23634996 PMCID: PMC3767041 DOI: 10.1056/nejmoa1301689] [Citation(s) in RCA: 3616] [Impact Index Per Article: 328.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
BACKGROUND Many mutations that contribute to the pathogenesis of acute myeloid leukemia (AML) are undefined. The relationships between patterns of mutations and epigenetic phenotypes are not yet clear. METHODS We analyzed the genomes of 200 clinically annotated adult cases of de novo AML, using either whole-genome sequencing (50 cases) or whole-exome sequencing (150 cases), along with RNA and microRNA sequencing and DNA-methylation analysis. RESULTS AML genomes have fewer mutations than most other adult cancers, with an average of only 13 mutations found in genes. Of these, an average of 5 are in genes that are recurrently mutated in AML. A total of 23 genes were significantly mutated, and another 237 were mutated in two or more samples. Nearly all samples had at least 1 nonsynonymous mutation in one of nine categories of genes that are almost certainly relevant for pathogenesis, including transcription-factor fusions (18% of cases), the gene encoding nucleophosmin (NPM1) (27%), tumor-suppressor genes (16%), DNA-methylation-related genes (44%), signaling genes (59%), chromatin-modifying genes (30%), myeloid transcription-factor genes (22%), cohesin-complex genes (13%), and spliceosome-complex genes (14%). Patterns of cooperation and mutual exclusivity suggested strong biologic relationships among several of the genes and categories. CONCLUSIONS We identified at least one potential driver mutation in nearly all AML samples and found that a complex interplay of genetic events contributes to AML pathogenesis in individual patients. The databases from this study are widely available to serve as a foundation for further investigations of AML pathogenesis, classification, and risk stratification. (Funded by the National Institutes of Health.).
Collapse
|
65
|
Warren RL, Freeman DJ, Pleasance S, Watson P, Moore RA, Cochrane K, Allen-Vercoe E, Holt RA. Co-occurrence of anaerobic bacteria in colorectal carcinomas. MICROBIOME 2013; 1:16. [PMID: 24450771 PMCID: PMC3971631 DOI: 10.1186/2049-2618-1-16] [Citation(s) in RCA: 227] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2013] [Accepted: 04/17/2013] [Indexed: 05/08/2023]
Abstract
BACKGROUND Numerous cancers have been linked to microorganisms. Given that colorectal cancer is a leading cause of cancer deaths and the colon is continuously exposed to a high diversity of microbes, the relationship between gut mucosal microbiome and colorectal cancer needs to be explored. Metagenomic studies have shown an association between Fusobacterium species and colorectal carcinoma. Here, we have extended these studies with deeper sequencing of a much larger number (n = 130) of colorectal carcinoma and matched normal control tissues. We analyzed these data using co-occurrence networks in order to identify microbe-microbe and host-microbe associations specific to tumors. RESULTS We confirmed tumor over-representation of Fusobacterium species and observed significant co-occurrence within individual tumors of Fusobacterium, Leptotrichia and Campylobacter species. This polymicrobial signature was associated with over-expression of numerous host genes, including the gene encoding the pro-inflammatory chemokine Interleukin-8. The tumor-associated bacteria we have identified are all Gram-negative anaerobes, recognized previously as constituents of the oral microbiome, which are capable of causing infection. We isolated a novel strain of Campylobacter showae from a colorectal tumor specimen. This strain is substantially diverged from a previously sequenced oral Campylobacter showae isolate, carries potential virulence genes, and aggregates with a previously isolated tumor strain of Fusobacterium nucleatum. CONCLUSIONS A polymicrobial signature of Gram-negative anaerobic bacteria is associated with colorectal carcinoma tissue.
Collapse
|
66
|
Martin SD, Wick DA, Webb JR, Holt RA, Nelson BH. Abstract B19: Tumor-specific structural rearrangements as potential targets for anticancer vaccines. Cancer Res 2013. [DOI: 10.1158/1538-7445.tumimm2012-b19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
A hallmark of cancer is the accumulation of mutations that allow cells to proliferate uncontrollably. Every tumor has a unique set of somatic mutations, which can be identified with next generation sequencing. In theory, CD8+ T cells can recognize mutated proteins present in human cancers if the mutations are presented as peptides bound to MHC class I molecules. We hypothesize that personalized, tumor-specific peptide vaccines will activate CD8+ T cells and induce tumor regression. To test this concept, we subjected four mouse mammary tumors to RNA sequencing using the Illumina sequencing platform. We identified 14 somatic point mutations and one fusion transcript unique to these tumors. In addition, we have optimized a vaccination strategy involving long peptides and the adjuvant poly (I:C) that results in massive proliferation of antigen-specific CD8+ T cells. When we used this optimized vaccination protocol to target tumor specific mutations, a strong T cell response was elicited towards the fusion protein, but only weak responses or no responses were elicited towards the point mutations. Currently, we are vaccinating mice with immunogenic mutated-peptides, and assessing whether the T cell response elicited by the vaccines causes regression or rejection of established tumors that harbor the same mutations. As the cost of sequencing the human genome continues to decrease, personalized vaccines that target tumor-specific mutations may become a treatment strategy that is clinically feasible.
Citation Format: Spencer David Martin, Darin A. Wick, John R. Webb, Robert A. Holt, Brad H. Nelson. Tumor-specific structural rearrangements as potential targets for anticancer vaccines. [abstract]. In: Proceedings of the AACR Special Conference on Tumor Immunology: Multidisciplinary Science Driving Basic and Clinical Advances; Dec 2-5, 2012; Miami, FL. Philadelphia (PA): AACR; Cancer Res 2013;73(1 Suppl):Abstract nr B19.
Collapse
|
67
|
Warren RL, Choe G, Freeman DJ, Castellarin M, Munro S, Moore R, Holt RA. Derivation of HLA types from shotgun sequence datasets. Genome Med 2012; 4:95. [PMID: 23228053 PMCID: PMC3580435 DOI: 10.1186/gm396] [Citation(s) in RCA: 134] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2012] [Revised: 10/12/2012] [Accepted: 12/10/2012] [Indexed: 12/19/2022] Open
Abstract
The human leukocyte antigen (HLA) is key to many aspects of human physiology and medicine. All current sequence-based HLA typing methodologies are targeted approaches requiring the amplification of specific HLA gene segments. Whole genome, exome and transcriptome shotgun sequencing can generate prodigious data but due to the complexity of HLA loci these data have not been immediately informative regarding HLA genotype. We describe HLAminer, a computational method for identifying HLA alleles directly from shotgun sequence datasets (http://www.bcgsc.ca/platform/bioinfo/software/hlaminer). This approach circumvents the additional time and cost of generating HLA-specific data and capitalizes on the increasing accessibility and affordability of massively parallel sequencing.
Collapse
|
68
|
Castellarin M, Milne K, Zeng T, Tse K, Mayo M, Zhao Y, Webb JR, Watson PH, Nelson BH, Holt RA. Clonal evolution of high-grade serous ovarian carcinoma from primary to recurrent disease. J Pathol 2012; 229:515-24. [PMID: 22996961 DOI: 10.1002/path.4105] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2012] [Revised: 08/17/2012] [Accepted: 09/11/2012] [Indexed: 01/04/2023]
Abstract
High-grade serous carcinoma (HGSC) is the most common and fatal form of ovarian cancer. While most tumours are highly sensitive to cytoreductive surgery and platinum- and taxane-based chemotherapy, the majority of patients experience recurrence of treatment-resistant tumours. The clonal origin and mutational adaptations associated with recurrent disease are poorly understood. We performed whole exome sequencing on tumour cells harvested from ascites at three time points (primary, first recurrence, and second recurrence) for three HGSC patients receiving standard treatment. Somatic point mutations and small insertions and deletions were identified by comparison to constitutional DNA. The clonal structure and evolution of tumours were inferred from patterns of mutant allele frequencies. TP53 mutations were predominant in all patients at all time points, consistent with the known founder role of this gene. Tumours from all three patients also harboured mutations associated with cell cycle checkpoint function and Golgi vesicle trafficking. There was convergence of germline and somatic variants within the DNA repair, ECM, cell cycle control, and Golgi vesicle pathways. The vast majority of somatic variants found in recurrent tumours were present in primary tumours. Our findings highlight both known and novel pathways that are commonly mutated in HGSC. Moreover, they provide the first evidence at single nucleotide resolution that recurrent HGSC arises from multiple clones present in the primary tumour with negligible accumulation of new mutations during standard treatment.
Collapse
|
69
|
Castellarin M, Warren RL, Freeman JD, Dreolini L, Krzywinski M, Strauss J, Barnes R, Watson P, Allen-Vercoe E, Moore RA, Holt RA. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Res 2011; 22:299-306. [PMID: 22009989 DOI: 10.1101/gr.126516.111] [Citation(s) in RCA: 1343] [Impact Index Per Article: 103.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
An estimated 15% or more of the cancer burden worldwide is attributable to known infectious agents. We screened colorectal carcinoma and matched normal tissue specimens using RNA-seq followed by host sequence subtraction and found marked over-representation of Fusobacterium nucleatum sequences in tumors relative to control specimens. F. nucleatum is an invasive anaerobe that has been linked previously to periodontitis and appendicitis, but not to cancer. Fusobacteria are rare constituents of the fecal microbiota, but have been cultured previously from biopsies of inflamed gut mucosa. We obtained a Fusobacterium isolate from a frozen tumor specimen; this showed highest sequence similarity to a known gut mucosa isolate and was confirmed to be invasive. We verified overabundance of Fusobacterium sequences in tumor versus matched normal control tissue by quantitative PCR analysis from a total of 99 subjects (p = 2.5 × 10(-6)), and we observed a positive association with lymph node metastasis.
Collapse
|
70
|
Abstract
As next-generation sequence (NGS) production continues to increase, analysis is becoming a significant bottleneck. However, in situations where information is required only for specific sequence variants, it is not necessary to assemble or align whole genome data sets in their entirety. Rather, NGS data sets can be mined for the presence of sequence variants of interest by localized assembly, which is a faster, easier, and more accurate approach. We present TASR, a streamlined assembler that interrogates very large NGS data sets for the presence of specific variants by only considering reads within the sequence space of input target sequences provided by the user. The NGS data set is searched for reads with an exact match to all possible short words within the target sequence, and these reads are then assembled stringently to generate a consensus of the target and flanking sequence. Typically, variants of a particular locus are provided as different target sequences, and the presence of the variant in the data set being interrogated is revealed by a successful assembly outcome. However, TASR can also be used to find unknown sequences that flank a given target. We demonstrate that TASR has utility in finding or confirming genomic mutations, polymorphisms, fusions and integration events. Targeted assembly is a powerful method for interrogating large data sets for the presence of sequence variants of interest. TASR is a fast, flexible and easy to use tool for targeted assembly.
Collapse
|
71
|
Waltham TN, Girvan HM, Butler CF, Rigby SR, Dunford AJ, Holt RA, Munro AW. Analysis of the oxidation of short chain alkynes by flavocytochrome P450 BM3. Metallomics 2011; 3:369-78. [PMID: 21431175 DOI: 10.1039/c1mt00004g] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Bacillus megaterium flavocytochrome P450 BM3 (BM3) is a high activity fatty acid hydroxylase, formed by the fusion of soluble cytochrome P450 and cytochrome P450 reductase modules. Short chain (C6, C8) alkynes were shown to be substrates for BM3, with productive outcomes (i.e. alkyne hydroxylation) dependent on position of the carbon-carbon triple bond in the molecule. Wild-type P450 BM3 catalyses ω-3 hydroxylation of both 1-hexyne and 1-octyne, but is suicidally inactivated in NADPH-dependent turnover with non-terminal alkynes. A F87G mutant of P450 BM3 also undergoes turnover-dependent heme destruction with the terminal alkynes, pointing to a key role for Phe87 in controlling regioselectivity of alkyne oxidation. The terminal alkynes access the BM3 heme active site led by the acetylene functional group, since hydroxylated products are not observed near the opposite end of the molecules. For both 1-hexyne and 1-octyne, the predominant enantiomeric product formed (up to ∼90%) is the (S)-(-)-1-alkyn-3-ol form. Wild-type P450 BM3 is shown to be an effective oxidase catalyst of terminal alkynes, with strict regioselectivity of oxidation and potential biotechnological applications. The absence of measurable octanoic or hexanoic acid products from oxidation of the relevant 1-alkynes is also consistent with previous studies suggesting that removal of the phenyl group in the F87G mutant does not lead to significant levels of ω-oxidation of alkyl chain substrates.
Collapse
|
72
|
Warren RL, Freeman JD, Zeng T, Choe G, Munro S, Moore R, Webb JR, Holt RA. Exhaustive T-cell repertoire sequencing of human peripheral blood samples reveals signatures of antigen selection and a directly measured repertoire size of at least 1 million clonotypes. Genome Res 2011; 21:790-7. [PMID: 21349924 DOI: 10.1101/gr.115428.110] [Citation(s) in RCA: 252] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Massively parallel sequencing is a useful approach for characterizing T-cell receptor diversity. However, immune receptors are extraordinarily difficult sequencing targets because any given receptor variant may be present in very low abundance and may differ legitimately by only a single nucleotide. We show that the sensitivity of sequence-based repertoire profiling is limited by both sequencing depth and sequencing accuracy. At two timepoints, 1 wk apart, we isolated bulk PBMC plus naïve (CD45RA+/CD45RO-) and memory (CD45RA-/CD45RO+) T-cell subsets from a healthy donor. From T-cell receptor beta chain (TCRB) mRNA we constructed and sequenced multiple libraries to obtain a total of 1.7 billion paired sequence reads. The sequencing error rate was determined empirically and used to inform a high stringency data filtering procedure. The error filtered data yielded 1,061,522 distinct TCRB nucleotide sequences from this subject which establishes a new, directly measured, lower limit on individual T-cell repertoire size and provides a useful reference set of sequences for repertoire analysis. TCRB nucleotide sequences obtained from two additional donors were compared to those from the first donor and revealed limited sharing (up to 1.1%) of nucleotide sequences among donors, but substantially higher sharing (up to 14.2%) of inferred amino acid sequences. For each donor, shared amino acid sequences were encoded by a much larger diversity of nucleotide sequences than were unshared amino acid sequences. We also observed a highly statistically significant association between numbers of shared sequences and shared HLA class I alleles.
Collapse
|
73
|
Hesse-Orce U, DiGuistini S, Keeling CI, Wang Y, Li M, Henderson H, Docking TR, Liao NY, Robertson G, Holt RA, Jones SJM, Bohlmann J, Breuil C. Gene discovery for the bark beetle-vectored fungal tree pathogen Grosmannia clavigera. BMC Genomics 2010; 11:536. [PMID: 20920358 PMCID: PMC3091685 DOI: 10.1186/1471-2164-11-536] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2010] [Accepted: 10/04/2010] [Indexed: 11/16/2022] Open
Abstract
Background Grosmannia clavigera is a bark beetle-vectored fungal pathogen of pines that causes wood discoloration and may kill trees by disrupting nutrient and water transport. Trees respond to attacks from beetles and associated fungi by releasing terpenoid and phenolic defense compounds. It is unclear which genes are important for G. clavigera's ability to overcome antifungal pine terpenoids and phenolics. Results We constructed seven cDNA libraries from eight G. clavigera isolates grown under various culture conditions, and Sanger sequenced the 5' and 3' ends of 25,000 cDNA clones, resulting in 44,288 high quality ESTs. The assembled dataset of unique transcripts (unigenes) consists of 6,265 contigs and 2,459 singletons that mapped to 6,467 locations on the G. clavigera reference genome, representing ~70% of the predicted G. clavigera genes. Although only 54% of the unigenes matched characterized proteins at the NCBI database, this dataset extensively covers major metabolic pathways, cellular processes, and genes necessary for response to environmental stimuli and genetic information processing. Furthermore, we identified genes expressed in spores prior to germination, and genes involved in response to treatment with lodgepole pine phloem extract (LPPE). Conclusions We provide a comprehensively annotated EST dataset for G. clavigera that represents a rich resource for gene characterization in this and other ophiostomatoid fungi. Genes expressed in response to LPPE treatment are indicative of fungal oxidative stress response. We identified two clusters of potentially functionally related genes responsive to LPPE treatment. Furthermore, we report a simple method for identifying contig misassemblies in de novo assembled EST collections caused by gene overlap on the genome.
Collapse
|
74
|
Jones SJM, Laskin J, Li YY, Griffith OL, An J, Bilenky M, Butterfield YS, Cezard T, Chuah E, Corbett R, Fejes AP, Griffith M, Yee J, Martin M, Mayo M, Melnyk N, Morin RD, Pugh TJ, Severson T, Shah SP, Sutcliffe M, Tam A, Terry J, Thiessen N, Thomson T, Varhol R, Zeng T, Zhao Y, Moore RA, Huntsman DG, Birol I, Hirst M, Holt RA, Marra MA. Evolution of an adenocarcinoma in response to selection by targeted kinase inhibitors. Genome Biol 2010; 11:R82. [PMID: 20696054 PMCID: PMC2945784 DOI: 10.1186/gb-2010-11-8-r82] [Citation(s) in RCA: 132] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2010] [Revised: 07/08/2010] [Accepted: 08/09/2010] [Indexed: 01/26/2023] Open
Abstract
BACKGROUND Adenocarcinomas of the tongue are rare and represent the minority (20 to 25%) of salivary gland tumors affecting the tongue. We investigated the utility of massively parallel sequencing to characterize an adenocarcinoma of the tongue, before and after treatment. RESULTS In the pre-treatment tumor we identified 7,629 genes within regions of copy number gain. There were 1,078 genes that exhibited increased expression relative to the blood and unrelated tumors and four genes contained somatic protein-coding mutations. Our analysis suggested the tumor cells were driven by the RET oncogene. Genes whose protein products are targeted by the RET inhibitors sunitinib and sorafenib correlated with being amplified and or highly expressed. Consistent with our observations, administration of sunitinib was associated with stable disease lasting 4 months, after which the lung lesions began to grow. Administration of sorafenib and sulindac provided disease stabilization for an additional 3 months after which the cancer progressed and new lesions appeared. A recurring metastasis possessed 7,288 genes within copy number amplicons, 385 genes exhibiting increased expression relative to other tumors and 9 new somatic protein coding mutations. The observed mutations and amplifications were consistent with therapeutic resistance arising through activation of the MAPK and AKT pathways. CONCLUSIONS We conclude that complete genomic characterization of a rare tumor has the potential to aid in clinical decision making and identifying therapeutic approaches where no established treatment protocols exist. These results also provide direct in vivo genomic evidence for mutational evolution within a tumor under drug selection and potential mechanisms of drug resistance accrual.
Collapse
|
75
|
Jones SJM, Laskin J, Li YY, Griffith OL, An J, Bilenky M, Butterfield YS, Cezard T, Chuah E, Corbett R, Fejes A, Griffith M, Yee J, Martin M, Mayo M, Melnyk N, Morin RD, Pugh TJ, Severson T, Shah SP, Sutcliffe M, Tam A, Terry J, Thiessen N, Thomson T, Varhol R, Zeng T, Zhao Y, Moore RA, Huntsman DG, Birol I, Hirst M, Holt RA, Marra MA. Genomic analysis of a rare human tumor. BMC Bioinformatics 2010. [PMCID: PMC3290057 DOI: 10.1186/1471-2105-11-s4-o3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
|