1
|
Sergi A, Beltrame L, Marchini S, Masseroli M. Integrated approach to generate artificial samples with low tumor fraction for somatic variant calling benchmarking. BMC Bioinformatics 2024; 25:180. [PMID: 38720249 PMCID: PMC11077792 DOI: 10.1186/s12859-024-05793-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 04/19/2024] [Indexed: 05/12/2024] Open
Abstract
BACKGROUND High-throughput sequencing (HTS) has become the gold standard approach for variant analysis in cancer research. However, somatic variants may occur at low fractions due to contamination from normal cells or tumor heterogeneity; this poses a significant challenge for standard HTS analysis pipelines. The problem is exacerbated in scenarios with minimal tumor DNA, such as circulating tumor DNA in plasma. Assessing sensitivity and detection of HTS approaches in such cases is paramount, but time-consuming and expensive: specialized experimental protocols and a sufficient quantity of samples are required for processing and analysis. To overcome these limitations, we propose a new computational approach specifically designed for the generation of artificial datasets suitable for this task, simulating ultra-deep targeted sequencing data with low-fraction variants and demonstrating their effectiveness in benchmarking low-fraction variant calling. RESULTS Our approach enables the generation of artificial raw reads that mimic real data without relying on pre-existing data by using NEAT, a fine-grained read simulator that generates artificial datasets using models learned from multiple different datasets. Then, it incorporates low-fraction variants to simulate somatic mutations in samples with minimal tumor DNA content. To prove the suitability of the created artificial datasets for low-fraction variant calling benchmarking, we used them as ground truth to evaluate the performance of widely-used variant calling algorithms: they allowed us to define tuned parameter values of major variant callers, considerably improving their detection of very low-fraction variants. CONCLUSIONS Our findings highlight both the pivotal role of our approach in creating adequate artificial datasets with low tumor fraction, facilitating rapid prototyping and benchmarking of algorithms for such dataset type, as well as the important need of advancing low-fraction variant calling techniques.
Collapse
Affiliation(s)
- Aldo Sergi
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy.
- IRCCS Humanitas Research Hospital, Via Manzoni 56, 20089, Milan, Rozzano, Italy.
| | - Luca Beltrame
- IRCCS Humanitas Research Hospital, Via Manzoni 56, 20089, Milan, Rozzano, Italy
| | - Sergio Marchini
- IRCCS Humanitas Research Hospital, Via Manzoni 56, 20089, Milan, Rozzano, Italy
| | - Marco Masseroli
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy
| |
Collapse
|
2
|
Craven KE, Fischer CG, Jiang L, Pallavajjala A, Lin MT, Eshleman JR. Optimizing Insertion and Deletion Detection Using Next-Generation Sequencing in the Clinical Laboratory. J Mol Diagn 2022; 24:1217-1231. [PMID: 36162758 PMCID: PMC9808503 DOI: 10.1016/j.jmoldx.2022.08.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 07/18/2022] [Accepted: 08/31/2022] [Indexed: 01/13/2023] Open
Abstract
Detection of insertions and deletions (InDels) by short-read next-generation sequencing (NGS) technology can be challenging because of frequent misaligned reads. A systematic analysis of short InDels (1 to 30 bases) and fms-related receptor tyrosine kinase 3 (FLT3) internal tandem duplications (ITDs; 6 to 183 bases) from 46 clinical cases of solid or hematologic malignancy processed with a clinical NGS assay identified misaligned reads in every case, ranging from 3% to 100% of reads with the InDel showing mismapped bases. Mismaps also increased with InDel size. As a consequence, the clinical NGS bioinformatics pipeline undercalled the variant allele frequency by 1% to 84%, incorrectly called simultaneous single-base substitutions along with InDels, or did not report an FLT3 ITD that had been detected by capillary electrophoresis. To improve the ability of the pipeline to better detect and quantify InDels, we utilized a software program called Assembly-Based ReAligner (ABRA2) to more accurately remap reads. ABRA2 was able to correct 41% to 100% of the reads with mismapped bases and led to absolute increases in the variant allele frequency from 1% to 61% along with correction of all of the single-base substitutions except for two cases. ABRA2 could also detect multiple FLT3 ITD clones except for one 183-base ITD. Our analysis has found that ABRA2 performs well on short InDels as well as FLT3 ITDs that are <100 bases.
Collapse
Affiliation(s)
- Kelly E Craven
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Catherine G Fischer
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland; Division of Cancer Prevention, National Cancer Institute, Rockville, Maryland
| | - LiQun Jiang
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Aparna Pallavajjala
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Ming-Tseh Lin
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - James R Eshleman
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland; Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, Maryland; The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland.
| |
Collapse
|
3
|
Lipoprotein(a) beyond the kringle IV repeat polymorphism: The complexity of genetic variation in the LPA gene. Atherosclerosis 2022; 349:17-35. [PMID: 35606073 PMCID: PMC7613587 DOI: 10.1016/j.atherosclerosis.2022.04.003] [Citation(s) in RCA: 59] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Revised: 02/23/2022] [Accepted: 04/01/2022] [Indexed: 12/24/2022]
Abstract
High lipoprotein(a) [Lp(a)] concentrations are one of the most important genetically determined risk factors for cardiovascular disease. Lp(a) concentrations are an enigmatic trait largely controlled by one single gene (LPA) that contains a complex interplay of several genetic elements with many surprising effects discussed in this review. A hypervariable coding copy number variation (the kringle IV type-2 repeat, KIV-2) generates >40 apolipoprotein(a) protein isoforms and determines the median Lp(a) concentrations. Carriers of small isoforms with up to 22 kringle IV domains have median Lp(a) concentrations up to 5 times higher than those with large isoforms (>22 kringle IV domains). The effect of the apo(a) isoforms are, however, modified by many functional single nucleotide polymorphisms (SNPs) distributed over the complete range of allele frequencies (<0.1% to >20%) with very pronounced effects on Lp(a) concentrations. A complex interaction is present between the apo (a) isoforms and LPA SNPs, with isoforms partially masking the effect of functional SNPs and, vice versa, SNPs lowering the Lp(a) concentrations of affected isoforms. This picture is further complicated by SNP-SNP interactions, a poorly understood role of other polymorphisms such as short tandem repeats and linkage structures that are poorly captured by common R2 values. A further layer of complexity derives from recent findings that several functional SNPs are located in the KIV-2 repeat and are thus not accessible to conventional sequencing and genotyping technologies. A critical impact of the ancestry on correlation structures and baseline Lp(a) values becomes increasingly evident. This review provides a comprehensive overview on the complex genetic architecture of the Lp(a) concentrations in plasma, a field that has made tremendous progress with the introduction of new technologies. Understanding the genetics of Lp(a) might be a key to many mysteries of Lp(a) and booster new ideas on the metabolism of Lp(a) and possible interventional targets.
Collapse
|
4
|
Histologic and Genotypic Characterization of Lung Cancer in the Inuit Population of the Eastern Canadian Arctic. Curr Oncol 2022; 29:3171-3186. [PMID: 35621648 PMCID: PMC9139845 DOI: 10.3390/curroncol29050258] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 04/19/2022] [Accepted: 04/20/2022] [Indexed: 11/18/2022] Open
Abstract
Inuit are the Indigenous Arctic peoples and residents of the Canadian territory of Nunavut who have the highest global rate of lung cancer. Given lung cancer’s mortality, histological and genomic characterization was undertaken to better understand the disease biology. We retrospectively studied all Inuit cases from Nunavut’s Qikiqtani (Baffin) region, referred to the Ottawa Hospital Cancer Center between 2001 and 2011. Demographics were compiled from medical records and tumor samples underwent pathologic/histologic confirmation. Tumors were analyzed by next generation sequencing (NGS) with a cancer hotspot mutation panel. Of 98 patients, the median age was 66 years and 61% were male. Tobacco use was reported in 87%, and 69% had a history of lung disease (tuberculosis or other). Histological types were: non-small cell lung carcinoma (NSCLC), 81%; small cell lung carcinoma, 16%. Squamous cell carcinoma (SCC) represented 65% of NSCLC. NGS on 55 samples demonstrated mutation rates similar to public lung cancer datasets. In SCC, the STK11 F354L mutation was observed at higher frequency than previously reported. This is the first study to characterize the histologic/genomic profiles of lung cancer in this population. A high incidence of SCC, and an elevated rate of STK11 mutations distinguishes this group from the North American population.
Collapse
|
5
|
Grillo E, Ravelli C, Corsini M, Zammataro L, Mitola S. Protein domain-based approaches for the identification and prioritization of therapeutically actionable cancer variants. Biochim Biophys Acta Rev Cancer 2021; 1876:188614. [PMID: 34403770 DOI: 10.1016/j.bbcan.2021.188614] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Revised: 08/11/2021] [Accepted: 08/11/2021] [Indexed: 01/04/2023]
Abstract
The tremendous number of cancer variants that can be detected by NGS analyses has required the development of computational approaches to prioritize mutations on the basis of their biological and clinical significance. Standard strategies take a gene-centric approach to the problem, allowing exclusively the identification of highly frequent variants. On the contrary, protein domain (PD)-based approaches allow to identify functionally relevant low frequency variants by searching for mutations that recur on analogous residues across homologous proteins (i.e. containing the same PD). Such approaches enable to transfer information about the effects and druggability from one known mutation to unknown ones. Here we describe how PD-based strategies work, and discuss how they could be exploited for mutation prioritization. The principle that mutations clustered on specific residues of PDs have the same functional consequences and are therapeutically actionable in a similar manner could help the choice of patient-specific targeted drugs, eventually improving the management of cancer patients.
Collapse
Affiliation(s)
- Elisabetta Grillo
- Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy.
| | - Cosetta Ravelli
- Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy
| | - Michela Corsini
- Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy
| | - Luca Zammataro
- Division of Artificial Intelligence Systems for Immunoinformatics, Kiromic BioPharma, Inc., Houston, USA
| | - Stefania Mitola
- Department of Molecular and Translational Medicine, University of Brescia, Brescia, Italy.
| |
Collapse
|
6
|
Saiki R, Momozawa Y, Nannya Y, Nakagawa MM, Ochi Y, Yoshizato T, Terao C, Kuroda Y, Shiraishi Y, Chiba K, Tanaka H, Niida A, Imoto S, Matsuda K, Morisaki T, Murakami Y, Kamatani Y, Matsuda S, Kubo M, Miyano S, Makishima H, Ogawa S. Combined landscape of single-nucleotide variants and copy number alterations in clonal hematopoiesis. Nat Med 2021; 27:1239-1249. [PMID: 34239136 DOI: 10.1038/s41591-021-01411-9] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 05/26/2021] [Indexed: 02/04/2023]
Abstract
Clonal hematopoiesis (CH) in apparently healthy individuals is implicated in the development of hematological malignancies (HM) and cardiovascular diseases. Previous studies of CH analyzed either single-nucleotide variants and indels (SNVs/indels) or copy number alterations (CNAs), but not both. Here, using a combination of targeted sequencing of 23 CH-related genes and array-based CNA detection of blood-derived DNA, we have delineated the landscape of CH-related SNVs/indels and CNAs in 11,234 individuals without HM from the BioBank Japan cohort, including 672 individuals with subsequent HM development, and studied the effects of these somatic alterations on mortality from HM and cardiovascular disease, as well as on hematological and cardiovascular phenotypes. The total number of both types of CH-related lesions and their clone size positively correlated with blood count abnormalities and mortality from HM. CH-related SNVs/indels and CNAs exhibited statistically significant co-occurrence in the same individuals. In particular, co-occurrence of SNVs/indels and CNAs affecting DNMT3A, TET2, JAK2 and TP53 resulted in biallelic alterations of these genes and was associated with higher HM mortality. Co-occurrence of SNVs/indels and CNAs also modulated risks for cardiovascular mortality. These findings highlight the importance of detecting both SNVs/indels and CNAs in the evaluation of CH.
Collapse
Affiliation(s)
- Ryunosuke Saiki
- Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yasuhito Nannya
- Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Masahiro M Nakagawa
- Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan.,Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan
| | - Yotaro Ochi
- Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Tetsuichi Yoshizato
- Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yutaka Kuroda
- Department of Orthopaedic Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Yuichi Shiraishi
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, Japan
| | - Kenichi Chiba
- Division of Cellular Signaling, National Cancer Center Research Institute, Tokyo, Japan
| | - Hiroko Tanaka
- Department of Integrated Data Science, M&D Data Science Center, Tokyo Medical and Dental University, Tokyo, Japan
| | - Atsushi Niida
- Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Seiya Imoto
- Division of Health Medical Intelligence, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Koichi Matsuda
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Takayuki Morisaki
- Division of Molecular Pathology, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Yoshinori Murakami
- Division of Molecular Pathology, Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Yoichiro Kamatani
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan.,Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan
| | - Shuichi Matsuda
- Department of Orthopaedic Surgery, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Michiaki Kubo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Satoru Miyano
- Department of Integrated Data Science, M&D Data Science Center, Tokyo Medical and Dental University, Tokyo, Japan
| | - Hideki Makishima
- Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Seishi Ogawa
- Department of Pathology and Tumor Biology, Graduate School of Medicine, Kyoto University, Kyoto, Japan. .,Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan. .,Department of Medicine, Centre for Haematology and Regenerative Medicine, Karolinska Institute, Stockholm, Sweden.
| |
Collapse
|
7
|
Lee D, Lee JH, Bang D. Accurate Detection of Rare Mutant Alleles by Target Base-Specific Cleavage with the CRISPR/Cas9 System. ACS Synth Biol 2021; 10:1451-1464. [PMID: 34009946 DOI: 10.1021/acssynbio.1c00056] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The detection of low-frequency somatic mutations enables early diagnosis of disease; however, base-substitution errors that arise during genomic library preparation and high-throughput sequencing can lead to false diagnostic information. To discriminate true genomic alterations from technical errors, we developed spCas9-assisted true variant labeling sequencing (CARVE-seq), which detects low-frequency mutant alleles with high accuracy. CARVE-seq utilizes single-base discrimination during spCas9 cleavage reactions to exclude technical errors. Ten single nucleotide variants that recurrently occur in tumors were assayed by CARVE-seq using 20 ng reference samples, and 100% positive predictive value and specificity was observed, which proved the highly accurate performance of CARVE-seq.
Collapse
Affiliation(s)
- Dongin Lee
- Department of Chemistry, Yonsei University, 50, Yonsei-ro, Seodaemun-gu, Seoul, 03722, Republic of Korea
| | - Ji Hyun Lee
- Department of Clinical Pharmacology and Therapeutics, College of Medicine, Kyung Hee University, 26, Kyungheedae-ro, Dongdaemun-gu, Seoul, 02447, Republic of Korea
- Department of Biomedical Science and Technology, Kyung Hee Medical Science Research Institute, Kyung Hee University, Seoul, 02447, Republic of Korea
| | - Duhee Bang
- Department of Chemistry, Yonsei University, 50, Yonsei-ro, Seodaemun-gu, Seoul, 03722, Republic of Korea
| |
Collapse
|
8
|
Shahid S, Shakeel M, Siddiqui S, Ahmed S, Sohail M, Khan IA, Abid A, Shamsi T. Novel Genetic Variations in Acute Myeloid Leukemia in Pakistani Population. Front Genet 2020; 11:560. [PMID: 32655615 PMCID: PMC7324646 DOI: 10.3389/fgene.2020.00560] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 05/07/2020] [Indexed: 12/22/2022] Open
Abstract
Acute myeloid leukemia (AML) is a hematological malignancy characterized by clonal expansion of blast cells that exhibit great genetic heterogeneity. In this study, we describe the mutational landscape and its clinico-pathological significance in 26 myeloid neoplasm patients from a South Asian population (Pakistan) by using ultra-deep targeted next-generation DNA sequencing of 54 genes (∼5000×) and its subsequent bioinformatics analysis. The data analysis indicated novel non-silent somatic mutational events previously not reported in AML, including nine non-synonymous and one stop-gain mutations. Notably, two recurrent somatic non-synonymous mutations, i.e., STAG2 (causing p.L526F) and BCORL1 (p.A400V), were observed in three unrelated cases each. The BCOR was found to have three independent non-synonymous somatic mutations in three cases. Further, the SRSF2 with a protein truncating somatic mutation (p.Q88X) was observed for the first time in AML in this study. The prioritization of germline mutations with ClinVar, SIFT, Polyphen2, and Combined Annotation Dependent Depletion (CADD) highlighted 18 predicted deleterious/pathogenic mutations, including two recurrent deleterious mutations, i.e., a novel heterozygous non-synonymous SNV in GATA2 (p.T358P) and a frameshift insertion in NPM1 (p.L258fs), found in two unrelated cases each. The WT1 was observed with three independent potential detrimental germline mutations in three different cases. Collectively, non-silent somatic and/or germline mutations were observed in 23 (88.46%) of the cases (0.92 mutation per case). Furthermore, the pharmGKB database exploration showed a missense SNV rs1042522 in TP53, exhibiting decreased response to anti-cancer drugs, in 19 (73%) of the cases. This genomic profiling of AML provides deep insight into the disease pathophysiology. Identification of pharmacogenomics markers will help to adopt personalized approach for the management of AML patients in Pakistan.
Collapse
Affiliation(s)
- Saba Shahid
- Department of Genomics, National Institute of Blood Diseases and Bone Marrow Transplantation, Karachi, Pakistan
| | - Muhammad Shakeel
- Jamil-ur-Rahman Center for Genome Research, Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan
| | - Saima Siddiqui
- Department of Hematology, National Institute of Blood Diseases and Bone Marrow Transplantation Karachi, Karachi, Pakistan
| | - Shariq Ahmed
- Department of Genomics, National Institute of Blood Diseases and Bone Marrow Transplantation, Karachi, Pakistan
| | - Misha Sohail
- Department of Genomics, National Institute of Blood Diseases and Bone Marrow Transplantation, Karachi, Pakistan
| | - Ishtiaq Ahmad Khan
- Jamil-ur-Rahman Center for Genome Research, Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan
| | - Aiysha Abid
- Centre for Human Genetics and Molecular Medicine, Sindh Institute of Urology and Transplantation (SIUT), Karachi, Pakistan
| | - Tahir Shamsi
- Department of Hematology, National Institute of Blood Diseases and Bone Marrow Transplantation Karachi, Karachi, Pakistan
| |
Collapse
|
9
|
Jiang Y, Wang R, Urrutia E, Anastopoulos IN, Nathanson KL, Zhang NR. CODEX2: full-spectrum copy number variation detection by high-throughput DNA sequencing. Genome Biol 2018; 19:202. [PMID: 30477554 PMCID: PMC6260772 DOI: 10.1186/s13059-018-1578-y] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 11/02/2018] [Indexed: 12/04/2022] Open
Abstract
High-throughput DNA sequencing enables detection of copy number variations (CNVs) on the genome-wide scale with finer resolution compared to array-based methods but suffers from biases and artifacts that lead to false discoveries and low sensitivity. We describe CODEX2, as a statistical framework for full-spectrum CNV profiling that is sensitive for variants with both common and rare population frequencies and that is applicable to study designs with and without negative control samples. We demonstrate and evaluate CODEX2 on whole-exome and targeted sequencing data, where biases are the most prominent. CODEX2 outperforms existing methods and, in particular, significantly improves sensitivity for common CNVs.
Collapse
Affiliation(s)
- Yuchao Jiang
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, 27599, USA.
- Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, NC, 27599, USA.
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, 27599, USA.
| | - Rujin Wang
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Eugene Urrutia
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Ioannis N Anastopoulos
- Division of Translational Medicine and Human Genetics, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Katherine L Nathanson
- Division of Translational Medicine and Human Genetics, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Abramson Cancer Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Nancy R Zhang
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
10
|
Coassin S, Schönherr S, Weissensteiner H, Erhart G, Forer L, Losso JL, Lamina C, Haun M, Utermann G, Paulweber B, Specht G, Kronenberg F. A comprehensive map of single-base polymorphisms in the hypervariable LPA kringle IV type 2 copy number variation region. J Lipid Res 2018; 60:186-199. [PMID: 30413653 PMCID: PMC6314250 DOI: 10.1194/jlr.m090381] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Indexed: 12/21/2022] Open
Abstract
Lipoprotein (a) [Lp(a)] concentrations are among the strongest genetic risk factors for cardiovascular disease and present pronounced interethnic and interindividual differences. Approximately 90% of Lp(a) variance is controlled by the LPA gene, which contains a 5.6-kb-large copy number variation [kringle IV type 2 (KIV-2) repeat] that generates >40 protein isoforms. Variants within the KIV-2 region are not called in common sequencing projects, leaving up to 70% of the LPA coding region currently unaddressed. To completely assess the variability in LPA, we developed a sequencing strategy for this region and report here the first map of genetic variation in the KIV-2 region, a comprehensively evaluated ultradeep sequencing protocol, and an easy-to-use variant analysis pipeline. We sequenced 123 Central-European individuals and reanalyzed public data of 2,504 individuals from 26 populations. We found 14 different loss-of-function and splice-site mutations, as well as >100, partially even common, missense variants. Some coding variants were frequent in one population but absent in others. This provides novel candidates to explain the large ethnic and individual differences in Lp(a) concentrations. Importantly, our approach and pipeline are also applicable to other similar copy number variable regions, allowing access to regions that are not captured by common genome sequencing.
Collapse
Affiliation(s)
- Stefan Coassin
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Sebastian Schönherr
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Hansi Weissensteiner
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Gertraud Erhart
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Lukas Forer
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Jamie Lee Losso
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Claudia Lamina
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Margot Haun
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Gerd Utermann
- Division of Human Genetics, Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| | - Bernhard Paulweber
- First Department of Internal Medicine, Paracelsus Private Medical University, Salzburg, Austria
| | - Günther Specht
- Department of Database and Information Systems, Institute of Computer Science, University of Innsbruck, Innsbruck, Austria
| | - Florian Kronenberg
- Division of Genetic Epidemiology Department of Medical Genetics, Molecular and Clinical Pharmacology, Medical University of Innsbruck, Innsbruck, Austria
| |
Collapse
|
11
|
Urinary Exosomal and cell-free DNA Detects Somatic Mutation and Copy Number Alteration in Urothelial Carcinoma of Bladder. Sci Rep 2018; 8:14707. [PMID: 30279572 PMCID: PMC6168539 DOI: 10.1038/s41598-018-32900-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 09/17/2018] [Indexed: 12/22/2022] Open
Abstract
Urothelial bladder carcinoma (UBC) is characterized by a large number of genetic alterations. DNA from urine is a promising source for liquid biopsy in urological malignancies. We aimed to assess the availability of cell-free DNA (cfDNA) and exosomal DNA (exoDNA) in urine as a source for liquid biopsy in UBC. We included 9 patients who underwent surgery for UBC and performed genomic profiling of tumor samples and matched urinary cfDNA and exoDNA. For mutation analysis, deep sequencing was performed for 9 gene targets and shallow whole genome sequencing (sWGS) was used for the detection of copy number variation (CNV). We analyzed whether genetic alteration in tumor samples was reflected in urinary cfDNA or exoDNA. To measure the similarity between copy number profiles of tumor tissue and urinary DNA, the Pearson’s correlation coefficient was calculated. We found 17 somatic mutations in 6 patients. Of the 17 somatic mutations, 14 and 12 were identified by analysis of cfDNA and exoDNA with AFs of 56.2% and 65.6%, respectively. In CNV analysis using sWGS, although the mean depth was 0.6X, we found amplification of MDM2, ERBB2, CCND1 and CCNE1, and deletion of CDKN2A, PTEN and RB1, all known to be frequently altered in UBC. CNV plots of cfDNA and exoDNA showed a similar pattern with those from the tumor samples. Pearson’s correlation coefficients of tumor vs. cfDNA (0.481) and tumor vs. exoDNA (0.412) were higher than that of tumor vs. normal (0.086). We successfully identified somatic mutations and CNV in UBC using urinary cfDNA and exoDNA. Urinary exoDNA could be another source for liquid biopsy. Also, CNV analysis using sWGS is an alternative strategy for liquid biopsy, providing data from the whole genome at a low cost.
Collapse
|
12
|
Lugowska I, Teterycz P, Mikula M, Kulecka M, Kluska A, Balabas A, Piatkowska M, Wagrodzki M, Pienkowski A, Rutkowski P, Ostrowski J. IDH1/2 Mutations Predict Shorter Survival in Chondrosarcoma. J Cancer 2018; 9:998-1005. [PMID: 29581779 PMCID: PMC5868167 DOI: 10.7150/jca.22915] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 01/28/2018] [Indexed: 12/11/2022] Open
Abstract
Background. Recent studies have shown that isocitrate dehydrogenase 1/2 (IDH1/2)- activating mutations occur in a variety of cancers, including acute myeloid leukaemia, gliomas, and chondrosarcomas (CHS)s. The effect of IDH1/2 mutation on overall survival (OS) has not been reported in CHS. The aim of our study was to assess the prevalence of known cancer-related gene mutations in CHS, as well as their prognostic role in patient survival. Methods. DNA from FFPE samples of 80 patients (F:M- 1:1.3; mean age: 58 years; range 27-86) with histologically confirmed CHS (G1:29; G2:34; G3:17) was subjected to library preparation with the Ion AmpliSeq Cancer Hotspot Panel v2 and sequenced on the PGM Ion Torrent. Results. Among the clinical features only histological grade influenced OS. Deep sequencing identified 1784 single nucleotide variants. Of them, 426 were considered to be pathogenic or probably pathogenic. Activating IDH1/2 mutations were found in 27 patients (34%) including 17 R132 IDH1 (21%), 10 R172 IDH2 (13%) and 3 R140 IDH2 variants (4%). Three patients had concurrent IDH1 and IDH2 mutations. The R140 IDH2 mutant has not been reported to date in CHS patients. OS for CHS patients with IDH1/2 mutations was significantly lower than in patients without mutations (93% vs 64%; p<0.001). No other genetic feature of the Cancer Hotspot Panel had an impact on OS. Conclusions. In CHS, IDH1/2-mutation status and the histological aggressiveness of the CHS are important predictors for OS. The R140 IDH2 may also be a novel target for the treatment of CHS patients.
Collapse
Affiliation(s)
- Iwona Lugowska
- Department of Soft Tissue/Bone Sarcoma and Melanoma, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland.,Early Phase Clinical Trials Unit, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Pawel Teterycz
- Department of Soft Tissue/Bone Sarcoma and Melanoma, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Michal Mikula
- Department of Genetics, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Maria Kulecka
- Department of Gastroenterology, Hepatology and Clinical Oncology, Medical Center for Postgraduate Education, Roentgena 5, 02-781 Warsaw, Poland
| | - Anna Kluska
- Department of Genetics, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Aneta Balabas
- Department of Genetics, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Magdalena Piatkowska
- Department of Genetics, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Michal Wagrodzki
- Department of Pathology, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 502-781 Warsaw, Poland
| | - Andrzej Pienkowski
- Department of Soft Tissue/Bone Sarcoma and Melanoma, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Piotr Rutkowski
- Department of Soft Tissue/Bone Sarcoma and Melanoma, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland
| | - Jerzy Ostrowski
- Department of Genetics, Maria Sklodowska-Curie Memorial Cancer Center and Institute of Oncology; Roentgena 5, 02-781 Warsaw, Poland.,Department of Gastroenterology, Hepatology and Clinical Oncology, Medical Center for Postgraduate Education, Roentgena 5, 02-781 Warsaw, Poland
| |
Collapse
|
13
|
Momozawa Y, Akiyama M, Kamatani Y, Arakawa S, Yasuda M, Yoshida S, Oshima Y, Mori R, Tanaka K, Mori K, Inoue S, Terasaki H, Yasuma T, Honda S, Miki A, Inoue M, Fujisawa K, Takahashi K, Yasukawa T, Yanagi Y, Kadonosono K, Sonoda KH, Ishibashi T, Takahashi A, Kubo M. Low-frequency coding variants in CETP and CFB are associated with susceptibility of exudative age-related macular degeneration in the Japanese population. Hum Mol Genet 2018; 25:5027-5034. [PMID: 28173125 DOI: 10.1093/hmg/ddw335] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Revised: 09/08/2016] [Accepted: 09/28/2016] [Indexed: 12/21/2022] Open
Abstract
Age-related macular degeneration (AMD) is a major cause of blindness in the elderly. Previous sequencing studies of AMD susceptibility genes have revealed the association of rare coding variants in CFH, CFI, C3 and C9 in European population; however, the impact of rare or low-frequency coding variants on AMD susceptibility in other populations is largely unknown. To identify the role of low-frequency coding variants on exudative AMD susceptibility in a Japanese population, we analysed the association of coding variants of 34 AMD candidate genes in the two-stage design by a multiplex PCR-based target sequencing method. We used a total of 2,886 (1st: 827, 2nd: 2,059) exudative AMD cases including typical AMD, polypoidal choroidal vasculopathy, and retinal angiomatous proliferation and 9,337 (1st: 3,247 2nd: 6,090) controls. Gene-based analysis found a significant association of low-frequency variants (minor allele frequency (MAF) < 0.05) in CETP, C2 and CFB. The association of CETP remained after conditioned with all known genome-wide association study (GWAS) associated variants. In addition, when we included only disruptive variants, enrichment of rare variants (MAF < 0.01) was also observed after conditioned with all GWAS associated variants (P = 1.03 × 10−6, odds ratio (OR) = 2.48). Haplotype and conditional analysis of the C2-CFB-SKIV2L locus showed a low-frequency variant (R74H) in CFB would be individually associated with AMD susceptibility independent of the GWAS associated SNP. These findings highlight the importance of target sequencing to reveal the impact of rare or low-frequency coding variants on disease susceptibility in different ethnic populations.
Collapse
Affiliation(s)
- Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Masato Akiyama
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.,Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.,Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Satoshi Arakawa
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan.,Japan Community Health care Organization, Kyushu Hospital, Fukuoka, Japan
| | - Miho Yasuda
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Shigeo Yoshida
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Yuji Oshima
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Ryusaburo Mori
- Division of Ophthalmology, Department of Visual Sciences, Nihon University School of Medicine, Nihon University Hospital, Tokyo, Japan
| | - Koji Tanaka
- Division of Ophthalmology, Department of Visual Sciences, Nihon University School of Medicine, Nihon University Hospital, Tokyo, Japan
| | - Keisuke Mori
- Department of Ophthalmology, Saitama Medical University, Saitama, Japan.,Department of Ophthalmology, International University of Health and Welfare Hospital, Tochigi, Japan
| | - Satoshi Inoue
- Division of Gene Regulation and Signal Transduction, Research Center for Genomic Medicine, Saitama Medical University, Saitama, Japan
| | - Hiroko Terasaki
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Aichi, Japan
| | - Tetsuhiro Yasuma
- Department of Ophthalmology, Nagoya University Graduate School of Medicine, Aichi, Japan
| | - Shigeru Honda
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Akiko Miki
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Maiko Inoue
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Kimihiko Fujisawa
- Japan Community Health care Organization, Kyushu Hospital, Fukuoka, Japan
| | - Kanji Takahashi
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Tsutomu Yasukawa
- Department of Surgery, Division of Ophthalmology, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Yasuo Yanagi
- Department of Ophthalmology and Visual Science, Nagoya City University Graduate School of Medical Sciences, Nagoya, Japan
| | - Kazuaki Kadonosono
- Department of Pediatric Surgery, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Koh-Hei Sonoda
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Tatsuro Ishibashi
- Department of Ophthalmology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Japan
| | - Atsushi Takahashi
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Michiaki Kubo
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| |
Collapse
|
14
|
Abstract
BRCA1 and BRCA2 genes are implicated in 20-25% of hereditary breast and ovarian cancers. New age sequencing platforms have revolutionized massively parallel sequencing in clinical practice by providing cost effective, rapid, and sensitive sequencing. This study critically evaluates the false positives in multiplex panels and suggests the need for careful analysis. We employed multiplex PCR based BRCA1 and BRCA2 community Panel with ion torrent PGM machine for evaluation of these mutations. Out of all 41samples analyzed for BRCA1 and BRCA2 five were found with 950_951 insA(Asn319fs) at Chr13:32906565 position and one sample with 1032_1033 insA(Asn346fs) at Chr13:32906647, both being frame-shift mutations in BRCA2 gene. 950_951 insA(Asn319fs) mutation is reported as pathogenic allele in NCBI dbSNP. On examination of IGV for all these samples, it was seen that both mutations had 'A' nucleotide insertion at 950, and 1032 position in exon 10 of BRCA2 gene. Sanger Sequencing did not confirm these insertions. Next-generation sequencing shows great promise by allowing rapid mutational analysis of multiple genes in human cancer but our results indicate the need for careful sequence analysis to avoid false positive results.
Collapse
|
15
|
Low frequency of Fabry disease in patients with common heart disease. Genet Med 2017; 20:754-759. [PMID: 29227985 DOI: 10.1038/gim.2017.175] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 09/05/2017] [Indexed: 12/23/2022] Open
Abstract
PURPOSE To test the hypothesis that undiagnosed patients with Fabry disease exist among patients affected by common heart disease. METHODS Globotriaosylceramide in random whole urine using tandem mass spectroscopy, α-galactosidase A activity in dried blood spots, and next-generation sequencing of pooled or individual genomic DNA samples supplemented by Sanger sequencing. RESULTS We tested 2,256 consecutive patients: 852 women (median age 65 years (19-95)) and 1,404 men (median age 65 years (21-92)). The primary diagnoses were coronary artery disease (n = 994), arrhythmia (n = 607), cardiomyopathy (n = 138), and valvular disease (n = 568). Urinary globotriaosylceramide was elevated in 15% of patients and 15 males had low α-galactosidase A activity. GLA variants found included R118C (n = 2), D83N, and D313Y (n = 7); IVS6-22 C>T, IVS4-16 A>G, IVS2+990C>A, 5'UTR-10 C>T (n = 4), IVS1-581 C>T, IVS1-1238 G>A, 5'UTR-30 G>A, IVS2+590C>T, IVS0-12 G>A, IVS4+68A>G, IVS0-10 C>T, IVS2-81-77delCAGCC, IVS2-77delC. Although the pathogenicity of several of these missense mutations and complex intronic haplotypes has been controversial, none of the patients screened in this study were diagnosed definitively with Fabry disease. CONCLUSION This population of patients with common heart disease did not contain a substantial number of patients with undiagnosed Fabry disease. GLA gene sequencing is superior to urinary globotriaosylceramide or α-galactosidase A activity in the screening for Fabry disease.
Collapse
|
16
|
Hao Y, Xuei X, Li L, Nakshatri H, Edenberg HJ, Liu Y. RareVar: A Framework for Detecting Low-Frequency Single-Nucleotide Variants. J Comput Biol 2017; 24:637-646. [PMID: 28541743 DOI: 10.1089/cmb.2017.0057] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Accurate identification of low-frequency somatic point mutations in tumor samples has important clinical utilities. Although high-throughput sequencing technology enables capturing such variants while sequencing primary tumor samples, our ability for accurate detection is compromised when the variant frequency is close to the sequencer error rate. Most current experimental and bioinformatic strategies target mutations with ≥5% allele frequency, which limits our ability to understand the cancer etiology and tumor evolution. We present an experimental and computational modeling framework, RareVar, to reliably identify low-frequency single-nucleotide variants from high-throughput sequencing data under standard experimental protocols. RareVar protocol includes a benchmark design by pooling DNAs from already sequenced individuals at various concentrations to target variants at desired frequencies, 0.5%-3% in our case. By applying a generalized, linear model-based, position-specific error model, followed by machine-learning-based variant calibration, our approach outperforms existing methods. Our method can be applied on most capture and sequencing platforms without modifying the experimental protocol.
Collapse
Affiliation(s)
- Yangyang Hao
- 1 Department of Medical and Molecular Genetics, Indiana University School of Medicine , Indianapolis, Indiana.,2 Center for Computational Biology and Bioinformatics, Indiana University School of Medicine , Indianapolis, Indiana
| | - Xiaoling Xuei
- 3 Department of Biochemistry and Molecular Biology, Indiana University School of Medicine , Indianapolis, Indiana.,4 Center for Medical Genomics, Indiana University School of Medicine , Indianapolis, Indiana
| | - Lang Li
- 1 Department of Medical and Molecular Genetics, Indiana University School of Medicine , Indianapolis, Indiana.,2 Center for Computational Biology and Bioinformatics, Indiana University School of Medicine , Indianapolis, Indiana
| | - Harikrishna Nakshatri
- 5 Department of Surgery, Indiana University School of Medicine , Indianapolis, Indiana.,6 IU Simon Cancer Center, Indiana University School of Medicine , Indianapolis, Indiana
| | - Howard J Edenberg
- 1 Department of Medical and Molecular Genetics, Indiana University School of Medicine , Indianapolis, Indiana.,3 Department of Biochemistry and Molecular Biology, Indiana University School of Medicine , Indianapolis, Indiana.,4 Center for Medical Genomics, Indiana University School of Medicine , Indianapolis, Indiana
| | - Yunlong Liu
- 1 Department of Medical and Molecular Genetics, Indiana University School of Medicine , Indianapolis, Indiana.,2 Center for Computational Biology and Bioinformatics, Indiana University School of Medicine , Indianapolis, Indiana.,4 Center for Medical Genomics, Indiana University School of Medicine , Indianapolis, Indiana.,6 IU Simon Cancer Center, Indiana University School of Medicine , Indianapolis, Indiana
| |
Collapse
|
17
|
Wang K, Lai S, Yang X, Zhu T, Lu X, Wu CI, Ruan J. Ultrasensitive and high-efficiency screen of de novo low-frequency mutations by o2n-seq. Nat Commun 2017; 8:15335. [PMID: 28530222 PMCID: PMC5458117 DOI: 10.1038/ncomms15335] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 03/21/2017] [Indexed: 12/15/2022] Open
Abstract
Detection of de novo, low-frequency mutations is essential for characterizing cancer genomes and heterogeneous cell populations. However, the screening capacity of current ultrasensitive NGS methods is inadequate owing to either low-efficiency read utilization or severe amplification bias. Here, we present o2n-seq, an ultrasensitive and high-efficiency NGS library preparation method for discovering de novo, low-frequency mutations. O2n-seq reduces the error rate of NGS to 10-5-10-8. The efficiency of its data usage is about 10-30 times higher than that of barcode-based strategies. For detecting mutations with allele frequency (AF) 1% in 4.6 Mb-sized genome, the sensitivity and specificity of o2n-seq reach to 99% and 98.64%, respectively. For mutations with AF around 0.07% in phix174, o2n-seq detects all the mutations with 100% specificity. Moreover, we successfully apply o2n-seq to screen de novo, low-frequency mutations in human tumours. O2n-seq will aid to characterize the landscape of somatic mutations in research and clinical settings.
Collapse
Affiliation(s)
- Kaile Wang
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Pengfei Road No. 7, Dapeng New District, Shenzhen, Guangdong 518120, China
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Chaoyang, Beijing 100101, China
- University of Chinese Academy of Sciences, Shijingshan, Beijing 100049, China
| | - Shujuan Lai
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Chaoyang, Beijing 100101, China
| | - Xiaoxu Yang
- Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Haidian, Beijing 100871, China
| | - Tianqi Zhu
- Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Haidian, Beijing 100190, China
- Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
| | - Xuemei Lu
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Chaoyang, Beijing 100101, China
| | - Chung-I Wu
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Chaoyang, Beijing 100101, China
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, Guangdong 510275, China
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | - Jue Ruan
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Pengfei Road No. 7, Dapeng New District, Shenzhen, Guangdong 518120, China
| |
Collapse
|
18
|
MAGERI: Computational pipeline for molecular-barcoded targeted resequencing. PLoS Comput Biol 2017; 13:e1005480. [PMID: 28475621 PMCID: PMC5419444 DOI: 10.1371/journal.pcbi.1005480] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Accepted: 03/24/2017] [Indexed: 12/16/2022] Open
Abstract
Unique molecular identifiers (UMIs) show outstanding performance in targeted high-throughput resequencing, being the most promising approach for the accurate identification of rare variants in complex DNA samples. This approach has application in multiple areas, including cancer diagnostics, thus demanding dedicated software and algorithms. Here we introduce MAGERI, a computational pipeline that efficiently handles all caveats of UMI-based analysis to obtain high-fidelity mutation profiles and call ultra-rare variants. Using an extensive set of benchmark datasets including gold-standard biological samples with known variant frequencies, cell-free DNA from tumor patient blood samples and publicly available UMI-encoded datasets we demonstrate that our method is both robust and efficient in calling rare variants. The versatility of our software is supported by accurate results obtained for both tumor DNA and viral RNA samples in datasets prepared using three different UMI-based protocols.
Collapse
|
19
|
Chandrani P, Prabhash K, Prasad R, Sethunath V, Ranjan M, Iyer P, Aich J, Dhamne H, Iyer DN, Upadhyay P, Mohanty B, Chandna P, Kumar R, Joshi A, Noronha V, Patil V, Ramaswamy A, Karpe A, Thorat R, Chaudhari P, Ingle A, Choughule A, Dutt A. Drug-sensitive FGFR3 mutations in lung adenocarcinoma. Ann Oncol 2017; 28:597-603. [PMID: 27998968 PMCID: PMC5391708 DOI: 10.1093/annonc/mdw636] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
BACKGROUND Lung cancer is the leading cause of cancer-related deaths across the world. In this study, we present therapeutically relevant genetic alterations in lung adenocarcinoma of Indian origin. MATERIALS AND METHODS Forty-five primary lung adenocarcinoma tumors were sequenced for 676 amplicons using RainDance cancer panel at an average coverage of 1500 × (reads per million mapped reads). To validate the findings, 49 mutations across 23 genes were genotyped in an additional set of 363 primary lung adenocarcinoma tumors using mass spectrometry. NIH/3T3 cells over expressing mutant and wild-type FGFR3 constructs were characterized for anchorage independent growth, constitutive activation, tumor formation and sensitivity to FGFR inhibitors using in vitro and xenograft mouse models. RESULTS We present the first spectrum of actionable alterations in lung adenocarcinoma tumors of Indian origin, and shows that mutations of FGFR3 are present in 20 of 363 (5.5%) patients. These FGFR3 mutations are constitutively active and oncogenic when ectopically expressed in NIH/3T3 cells and using a xenograft model in NOD/SCID mice. Inhibition of FGFR3 kinase activity inhibits transformation of NIH/3T3 overexpressing FGFR3 constructs and growth of tumors driven by FGFR3 in the xenograft models. The reduction in tumor size in the mouse is paralleled by a reduction in the amounts of phospho-ERK, validating the in vitro findings. Interestingly, the FGFR3 mutations are significantly higher in a proportion of younger patients and show a trend toward better overall survival, compared with patients lacking actionable alterations or those harboring KRAS mutations. CONCLUSION We present the first actionable mutation spectrum in Indian lung cancer genome. These findings implicate FGFR3 as a novel therapeutic in lung adenocarcinoma.
Collapse
Affiliation(s)
- P. Chandrani
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
- Homi Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai
| | - K. Prabhash
- Homi Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai
- Department of Medical Oncology, Tata Memorial Hospital
| | - R. Prasad
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - V. Sethunath
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - M. Ranjan
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - P. Iyer
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
- Homi Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai
| | - J. Aich
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - H. Dhamne
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - D. N. Iyer
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - P. Upadhyay
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
- Homi Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai
| | - B. Mohanty
- Small Animal Imaging Facility, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - P. Chandna
- AceProbe Technologies Pvt. Ltd, New Delhi, India
| | - R. Kumar
- Department of Pathology, Tata Memorial Hospital
| | - A. Joshi
- Department of Medical Oncology, Tata Memorial Hospital
| | - V. Noronha
- Department of Medical Oncology, Tata Memorial Hospital
| | - V. Patil
- Department of Medical Oncology, Tata Memorial Hospital
| | - A. Ramaswamy
- Department of Medical Oncology, Tata Memorial Hospital
| | - A. Karpe
- Department of Medical Oncology, Tata Memorial Hospital
| | - R. Thorat
- Department of Pathology, Tata Memorial Hospital
| | - P. Chaudhari
- Small Animal Imaging Facility, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - A. Ingle
- Laboratory Animal Facility, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
| | - A. Choughule
- Department of Medical Oncology, Tata Memorial Hospital
| | - A. Dutt
- Integrated Genomics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Tata Memorial Centre, Navi Mumbai
- Homi Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai
- Correspondence to: Dr Amit Dutt, Wellcome Trust/DBT India Alliance Intermediate Fellow, Tata Memorial Centre, ACTREC, Navi Mumbai 410 210, India. Tel: +91-22-27405056; E-mail:
| |
Collapse
|
20
|
Ma S, Murphy TW, Lu C. Microfluidics for genome-wide studies involving next generation sequencing. BIOMICROFLUIDICS 2017; 11:021501. [PMID: 28396707 PMCID: PMC5346105 DOI: 10.1063/1.4978426] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/06/2017] [Accepted: 02/16/2017] [Indexed: 05/11/2023]
Abstract
Next-generation sequencing (NGS) has revolutionized how molecular biology studies are conducted. Its decreasing cost and increasing throughput permit profiling of genomic, transcriptomic, and epigenomic features for a wide range of applications. Microfluidics has been proven to be highly complementary to NGS technology with its unique capabilities for handling small volumes of samples and providing platforms for automation, integration, and multiplexing. In this article, we review recent progress on applying microfluidics to facilitate genome-wide studies. We emphasize on several technical aspects of NGS and how they benefit from coupling with microfluidic technology. We also summarize recent efforts on developing microfluidic technology for genomic, transcriptomic, and epigenomic studies, with emphasis on single cell analysis. We envision rapid growth in these directions, driven by the needs for testing scarce primary cell samples from patients in the context of precision medicine.
Collapse
Affiliation(s)
- Sai Ma
- Department of Biomedical Engineering and Mechanics, Virginia Tech , Blacksburg, Virginia 24061, USA
| | - Travis W Murphy
- Department of Chemical Engineering, Virginia Tech , Blacksburg, Virginia 24061, USA
| | - Chang Lu
- Department of Chemical Engineering, Virginia Tech , Blacksburg, Virginia 24061, USA
| |
Collapse
|
21
|
Applications of molecular testing in surgical pathology of the head and neck. Mod Pathol 2017; 30:S104-S111. [PMID: 28060367 DOI: 10.1038/modpathol.2016.192] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Revised: 10/12/2016] [Accepted: 10/12/2016] [Indexed: 12/15/2022]
Abstract
Molecular testing in routine surgical pathology is becoming an important component of the workup of many different types of tumors. In fact, in some organ systems, guidelines now suggest that the standard of care is to obtain specific molecular panels for tumor classification and/or therapeutic planning. In the head and neck, clinically applicable molecular tests are not as abundant as in other organ systems. Most current head and neck biomarkers are utilized for diagnosis rather than as companion diagnostic tests to predict therapeutic response. As the number of potential molecular biomarker assays increases and cost pressures escalate, the pathologist must be able to navigate the molecular testing pathways. This review explores scenarios in which molecular testing might be beneficial and cost-effective in head and neck pathology.
Collapse
|
22
|
Perkins G, Lu H, Garlan F, Taly V. Droplet-Based Digital PCR: Application in Cancer Research. Adv Clin Chem 2016; 79:43-91. [PMID: 28212714 DOI: 10.1016/bs.acc.2016.10.001] [Citation(s) in RCA: 74] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The efficient characterization of genetic and epigenetic alterations in oncology, virology, or prenatal diagnostics requires highly sensitive and specific high-throughput approaches. Nevertheless, with the use of conventional methods, sensitivity and specificity were largely limited. By partitioning individual target molecules within distinct compartments, digital PCR (dPCR) could overcome these limitations and detect very rare sequences with unprecedented precision and sensitivity. In dPCR, the sample is diluted such that each individual partition will contain no more than one target sequence. Following the assay reaction, the dPCR process provides an absolute value and analyzable quantitative data. The recent coupling of dPCR with microfluidic systems in commercial platforms should lead to an essential tool for the management of patients with cancer, especially adapted to the analysis of precious samples. Applications in cancer research range from the analysis of tumor heterogeneity to that of a range of body fluids. Droplet-based dPCR is indeed particularly appropriate for the emerging field of liquid biopsy analysis. In this review, following an overview of the development in dPCR technology and different strategies based on the use of microcompartments, we will focus particularly on the applications and latest development of microfluidic droplet-based dPCR in oncology.
Collapse
Affiliation(s)
- G Perkins
- Université Sorbonne Paris Cité, INSERM UMR-S1147, CNRS SNC 5014, Centre Universitaire des Saints-Pères, Equipe labélisée LIGUE Contre le Cancer, Paris, France; European Georges Pompidou Hospital, AP-HP - Paris Descartes University, Paris, France
| | - H Lu
- Université Sorbonne Paris Cité, INSERM UMR-S1147, CNRS SNC 5014, Centre Universitaire des Saints-Pères, Equipe labélisée LIGUE Contre le Cancer, Paris, France
| | - F Garlan
- Université Sorbonne Paris Cité, INSERM UMR-S1147, CNRS SNC 5014, Centre Universitaire des Saints-Pères, Equipe labélisée LIGUE Contre le Cancer, Paris, France
| | - V Taly
- Université Sorbonne Paris Cité, INSERM UMR-S1147, CNRS SNC 5014, Centre Universitaire des Saints-Pères, Equipe labélisée LIGUE Contre le Cancer, Paris, France.
| |
Collapse
|
23
|
Artyomenko A, Wu NC, Mangul S, Eskin E, Sun R, Zelikovsky A. Long Single-Molecule Reads Can Resolve the Complexity of the Influenza Virus Composed of Rare, Closely Related Mutant Variants. J Comput Biol 2016; 24:558-570. [PMID: 27901586 DOI: 10.1089/cmb.2016.0146] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
As a result of a high rate of mutations and recombination events, an RNA-virus exists as a heterogeneous "swarm" of mutant variants. The long read length offered by single-molecule sequencing technologies allows each mutant variant to be sequenced in a single pass. However, high error rate limits the ability to reconstruct heterogeneous viral population composed of rare, related mutant variants. In this article, we present two single-nucleotide variants (2SNV), a method able to tolerate the high error rate of the single-molecule protocol and reconstruct mutant variants. 2SNV uses linkage between single-nucleotide variations to efficiently distinguish them from read errors. To benchmark the sensitivity of 2SNV, we performed a single-molecule sequencing experiment on a sample containing a titrated level of known viral mutant variants. Our method is able to accurately reconstruct clone with frequency of 0.2% and distinguish clones that differed in only two nucleotides distantly located on the genome. 2SNV outperforms existing methods for full-length viral mutant reconstruction.
Collapse
Affiliation(s)
| | - Nicholas C Wu
- 2 Department of Integrative Structural and Computational Biology, The Scripps Research Institute , La Jolla, California
| | - Serghei Mangul
- 3 Department of Computer Science, University of California , Los Angeles, Los Angeles, California.,4 Institute for Quantitative and Computational Biosciences, University of California Los Angeles , Los Angeles, California
| | - Eleazar Eskin
- 3 Department of Computer Science, University of California , Los Angeles, Los Angeles, California
| | - Ren Sun
- 5 Molecular and Medical Pharmacology, University of California , Los Angeles, Los Angeles, California
| | - Alex Zelikovsky
- 1 Department of Computer Science, Georgia State University , Atlanta, Georgia
| |
Collapse
|
24
|
Hao Y, Zhang P, Xuei X, Nakshatri H, Edenberg HJ, Li L, Liu Y. Statistical modeling for sensitive detection of low-frequency single nucleotide variants. BMC Genomics 2016; 17 Suppl 7:514. [PMID: 27556804 PMCID: PMC5001245 DOI: 10.1186/s12864-016-2905-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Background Sensitive detection of low-frequency single nucleotide variants carries great significance in many applications. In cancer genetics research, tumor biopsies are a mixture of normal and tumor cells from various subpopulations due to tumor heterogeneity. Thus the frequencies of somatic variants from a subpopulation tend to be low. Liquid biopsies, which monitor circulating tumor DNA in blood to detect metastatic potential, also face the challenge of detecting low-frequency variants due to the small percentage of the circulating tumor DNA in blood. Moreover, in population genetics research, although pooled sequencing of a large number of individuals is cost-effective, pooling dilutes the signals of variants from any individual. Detection of low frequency variants is difficult and can be cofounded by sequencing artifacts. Existing methods are limited in sensitivity and mainly focus on frequencies around 2 % to 5 %; most fail to consider differential sequencing artifacts. Results We aimed to push down the frequency detection limit close to the position specific sequencing error rates by modeling the observed erroneous read counts with respect to genomic sequence contexts. 4 distributions suitable for count data modeling (using generalized linear models) were extensively characterized in terms of their goodness-of-fit as well as the performances on real sequencing data benchmarks, which were specifically designed for testing detection of low-frequency variants; two sequencing technologies with significantly different chemistry mechanisms were used to explore systematic errors. We found the zero-inflated negative binomial distribution generalized linear mode is superior to the other models tested, and the advantage is most evident at 0.5 % to 1 % range. This method is also generalizable to different sequencing technologies. Under standard sequencing protocols and depth given in the testing benchmarks, 95.3 % recall and 79.9 % precision for Ion Proton data, 95.6 % recall and 97.0 % precision for Illumina MiSeq data were achieved for SNVs with frequency > = 1 %, while the detection limit is around 0.5 %. Conclusions Our method enables sensitive detection of low-frequency single nucleotide variants across different sequencing platforms and will facilitate research and clinical applications such as pooled sequencing, cancer early detection, prognostic assessment, metastatic monitoring, and relapses or acquired resistance identification. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2905-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yangyang Hao
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Pengyue Zhang
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,Department of Biostatistics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Xiaoling Xuei
- Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,Center for Medical Genomics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Harikrishna Nakshatri
- Department of Surgery, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,IU Simon Cancer Center, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Howard J Edenberg
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,Center for Medical Genomics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Lang Li
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.,Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA
| | - Yunlong Liu
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA. .,Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA. .,Center for Medical Genomics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA. .,IU Simon Cancer Center, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
| |
Collapse
|
25
|
Wang K, Ma X, Zhang X, Wu D, Sun C, Sun Y, Lu X, Wu CI, Guo C, Ruan J. Using ultra-sensitive next generation sequencing to dissect DNA damage-induced mutagenesis. Sci Rep 2016; 6:25310. [PMID: 27122023 PMCID: PMC4848531 DOI: 10.1038/srep25310] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Accepted: 04/13/2016] [Indexed: 12/17/2022] Open
Abstract
Next generation sequencing (NGS) technologies have dramatically improved studies in biology and biomedical science. However, no optimal NGS approach is available to conveniently analyze low frequency mutations caused by DNA damage treatments. Here, by developing an exquisite ultra-sensitive NGS (USNGS) platform “EasyMF” and incorporating it with a widely used supF shuttle vector-based mutagenesis system, we can conveniently dissect roles of lesion bypass polymerases in damage-induced mutagenesis. In this improved mutagenesis analysis pipeline, the initial steps are the same as in the supF mutation assay, involving damaging the pSP189 plasmid followed by its transfection into human 293T cells to allow replication to occur. Then “EasyMF” is employed to replace downstream MBM7070 bacterial transformation and other steps for analyzing damage-induced mutation frequencies and spectra. This pipeline was validated by using UV damaged plasmid after its replication in lesion bypass polymerase-deficient 293T cells. The increased throughput and reduced cost of this system will allow us to conveniently screen regulators of translesion DNA synthesis pathway and monitor environmental genotoxic substances, which can ultimately provide insight into the mechanisms of genome stability and mutagenesis.
Collapse
Affiliation(s)
- Kaile Wang
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Xiaolu Ma
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Xue Zhang
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Dafei Wu
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Chenyi Sun
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Yazhou Sun
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Xuemei Lu
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Chung-I Wu
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China.,Department of Ecology and Evolution, University of Chicago, USA
| | - Caixia Guo
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Jue Ruan
- Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| |
Collapse
|
26
|
Wang K, Ma Q, Jiang L, Lai S, Lu X, Hou Y, Wu CI, Ruan J. Ultra-precise detection of mutations by droplet-based amplification of circularized DNA. BMC Genomics 2016; 17:214. [PMID: 26960407 PMCID: PMC4784281 DOI: 10.1186/s12864-016-2480-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 02/16/2016] [Indexed: 01/16/2023] Open
Abstract
Background NGS (next generation sequencing) has been widely used in studies of biological processes, ranging from microbial evolution to cancer genomics. However, the error rate of NGS (0.1 % ~ 1 %) is still remaining a great challenge for comprehensively investigating the low frequency variations, and the current solution methods have suffered severe amplification bias or low efficiency. Results We creatively developed Droplet-CirSeq for relatively efficient, low-bias and ultra-sensitive identification of variations by combining millions of picoliter uniform-sized droplets with Cir-seq. Droplet-CirSeq is entitled with an incredibly low error rate of 3 ~ 5 X 10-6. To systematically evaluate the performances of amplification uniformity and capability of mutation identification for Droplet-CirSeq, we took the mixtures of two E. coli strains as specific instances to simulate the circumstances of mutations with different frequencies. Compared with Cir-seq, the coefficient of variance of read depth for Droplet-CirSeq was 10 times less (p = 2.6 X 10-3), and the identified allele frequency presented more concentrated to the authentic frequency of mixtures (p = 4.8 X 10-3), illustrating a significant improvement of amplification bias and accuracy in allele frequency determination. Additionally, Droplet-CirSeq detected 2.5 times genuine SNPs (p < 0.001), achieved a 2.8 times lower false positive rate (p < 0.05) and a 1.5 times lower false negative rate (p < 0.001), in the case of a 3 pg DNA input. Intriguingly, the false positive sites predominantly represented in two types of base substitutions (G- > A, C- > T). Our findings indicated that 30 pg DNA input accommodated in 5 ~ 10 million droplets resulted in maximal detection of authentic mutations compared to 3 pg (p = 1.2 X 10-8) and 300 pg input (p = 2.2 X 10-3). Conclusions We developed a method namely Droplet-CirSeq to significantly improve the amplification bias, which presents obvious superiority over the currently prevalent methods in exploitation of ultra-low frequency mutations. Droplet-CirSeq would be promisingly used in the identification of low frequency mutations initiated from extremely low input DNA, such as DNA of uncultured microorganisms, captured DNA of target region, circulation DNA of plasma et al, and its creative conception of rolling circle amplification in droplets would also be used in other low input DNA amplification fields. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2480-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kaile Wang
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Qin Ma
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Lan Jiang
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Shujuan Lai
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Xuemei Lu
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Yali Hou
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.
| | - Chung-I Wu
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China. .,State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China. .,Department of Ecology and Evolution, University of Chicago, Illinois, USA.
| | - Jue Ruan
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China. .,Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| |
Collapse
|
27
|
Long Single-Molecule Reads Can Resolve the Complexity of the Influenza Virus Composed of Rare, Closely Related Mutant Variants. LECTURE NOTES IN COMPUTER SCIENCE 2016. [DOI: 10.1007/978-3-319-31957-5_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
28
|
Saliou A, Bidard FC, Lantz O, Stern MH, Vincent-Salomon A, Proudhon C, Pierga JY. Circulating tumor DNA for triple-negative breast cancer diagnosis and treatment decisions. Expert Rev Mol Diagn 2015; 16:39-50. [DOI: 10.1586/14737159.2016.1121100] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
29
|
Suyama Y, Matsuki Y. MIG-seq: an effective PCR-based method for genome-wide single-nucleotide polymorphism genotyping using the next-generation sequencing platform. Sci Rep 2015; 5:16963. [PMID: 26593239 PMCID: PMC4655332 DOI: 10.1038/srep16963] [Citation(s) in RCA: 152] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Accepted: 10/22/2015] [Indexed: 01/18/2023] Open
Abstract
Restriction-enzyme (RE)-based next-generation sequencing methods have revolutionized marker-assisted genetic studies; however, the use of REs has limited their widespread adoption, especially in field samples with low-quality DNA and/or small quantities of DNA. Here, we developed a PCR-based procedure to construct reduced representation libraries without RE digestion steps, representing de novo single-nucleotide polymorphism discovery, and its genotyping using next-generation sequencing. Using multiplexed inter-simple sequence repeat (ISSR) primers, thousands of genome-wide regions were amplified effectively from a wide variety of genomes, without prior genetic information. We demonstrated: 1) Mendelian gametic segregation of the discovered variants; 2) reproducibility of genotyping by checking its applicability for individual identification; and 3) applicability in a wide variety of species by checking standard population genetic analysis. This approach, called multiplexed ISSR genotyping by sequencing, should be applicable to many marker-assisted genetic studies with a wide range of DNA qualities and quantities.
Collapse
Affiliation(s)
- Yoshihisa Suyama
- Tohoku University, Kawatabi Field Science Center, Graduate School of Agricultural Science, 232-3 Yomogida, Naruko-onsen, Osaki, Miyagi 989-6711, Japan
| | - Yu Matsuki
- Tohoku University, Kawatabi Field Science Center, Graduate School of Agricultural Science, 232-3 Yomogida, Naruko-onsen, Osaki, Miyagi 989-6711, Japan
| |
Collapse
|
30
|
MSIplus for Integrated Colorectal Cancer Molecular Testing by Next-Generation Sequencing. J Mol Diagn 2015; 17:705-14. [PMID: 26322950 DOI: 10.1016/j.jmoldx.2015.05.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 04/27/2015] [Accepted: 05/26/2015] [Indexed: 12/30/2022] Open
Abstract
Molecular analysis of colon cancers currently requires multiphasic testing that uses various assays with different performance characteristics, adding cost and time to patient care. We have developed a single, next-generation sequencing assay to simultaneously evaluate colorectal cancers for mutations in relevant cancer genes (KRAS, NRAS, and BRAF) and for tumor microsatellite instability (MSI). In a sample set of 61 cases, the assay demonstrated overall sensitivity of 100% and specificity of 100% for identifying cancer-associated mutations, with a practical limit of detection at 2% mutant allele fraction. MSIplus was 97% sensitive (34 of 35 MSI-positive cases) and 100% specific (42 of 42 MSI-negative cases) for ascertaining MSI phenotype in a cohort of 78 tumor specimens. These performance characteristics were slightly better than for conventional multiplex PCR MSI testing (97% sensitivity and 95% specificity), which is based on comparison of microsatellite loci amplified from tumor and matched normal material, applied to the same specimen cohort. Because the assay uses an amplicon sequencing approach, it is rapid and appropriate for specimens with limited available material or fragmented DNA. This integrated testing strategy offers several advantages over existing methods, including a lack of need for matched normal material, sensitive and unbiased detection of variants in target genes, and an automated analysis pipeline enabling principled and reproducible identification of cancer-associated mutations and MSI status simultaneously.
Collapse
|
31
|
Vick B, Rothenberg M, Sandhöfer N, Carlet M, Finkenzeller C, Krupka C, Grunert M, Trumpp A, Corbacioglu S, Ebinger M, André MC, Hiddemann W, Schneider S, Subklewe M, Metzeler KH, Spiekermann K, Jeremias I. An advanced preclinical mouse model for acute myeloid leukemia using patients' cells of various genetic subgroups and in vivo bioluminescence imaging. PLoS One 2015; 10:e0120925. [PMID: 25793878 PMCID: PMC4368518 DOI: 10.1371/journal.pone.0120925] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Accepted: 01/27/2015] [Indexed: 12/21/2022] Open
Abstract
Acute myeloid leukemia (AML) is a clinically and molecularly heterogeneous disease with poor outcome. Adequate model systems are required for preclinical studies to improve understanding of AML biology and to develop novel, rational treatment approaches. Xenografts in immunodeficient mice allow performing functional studies on patient-derived AML cells. We have established an improved model system that integrates serial retransplantation of patient-derived xenograft (PDX) cells in mice, genetic manipulation by lentiviral transduction, and essential quality controls by immunophenotyping and targeted resequencing of driver genes. 17/29 samples showed primary engraftment, 10/17 samples could be retransplanted and some of them allowed virtually indefinite serial transplantation. 5/6 samples were successfully transduced using lentiviruses. Neither serial transplantation nor genetic engineering markedly altered sample characteristics analyzed. Transgene expression was stable in PDX AML cells. Example given, recombinant luciferase enabled bioluminescence in vivo imaging and highly sensitive and reliable disease monitoring; imaging visualized minimal disease at 1 PDX cell in 10000 mouse bone marrow cells and facilitated quantifying leukemia initiating cells. We conclude that serial expansion, genetic engineering and imaging represent valuable tools to improve the individualized xenograft mouse model of AML. Prospectively, these advancements enable repetitive, clinically relevant studies on AML biology and preclinical treatment trials on genetically defined and heterogeneous subgroups.
Collapse
Affiliation(s)
- Binje Vick
- Group Apoptosis, Department of Gene Vectors, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Maja Rothenberg
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
| | - Nadine Sandhöfer
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
- Clinical Cooperation Group Leukemia, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Michela Carlet
- Group Apoptosis, Department of Gene Vectors, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Cornelia Finkenzeller
- Group Apoptosis, Department of Gene Vectors, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Christina Krupka
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
- Clinical Cooperation Group Immunotherapy, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Michaela Grunert
- Group Apoptosis, Department of Gene Vectors, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Andreas Trumpp
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Division of Stem Cells and Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Heidelberg Institute for Stem Cell Technology and Experimental Medicine (HI-STEM GmbH), Heidelberg, Germany
| | - Selim Corbacioglu
- Department of Pediatrics, University of Regensburg, Regensburg, Germany
| | - Martin Ebinger
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Pediatric Hematology/Oncology, University Children’s Hospital, Eberhard Karls Universität, Tuebingen, Germany
| | - Maya C. André
- Department of Pediatric Hematology/Oncology, University Children’s Hospital, Eberhard Karls Universität, Tuebingen, Germany
- Department of Pediatric Intensive Care Medicine, University Children's Hospital (UKBB), Basel, Switzerland
| | - Wolfgang Hiddemann
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
- Clinical Cooperation Group Leukemia, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Stephanie Schneider
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
| | - Marion Subklewe
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
- Clinical Cooperation Group Immunotherapy, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Klaus H. Metzeler
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
- Clinical Cooperation Group Leukemia, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Karsten Spiekermann
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Grosshadern, Ludwig-Maximilians-Universität (LMU), Munich, Germany
- Clinical Cooperation Group Leukemia, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
| | - Irmela Jeremias
- Group Apoptosis, Department of Gene Vectors, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Oncology, Dr von Haunersches Kinderspital, Ludwig Maximilians-Universität (LMU), Munich, Germany
- * E-mail:
| |
Collapse
|
32
|
Hsu AL, Kondrashova O, Lunke S, Love CJ, Meldrum C, Marquis-Nicholson R, Corboy G, Pham K, Wakefield M, Waring PM, Taylor GR. AmpliVar: mutation detection in high-throughput sequence from amplicon-based libraries. Hum Mutat 2015; 36:411-8. [PMID: 25664426 DOI: 10.1002/humu.22763] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Accepted: 01/21/2015] [Indexed: 12/30/2022]
Abstract
Conventional means of identifying variants in high-throughput sequencing align each read against a reference sequence, and then call variants at each position. Here, we demonstrate an orthogonal means of identifying sequence variation by grouping the reads as amplicons prior to any alignment. We used AmpliVar to make key-value hashes of sequence reads and group reads as individual amplicons using a table of flanking sequences. Low-abundance reads were removed according to a selectable threshold, and reads above this threshold were aligned as groups, rather than as individual reads, permitting the use of sensitive alignment tools. We show that this approach is more sensitive, more specific, and more computationally efficient than comparable methods for the analysis of amplicon-based high-throughput sequencing data. The method can be extended to enable alignment-free confirmation of variants seen in hybridization capture target-enrichment data.
Collapse
Affiliation(s)
- Arthur L Hsu
- Department of Pathology, The University of Melbourne, Parkville, Victoria, Australia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Gene expression-based prognostic and predictive tools in breast cancer. Breast Cancer 2015; 22:245-52. [PMID: 25874688 DOI: 10.1007/s12282-015-0594-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 01/27/2015] [Indexed: 12/21/2022]
Abstract
Genomic assays measuring the expression of multiple genes have made their way into clinical practice and their utilization is now recommended by major international guidelines. A basic property of these tests is their capability to sub-divide patients into high- and low-risk cohorts thereby providing prognostic, and in certain settings, predictive decision support. Here, we summarize commercially available assays for breast cancer including RT-PCR and gene chip-based tests. Given the relative uncertainty in cancer treatment, multigene tests have the potential for a significant cost reduction as they can pinpoint those patients for whom chemotherapy proves to be unnecessary. However, concordance of risk assessment for an individual patient is still far from optimal. Additionally, emerging multigene approaches focus on predicting therapy response, which is a black spot of current tests. Promising techniques include the homologous recombination deficiency score, utilization of massive parallel sequencing to identify driver genes, employment of internet-based meta-analysis tools and investigation of miRNA expression signatures. Combination of multiple simultaneous analyses at diagnosis, including classical histopathological diagnostics, monogenic markers, genomic signatures and clinical parameters will most likely bring maximal benefit for patients. As the main driving force behind such genomic tests is the power to achieve cost reduction due to avoiding unnecessary systemic treatment, the future is most likely to hold a further proliferation of such assays.
Collapse
|
34
|
Do H, Dobrovic A. Sequence Artifacts in DNA from Formalin-Fixed Tissues: Causes and Strategies for Minimization. Clin Chem 2015; 61:64-71. [DOI: 10.1373/clinchem.2014.223040] [Citation(s) in RCA: 331] [Impact Index Per Article: 36.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Abstract
BACKGROUND
Precision medicine is dependent on identifying actionable mutations in tumors. Accurate detection of mutations is often problematic in formalin-fixed paraffin-embedded (FFPE) tissues. DNA extracted from formalin-fixed tissues is fragmented and also contains DNA lesions that are the sources of sequence artifacts. Sequence artifacts can be difficult to distinguish from true mutations, especially in the context of tumor heterogeneity, and are an increasing interpretive problem in this era of massively parallel sequencing. Understanding of the sources of sequence artifacts in FFPE tissues and implementation of preventative strategies are critical to improve the accurate detection of actionable mutations.
CONTENT
This mini-review focuses on DNA template lesions in FFPE tissues as the source of sequence artifacts in molecular analysis. In particular, fragmentation, base modification (including uracil and thymine deriving from cytosine deamination), and abasic sites are discussed as indirect or direct sources of sequence artifacts. We discuss strategies that can be implemented to minimize sequence artifacts and to distinguish true mutations from sequence artifacts. These strategies are applicable for the detection of actionable mutations in both single amplicon and massively parallel amplicon sequencing approaches.
SUMMARY
Because FFPE tissues are usually the only available material for DNA analysis, it is important to maximize the accurate informational content from FFPE DNA. Careful consideration of each step in the work flow is needed to minimize sequence artifacts. In addition, validation of actionable mutations either by appropriate experimental design or by orthogonal methods should be considered.
Collapse
Affiliation(s)
- Hongdo Do
- Translational Genomics and Epigenomics Laboratory, Olivia Newton-John Cancer Research Institute, Heidelberg, Victoria, Australia
- Department of Pathology, University of Melbourne, Parkville, Victoria, Australia
- School of Cancer Medicine, La Trobe University, Bundoora, Victoria, Australia
| | - Alexander Dobrovic
- Translational Genomics and Epigenomics Laboratory, Olivia Newton-John Cancer Research Institute, Heidelberg, Victoria, Australia
- Department of Pathology, University of Melbourne, Parkville, Victoria, Australia
- School of Cancer Medicine, La Trobe University, Bundoora, Victoria, Australia
| |
Collapse
|
35
|
Choudhary A, Mambo E, Sanford T, Boedigheimer M, Twomey B, Califano J, Hadd A, Oliner KS, Beaudenon S, Latham GJ, Adai AT. Evaluation of an integrated clinical workflow for targeted next-generation sequencing of low-quality tumor DNA using a 51-gene enrichment panel. BMC Med Genomics 2014; 7:62. [PMID: 25395014 PMCID: PMC4241214 DOI: 10.1186/s12920-014-0062-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Accepted: 10/22/2014] [Indexed: 12/21/2022] Open
Abstract
Background Improvements in both performance and cost for next-generation sequencing (NGS) have spurred its rapid adoption for clinical applications. We designed and optimized a pan-cancer target-enrichment panel for 51 well-established oncogenes and tumor suppressors, in conjunction with a bioinformatic pipeline informed by in-process controls and pre- and post-analytical quality control measures. Methods The evaluation of this workflow consisted of sequencing mixtures of intact DNA to establish analytical sensitivity and precision, utilization of heuristics to identify systematic artifacts, titration studies of intact and FFPE samples for input optimization, and incorporation of orthogonal sequencing strategies to increase both positive predictive value and variant detection. We also used 128 FFPE samples to assess clinical accuracy and incorporated the previously described quantitative functional index (QFI) for sample qualification as part of detailing complete system performance. Results We observed a concordance correlation coefficient of 0.99 between the observed versus expected percent variant at 250 ng input across 4 independent sequencing runs. A subset of the systematic variants were confirmed to be barely detectable on an independent sequencing platform (Wilcox signed-rank test p-value <10−16), and the incorporation of orthogonal sequencing strategies increased the harmonic mean of sensitivity and positive predictive value of mutation detection by 41%. In one cohort of FFPE tumor samples, coverage and inter-platform concordance were positively correlated with the QFI, emphasizing the need for pre-analytical sample quality control to reduce the risk of false positives and negatives. In a separate cohort of FFPE samples, the 51-gene panel achieved 78% sensitivity (95% CI = 56.3, 92.5) with 100% PPV (95% CI = 81.5, 100.0) based on known mutations at 7.9% median abundance. By sequencing specimens using an orthogonal NGS technology, sensitivity was improved to 87.0% (95% CI = 66.4,97.2) while maintaining PPV. Conclusions The results highlight the value of process integration in a comprehensive targeted NGS system, enabling both discovery and diagnostic applications, particularly when sequencing low-quality cancer specimens. Electronic supplementary material The online version of this article (doi:10.1186/s12920-014-0062-0) contains supplementary material, which is available to authorized users.
Collapse
|
36
|
Detection of minimal residual disease in NPM1-mutated acute myeloid leukemia by next-generation sequencing. Mod Pathol 2014; 27:1438-46. [PMID: 24743218 PMCID: PMC4201902 DOI: 10.1038/modpathol.2014.57] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Revised: 02/15/2014] [Accepted: 02/18/2014] [Indexed: 01/08/2023]
Abstract
Detection of minimal residual disease predicts adverse outcome in patients with acute myeloid leukemia. Currently, minimal residual disease may be detected by RQ-PCR or flow cytometry, both of which have practical and diagnostic limitations. Here, we describe a next-generation sequencing assay for minimal residual disease detection in NPM1-mutated acute myeloid leukemia, which encompasses ∼60% of patients with normal karyotype acute myeloid leukemia. Exon 12 of NPM1 was PCR amplified using sequencing adaptor-linked primers and deep sequenced to enable detection of low-prevalence, acute myeloid leukemia-specific activating mutations. We benchmarked our results against flow cytometry, the standard of care for acute myeloid leukemia minimal residual disease diagnosis at our institution. The performance of both approaches was evaluated using defined dilutions of an NPM1 mutation-positive cell line and longitudinal clinical samples from acute myeloid leukemia patients. Using defined control material, we found this assay sensitive to approximately 0.001% mutant cells, outperforming flow cytometry by an order of magnitude. Next-generation sequencing was precise and semiquantitative over four orders of magnitude. In 22 longitudinal samples from six acute myeloid leukemia patients, next-generation sequencing detected minimal residual disease in all samples deemed negative by flow cytometry. Further, in one-third of patients, sequencing detected alternate NPM1 mutations in addition to the patient's index mutation, consistent with tumor heterogeneity. Next-generation sequencing provides information without prior knowledge of NPM1 mutation subtype or validation of allele-specific probes as required for RQ-PCR assays, and without generation and interpretation of complex multidimensional flow cytometry data. This approach may complement current technologies to enhance patient-specific clinical decision-making.
Collapse
|
37
|
Watanabe M, Kusano J, Ohtaki S, Ishikura T, Katayama J, Koguchi A, Paumen M, Hayashi Y. Simultaneous genomic identification and profiling of a single cell using semiconductor-based next generation sequencing. Appl Transl Genom 2014; 3:70-7. [PMID: 27294018 PMCID: PMC4887956 DOI: 10.1016/j.atg.2014.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2014] [Revised: 05/19/2014] [Accepted: 05/29/2014] [Indexed: 12/02/2022]
Abstract
Combining single-cell methods and next-generation sequencing should provide a powerful means to understand single-cell biology and obviate the effects of sample heterogeneity. Here we report a single-cell identification method and seamless cancer gene profiling using semiconductor-based massively parallel sequencing. A549 cells (adenocarcinomic human alveolar basal epithelial cell line) were used as a model. Single-cell capture was performed using laser capture microdissection (LCM) with an Arcturus® XT system, and a captured single cell and a bulk population of A549 cells (≈ 106 cells) were subjected to whole genome amplification (WGA). For cell identification, a multiplex PCR method (AmpliSeq™ SNP HID panel) was used to enrich 136 highly discriminatory SNPs with a genotype concordance probability of 1031–35. For cancer gene profiling, we used mutation profiling that was performed in parallel using a hotspot panel for 50 cancer-related genes. Sequencing was performed using a semiconductor-based bench top sequencer. The distribution of sequence reads for both HID and Cancer panel amplicons was consistent across these samples. For the bulk population of cells, the percentages of sequence covered at coverage of more than 100 × were 99.04% for the HID panel and 98.83% for the Cancer panel, while for the single cell percentages of sequence covered at coverage of more than 100 × were 55.93% for the HID panel and 65.96% for the Cancer panel. Partial amplification failure or randomly distributed non-amplified regions across samples from single cells during the WGA procedures or random allele drop out probably caused these differences. However, comparative analyses showed that this method successfully discriminated a single A549 cancer cell from a bulk population of A549 cells. Thus, our approach provides a powerful means to overcome tumor sample heterogeneity when searching for somatic mutations.
Collapse
Affiliation(s)
- Manabu Watanabe
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| | - Junko Kusano
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| | - Shinsaku Ohtaki
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| | - Takashi Ishikura
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| | - Jin Katayama
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| | - Akira Koguchi
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| | - Michael Paumen
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| | - Yoshiharu Hayashi
- Life Technologies Japan Ltd., a part of Thermo Fisher Scientific. 4-2-8 Shibaura, Minato-ku Tokyo 108-0023, Japan
| |
Collapse
|
38
|
Zec H, Shin DJ, Wang TH. Novel droplet platforms for the detection of disease biomarkers. Expert Rev Mol Diagn 2014; 14:787-801. [PMID: 25109704 DOI: 10.1586/14737159.2014.945437] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Personalized medicine - healthcare based on individual genetic variation - has the potential to transform the way healthcare is delivered to patients. The promise of personalized medicine has been predicated on the predictive and diagnostic power of genomic and proteomic biomarkers. Biomarker screening may help improve health outcomes, for example, by identifying individuals' susceptibility to diseases and predicting how patients will respond to drugs. Microfluidic droplet technology offers an exciting opportunity to revolutionize the accessibility of personalized medicine. A framework for the role of droplet microfluidics in biomarker detection can be based on two main themes. Emulsion-based microdroplet platforms can provide new ways to measure and detect biomolecules. In addition, microdroplet platforms facilitate high-throughput screening of biomarkers. Meanwhile, surface-based droplet platforms provide an opportunity to develop miniaturized diagnostic systems. These platforms may function as portable benchtop environments that dramatically shorten the transition of a benchtop assay into a point-of-care format.
Collapse
Affiliation(s)
- Helena Zec
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore MD 21218, USA
| | | | | |
Collapse
|
39
|
Yu Z, Cao K, Tischler T, Stolle CA, Santani AB. Mung bean nuclease treatment increases capture specificity of microdroplet-PCR based targeted DNA enrichment. PLoS One 2014; 9:e103491. [PMID: 25058678 PMCID: PMC4110027 DOI: 10.1371/journal.pone.0103491] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 07/01/2014] [Indexed: 12/04/2022] Open
Abstract
Targeted DNA enrichment coupled with next generation sequencing has been increasingly used for interrogation of select sub-genomic regions at high depth of coverage in a cost effective manner. Specificity measured by on-target efficiency is a key performance metric for target enrichment. Non-specific capture leads to off-target reads, resulting in waste of sequencing throughput on irrelevant regions. Microdroplet-PCR allows simultaneous amplification of up to thousands of regions in the genome and is among the most commonly used strategies for target enrichment. Here we show that carryover of single-stranded template genomic DNA from microdroplet-PCR constitutes a major contributing factor for off-target reads in the resultant libraries. Moreover, treatment of microdroplet-PCR enrichment products with a nuclease specific to single-stranded DNA alleviates off-target load and improves enrichment specificity. We propose that nuclease treatment of enrichment products should be incorporated in the workflow of targeted sequencing using microdroplet-PCR for target capture. These findings may have a broad impact on other PCR based applications for which removal of template DNA is beneficial.
Collapse
Affiliation(s)
- Zhenming Yu
- Division of Genomic Diagnostics and Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
- * E-mail: (ZY) (ZY); (ABS) (AS)
| | - Kajia Cao
- Division of Genomic Diagnostics and Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Tanya Tischler
- Division of Genomic Diagnostics and Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Catherine A. Stolle
- Division of Genomic Diagnostics and Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
| | - Avni B. Santani
- Division of Genomic Diagnostics and Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- * E-mail: (ZY) (ZY); (ABS) (AS)
| |
Collapse
|
40
|
McCall CM, Mosier S, Thiess M, Debeljak M, Pallavajjala A, Beierl K, Deak KL, Datto MB, Gocke CD, Lin MT, Eshleman JR. False positives in multiplex PCR-based next-generation sequencing have unique signatures. J Mol Diagn 2014; 16:541-549. [PMID: 25017478 DOI: 10.1016/j.jmoldx.2014.06.001] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Revised: 03/02/2014] [Accepted: 06/16/2014] [Indexed: 01/11/2023] Open
Abstract
Next-generation sequencing shows great promise by allowing rapid mutational analysis of multiple genes in human cancers. Recently, we implemented the multiplex PCR-based Ion AmpliSeq Cancer Hotspot Panel (>200 amplicons in 50 genes) to evaluate EGFR, KRAS, and BRAF in lung and colorectal adenocarcinomas. In 10% of samples, automated analysis identified a novel G873R substitution mutation in EGFR. By examining reads individually, we found this mutation in >5% of reads in 50 of 291 samples and also found similar events in 18 additional amplicons. These apparent mutations are present only in short reads and within 10 bases of either end of the read. We therefore hypothesized that these were from panel primers promiscuously binding to nearly complementary sequences of nontargeted amplicons. Sequences around the mutations matched primer binding sites in the panel in 18 of 19 cases, thus likely corresponding to panel primers. Furthermore, because most primers did not show this effect, we demonstrated that next-generation sequencing may be used to better design multiplex PCR primers through iterative elimination of offending primers to minimize mispriming. Our results indicate the need for careful sequence analysis to avoid false-positive mutations that can arise in multiplex PCR panels. The AmpliSeq Cancer panel is a valuable tool for clinical diagnostics, provided awareness of potential artifacts.
Collapse
Affiliation(s)
- Chad M McCall
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Stacy Mosier
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Michele Thiess
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Marija Debeljak
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Aparna Pallavajjala
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland; Department of Oncology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Katie Beierl
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Kristen L Deak
- Department of Pathology, Duke University School of Medicine, Durham, North Carolina
| | - Michael B Datto
- Department of Pathology, Duke University School of Medicine, Durham, North Carolina
| | - Christopher D Gocke
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Ming-Tseh Lin
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - James R Eshleman
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland; Department of Oncology, The Sol Goldman Pancreatic Cancer Research Center, Johns Hopkins University School of Medicine, Baltimore, Maryland.
| |
Collapse
|
41
|
Salipante SJ, Scroggins SM, Hampel HL, Turner EH, Pritchard CC. Microsatellite instability detection by next generation sequencing. Clin Chem 2014; 60:1192-9. [PMID: 24987110 DOI: 10.1373/clinchem.2014.223677] [Citation(s) in RCA: 287] [Impact Index Per Article: 28.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
BACKGROUND Microsatellite instability (MSI) is a useful phenotype in cancer diagnosis and prognosis. Nevertheless, methods to detect MSI status from next generation DNA sequencing (NGS) data are underdeveloped. METHODS We developed an approach to detect the MSI phenotype using NGS (mSINGS). The method was used to evaluate mononucleotide microsatellite loci that were incidentally sequenced after targeted gene enrichment and could be applied to gene or exome capture panels designed for other purposes. For each microsatellite locus, the number of differently sized repeats in experimental samples were quantified and compared to a population of normal controls. Loci were considered unstable if the experimental number of repeats was statistically greater than in the control population. MSI status was determined by the fraction of unstable microsatellite loci. RESULTS We examined data from 324 samples generated using targeted gene capture assays of 3 different sizes, ranging from a 0.85-Mb to a 44-Mb exome design and incorporating from 15 to 2957 microsatellite markers. When we compared mSING results to MSI-PCR as a gold standard for 108 cases, we found the approach to be both diagnostically sensitive (range of 96.4% to 100% across 3 panels) and specific (range of 97.2% to 100%) for determining MSI status. The fraction of unstable microsatellite markers calculated from sequencing data correlated with the number of unstable loci detected by conventional MSI-PCR testing. CONCLUSIONS NGS data can enable highly accurate detection of MSI, even from limited capture designs. This novel approach offers several advantages over existing PCR-based methods.
Collapse
Affiliation(s)
| | | | - Heather L Hampel
- Department of Internal Medicine, Division of Human Genetics, The Ohio State University, Columbus, OH
| | - Emily H Turner
- Department of Laboratory Medicine, University of Washington, Seattle WA
| | - Colin C Pritchard
- Department of Laboratory Medicine, University of Washington, Seattle WA;
| |
Collapse
|
42
|
Hanna DMT, Fellowes A, Vedururu R, Mechinaud F, Hansford JR. A unique case of refractory primary mediastinal B-cell lymphoma with JAK3 mutation and the role for targeted therapy. Haematologica 2014; 99:e156-8. [PMID: 24837469 DOI: 10.3324/haematol.2014.108142] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Affiliation(s)
- Diane M T Hanna
- Children's Cancer Centre, Royal Children's Hospital, Victoria, Australia
| | - Andrew Fellowes
- Molecular Pathology Laboratory, Peter MacCallum Cancer Centre, Victoria, Australia
| | - Ravikiran Vedururu
- Molecular Pathology Laboratory, Peter MacCallum Cancer Centre, Victoria, Australia
| | | | - Jordan R Hansford
- Children's Cancer Centre, Royal Children's Hospital, Victoria, Australia
| |
Collapse
|
43
|
Roulland S, Kelly RS, Morgado E, Sungalee S, Solal-Celigny P, Colombat P, Jouve N, Palli D, Pala V, Tumino R, Panico S, Sacerdote C, Quirós JR, Gonzáles CA, Sánchez MJ, Dorronsoro M, Navarro C, Barricarte A, Tjønneland A, Olsen A, Overvad K, Canzian F, Kaaks R, Boeing H, Drogan D, Nieters A, Clavel-Chapelon F, Trichopoulou A, Trichopoulos D, Lagiou P, Bueno-de-Mesquita HB, Peeters PHM, Vermeulen R, Hallmans G, Melin B, Borgquist S, Carlson J, Lund E, Weiderpass E, Khaw KT, Wareham N, Key TJ, Travis RC, Ferrari P, Romieu I, Riboli E, Salles G, Vineis P, Nadel B. t(14;18) Translocation: A predictive blood biomarker for follicular lymphoma. J Clin Oncol 2014; 32:1347-55. [PMID: 24687831 DOI: 10.1200/jco.2013.52.8190] [Citation(s) in RCA: 94] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
PURPOSE The (14;18) translocation constitutes both a genetic hallmark and critical early event in the natural history of follicular lymphoma (FL). However, t(14;18) is also detectable in the blood of otherwise healthy persons, and its relationship with progression to disease remains unclear. Here we sought to determine whether t(14;18)-positive cells in healthy individuals represent tumor precursors and whether their detection could be used as an early predictor for FL. PARTICIPANTS AND METHODS Among 520,000 healthy participants enrolled onto the EPIC (European Prospective Investigation Into Cancer and Nutrition) cohort, we identified 100 who developed FL 2 to 161 months after enrollment. Prediagnostic blood from these and 218 controls were screened for t(14;18) using sensitive polymerase chain reaction-based assays. Results were subsequently validated in an independent cohort (65 case participants; 128 controls). Clonal relationships between t(14;18) cells and FL were also assessed by molecular backtracking of paired prediagnostic blood and tumor samples. RESULTS Clonal analysis of t(14;18) junctions in paired prediagnostic blood versus tumor samples demonstrated that progression to FL occurred from t(14;18)-positive committed precursors. Furthermore, healthy participants at enrollment who developed FL up to 15 years later showed a markedly higher t(14;18) prevalence and frequency than controls (P < .001). Altogether, we estimated a 23-fold higher risk of subsequent FL in blood samples associated with a frequency > 10(-4) (odds ratio, 23.17; 95% CI, 9.98 to 67.31; P < .001). Remarkably, risk estimates remained high and significant up to 15 years before diagnosis. CONCLUSION High t(14;18) frequency in blood from healthy individuals defines the first predictive biomarker for FL, effective years before diagnosis.
Collapse
MESH Headings
- Adult
- Aged
- Biomarkers, Tumor/blood
- Biomarkers, Tumor/genetics
- Case-Control Studies
- Chromosomes, Human, Pair 14
- Chromosomes, Human, Pair 18
- Cohort Studies
- Europe/epidemiology
- Female
- Humans
- Lymphoma, Follicular/blood
- Lymphoma, Follicular/epidemiology
- Lymphoma, Follicular/genetics
- Male
- Middle Aged
- Molecular Epidemiology
- Polymerase Chain Reaction/methods
- Prevalence
- Translocation, Genetic
Collapse
Affiliation(s)
- Sandrine Roulland
- Sandrine Roulland, Ester Morgado, Stéphanie Sungalee, Nathalie Jouve, and Bertrand Nadel, Aix-Marseille Université, Institut National de la Santé et de la Recherche Médicale (INSERM) U1104, and Centre National de la Recherche Scientifique (CNRS) Unités Mixtes de Recherche (UMR) 7280, Marseille; Philippe Solal-Celigny, Jean Bernard Center, Le Mans; Philippe Colombat, Bretonneau University Hospital, Tours; Françoise Clavel-Chapelon, INSERM U1018 Centre de Recherche en Epidémiologie et Santé des Populations, Villejuif; Pietro Ferrari and Isabelle Romieu, International Agency for Research on Cancer, Lyon; Gilles Salles, Hospices Civils de Lyon, Université de Lyon, UMR CNRS 5239, Pierre Bénite, France; Rachel S. Kelly, Petra H.M. Peeters, Roel Vermeulen, Elio Riboli, and Paolo Vineis, School of Public Health, Imperial College London, London; Kay-Tee Khaw, University of Cambridge; Nick Wareham, Institute of Metabolic Science, Cambridge; Timothy J. Key and Ruth C. Travis, University of Oxford, Oxford, United Kingdom; Domenico Palli, Istituto per lo Studio e la Prevenzione Oncologica, Florence; Valeria Pala, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico, Istituto Nazionale Tumori, Milan; Rosario Tumino, "Civile-M.P. Arezzo" Hospital, Ragusa; Salvatore Panico, Federico II University, Naples; Carlotta Sacerdote, Centro di Riferimento per l'Epidemiologia e la Prevenzione Oncologica-Piemonte, Torino, Italy; José R. Quirós, Public Health and Health Planning Directorate, Asturias; Carlos A. Gonzáles, Catalan Institute of Oncology, Barcelona; Maria-José Sánchez, Andalusian School of Public Health and Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Granada; Miren Dorronsoro, Basque Regional Health Department and CIBERESP Biodonostia, San Sebastian; Carmen Navarro, Murcia Regional Health Council, Universidad de Murcia, and CIBERESP, Murcia; Aurelio Barricarte, Navarre Public Health Institute and CIBERESP, Pamplona, Sp
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Roth A, Khattra J, Yap D, Wan A, Laks E, Biele J, Ha G, Aparicio S, Bouchard-Côté A, Shah SP. PyClone: statistical inference of clonal population structure in cancer. Nat Methods 2014; 11:396-8. [PMID: 24633410 PMCID: PMC4864026 DOI: 10.1038/nmeth.2883] [Citation(s) in RCA: 680] [Impact Index Per Article: 68.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2013] [Accepted: 01/31/2014] [Indexed: 12/25/2022]
Abstract
We introduce PyClone, a statistical model for inference of clonal population structures in cancers. PyClone is a Bayesian clustering method for grouping sets of deeply sequenced somatic mutations into putative clonal clusters while estimating their cellular prevalences and accounting for allelic imbalances introduced by segmental copy-number changes and normal-cell contamination. Single-cell sequencing validation demonstrates PyClone's accuracy.
Collapse
Affiliation(s)
- Andrew Roth
- Bioinformatics Graduate Program, University Of British Columbia, Vancouver, Canada
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
| | - Jaswinder Khattra
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
| | - Damian Yap
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
| | - Adrian Wan
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
| | - Emma Laks
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
| | - Justina Biele
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
| | - Gavin Ha
- Bioinformatics Graduate Program, University Of British Columbia, Vancouver, Canada
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
| | - Samuel Aparicio
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC, V6T 2B5, Canada
| | | | - Sohrab P. Shah
- Molecular Oncology, British Columbia Cancer Research Centre, Vancouver, BC V5Z 1L3, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC, V6T 2B5, Canada
| |
Collapse
|
45
|
Ankala A, Hegde M. Genomic Technologies and the New Era of Genomic Medicine. J Mol Diagn 2014; 16:7-10. [DOI: 10.1016/j.jmoldx.2013.11.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2013] [Revised: 11/08/2013] [Accepted: 11/12/2013] [Indexed: 10/26/2022] Open
|
46
|
Leung F, Diamandis EP, Kulasingam V. Ovarian Cancer Biomarkers. Adv Clin Chem 2014. [DOI: 10.1016/b978-0-12-801401-1.00002-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
47
|
Targeted-capture massively-parallel sequencing enables robust detection of clinically informative mutations from formalin-fixed tumours. Sci Rep 2013; 3:3494. [PMID: 24336498 PMCID: PMC3861801 DOI: 10.1038/srep03494] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2013] [Accepted: 11/27/2013] [Indexed: 01/09/2023] Open
Abstract
Massively parallel sequencing offers the ability to interrogate a tumour biopsy for multiple mutational changes. For clinical samples, methodologies must enable maximal extraction of available sequence information from formalin-fixed and paraffin-embedded (FFPE) material. We assessed the use of targeted capture for mutation detection in FFPE DNA. The capture probes targeted the coding region of all known kinase genes and selected oncogenes and tumour suppressor genes. Seven melanoma cell lines and matching FFPE xenograft DNAs were sequenced. An informatics pipeline was developed to identify variants and contaminating mouse reads. Concordance of 100% was observed between unfixed and formalin-fixed for reported COSMIC variants including BRAF V600E. mutations in genes not conventionally screened including ERBB4, ATM, STK11 and CDKN2A were readily detected. All regions were adequately covered with independent reads regardless of GC content. This study indicates that hybridisation capture is a robust approach for massively parallel sequencing of FFPE samples.
Collapse
|
48
|
Harismendy O, Schwab RB, Alakus H, Yost SE, Matsui H, Hasteh F, Wallace AM, Park HL, Madlensky L, Parker B, Carpenter PM, Jepsen K, Anton-Culver H, Frazer KA. Evaluation of ultra-deep targeted sequencing for personalized breast cancer care. Breast Cancer Res 2013; 15:R115. [PMID: 24326041 PMCID: PMC3978701 DOI: 10.1186/bcr3584] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Accepted: 12/06/2013] [Indexed: 01/24/2023] Open
Abstract
INTRODUCTION The increasing number of targeted therapies, together with a deeper understanding of cancer genetics and drug response, have prompted major healthcare centers to implement personalized treatment approaches relying on high-throughput tumor DNA sequencing. However, the optimal way to implement this transformative methodology is not yet clear. Current assays may miss important clinical information such as the mutation allelic fraction, the presence of sub-clones or chromosomal rearrangements, or the distinction between inherited variants and somatic mutations. Here, we present the evaluation of ultra-deep targeted sequencing (UDT-Seq) to generate and interpret the molecular profile of 38 breast cancer patients from two academic medical centers. METHODS We sequenced 47 genes in matched germline and tumor DNA samples from 38 breast cancer patients. The selected genes, or the pathways they belong to, can be targeted by drugs or are important in familial cancer risk or drug metabolism. RESULTS Relying on the added value of sequencing matched tumor and germline DNA and using a dedicated analysis, UDT-Seq has a high sensitivity to identify mutations in tumors with low malignant cell content. Applying UDT-Seq to matched tumor and germline specimens from the 38 patients resulted in a proposal for at least one targeted therapy for 22 patients, the identification of tumor sub-clones in 3 patients, the suggestion of potential adverse drug effects in 3 patients and a recommendation for genetic counseling for 2 patients. CONCLUSION Overall our study highlights the additional benefits of a sequencing strategy, which includes germline DNA and is optimized for heterogeneous tumor tissues.
Collapse
Affiliation(s)
- Olivier Harismendy
- Division of Genome Information Sciences, Department of Pediatrics and Rady Children’s Hospital, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
- Moores UCSD Cancer Center, School of Medicine, University of California San Diego, 3855 Health Science Drive, La Jolla CA 92093, USA
- Clinical and Translational Science Institute, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Richard B Schwab
- Moores UCSD Cancer Center, School of Medicine, University of California San Diego, 3855 Health Science Drive, La Jolla CA 92093, USA
- Department of Medicine, School of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Hakan Alakus
- Division of Genome Information Sciences, Department of Pediatrics and Rady Children’s Hospital, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
- Department of Pathology, School of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Shawn E Yost
- Division of Genome Information Sciences, Department of Pediatrics and Rady Children’s Hospital, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
- Department of Surgery, School of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Hiroko Matsui
- Division of Genome Information Sciences, Department of Pediatrics and Rady Children’s Hospital, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Farnaz Hasteh
- Bioinformatics Graduate Program, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Anne M Wallace
- Moores UCSD Cancer Center, School of Medicine, University of California San Diego, 3855 Health Science Drive, La Jolla CA 92093, USA
- Department of Family and Preventive Medicine, School of Medicine, University of California San Diego, La Jolla CA, USA
| | - Hannah L Park
- Institute for Genomic Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Lisa Madlensky
- Moores UCSD Cancer Center, School of Medicine, University of California San Diego, 3855 Health Science Drive, La Jolla CA 92093, USA
- Department of Epidemiology, School of Medicine, University of California Irvine, 252 Irvine Hall, Irvine CA 92697, USA
| | - Barbara Parker
- Department of Medicine, School of Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Philip M Carpenter
- Department of Pathology and Laboratory Medicine, School of Medicine, University of California Irvine, 252 Irvine Hall, Irvine CA 92697, USA
| | - Kristen Jepsen
- Division of Genome Information Sciences, Department of Pediatrics and Rady Children’s Hospital, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Hoda Anton-Culver
- Institute for Genomic Medicine, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
| | - Kelly A Frazer
- Division of Genome Information Sciences, Department of Pediatrics and Rady Children’s Hospital, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
- Moores UCSD Cancer Center, School of Medicine, University of California San Diego, 3855 Health Science Drive, La Jolla CA 92093, USA
- Clinical and Translational Science Institute, University of California San Diego, 9500 Gilman Drive, La Jolla CA 92093, USA
- Department of General, Visceral and Cancer Surgery, University of Cologne, Frangenheimstraße 4, 50931, Köln Germany
| |
Collapse
|
49
|
Patel A, Schwab R, Liu YT, Bafna V. Amplification and thrifty single-molecule sequencing of recurrent somatic structural variations. Genome Res 2013; 24:318-28. [PMID: 24307551 PMCID: PMC3912422 DOI: 10.1101/gr.161497.113] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Deletion of tumor-suppressor genes as well as other genomic rearrangements pervade cancer genomes across numerous types of solid tumor and hematologic malignancies. However, even for a specific rearrangement, the breakpoints may vary between individuals, such as the recurrent CDKN2A deletion. Characterizing the exact breakpoints for structural variants (SVs) is useful for designating patient-specific tumor biomarkers. We propose AmBre (Amplification of Breakpoints), a method to target SV breakpoints occurring in samples composed of heterogeneous tumor and germline DNA. Additionally, AmBre validates SVs called by whole-exome/genome sequencing and hybridization arrays. AmBre involves a PCR-based approach to amplify the DNA segment containing an SV's breakpoint and then confirms breakpoints using sequencing by Pacific Biosciences RS. To amplify breakpoints with PCR, primers tiling specified target regions are carefully selected with a simulated annealing algorithm to minimize off-target amplification and maximize efficiency at capturing all possible breakpoints within the target regions. To confirm correct amplification and obtain breakpoints, PCR amplicons are combined without barcoding and simultaneously long-read sequenced using a single SMRT cell. Our algorithm efficiently separates reads based on breakpoints. Each read group supporting the same breakpoint corresponds with an amplicon and a consensus amplicon sequence is called. AmBre was used to discover CDKN2A deletion breakpoints in cancer cell lines: A549, CEM, Detroit562, MOLT4, MCF7, and T98G. Also, we successfully assayed RUNX1–RUNX1T1 reciprocal translocations by finding both breakpoints in the Kasumi-1 cell line. AmBre successfully targets SVs where DNA harboring the breakpoints are present in 1:1000 mixtures.
Collapse
Affiliation(s)
- Anand Patel
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, California 92093, USA
| | | | | | | |
Collapse
|
50
|
Spencer DH, Tyagi M, Vallania F, Bredemeyer AJ, Pfeifer JD, Mitra RD, Duncavage EJ. Performance of common analysis methods for detecting low-frequency single nucleotide variants in targeted next-generation sequence data. J Mol Diagn 2013; 16:75-88. [PMID: 24211364 DOI: 10.1016/j.jmoldx.2013.09.003] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2013] [Revised: 08/16/2013] [Accepted: 09/04/2013] [Indexed: 12/31/2022] Open
Abstract
Next-generation sequencing (NGS) is becoming a common approach for clinical testing of oncology specimens for mutations in cancer genes. Unlike inherited variants, cancer mutations may occur at low frequencies because of contamination from normal cells or tumor heterogeneity and can therefore be challenging to detect using common NGS analysis tools, which are often designed for constitutional genomic studies. We generated high-coverage (>1000×) NGS data from synthetic DNA mixtures with variant allele fractions (VAFs) of 25% to 2.5% to assess the performance of four variant callers, SAMtools, Genome Analysis Toolkit, VarScan2, and SPLINTER, in detecting low-frequency variants. SAMtools had the lowest sensitivity and detected only 49% of variants with VAFs of approximately 25%; whereas the Genome Analysis Toolkit, VarScan2, and SPLINTER detected at least 94% of variants with VAFs of approximately 10%. VarScan2 and SPLINTER achieved sensitivities of 97% and 89%, respectively, for variants with observed VAFs of 1% to 8%, with >98% sensitivity and >99% positive predictive value in coding regions. Coverage analysis demonstrated that >500× coverage was required for optimal performance. The specificity of SPLINTER improved with higher coverage, whereas VarScan2 yielded more false positive results at high coverage levels, although this effect was abrogated by removing low-quality reads before variant identification. Finally, we demonstrate the utility of high-sensitivity variant callers with data from 15 clinical lung cancers.
Collapse
Affiliation(s)
- David H Spencer
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri
| | - Manoj Tyagi
- Department of Genetics, Washington University, St. Louis, Missouri
| | - Francesco Vallania
- Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | | | - John D Pfeifer
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri
| | - Rob D Mitra
- Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Eric J Duncavage
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri.
| |
Collapse
|