1
|
Maruzani R, Brierley L, Jorgensen A, Fowler A. Benchmarking UMI-aware and standard variant callers for low frequency ctDNA variant detection. BMC Genomics 2024; 25:827. [PMID: 39227777 PMCID: PMC11370058 DOI: 10.1186/s12864-024-10737-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 08/22/2024] [Indexed: 09/05/2024] Open
Abstract
BACKGROUND Circulating tumour DNA (ctDNA) is a subset of cell free DNA (cfDNA) released by tumour cells into the bloodstream. Circulating tumour DNA has shown great potential as a biomarker to inform treatment in cancer patients. Collecting ctDNA is minimally invasive and reflects the entire genetic makeup of a patient's cancer. ctDNA variants in NGS data can be difficult to distinguish from sequencing and PCR artefacts due to low abundance, particularly in the early stages of cancer. Unique Molecular Identifiers (UMIs) are short sequences ligated to the sequencing library before amplification. These sequences are useful for filtering out low frequency artefacts. The utility of ctDNA as a cancer biomarker depends on accurate detection of cancer variants. RESULTS In this study, we benchmarked six variant calling tools, including two UMI-aware callers for their ability to call ctDNA variants. The standard variant callers tested included Mutect2, bcftools, LoFreq and FreeBayes. The UMI-aware variant callers benchmarked were UMI-VarCal and UMIErrorCorrect. We used both datasets with known variants spiked in at low frequencies, and datasets containing ctDNA, and generated synthetic UMI sequences for these datasets. Variant callers displayed different preferences for sensitivity and specificity. Mutect2 showed high sensitivity, while returning more privately called variants than any other caller in data without synthetic UMIs - an indicator of false positive variant discovery. In data encoded with synthetic UMIs, UMI-VarCal detected fewer putative false positive variants than all other callers in synthetic datasets. Mutect2 showed a balance between high sensitivity and specificity in data encoded with synthetic UMIs. CONCLUSIONS Our results indicate UMI-aware variant callers have potential to improve sensitivity and specificity in calling low frequency ctDNA variants over standard variant calling tools. There is a growing need for further development of UMI-aware variant calling tools if effective early detection methods for cancer using ctDNA samples are to be realised.
Collapse
Affiliation(s)
- Rugare Maruzani
- Department of Health Data Science, Institute of Population Health, University of Liverpool, Waterhouse Building, Block F, Brownlow Street, Liverpool, L69 3GF, UK.
| | - Liam Brierley
- Department of Health Data Science, Institute of Population Health, University of Liverpool, Waterhouse Building, Block F, Brownlow Street, Liverpool, L69 3GF, UK
- MRC-University of Glasgow Centre for Virus Research, University of Glasgow, Garscube Campus, 464 Bearsden Road, Glasgow, G61 1QH, UK
| | - Andrea Jorgensen
- Department of Health Data Science, Institute of Population Health, University of Liverpool, Waterhouse Building, Block F, Brownlow Street, Liverpool, L69 3GF, UK
| | - Anna Fowler
- Department of Health Data Science, Institute of Population Health, University of Liverpool, Waterhouse Building, Block F, Brownlow Street, Liverpool, L69 3GF, UK
| |
Collapse
|
2
|
Furtado LV, Bifulco C, Dolderer D, Hsiao SJ, Kipp BR, Lindeman NI, Ritterhouse LL, Temple-Smolkin RL, Zehir A, Nowak JA. Recommendations for Tumor Mutational Burden Assay Validation and Reporting: A Joint Consensus Recommendation of the Association for Molecular Pathology, College of American Pathologists, and Society for Immunotherapy of Cancer. J Mol Diagn 2024; 26:653-668. [PMID: 38851389 DOI: 10.1016/j.jmoldx.2024.05.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 04/05/2024] [Accepted: 05/07/2024] [Indexed: 06/10/2024] Open
Abstract
Tumor mutational burden (TMB) has been recognized as a predictive biomarker for immunotherapy response in several tumor types. Several laboratories offer TMB testing, but there is significant variation in how TMB is calculated, reported, and interpreted among laboratories. TMB standardization efforts are underway, but no published guidance for TMB validation and reporting is currently available. Recognizing the current challenges of clinical TMB testing, the Association for Molecular Pathology convened a multidisciplinary collaborative working group with representation from the American Society of Clinical Oncology, the College of American Pathologists, and the Society for the Immunotherapy of Cancer to review the laboratory practices surrounding TMB and develop recommendations for the analytical validation and reporting of TMB testing based on survey data, literature review, and expert consensus. These recommendations encompass pre-analytical, analytical, and postanalytical factors of TMB analysis, and they emphasize the relevance of comprehensive methodological descriptions to allow comparability between assays.
Collapse
Affiliation(s)
- Larissa V Furtado
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee.
| | - Carlo Bifulco
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Pathology, Providence Portland Medical Center, Portland, Oregon
| | - Daniel Dolderer
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Pathology, Jupiter Medical Center, Jupiter, Florida
| | - Susan J Hsiao
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Pathology and Cell Biology, Columbia University Medical Center, New York, New York
| | - Benjamin R Kipp
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota
| | - Neal I Lindeman
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Pathology, Weill Cornell Medicine, New York, New York
| | - Lauren L Ritterhouse
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Pathology, Massachusetts General Hospital, Boston, Massachusetts
| | | | - Ahmet Zehir
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Memorial Sloan Kettering Cancer Center, New York, New York
| | - Jonathan A Nowak
- The Tumor Mutational Burden Working Group of the Clinical Practice Committee, Association for Molecular Pathology, Rockville, Maryland; Department of Pathology, Brigham and Women's Hospital, Boston, Massachusetts
| |
Collapse
|
3
|
Garcia-Pedemonte D, Carcereny A, Gregori J, Quer J, Garcia-Cehic D, Guerrero L, Ceretó-Massagué A, Abid I, Bosch A, Costafreda MI, Pintó RM, Guix S. Comparison of Nanopore and Synthesis-Based Next-Generation Sequencing Platforms for SARS-CoV-2 Variant Monitoring in Wastewater. Int J Mol Sci 2023; 24:17184. [PMID: 38139015 PMCID: PMC10743471 DOI: 10.3390/ijms242417184] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 11/30/2023] [Accepted: 12/04/2023] [Indexed: 12/24/2023] Open
Abstract
Shortly after the beginning of the SARS-CoV-2 pandemic, many countries implemented sewage sentinel systems to monitor the circulation of the virus in the population. A fundamental part of these surveillance programs is the variant tracking through sequencing approaches to monitor and identify new variants or mutations that may be of importance. Two of the main sequencing platforms are Illumina and Oxford Nanopore Technologies. Here, we compare the performance of MiSeq (Illumina) and MinION (Oxford Nanopore Technologies), as well as two different data processing pipelines, to determine the effect they may have on the results. MiSeq showed higher sequencing coverage, lower error rate, and better capacity to detect and accurately estimate variant abundances than MinION R9.4.1 flow cell data. The use of different variant callers (LoFreq and iVar) and approaches to calculate the variant proportions had a remarkable impact on the results generated from wastewater samples. Freyja, coupled with iVar, may be more sensitive and accurate than LoFreq, especially with MinION data, but it comes at the cost of having a higher error rate. The analysis of MinION R10.4.1 flow cell data using Freyja combined with iVar narrows the gap with MiSeq performance in terms of read quality, accuracy, sensitivity, and number of detected mutations. Although MiSeq should still be considered as the standard method for SARS-CoV-2 variant tracking, MinION's versatility and rapid turnaround time may represent a clear advantage during the ongoing pandemic.
Collapse
Affiliation(s)
- David Garcia-Pedemonte
- Enteric Virus Laboratory, Section of Microbiology, Virology and Biotechnology, Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, 08028 Barcelona, Spain; (D.G.-P.); (A.C.); (I.A.); (A.B.); (M.I.C.)
- Enteric Virus Laboratory, Institute of Nutrition and Food Safety (INSA), University of Barcelona, 08921 Santa Coloma de Gramenet, Spain
| | - Albert Carcereny
- Enteric Virus Laboratory, Section of Microbiology, Virology and Biotechnology, Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, 08028 Barcelona, Spain; (D.G.-P.); (A.C.); (I.A.); (A.B.); (M.I.C.)
- Enteric Virus Laboratory, Institute of Nutrition and Food Safety (INSA), University of Barcelona, 08921 Santa Coloma de Gramenet, Spain
| | - Josep Gregori
- Liver Unit, Liver Diseases—Viral Hepatitis, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Campus, 08035 Barcelona, Spain; (J.G.); (J.Q.); (D.G.-C.)
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Josep Quer
- Liver Unit, Liver Diseases—Viral Hepatitis, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Campus, 08035 Barcelona, Spain; (J.G.); (J.Q.); (D.G.-C.)
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Damir Garcia-Cehic
- Liver Unit, Liver Diseases—Viral Hepatitis, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Hospital Campus, 08035 Barcelona, Spain; (J.G.); (J.Q.); (D.G.-C.)
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Laura Guerrero
- Catalan Institute for Water Research (ICRA), 17003 Girona, Spain;
| | - Adrià Ceretó-Massagué
- Centre for Omic Sciences (COS), Joint Unit Universitat Rovira i Virgili-EURECAT, Unique Scientific and Technical Infrastructures (ICTS), 43204 Reus, Spain;
| | - Islem Abid
- Enteric Virus Laboratory, Section of Microbiology, Virology and Biotechnology, Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, 08028 Barcelona, Spain; (D.G.-P.); (A.C.); (I.A.); (A.B.); (M.I.C.)
- Center of Excellence in Biotechnology Research, College of Applied Science, King Saud University, Riyadh 11495, Saudi Arabia
| | - Albert Bosch
- Enteric Virus Laboratory, Section of Microbiology, Virology and Biotechnology, Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, 08028 Barcelona, Spain; (D.G.-P.); (A.C.); (I.A.); (A.B.); (M.I.C.)
- Enteric Virus Laboratory, Institute of Nutrition and Food Safety (INSA), University of Barcelona, 08921 Santa Coloma de Gramenet, Spain
| | - Maria Isabel Costafreda
- Enteric Virus Laboratory, Section of Microbiology, Virology and Biotechnology, Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, 08028 Barcelona, Spain; (D.G.-P.); (A.C.); (I.A.); (A.B.); (M.I.C.)
- Enteric Virus Laboratory, Institute of Nutrition and Food Safety (INSA), University of Barcelona, 08921 Santa Coloma de Gramenet, Spain
| | - Rosa M. Pintó
- Enteric Virus Laboratory, Section of Microbiology, Virology and Biotechnology, Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, 08028 Barcelona, Spain; (D.G.-P.); (A.C.); (I.A.); (A.B.); (M.I.C.)
- Enteric Virus Laboratory, Institute of Nutrition and Food Safety (INSA), University of Barcelona, 08921 Santa Coloma de Gramenet, Spain
| | - Susana Guix
- Enteric Virus Laboratory, Section of Microbiology, Virology and Biotechnology, Department of Genetics, Microbiology and Statistics, School of Biology, University of Barcelona, 08028 Barcelona, Spain; (D.G.-P.); (A.C.); (I.A.); (A.B.); (M.I.C.)
- Enteric Virus Laboratory, Institute of Nutrition and Food Safety (INSA), University of Barcelona, 08921 Santa Coloma de Gramenet, Spain
| |
Collapse
|
4
|
Jiang X, Zhang B, Lan F, Zhong C, Jin J, Li X, Zhou Q, Li J, Yang N, Wen C, Sun C. Host genetics and gut microbiota jointly regulate blood biochemical indicators in chickens. Appl Microbiol Biotechnol 2023; 107:7601-7620. [PMID: 37792060 PMCID: PMC10656342 DOI: 10.1007/s00253-023-12814-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 09/14/2023] [Accepted: 09/22/2023] [Indexed: 10/05/2023]
Abstract
Blood biochemical indicators play a crucial role in assessing an individual's overall health status and metabolic function. In this study, we measured five blood biochemical indicators, including total cholesterol (CHOL), low-density lipoprotein cholesterol (LDL-CH), triglycerides (TG), high-density lipoprotein cholesterol (HDL-CH), and blood glucose (BG), as well as 19 growth traits of 206 male chickens. By integrating host whole-genome information and 16S rRNA sequencing of the duodenum, jejunum, ileum, cecum, and feces microbiota, we assessed the contributions of host genetics and gut microbiota to blood biochemical indicators and their interrelationships. Our results demonstrated significant negative phenotypic and genetic correlations (r = - 0.20 ~ - 0.67) between CHOL and LDL-CH with growth traits such as body weight, abdominal fat content, muscle content, and shin circumference. The results of heritability and microbiability indicated that blood biochemical indicators were jointly regulated by host genetics and gut microbiota. Notably, the heritability of HDL-CH was estimated to be 0.24, while the jejunal microbiability for BG and TG reached 0.45 and 0.23. Furthermore, by conducting genome-wide association study (GWAS) with the single-nucleotide polymorphism (SNPs), insertion/deletion (indels), and structural variation (SV), we identified RAP2C, member of the RAS oncogene family (RAP2C), dedicator of cytokinesis 11 (DOCK11), neurotensin (NTS) and BOP1 ribosomal biogenesis factor (BOP1) as regulators of HDL-CH, and glycerophosphodiester phosphodiesterase domain containing 5 (GDPD5), dihydrodiol dehydrogenase (DHDH), and potassium voltage-gated channel interacting protein 1 (KCNIP1) as candidate genes of BG. Moreover, our findings suggest that cecal RF39 and Clostridia_UCG_014 may be linked to the regulation of CHOL, and jejunal Streptococcaceae may be involved in the regulation of TG. Additionally, microbial GWAS results indicated that the presence of gut microbiota was under host genetic regulation. Our findings provide valuable insights into the complex interaction between host genetics and microbiota in shaping the blood biochemical profile of chickens. KEY POINTS: • Multiple candidate genes were identified for the regulation of CHOL, HDL-CH, and BG. • RF39, Clostridia_UCG_014, and Streptococcaceae were implicated in CHOL and TG modulation. • The composition of gut microbiota is influenced by host genetics.
Collapse
Affiliation(s)
- Xinwei Jiang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Boxuan Zhang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Fangren Lan
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Conghao Zhong
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Jiaming Jin
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Xiaochang Li
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Qianqian Zhou
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Junying Li
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Ning Yang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Chaoliang Wen
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - Congjiao Sun
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
5
|
Messaritakis I, Psaroudaki E, Vogiatzoglou K, Sfakianaki M, Topalis P, Iliopoulos I, Mavroudis D, Tsiaoussis J, Gouvas N, Tzardi M, Souglakos J. Unraveling the Role of Molecular Profiling in Predicting Treatment Response in Stage III Colorectal Cancer Patients: Insights from the IDEA International Study. Cancers (Basel) 2023; 15:4819. [PMID: 37835512 PMCID: PMC10571744 DOI: 10.3390/cancers15194819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/06/2023] [Accepted: 09/28/2023] [Indexed: 10/15/2023] Open
Abstract
BACKGROUND This study aimed to investigate the molecular profiles of 237 stage III CRC patients from the international IDEA study. It also sought to correlate these profiles with Toll-like and vitamin D receptor polymorphisms, clinicopathological and epidemiological characteristics, and patient outcomes. METHODS Whole Exome Sequencing and PCR-RFLP on surgical specimens and blood samples, respectively, were performed to identify molecular profiling and the presence of Toll-like and vitamin D polymorphisms. Bioinformatic analysis revealed mutational status. RESULTS Among the enrolled patients, 63.7% were male, 66.7% had left-sided tumors, and 55.7% received CAPOX as adjuvant chemotherapy. Whole exome sequencing identified 59 mutated genes in 11 different signaling pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) CRC panel. On average, patients had 8 mutated genes (range, 2-21 genes). Mutations in ARAF and MAPK10 emerged as independent prognostic factors for reduced DFS (p = 0.027 and p < 0.001, respectively), while RAC3 and RHOA genes emerged as independent prognostic factors for reduced OS (p = 0.029 and p = 0.006, respectively). Right-sided tumors were also identified as independent prognostic factors for reduced DFS (p = 0.019) and OS (p = 0.043). Additionally, patients with tumors in the transverse colon had mutations in genes related to apoptosis, PIK3-Akt, Wnt, and MAPK signaling pathways. CONCLUSIONS Molecular characterization of tumor cells can enhance our understanding of the disease course. Mutations may serve as promising prognostic biomarkers, offering improved treatment options. Confirming these findings will require larger patient cohorts and international collaborations to establish correlations between molecular profiling, clinicopathological and epidemiological characteristics and clinical outcomes.
Collapse
Affiliation(s)
- Ippokratis Messaritakis
- Laboratory of Translational Oncology, Medical School, University of Crete, 70013 Heraklion, Greece; (E.P.); (K.V.); (M.S.); (D.M.); (J.S.)
| | - Eleni Psaroudaki
- Laboratory of Translational Oncology, Medical School, University of Crete, 70013 Heraklion, Greece; (E.P.); (K.V.); (M.S.); (D.M.); (J.S.)
| | - Konstantinos Vogiatzoglou
- Laboratory of Translational Oncology, Medical School, University of Crete, 70013 Heraklion, Greece; (E.P.); (K.V.); (M.S.); (D.M.); (J.S.)
| | - Maria Sfakianaki
- Laboratory of Translational Oncology, Medical School, University of Crete, 70013 Heraklion, Greece; (E.P.); (K.V.); (M.S.); (D.M.); (J.S.)
| | - Pantelis Topalis
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, 70013 Heraklion, Greece;
| | - Ioannis Iliopoulos
- Laboratory of Computational Biology, Division of Basic Sciences, School of Medicine, University of Crete, 71003 Heraklion, Greece;
| | - Dimitrios Mavroudis
- Laboratory of Translational Oncology, Medical School, University of Crete, 70013 Heraklion, Greece; (E.P.); (K.V.); (M.S.); (D.M.); (J.S.)
- Department of Medical Oncology, University General Hospital of Heraklion, 71100 Heraklion, Greece
| | - John Tsiaoussis
- Department of Anatomy, School of Medicine, University of Crete, 70013 Heraklion, Greece;
| | - Nikolaos Gouvas
- Medical School, University of Cyprus, 99010 Nicosia, Cyprus;
| | - Maria Tzardi
- Laboratory of Pathology, Medical School, University of Crete, 70013 Heraklion, Greece;
| | - John Souglakos
- Laboratory of Translational Oncology, Medical School, University of Crete, 70013 Heraklion, Greece; (E.P.); (K.V.); (M.S.); (D.M.); (J.S.)
- Department of Medical Oncology, University General Hospital of Heraklion, 71100 Heraklion, Greece
| |
Collapse
|
6
|
Glotov OS, Chernov AN, Glotov AS. Human Exome Sequencing and Prospects for Predictive Medicine: Analysis of International Data and Own Experience. J Pers Med 2023; 13:1236. [PMID: 37623486 PMCID: PMC10455459 DOI: 10.3390/jpm13081236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 07/25/2023] [Accepted: 07/26/2023] [Indexed: 08/26/2023] Open
Abstract
Today, whole-exome sequencing (WES) is used to conduct the massive screening of structural and regulatory genes in order to identify the allele frequencies of disease-associated polymorphisms in various populations and thus detect pathogenic genetic changes (mutations or polymorphisms) conducive to malfunctional protein sequences. With its extensive capabilities, exome sequencing today allows both the diagnosis of monogenic diseases (MDs) and the examination of seemingly healthy populations to reveal a wide range of potential risks prior to disease manifestation (in the future, exome sequencing may outpace costly and less informative genome sequencing to become the first-line examination technique). This review establishes the human genetic passport as a new WES-based clinical concept for the identification of new candidate genes, gene variants, and molecular mechanisms in the diagnosis, prediction, and treatment of monogenic, oligogenic, and multifactorial diseases. Various diseases are addressed to demonstrate the extensive potential of WES and consider its advantages as well as disadvantages. Thus, WES can become a general test with a broad spectrum pf applications, including opportunistic screening.
Collapse
Affiliation(s)
- Oleg S. Glotov
- Department of Genomic Medicine, D. O. Ott Research Institute of Obstetrics, Gynecology and Reproductology, 199034 St. Petersburg, Russia;
- Department of Experimental Medical Virology, Molecular Genetics and Biobanking of Pediatric Research and Clinical Center for Infectious Diseases, 197022 St. Petersburg, Russia
| | - Alexander N. Chernov
- Department of Genomic Medicine, D. O. Ott Research Institute of Obstetrics, Gynecology and Reproductology, 199034 St. Petersburg, Russia;
- Department of General Pathology and Pathological Physiology, Institute of Experimental Medicine, 197376 St. Petersburg, Russia
| | - Andrey S. Glotov
- Department of Genomic Medicine, D. O. Ott Research Institute of Obstetrics, Gynecology and Reproductology, 199034 St. Petersburg, Russia;
| |
Collapse
|
7
|
Athanasopoulou K, Daneva GN, Boti MA, Dimitroulis G, Adamopoulos PG, Scorilas A. The Transition from Cancer "omics" to "epi-omics" through Next- and Third-Generation Sequencing. LIFE (BASEL, SWITZERLAND) 2022; 12:life12122010. [PMID: 36556377 PMCID: PMC9785810 DOI: 10.3390/life12122010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 11/25/2022] [Accepted: 11/30/2022] [Indexed: 12/05/2022]
Abstract
Deciphering cancer etiopathogenesis has proven to be an especially challenging task since the mechanisms that drive tumor development and progression are far from simple. An astonishing amount of research has revealed a wide spectrum of defects, including genomic abnormalities, epigenomic alterations, disturbance of gene transcription, as well as post-translational protein modifications, which cooperatively promote carcinogenesis. These findings suggest that the adoption of a multidimensional approach can provide a much more precise and comprehensive picture of the tumor landscape, hence serving as a powerful tool in cancer research and precision oncology. The introduction of next- and third-generation sequencing technologies paved the way for the decoding of genetic information and the elucidation of cancer-related cellular compounds and mechanisms. In the present review, we discuss the current and emerging applications of both generations of sequencing technologies, also referred to as massive parallel sequencing (MPS), in the fields of cancer genomics, transcriptomics and proteomics, as well as in the progressing realms of epi-omics. Finally, we provide a brief insight into the expanding scope of sequencing applications in personalized cancer medicine and pharmacogenomics.
Collapse
|
8
|
Angaroni F, Guidi A, Ascolani G, d'Onofrio A, Antoniotti M, Graudenzi A. J-SPACE: a Julia package for the simulation of spatial models of cancer evolution and of sequencing experiments. BMC Bioinformatics 2022; 23:269. [PMID: 35804300 PMCID: PMC9270769 DOI: 10.1186/s12859-022-04779-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 06/09/2022] [Indexed: 11/15/2022] Open
Abstract
Background The combined effects of biological variability and measurement-related errors on cancer sequencing data remain largely unexplored. However, the spatio-temporal simulation of multi-cellular systems provides a powerful instrument to address this issue. In particular, efficient algorithmic frameworks are needed to overcome the harsh trade-off between scalability and expressivity, so to allow one to simulate both realistic cancer evolution scenarios and the related sequencing experiments, which can then be used to benchmark downstream bioinformatics methods. Result We introduce a Julia package for SPAtial Cancer Evolution (J-SPACE), which allows one to model and simulate a broad set of experimental scenarios, phenomenological rules and sequencing settings.Specifically, J-SPACE simulates the spatial dynamics of cells as a continuous-time multi-type birth-death stochastic process on a arbitrary graph, employing different rules of interaction and an optimised Gillespie algorithm. The evolutionary dynamics of genomic alterations (single-nucleotide variants and indels) is simulated either under the Infinite Sites Assumption or several different substitution models, including one based on mutational signatures. After mimicking the spatial sampling of tumour cells, J-SPACE returns the related phylogenetic model, and allows one to generate synthetic reads from several Next-Generation Sequencing (NGS) platforms, via the ART read simulator. The results are finally returned in standard FASTA, FASTQ, SAM, ALN and Newick file formats. Conclusion J-SPACE is designed to efficiently simulate the heterogeneous behaviour of a large number of cancer cells and produces a rich set of outputs. Our framework is useful to investigate the emergent spatial dynamics of cancer subpopulations, as well as to assess the impact of incomplete sampling and of experiment-specific errors. Importantly, the output of J-SPACE is designed to allow the performance assessment of downstream bioinformatics pipelines processing NGS data. J-SPACE is freely available at: https://github.com/BIMIB-DISCo/J-Space.jl.
Collapse
Affiliation(s)
- Fabrizio Angaroni
- Dept. of Informatics, Systems and Communication, Univ. of Milan-Bicocca, Milan, Italy.
| | - Alessandro Guidi
- Dept. of Informatics, Systems and Communication, Univ. of Milan-Bicocca, Milan, Italy
| | - Gianluca Ascolani
- Dept. of Informatics, Systems and Communication, Univ. of Milan-Bicocca, Milan, Italy
| | - Alberto d'Onofrio
- Department of Mathematics and Geosciences, Univ. of Trieste, Trieste, Italy
| | - Marco Antoniotti
- Dept. of Informatics, Systems and Communication, Univ. of Milan-Bicocca, Milan, Italy.,Bicocca Bioinformatics, Biostatistics and Bioimaging Centre (B4), Milan, Italy
| | - Alex Graudenzi
- Dept. of Informatics, Systems and Communication, Univ. of Milan-Bicocca, Milan, Italy.,Bicocca Bioinformatics, Biostatistics and Bioimaging Centre (B4), Milan, Italy.,Inst. of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR), Segrate, Italy
| |
Collapse
|
9
|
Alekseenko A, Wang J, Barrett D, Pelechano V. OPUSeq simplifies detection of low-frequency DNA variants and uncovers fragmentase-associated artifacts. NAR Genom Bioinform 2022; 4:lqac048. [PMID: 35769342 PMCID: PMC9235115 DOI: 10.1093/nargab/lqac048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 06/15/2022] [Indexed: 11/13/2022] Open
Abstract
Detection of low-frequency DNA variants (below 1%) is becoming increasingly important in biomedical research and clinical practice, but is challenging to do with standard sequencing approaches due to high error rates. The use of double-stranded unique molecular identifiers (dsUMIs) allows correction of errors by comparing reads arising from the same original DNA duplex. However, the implementation of such approaches is still challenging. Here, we present a novel method, one-pot dsUMI sequencing (OPUSeq), which allows incorporation of dsUMIs in the same reaction as the library PCR. This obviates the need for adapter pre-synthesis or additional enzymatic steps. OPUSeq can be incorporated into standard DNA library preparation approaches and coupled with hybridization target capture. We demonstrate successful error correction and detection of variants down to allele frequency of 0.01%. Using OPUSeq, we also show that the use of enzymatic fragmentation can lead to the appearance of spurious double-stranded variants, interfering with detection of variant fractions below 0.1%.
Collapse
Affiliation(s)
- Alisa Alekseenko
- SciLifeLab, Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Tomtebodavägen 23A, 17165, Solna, Sweden
| | - Jingwen Wang
- SciLifeLab, Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Tomtebodavägen 23A, 17165, Solna, Sweden
| | - Donal Barrett
- SciLifeLab, Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Tomtebodavägen 23A, 17165, Solna, Sweden
| | - Vicent Pelechano
- SciLifeLab, Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Tomtebodavägen 23A, 17165, Solna, Sweden
| |
Collapse
|
10
|
Borden ES, Buetow KH, Wilson MA, Hastings KT. Cancer Neoantigens: Challenges and Future Directions for Prediction, Prioritization, and Validation. Front Oncol 2022; 12:836821. [PMID: 35311072 PMCID: PMC8929516 DOI: 10.3389/fonc.2022.836821] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 02/07/2022] [Indexed: 12/16/2022] Open
Abstract
Prioritization of immunogenic neoantigens is key to enhancing cancer immunotherapy through the development of personalized vaccines, adoptive T cell therapy, and the prediction of response to immune checkpoint inhibition. Neoantigens are tumor-specific proteins that allow the immune system to recognize and destroy a tumor. Cancer immunotherapies, such as personalized cancer vaccines, adoptive T cell therapy, and immune checkpoint inhibition, rely on an understanding of the patient-specific neoantigen profile in order to guide personalized therapeutic strategies. Genomic approaches to predicting and prioritizing immunogenic neoantigens are rapidly expanding, raising new opportunities to advance these tools and enhance their clinical relevance. Predicting neoantigens requires acquisition of high-quality samples and sequencing data, followed by variant calling and variant annotation. Subsequently, prioritizing which of these neoantigens may elicit a tumor-specific immune response requires application and integration of tools to predict the expression, processing, binding, and recognition potentials of the neoantigen. Finally, improvement of the computational tools is held in constant tension with the availability of datasets with validated immunogenic neoantigens. The goal of this review article is to summarize the current knowledge and limitations in neoantigen prediction, prioritization, and validation and propose future directions that will improve personalized cancer treatment.
Collapse
Affiliation(s)
- Elizabeth S Borden
- Department of Basic Medical Sciences, College of Medicine-Phoenix, University of Arizona, Phoenix, AZ, United States.,Department of Research and Internal Medicine (Dermatology), Phoenix Veterans Affairs Health Care System, Phoenix, AZ, United States
| | - Kenneth H Buetow
- School of Life Sciences, Arizona State University, Tempe, AZ, United States.,Center for Evolution and Medicine, Arizona State University, Tempe, AZ, United States
| | - Melissa A Wilson
- School of Life Sciences, Arizona State University, Tempe, AZ, United States.,Center for Evolution and Medicine, Arizona State University, Tempe, AZ, United States
| | - Karen Taraszka Hastings
- Department of Basic Medical Sciences, College of Medicine-Phoenix, University of Arizona, Phoenix, AZ, United States.,Department of Research and Internal Medicine (Dermatology), Phoenix Veterans Affairs Health Care System, Phoenix, AZ, United States
| |
Collapse
|
11
|
Fang LT, Zhu B, Zhao Y, Chen W, Yang Z, Kerrigan L, Langenbach K, de Mars M, Lu C, Idler K, Jacob H, Zheng Y, Ren L, Yu Y, Jaeger E, Schroth GP, Abaan OD, Talsania K, Lack J, Shen TW, Chen Z, Stanbouly S, Tran B, Shetty J, Kriga Y, Meerzaman D, Nguyen C, Petitjean V, Sultan M, Cam M, Mehta M, Hung T, Peters E, Kalamegham R, Sahraeian SME, Mohiyuddin M, Guo Y, Yao L, Song L, Lam HYK, Drabek J, Vojta P, Maestro R, Gasparotto D, Kõks S, Reimann E, Scherer A, Nordlund J, Liljedahl U, Jensen RV, Pirooznia M, Li Z, Xiao C, Sherry ST, Kusko R, Moos M, Donaldson E, Tezak Z, Ning B, Tong W, Li J, Duerken-Hughes P, Catalanotti C, Maheshwari S, Shuga J, Liang WS, Keats J, Adkins J, Tassone E, Zismann V, McDaniel T, Trent J, Foox J, Butler D, Mason CE, Hong H, Shi L, Wang C, Xiao W. Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing. Nat Biotechnol 2021; 39:1151-1160. [PMID: 34504347 PMCID: PMC8532138 DOI: 10.1038/s41587-021-00993-6] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Accepted: 06/18/2021] [Indexed: 02/08/2023]
Abstract
The lack of samples for generating standardized DNA datasets for setting up a sequencing pipeline or benchmarking the performance of different algorithms limits the implementation and uptake of cancer genomics. Here, we describe reference call sets obtained from paired tumor-normal genomic DNA (gDNA) samples derived from a breast cancer cell line-which is highly heterogeneous, with an aneuploid genome, and enriched in somatic alterations-and a matched lymphoblastoid cell line. We partially validated both somatic mutations and germline variants in these call sets via whole-exome sequencing (WES) with different sequencing platforms and targeted sequencing with >2,000-fold coverage, spanning 82% of genomic regions with high confidence. Although the gDNA reference samples are not representative of primary cancer cells from a clinical sample, when setting up a sequencing pipeline, they not only minimize potential biases from technologies, assays and informatics but also provide a unique resource for benchmarking 'tumor-only' or 'matched tumor-normal' analyses.
Collapse
Affiliation(s)
- Li Tai Fang
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Bin Zhu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Yongmei Zhao
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Wanqiu Chen
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Zhaowei Yang
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
- Department of Allergy and Clinical Immunology, State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, Guangzhou Institute of Respiratory Health, First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Liz Kerrigan
- ATCC (American Type Culture Collection), Manassas, VA, USA
| | | | | | - Charles Lu
- Computational Genomics, Genomics Research Center (GRC), AbbVie, North Chicago, IL, USA
| | - Kenneth Idler
- Computational Genomics, Genomics Research Center (GRC), AbbVie, North Chicago, IL, USA
| | - Howard Jacob
- Computational Genomics, Genomics Research Center (GRC), AbbVie, North Chicago, IL, USA
| | - Yuanting Zheng
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Luyao Ren
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Ying Yu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | | | | | | | - Keyur Talsania
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Justin Lack
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Tsai-Wei Shen
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Zhong Chen
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Seta Stanbouly
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Bao Tran
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Jyoti Shetty
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Yuliya Kriga
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Daoud Meerzaman
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology (CBIIT), National Cancer Institute, Rockville, MD, USA
| | - Cu Nguyen
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology (CBIIT), National Cancer Institute, Rockville, MD, USA
| | - Virginie Petitjean
- Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
| | - Marc Sultan
- Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
| | - Margaret Cam
- CCR Collaborative Bioinformatics Resource (CCBR), Office of Science and Technology Resources, Center for Cancer Research, Bethesda, MD, USA
| | - Monika Mehta
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Tiffany Hung
- Genentech, a member of the Roche group, South San Francisco, CA, USA
| | - Eric Peters
- Genentech, a member of the Roche group, South San Francisco, CA, USA
| | - Rasika Kalamegham
- Genentech, a member of the Roche group, South San Francisco, CA, USA
| | | | - Marghoob Mohiyuddin
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Yunfei Guo
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Lijing Yao
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Lei Song
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Hugo Y K Lam
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Jiri Drabek
- IMTM, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
| | - Petr Vojta
- IMTM, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
| | - Roberta Maestro
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
| | - Daniela Gasparotto
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
| | - Sulev Kõks
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Perron Institute for Neurological and Translational Science, Nedlands, Western Australia, Australia
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Ene Reimann
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Andreas Scherer
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Jessica Nordlund
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Department of Medical Sciences, Molecular Medicine and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Ulrika Liljedahl
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Department of Medical Sciences, Molecular Medicine and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Roderick V Jensen
- Department of Biological Sciences, Virginia Tech, Blacksburg, VA, USA
| | - Mehdi Pirooznia
- Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Zhipan Li
- Sentieon Inc., Mountain View, CA, USA
| | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Stephen T Sherry
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | - Malcolm Moos
- Center for Biologics Evaluation and Research, FDA, Silver Spring, MD, USA
| | - Eric Donaldson
- Center for Drug Evaluation and Research, FDA, Silver Spring, MD, USA
| | - Zivana Tezak
- Center for Devices and Radiological Health, FDA, Silver Spring, MD, USA
| | - Baitang Ning
- National Center for Toxicological Research, FDA, Jefferson, AR, USA
| | - Weida Tong
- National Center for Toxicological Research, FDA, Jefferson, AR, USA
| | - Jing Li
- Department of Allergy and Clinical Immunology, State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, Guangzhou Institute of Respiratory Health, First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | | | | | | | | | - Winnie S Liang
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Jonathan Keats
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | | | - Erica Tassone
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | | | | | - Jeffrey Trent
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Jonathan Foox
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Daniel Butler
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Huixiao Hong
- National Center for Toxicological Research, FDA, Jefferson, AR, USA.
| | - Leming Shi
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China.
| | - Charles Wang
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA.
- Department of Basic Science, Loma Linda University School of Medicine, Loma Linda, CA, USA.
| | - Wenming Xiao
- Center for Devices and Radiological Health, FDA, Silver Spring, MD, USA.
| |
Collapse
|
12
|
Assessment of the gene mosaicism burden in blood and its implications for immune disorders. Sci Rep 2021; 11:12940. [PMID: 34155260 PMCID: PMC8217568 DOI: 10.1038/s41598-021-92381-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 06/09/2021] [Indexed: 12/23/2022] Open
Abstract
There are increasing evidences showing the contribution of somatic genetic variants to non-cancer diseases. However, their detection using massive parallel sequencing methods still has important limitations. In addition, the relative importance and dynamics of somatic variation in healthy tissues are not fully understood. We performed high-depth whole-exome sequencing in 16 samples from patients with a previously determined pathogenic somatic variant for a primary immunodeficiency and tested different variant callers detection ability. Subsequently, we explored the load of somatic variants in the whole blood of these individuals and validated it by amplicon-based deep sequencing. Variant callers allowing low frequency read thresholds were able to detect most of the variants, even at very low frequencies in the tissue. The genetic load of somatic coding variants detectable in whole blood is low, ranging from 1 to 2 variants in our dataset, except for one case with 17 variants compatible with clonal haematopoiesis under genetic drift. Because of the ability we demonstrated to detect this type of genetic variation, and its relevant role in disorders such as primary immunodeficiencies, we suggest considering this model of gene mosaicism in future genetic studies and considering revisiting previous massive parallel sequencing data in patients with negative results.
Collapse
|
13
|
Mosquera Orgueira A, Ferreiro Ferro R, Díaz Arias JÁ, Aliste Santos C, Antelo Rodríguez B, Bao Pérez L, Alonso Vence N, Bendaña López Á, Abuin Blanco A, Melero Valentín P, Peleteiro Raindo A, Cid López M, Pérez Encinas MM, González Pérez MS, Fraga Rodríguez MF, Bello López JL. Detection of new drivers of frequent B-cell lymphoid neoplasms using an integrated analysis of whole genomes. PLoS One 2021; 16:e0248886. [PMID: 33945543 PMCID: PMC8096002 DOI: 10.1371/journal.pone.0248886] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 01/19/2021] [Indexed: 12/21/2022] Open
Abstract
B-cell lymphoproliferative disorders exhibit a diverse spectrum of diagnostic entities with heterogeneous behaviour. Multiple efforts have focused on the determination of the genomic drivers of B-cell lymphoma subtypes. In the meantime, the aggregation of diverse tumors in pan-cancer genomic studies has become a useful tool to detect new driver genes, while enabling the comparison of mutational patterns across tumors. Here we present an integrated analysis of 354 B-cell lymphoid disorders. 112 recurrently mutated genes were discovered, of which KMT2D, CREBBP, IGLL5 and BCL2 were the most frequent, and 31 genes were putative new drivers. Mutations in CREBBP, TNFRSF14 and KMT2D predominated in follicular lymphoma, whereas those in BTG2, HTA-A and PIM1 were more frequent in diffuse large B-cell lymphoma. Additionally, we discovered 31 significantly mutated protein networks, reinforcing the role of genes such as CREBBP, EEF1A1, STAT6, GNA13 and TP53, but also pointing towards a myriad of infrequent players in lymphomagenesis. Finally, we report aberrant expression of oncogenes and tumor suppressors associated with novel noncoding mutations (DTX1 and S1PR2), and new recurrent copy number aberrations affecting immune check-point regulators (CD83, PVR) and B-cell specific genes (TNFRSF13C). Our analysis expands the number of mutational drivers of B-cell lymphoid neoplasms, and identifies several differential somatic events between disease subtypes.
Collapse
Affiliation(s)
- Adrián Mosquera Orgueira
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
- University of Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| | - Roi Ferreiro Ferro
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - José Ángel Díaz Arias
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - Carlos Aliste Santos
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Pathology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - Beatriz Antelo Rodríguez
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Pathology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - Laura Bao Pérez
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - Natalia Alonso Vence
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - Ággeles Bendaña López
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
- University of Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| | - Aitor Abuin Blanco
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - Paula Melero Valentín
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - And´res Peleteiro Raindo
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
- University of Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| | - Miguel Cid López
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
- University of Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| | - Manuel Mateo Pérez Encinas
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
- University of Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| | - Marta Sonia González Pérez
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - Máximo Francisco Fraga Rodríguez
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- University of Santiago de Compostela, Santiago de Compostela, Galicia, Spain
- Department of Pathology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
| | - José Luis Bello López
- Health Research Institute of Santiago de Compostela (IDIS), Santiago de Compostela, Galicia, Spain
- Department of Hematology, Complexo Hospitalario Universitario de Santiago de Compostela (CHUS), SERGAS, Santiago de Compostela, Galicia, Spain
- University of Santiago de Compostela, Santiago de Compostela, Galicia, Spain
| |
Collapse
|
14
|
Spatial Distribution of Private Gene Mutations in Clear Cell Renal Cell Carcinoma. Cancers (Basel) 2021; 13:cancers13092163. [PMID: 33946379 PMCID: PMC8124666 DOI: 10.3390/cancers13092163] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 04/02/2021] [Accepted: 04/27/2021] [Indexed: 12/15/2022] Open
Abstract
Simple Summary Tumours consist of multiple groups of similar cells resulting from differing evolutionary trajectories, i.e., subclones. These subclones are prevalent in clear cell renal cell carcinoma (ccRCC). The aim of this study is to determine how similar or dissimilar the subclones in 89 ccRCC tumours are from one another regarding their gene mutations and expression profiles, i.e., the extent of intra-tumour heterogeneity. The implications of these alterations with respect to signalling pathways is also assessed. Deep sequencing allows for the identification of mutations with low-allele frequencies, providing a more comprehensive view of the heterogeneity present in the tumours. With an average of 62% of mutations having been identified in only one of the two biopsies, some of which in turn are found to impact gene expression, the complex makeup of ccRCC tumours is evident, and this can drastically influence treatment outcome. Abstract Intra-tumour heterogeneity is the molecular hallmark of renal cancer, and the molecular tumour composition determines the treatment outcome of renal cancer patients. In renal cancer tumourigenesis, in general, different tumour clones evolve over time. We analysed intra-tumour heterogeneity and subclonal mutation patterns in 178 tumour samples obtained from 89 clear cell renal cell carcinoma patients. In an initial discovery phase, whole-exome and transcriptome sequencing data from paired tumour biopsies from 16 ccRCC patients were used to design a gene panel for follow-up analysis. In this second phase, 826 selected genes were targeted at deep coverage in an extended cohort of 89 patients for a detailed analysis of tumour heterogeneity. On average, we found 22 mutations per patient. Pairwise comparison of the two biopsies from the same tumour revealed that on average, 62% of the mutations in a patient were detected in one of the two samples. In addition to commonly mutated genes (VHL, PBRM1, SETD2 and BAP1), frequent subclonal mutations with low variant allele frequency (<10%) were observed in TP53 and in mucin coding genes MUC6, MUC16, and MUC3A. Of the 89 ccRCC tumours, 87 (~98%) harboured private mutations, occurring in only one of the paired tumour samples. Clonally exclusive pathway pairs were identified using the WES data set from 16 ccRCC patients. Our findings imply that shared and private mutations significantly contribute to the complexity of differential gene expression and pathway interaction and might explain the clonal evolution of different molecular renal cancer subgroups. Multi-regional sequencing is central for the identification of subclones within ccRCC.
Collapse
|
15
|
Kısakol B, Sarıhan Ş, Ergün MA, Baysan M. Detailed evaluation of cancer sequencing pipelines in different microenvironments and heterogeneity levels. ACTA ACUST UNITED AC 2021; 45:114-126. [PMID: 33907494 PMCID: PMC8068765 DOI: 10.3906/biy-2008-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Accepted: 02/03/2021] [Indexed: 11/25/2022]
Abstract
The importance of next generation sequencing (NGS) rises in cancer research as accessing this key technology becomes easier for researchers. The sequence data created by NGS technologies must be processed by various bioinformatics algorithms within a pipeline in order to convert raw data to meaningful information. Mapping and variant calling are the two main steps of these analysis pipelines, and many algorithms are available for these steps. Therefore, detailed benchmarking of these algorithms in different scenarios is crucial for the efficient utilization of sequencing technologies. In this study, we compared the performance of twelve pipelines (three mapping and four variant discovery algorithms) with recommended settings to capture single nucleotide variants. We observed significant discrepancy in variant calls among tested pipelines for different heterogeneity levels in real and simulated samples with overall high specificity and low sensitivity. Additional to the individual evaluation of pipelines, we also constructed and tested the performance of pipeline combinations. In these analyses, we observed that certain pipelines complement each other much better than others and display superior performance than individual pipelines. This suggests that adhering to a single pipeline is not optimal for cancer sequencing analysis and sample heterogeneity should be considered in algorithm optimization.
Collapse
Affiliation(s)
- Batuhan Kısakol
- Department of Physiology and Medical Physics, Centre for Systems Medicine, Royal College of Surgeons in Ireland, Dublin Ireland
| | - Şahin Sarıhan
- Computer Engineering Department, Faculty of Engineering, Marmara University, İstanbul, Turkey Turkey
| | - Mehmet Arif Ergün
- Computer Engineering Department, Faculty of Computer and Informatics Engineering, İstanbul Technical University,İstanbul Turkey
| | - Mehmet Baysan
- Computer Engineering Department, Faculty of Computer and Informatics Engineering, İstanbul Technical University,İstanbul Turkey
| |
Collapse
|
16
|
Brueffer C, Gladchuk S, Winter C, Vallon‐Christersson J, Hegardt C, Häkkinen J, George AM, Chen Y, Ehinger A, Larsson C, Loman N, Malmberg M, Rydén L, Borg Å, Saal LH. The mutational landscape of the SCAN-B real-world primary breast cancer transcriptome. EMBO Mol Med 2020; 12:e12118. [PMID: 32926574 PMCID: PMC7539222 DOI: 10.15252/emmm.202012118] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2020] [Revised: 08/08/2020] [Accepted: 08/13/2020] [Indexed: 12/12/2022] Open
Abstract
Breast cancer is a disease of genomic alterations, of which the panorama of somatic mutations and how these relate to subtypes and therapy response is incompletely understood. Within SCAN-B (ClinicalTrials.gov: NCT02306096), a prospective study elucidating the transcriptomic profiles for thousands of breast cancers, we developed a RNA-seq pipeline for detection of SNVs/indels and profiled a real-world cohort of 3,217 breast tumors. We describe the mutational landscape of primary breast cancer viewed through the transcriptome of a large population-based cohort and relate it to patient survival. We demonstrate that RNA-seq can be used to call mutations in genes such as PIK3CA, TP53, and ERBB2, as well as the status of molecular pathways and mutational burden, and identify potentially druggable mutations in 86.8% of tumors. To make this rich dataset available for the research community, we developed an open source web application, the SCAN-B MutationExplorer (http://oncogenomics.bmc.lu.se/MutationExplorer). These results add another dimension to the use of RNA-seq as a clinical tool, where both gene expression- and mutation-based biomarkers can be interrogated in real-time within 1 week of tumor sampling.
Collapse
Affiliation(s)
- Christian Brueffer
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
| | - Sergii Gladchuk
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
| | - Christof Winter
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- Present address:
Institut für Klinische Chemie und PathobiochemieKlinikum rechts der IsarTechnische Universität MünchenMünchenGermany
| | - Johan Vallon‐Christersson
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- CREATE Health Strategic Center for Translational Cancer ResearchLund UniversityLundSweden
| | - Cecilia Hegardt
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- CREATE Health Strategic Center for Translational Cancer ResearchLund UniversityLundSweden
| | - Jari Häkkinen
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
| | - Anthony M George
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
| | - Yilun Chen
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
| | - Anna Ehinger
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- Department of PathologySkåne University HospitalLundSweden
| | - Christer Larsson
- Lund University Cancer CenterLundSweden
- Division of Molecular PathologyDepartment of Laboratory MedicineLund UniversityLundSweden
| | - Niklas Loman
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- Department of OncologySkåne University HospitalLundSweden
| | | | - Lisa Rydén
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- Department of SurgerySkåne University HospitalLundSweden
| | - Åke Borg
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- CREATE Health Strategic Center for Translational Cancer ResearchLund UniversityLundSweden
| | - Lao H Saal
- Division of OncologyDepartment of Clinical SciencesLund UniversityLundSweden
- Lund University Cancer CenterLundSweden
- CREATE Health Strategic Center for Translational Cancer ResearchLund UniversityLundSweden
| |
Collapse
|
17
|
Pei S, Liu T, Ren X, Li W, Chen C, Xie Z. Benchmarking variant callers in next-generation and third-generation sequencing analysis. Brief Bioinform 2020; 22:5875142. [PMID: 32698196 DOI: 10.1093/bib/bbaa148] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 06/11/2020] [Accepted: 06/12/2020] [Indexed: 12/15/2022] Open
Abstract
DNA variants represent an important source of genetic variations among individuals. Next- generation sequencing (NGS) is the most popular technology for genome-wide variant calling. Third-generation sequencing (TGS) has also recently been used in genetic studies. Although many variant callers are available, no single caller can call both types of variants on NGS or TGS data with high sensitivity and specificity. In this study, we systematically evaluated 11 variant callers on 12 NGS and TGS datasets. For germline variant calling, we tested DNAseq and DNAscope modes from Sentieon, HaplotypeCaller mode from GATK and WGS mode from DeepVariant. All the four callers had comparable performance on NGS data and 30× coverage of WGS data was recommended. For germline variant calling on TGS data, we tested DNAseq mode from Sentieon, HaplotypeCaller mode from GATK and PACBIO mode from DeepVariant. All the three callers had similar performance in SNP calling, while DeepVariant outperformed the others in InDel calling. TGS detected more variants than NGS, particularly in complex and repetitive regions. For somatic variant calling on NGS, we tested TNscope and TNseq modes from Sentieon, MuTect2 mode from GATK, NeuSomatic, VarScan2, and Strelka2. TNscope and Mutect2 outperformed the other callers. A higher proportion of tumor sample purity (from 10 to 20%) significantly increased the recall value of calling. Finally, computational costs of the callers were compared and Sentieon required the least computational cost. These results suggest that careful selection of a tool and parameters is needed for accurate SNP or InDel calling under different scenarios.
Collapse
Affiliation(s)
- Surui Pei
- Zhongshan Ophthalmic Center at Sun Yat-sen University and Annoroad Gene Technology (Beijing) Co., Ltd
| | - Tao Liu
- Annoroad Gene Technology (Beijing) Co., Ltd
| | - Xue Ren
- Annoroad Gene Technology (Beijing) Co., Ltd
| | - Weizhong Li
- Zhongshan School of Medicine at Sun Yat-sen University
| | | | - Zhi Xie
- Zhongshan Ophthalmic Center at Sun Yat-sen University
| |
Collapse
|
18
|
Tello D, Gil J, Loaiza CD, Riascos JJ, Cardozo N, Duitama J. NGSEP3: accurate variant calling across species and sequencing protocols. Bioinformatics 2020; 35:4716-4723. [PMID: 31099384 PMCID: PMC6853766 DOI: 10.1093/bioinformatics/btz275] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Revised: 03/16/2019] [Accepted: 04/17/2019] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION Accurate detection, genotyping and downstream analysis of genomic variants from high-throughput sequencing data are fundamental features in modern production pipelines for genetic-based diagnosis in medicine or genomic selection in plant and animal breeding. Our research group maintains the Next-Generation Sequencing Experience Platform (NGSEP) as a precise, efficient and easy-to-use software solution for these features. RESULTS Understanding that incorrect alignments around short tandem repeats are an important source of genotyping errors, we implemented in NGSEP new algorithms for realignment and haplotype clustering of reads spanning indels and short tandem repeats. We performed extensive benchmark experiments comparing NGSEP to state-of-the-art software using real data from three sequencing protocols and four species with different distributions of repetitive elements. NGSEP consistently shows comparative accuracy and better efficiency compared to the existing solutions. We expect that this work will contribute to the continuous improvement of quality in variant calling needed for modern applications in medicine and agriculture. AVAILABILITY AND IMPLEMENTATION NGSEP is available as open source software at http://ngsep.sf.net. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Daniel Tello
- Systems and Computing Engineering Department, Universidad de los Andes, Bogotá 111711, Colombia
| | - Juanita Gil
- Systems and Computing Engineering Department, Universidad de los Andes, Bogotá 111711, Colombia
| | - Cristian D Loaiza
- Biotechnology lab, Centro de Investigación de la caña de azúcar de Colombia, CENICAÑA, Cali 760046, Colombia
- Present address: Department of Plants, Soils, and Climate, Utah State University, Logan, UT, USA
| | - John J Riascos
- Biotechnology lab, Centro de Investigación de la caña de azúcar de Colombia, CENICAÑA, Cali 760046, Colombia
| | - Nicolás Cardozo
- Systems and Computing Engineering Department, Universidad de los Andes, Bogotá 111711, Colombia
| | - Jorge Duitama
- Systems and Computing Engineering Department, Universidad de los Andes, Bogotá 111711, Colombia
- Agrobiodiversity Research Area, International Center for Tropical Agriculture, Cali 763537, Colombia
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
19
|
Ho AS, Ochoa A, Jayakumaran G, Zehir A, Valero Mayor C, Tepe J, Makarov V, Dalin MG, He J, Bailey M, Montesion M, Ross JS, Miller VA, Chan L, Ganly I, Dogan S, Katabi N, Tsipouras P, Ha P, Agrawal N, Solit DB, Futreal PA, El Naggar AK, Reis-Filho JS, Weigelt B, Ho AL, Schultz N, Chan TA, Morris LG. Genetic hallmarks of recurrent/metastatic adenoid cystic carcinoma. J Clin Invest 2020; 129:4276-4289. [PMID: 31483290 DOI: 10.1172/jci128227] [Citation(s) in RCA: 129] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 07/09/2019] [Indexed: 12/29/2022] Open
Abstract
BACKGROUNDAdenoid cystic carcinoma (ACC) is a rare malignancy arising in salivary glands and other sites, characterized by high rates of relapse and distant spread. Recurrent/metastatic (R/M) ACCs are generally incurable, due to a lack of active systemic therapies. To improve outcomes, deeper understanding of genetic alterations and vulnerabilities in R/M tumors is needed.METHODSAn integrated genomic analysis of 1,045 ACCs (177 primary, 868 R/M) was performed to identify alterations associated with advanced and metastatic tumors. Intratumoral genetic heterogeneity, germline mutations, and therapeutic actionability were assessed.RESULTSCompared with primary tumors, R/M tumors were enriched for alterations in key Notch (NOTCH1, 26.3% vs. 8.5%; NOTCH2, 4.6% vs. 2.3%; NOTCH3, 5.7% vs. 2.3%; NOTCH4, 3.6% vs. 0.6%) and chromatin-remodeling (KDM6A, 15.2% vs. 3.4%; KMT2C/MLL3, 14.3% vs. 4.0%; ARID1B, 14.1% vs. 4.0%) genes. TERT promoter mutations (13.1% of R/M cases) were mutually exclusive with both NOTCH1 mutations (q = 3.3 × 10-4) and MYB/MYBL1 fusions (q = 5.6 × 10-3), suggesting discrete, alternative mechanisms of tumorigenesis. This network of alterations defined 4 distinct ACC subgroups: MYB+NOTCH1+, MYB+/other, MYBWTNOTCH1+, and MYBWTTERT+. Despite low mutational load, we identified numerous samples with marked intratumoral genetic heterogeneity, including branching evolution across multiregion sequencing.CONCLUSIONThese observations collectively redefine the molecular underpinnings of ACC progression and identify further targets for precision therapies.FUNDINGAdenoid Cystic Carcinoma Research Foundation, Pershing Square Sohn Cancer Research grant, the PaineWebber Chair, Stand Up 2 Cancer, NIH R01 CA205426, the STARR Cancer Consortium, NCI R35 CA232097, the Frederick Adler Chair, Cycle for Survival, the Jayme Flowers Fund, The Sebastian Nativo Fund, NIH K08 DE024774 and R01 DE027738, and MSKCC through NIH/NCI Cancer Center Support Grant (P30 CA008748).
Collapse
Affiliation(s)
- Allen S Ho
- Department of Surgery and.,Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, California, USA
| | - Angelica Ochoa
- Marie-Josée and Henry R. Kravis Center for Molecular Oncology
| | | | | | | | - Justin Tepe
- Head and Neck Service, Department of Surgery, and
| | - Vladimir Makarov
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA
| | - Martin G Dalin
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA
| | - Jie He
- Foundation Medicine, Cambridge, Massachusetts, USA
| | - Mark Bailey
- Foundation Medicine, Cambridge, Massachusetts, USA
| | | | | | | | - Lindsay Chan
- Foundation Medicine, Cambridge, Massachusetts, USA
| | - Ian Ganly
- Head and Neck Service, Department of Surgery, and
| | | | | | - Petros Tsipouras
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Patrick Ha
- Department of Otolaryngology-Head and Neck Surgery, UCSF, San Francisco, California, USA
| | - Nishant Agrawal
- Department of Surgery, University of Chicago, Chicago, Illinois, USA
| | - David B Solit
- Marie-Josée and Henry R. Kravis Center for Molecular Oncology.,Head and Neck Service, Department of Surgery, and.,Department of Medicine
| | | | - Adel K El Naggar
- Department of Pathology, University of Texas MD Anderson Cancer Center (MDACC), Houston, Texas, USA
| | | | - Britta Weigelt
- Experimental Pathology Service, MSKCC, New York, New York, USA
| | | | - Nikolaus Schultz
- Marie-Josée and Henry R. Kravis Center for Molecular Oncology.,Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA
| | - Timothy A Chan
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA.,Department of Radiation Oncology, and.,Immunogenomics and Precision Oncology Platform, MSKCC, New York, New York, USA
| | - Luc Gt Morris
- Head and Neck Service, Department of Surgery, and.,Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center (MSKCC), New York, New York, USA.,Immunogenomics and Precision Oncology Platform, MSKCC, New York, New York, USA
| |
Collapse
|
20
|
Meerzaman D, Dunn BK. Value of Collaboration among Multi-Domain Experts in Analysis of High-Throughput Genomics Data. Cancer Res 2019; 79:5140-5145. [PMID: 31337654 DOI: 10.1158/0008-5472.can-19-0769] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 05/30/2019] [Accepted: 07/11/2019] [Indexed: 12/18/2022]
Abstract
The recent explosion and ease of access to large-scale genomics data is intriguing. However, serious obstacles exist to the optimal management of the entire spectrum from data production in the laboratory through bioinformatic analysis to statistical evaluation and ultimately clinical interpretation. Beyond the multitude of technical issues, what stands out the most is the absence of adequate communication among the specialists in these domains. Successful interdisciplinary collaborations along the genomics pipeline extending from laboratory experiments to bioinformatic analyses to clinical application are notable in large scale, well managed projects such as The Cancer Genome Atlas. However, in certain settings in which the various experts perform their specialized research activities in isolation, the siloed approach to their research contributes to the generation of questionable genomic interpretations. Such situations are particularly concerning when the ultimate endpoint involves genetic/genomic interpretations that are intended for clinical applications. In spite of the fact that clinicians express interest in gaining a better understanding of clinical genomic applications, the lack of communication from upstream experts leaves them with a serious level of discomfort in applying such genomic knowledge to patient care. This discomfort is especially evident among healthcare providers who are not trained as geneticists, in particular primary care physicians. We offer some initiatives that have potential to address this problem, with emphasis on improved and ongoing communication among all the experts in these fields, constituting a comprehensive genomic "pipeline" from laboratory to patient.
Collapse
Affiliation(s)
- Daoud Meerzaman
- NCI, Center for Biomedical Informatics and Information Technology, Bethesda, Maryland.
| | | |
Collapse
|
21
|
Identification of new putative driver mutations and predictors of disease evolution in chronic lymphocytic leukemia. Blood Cancer J 2019; 9:78. [PMID: 31570692 PMCID: PMC6769000 DOI: 10.1038/s41408-019-0243-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2019] [Revised: 09/04/2019] [Accepted: 09/17/2019] [Indexed: 12/21/2022] Open
|
22
|
Ho AS, Ochoa A, Jayakumaran G, Zehir A, Valero Mayor C, Tepe J, Makarov V, Dalin MG, He J, Bailey M, Montesion M, Ross JS, Miller VA, Chan L, Ganly I, Dogan S, Katabi N, Tsipouras P, Ha P, Agrawal N, Solit DB, Futreal PA, El Naggar AK, Reis-Filho JS, Weigelt B, Ho AL, Schultz N, Chan TA, Morris LG. Genetic hallmarks of recurrent/metastatic adenoid cystic carcinoma. J Clin Invest 2019. [DOI: 10.1172/jci128227 pmid:314832902019-10-01]] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
|
23
|
Woo XY, Srivastava A, Graber JH, Yadav V, Sarsani VK, Simons A, Beane G, Grubb S, Ananda G, Liu R, Stafford G, Chuang JH, Airhart SD, Karuturi RKM, George J, Bult CJ. Genomic data analysis workflows for tumors from patient-derived xenografts (PDXs): challenges and guidelines. BMC Med Genomics 2019; 12:92. [PMID: 31262303 PMCID: PMC6604205 DOI: 10.1186/s12920-019-0551-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 06/17/2019] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Patient-derived xenograft (PDX) models are in vivo models of human cancer that have been used for translational cancer research and therapy selection for individual patients. The Jackson Laboratory (JAX) PDX resource comprises 455 models originating from 34 different primary sites (as of 05/08/2019). The models undergo rigorous quality control and are genomically characterized to identify somatic mutations, copy number alterations, and transcriptional profiles. Bioinformatics workflows for analyzing genomic data obtained from human tumors engrafted in a mouse host (i.e., Patient-Derived Xenografts; PDXs) must address challenges such as discriminating between mouse and human sequence reads and accurately identifying somatic mutations and copy number alterations when paired non-tumor DNA from the patient is not available for comparison. RESULTS We report here data analysis workflows and guidelines that address these challenges and achieve reliable identification of somatic mutations, copy number alterations, and transcriptomic profiles of tumors from PDX models that lack genomic data from paired non-tumor tissue for comparison. Our workflows incorporate commonly used software and public databases but are tailored to address the specific challenges of PDX genomics data analysis through parameter tuning and customized data filters and result in improved accuracy for the detection of somatic alterations in PDX models. We also report a gene expression-based classifier that can identify EBV-transformed tumors. We validated our analytical approaches using data simulations and demonstrated the overall concordance of the genomic properties of xenograft tumors with data from primary human tumors in The Cancer Genome Atlas (TCGA). CONCLUSIONS The analysis workflows that we have developed to accurately predict somatic profiles of tumors from PDX models that lack normal tissue for comparison enable the identification of the key oncogenic genomic and expression signatures to support model selection and/or biomarker development in therapeutic studies. A reference implementation of our analysis recommendations is available at https://github.com/TheJacksonLaboratory/PDX-Analysis-Workflows .
Collapse
Affiliation(s)
- Xing Yi Woo
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06030, USA
| | - Anuj Srivastava
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06030, USA
| | - Joel H Graber
- MDI Biological Laboratory, Bar Harbor, ME, 04609, USA
| | - Vinod Yadav
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06030, USA
- Present Address: Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Vishal Kumar Sarsani
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA
- Present Address: University of Massachusetts, Amherst, MA, 01003, USA
| | - Al Simons
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA
| | - Glen Beane
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA
| | - Stephen Grubb
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA
| | - Guruprasad Ananda
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06030, USA
| | - Rangjiao Liu
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06030, USA
- Present Address: Novogene Corporation, Rockville, MD, 20850, USA
| | - Grace Stafford
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA
| | - Jeffrey H Chuang
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06030, USA
| | - Susan D Airhart
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA
| | | | - Joshy George
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06030, USA.
| | - Carol J Bult
- The Jackson Laboratory for Mammalian Genetics, Bar Harbor, ME, 04609, USA.
| |
Collapse
|
24
|
Singer J, Irmisch A, Ruscheweyh HJ, Singer F, Toussaint NC, Levesque MP, Stekhoven DJ, Beerenwinkel N. Bioinformatics for precision oncology. Brief Bioinform 2019; 20:778-788. [PMID: 29272324 PMCID: PMC6585151 DOI: 10.1093/bib/bbx143] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 09/29/2017] [Indexed: 12/13/2022] Open
Abstract
Molecular profiling of tumor biopsies plays an increasingly important role not only in cancer research, but also in the clinical management of cancer patients. Multi-omics approaches hold the promise of improving diagnostics, prognostics and personalized treatment. To deliver on this promise of precision oncology, appropriate bioinformatics methods for managing, integrating and analyzing large and complex data are necessary. Here, we discuss the specific requirements of bioinformatics methods and software that arise in the setting of clinical oncology, owing to a stricter regulatory environment and the need for rapid, highly reproducible and robust procedures. We describe the workflow of a molecular tumor board and the specific bioinformatics support that it requires, from the primary analysis of raw molecular profiling data to the automatic generation of a clinical report and its delivery to decision-making clinical oncologists. Such workflows have to various degrees been implemented in many clinical trials, as well as in molecular tumor boards at specialized cancer centers and university hospitals worldwide. We review these and more recent efforts to include other high-dimensional multi-omics patient profiles into the tumor board, as well as the state of clinical decision support software to translate molecular findings into treatment recommendations.
Collapse
Affiliation(s)
- Jochen Singer
- Department of Biosystems Science and Engineering of ETH Zurich in Basel, Switzerland
| | - Anja Irmisch
- Department of Dermatology at the University of Zurich Hospital in Zurich, Switzerland
| | | | | | | | | | | | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering of ETH Zurich in Basel, Switzerland
| |
Collapse
|
25
|
Suwinski P, Ong C, Ling MHT, Poh YM, Khan AM, Ong HS. Advancing Personalized Medicine Through the Application of Whole Exome Sequencing and Big Data Analytics. Front Genet 2019; 10:49. [PMID: 30809243 PMCID: PMC6379253 DOI: 10.3389/fgene.2019.00049] [Citation(s) in RCA: 103] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2018] [Accepted: 01/21/2019] [Indexed: 12/11/2022] Open
Abstract
There is a growing attention toward personalized medicine. This is led by a fundamental shift from the ‘one size fits all’ paradigm for treatment of patients with conditions or predisposition to diseases, to one that embraces novel approaches, such as tailored target therapies, to achieve the best possible outcomes. Driven by these, several national and international genome projects have been initiated to reap the benefits of personalized medicine. Exome and targeted sequencing provide a balance between cost and benefit, in contrast to whole genome sequencing (WGS). Whole exome sequencing (WES) targets approximately 3% of the whole genome, which is the basis for protein-coding genes. Nonetheless, it has the characteristics of big data in large deployment. Herein, the application of WES and its relevance in advancing personalized medicine is reviewed. WES is mapped to Big Data “10 Vs” and the resulting challenges discussed. Application of existing biological databases and bioinformatics tools to address the bottleneck in data processing and analysis are presented, including the need for new generation big data analytics for the multi-omics challenges of personalized medicine. This includes the incorporation of artificial intelligence (AI) in the clinical utility landscape of genomic information, and future consideration to create a new frontier toward advancing the field of personalized medicine.
Collapse
Affiliation(s)
- Pawel Suwinski
- Malaysian Genomics Resource Centre Berhad, Kuala Lumpur, Malaysia
| | - ChuangKee Ong
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Serdang, Malaysia.,Centre of Genomics Research, Precision Medicine and Genomics, AstraZeneca UK Limited, London, United Kingdom
| | - Maurice H T Ling
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Serdang, Malaysia
| | - Yang Ming Poh
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Serdang, Malaysia
| | - Asif M Khan
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Serdang, Malaysia.,Graduate School of Medicine, Perdana University, Serdang, Malaysia
| | - Hui San Ong
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Serdang, Malaysia
| |
Collapse
|
26
|
Abstract
Somatic structural variants undoubtedly play important roles in driving tumourigenesis. This is evident despite the substantial technical challenges that remain in accurately detecting structural variants and their breakpoints in tumours and in spite of our incomplete understanding of the impact of structural variants on cellular function. Developments in these areas of research contribute to the ongoing discovery of structural variation with a clear impact on the evolution of the tumour and on the clinical importance to the patient. Recent large whole genome sequencing studies have reinforced our impression of each tumour as a unique combination of mutations but paradoxically have also discovered similar genome-wide patterns of single-nucleotide and structural variation between tumours. Statistical methods have been developed to deconvolute mutation patterns, or signatures, that recur across samples, providing information about the mutagens and repair processes that may be active in a given tumour. These signatures can guide treatment by, for example, highlighting vulnerabilities in a particular tumour to a particular chemotherapy. Thus, although the complete reconstruction of the full evolutionary trajectory of a tumour genome remains currently out of reach, valuable data are already emerging to improve the treatment of cancer.
Collapse
Affiliation(s)
- Ailith Ewing
- MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH42XU, UK
| | - Colin Semple
- MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, The University of Edinburgh, Western General Hospital, Crewe Road, Edinburgh, EH42XU, UK
| |
Collapse
|
27
|
Bian X, Zhu B, Wang M, Hu Y, Chen Q, Nguyen C, Hicks B, Meerzaman D. Comparing the performance of selected variant callers using synthetic data and genome segmentation. BMC Bioinformatics 2018; 19:429. [PMID: 30453880 PMCID: PMC6245711 DOI: 10.1186/s12859-018-2440-7] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Accepted: 10/19/2018] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND High-throughput sequencing has rapidly become an essential part of precision cancer medicine. But validating results obtained from analyzing and interpreting genomic data remains a rate-limiting factor. The gold standard, of course, remains manual validation by expert panels, which is not without its weaknesses, namely high costs in both funding and time as well as the necessarily selective nature of manual validation. But it may be possible to develop more economical, complementary means of validation. In this study we employed four synthetic data sets (variants with known mutations spiked into specific genomic locations) of increasing complexity to assess the sensitivity, specificity, and balanced accuracy of five open-source variant callers: FreeBayes v1.0, VarDict v11.5.1, MuTect v1.1.7, MuTect2, and MuSE v1.0rc. FreeBayes, VarDict, and MuTect were run in bcbio-next gen, and the results were integrated into a single Ensemble call set. The known mutations provided a level of "ground truth" against which we evaluated variant-caller performance. We further facilitated the comparison and evaluation by segmenting the whole genome into 10,000,000 base-pair fragments which yielded 316 segments. RESULTS Differences among the numbers of true positives were small among the callers, but the numbers of false positives varied much more when the tools were used to analyze sets one through three. Both FreeBayes and VarDict produced strikingly more false positives than did the others, although VarDict, somewhat paradoxically also produced the highest number of true positives. The Ensemble approach yielded results characterized by higher specificity and balanced accuracy and fewer false positives than did any of the five tools used alone. Sensitivity and specificity, however, declined for all five callers as the complexity of the data sets increased, but we did not uncover anything more than limited, weak correlations between caller performance and certain DNA structural features: gene density and guanine-cytosine content. Altogether, MuTect2 performed the best among the callers tested, followed by MuSE and MuTect. CONCLUSIONS Spiking data sets with specific mutations -single-nucleotide variations (SNVs), single-nucleotide polymorphisms (SNPs), or structural variations (SVs) in this study-at known locations in the genome provides an effective and economical way to compare data analyzed by variant callers with ground truth. The method constitutes a viable alternative to the prolonged, expensive, and noncomprehensive assessment by expert panels. It should be further developed and refined, as should other comparatively "lightweight" methods of assessing accuracy. Given that the scientific community has not yet established gold standards for validating NGS-related technologies such as variant callers, developing multiple alternative means for verifying variant-caller accuracy will eventually lead to the establishment of higher-quality standards than could be achieved by prematurely limiting the range of innovative methods explored by members of the community.
Collapse
Affiliation(s)
- Xiaopeng Bian
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, 20850, USA.
| | - Bin Zhu
- Cancer Genomics Research Laboratory(CGR), Division of Cancer Epidemiology and Genetics, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, 8717 Grovemont Circle, Gaithersburg, MD, 20877, USA
| | - Mingyi Wang
- Cancer Genomics Research Laboratory(CGR), Division of Cancer Epidemiology and Genetics, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, 8717 Grovemont Circle, Gaithersburg, MD, 20877, USA
| | - Ying Hu
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, 20850, USA
| | - Qingrong Chen
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, 20850, USA
| | - Cu Nguyen
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, 20850, USA
| | - Belynda Hicks
- Cancer Genomics Research Laboratory(CGR), Division of Cancer Epidemiology and Genetics, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, 8717 Grovemont Circle, Gaithersburg, MD, 20877, USA
| | - Daoud Meerzaman
- Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, 20850, USA.
| |
Collapse
|
28
|
Singer F, Irmisch A, Toussaint NC, Grob L, Singer J, Thurnherr T, Beerenwinkel N, Levesque MP, Dummer R, Quagliata L, Rothschild SI, Wicki A, Beisel C, Stekhoven DJ. SwissMTB: establishing comprehensive molecular cancer diagnostics in Swiss clinics. BMC Med Inform Decis Mak 2018; 18:89. [PMID: 30373609 PMCID: PMC6206832 DOI: 10.1186/s12911-018-0680-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 10/18/2018] [Indexed: 12/18/2022] Open
Abstract
Background Molecular precision oncology is an emerging practice to improve cancer therapy by decreasing the risk of choosing treatments that lack efficacy or cause adverse events. However, the challenges of integrating molecular profiling into routine clinical care are manifold. From a computational perspective these include the importance of a short analysis turnaround time, the interpretation of complex drug-gene and gene-gene interactions, and the necessity of standardized high-quality workflows. In addition, difficulties faced when integrating molecular diagnostics into clinical practice are ethical concerns, legal requirements, and limited availability of treatment options beyond standard of care as well as the overall lack of awareness of their existence. Methods To the best of our knowledge, we are the first group in Switzerland that established a workflow for personalized diagnostics based on comprehensive high-throughput sequencing of tumors at the clinic. Our workflow, named SwissMTB (Swiss Molecular Tumor Board), links genetic tumor alterations and gene expression to therapeutic options and clinical trial opportunities. The resulting treatment recommendations are summarized in a clinical report and discussed in a molecular tumor board at the clinic to support therapy decisions. Results Here we present results from an observational pilot study including 22 late-stage cancer patients. In this study we were able to identify actionable variants and corresponding therapies for 19 patients. Half of the patients were analyzed retrospectively. In two patients we identified resistance-associated variants explaining lack of therapy response. For five out of eleven patients analyzed before treatment the SwissMTB diagnostic influenced treatment decision. Conclusions SwissMTB enables the analysis and clinical interpretation of large numbers of potentially actionable molecular targets. Thus, our workflow paves the way towards a more frequent use of comprehensive molecular diagnostics in Swiss hospitals. Electronic supplementary material The online version of this article (10.1186/s12911-018-0680-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Franziska Singer
- NEXUS Personalized Health Technologies, ETH Zurich, Otto-Stern-Weg 7, 8093, Zurich, Switzerland.,SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland
| | - Anja Irmisch
- Department of Dermatology, University Hospital Zurich, 8091, Zurich, Switzerland
| | - Nora C Toussaint
- NEXUS Personalized Health Technologies, ETH Zurich, Otto-Stern-Weg 7, 8093, Zurich, Switzerland.,SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland
| | - Linda Grob
- NEXUS Personalized Health Technologies, ETH Zurich, Otto-Stern-Weg 7, 8093, Zurich, Switzerland.,SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland
| | - Jochen Singer
- SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland.,Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland
| | - Thomas Thurnherr
- SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland.,Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland
| | - Niko Beerenwinkel
- SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland.,Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland
| | - Mitchell P Levesque
- Department of Dermatology, University Hospital Zurich, 8091, Zurich, Switzerland
| | - Reinhard Dummer
- Department of Dermatology, University Hospital Zurich, 8091, Zurich, Switzerland
| | - Luca Quagliata
- Department of Pathology, University Hospital Basel, Schönbeinstrasse 40, 4056, Basel, Switzerland
| | - Sacha I Rothschild
- Division of Oncology, Department of Biomedicine, University Hospital Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Andreas Wicki
- Division of Oncology, Department of Biomedicine, University Hospital Basel, Petersgraben 4, 4031, Basel, Switzerland
| | - Christian Beisel
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland
| | - Daniel J Stekhoven
- NEXUS Personalized Health Technologies, ETH Zurich, Otto-Stern-Weg 7, 8093, Zurich, Switzerland. .,SIB Swiss Institute of Bioinformatics, 4058, Basel, Switzerland.
| |
Collapse
|
29
|
Cooper CI, Yao D, Sendorek DH, Yamaguchi TN, P'ng C, Houlahan KE, Caloian C, Fraser M, Ellrott K, Margolin AA, Bristow RG, Stuart JM, Boutros PC. Valection: design optimization for validation and verification studies. BMC Bioinformatics 2018; 19:339. [PMID: 30253747 PMCID: PMC6157051 DOI: 10.1186/s12859-018-2391-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 09/19/2018] [Indexed: 01/09/2023] Open
Abstract
Background Platform-specific error profiles necessitate confirmatory studies where predictions made on data generated using one technology are additionally verified by processing the same samples on an orthogonal technology. However, verifying all predictions can be costly and redundant, and testing a subset of findings is often used to estimate the true error profile. Results To determine how to create subsets of predictions for validation that maximize accuracy of global error profile inference, we developed Valection, a software program that implements multiple strategies for the selection of verification candidates. We evaluated these selection strategies on one simulated and two experimental datasets. Conclusions Valection is implemented in multiple programming languages, available at: http://labs.oicr.on.ca/boutros-lab/software/valection Electronic supplementary material The online version of this article (10.1186/s12859-018-2391-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Christopher I Cooper
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada
| | - Delia Yao
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada
| | - Dorota H Sendorek
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada
| | - Takafumi N Yamaguchi
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada
| | - Christine P'ng
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada
| | - Kathleen E Houlahan
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Canada
| | - Cristian Caloian
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada
| | - Michael Fraser
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | | | - Kyle Ellrott
- Computational Biology Program, Oregon Health & Science University, Portland, OR, USA.,Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA.,Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Adam A Margolin
- Computational Biology Program, Oregon Health & Science University, Portland, OR, USA.,Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA.,Sage Bionetworks, Seattle, WA, USA
| | - Robert G Bristow
- Department of Medical Biophysics, University of Toronto, Toronto, Canada.,Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Joshua M Stuart
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Paul C Boutros
- Ontario Institute for Cancer Research, 661 University Avenue, Suite 510, Toronto, Ontario, M5G 0A3, Canada. .,Department of Medical Biophysics, University of Toronto, Toronto, Canada. .,Department of Pharmacology & Toxicology, University of Toronto, Toronto, Canada. .,Departments of Human Genetics & Urology, University of California, Los Angeles, USA. .,Jonsson Comprehensive Cancer Centre, University of California, Los Angeles, USA. .,Institute for Precision Health, University of California, Los Angeles, USA.
| |
Collapse
|
30
|
Singer J, Ruscheweyh HJ, Hofmann AL, Thurnherr T, Singer F, Toussaint NC, Ng CKY, Piscuoglio S, Beisel C, Christofori G, Dummer R, Hall MN, Krek W, Levesque MP, Manz MG, Moch H, Papassotiropoulos A, Stekhoven DJ, Wild P, Wüst T, Rinn B, Beerenwinkel N. NGS-pipe: a flexible, easily extendable and highly configurable framework for NGS analysis. Bioinformatics 2018; 34:107-108. [PMID: 28968639 PMCID: PMC5870795 DOI: 10.1093/bioinformatics/btx540] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Accepted: 08/26/2017] [Indexed: 01/07/2023] Open
Abstract
Motivation Next-generation sequencing is now an established method in genomics, and massive amounts of sequencing data are being generated on a regular basis. Analysis of the sequencing data is typically performed by lab-specific in-house solutions, but the agreement of results from different facilities is often small. General standards for quality control, reproducibility and documentation are missing. Results We developed NGS-pipe, a flexible, transparent and easy-to-use framework for the design of pipelines to analyze whole-exome, whole-genome and transcriptome sequencing data. NGS-pipe facilitates the harmonization of genomic data analysis by supporting quality control, documentation, reproducibility, parallelization and easy adaptation to other NGS experiments. Availability and implementation https://github.com/cbg-ethz/NGS-pipe
Collapse
Affiliation(s)
- Jochen Singer
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Hans-Joachim Ruscheweyh
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,Scientific IT Services, ETH Zurich, Basel, Switzerland
| | - Ariane L Hofmann
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Thomas Thurnherr
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Franziska Singer
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,NEXUS Personalized Health Technologies, Zurich, Switzerland
| | - Nora C Toussaint
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,NEXUS Personalized Health Technologies, Zurich, Switzerland
| | - Charlotte K Y Ng
- Department of Biomedicine, University of Basel, Basel, Switzerland.,Institute of Pathology.,Division of Gastroenterology and Hepatology, University Hospital Basel, Basel, Switzerland
| | | | - Christian Beisel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
| | | | - Reinhard Dummer
- Department of Dermatology, University Hospital Zurich, Zurich, Switzerland
| | | | - Wilhelm Krek
- Institute for Molecular Health Sciences, ETH Zurich, Zurich, Switzerland
| | | | - Markus G Manz
- Division of Hematology, University Hospital Zurich, Zurich, Switzerland
| | - Holger Moch
- Institute of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland
| | - Andreas Papassotiropoulos
- Division of Molecular Neuroscience, Department of Psychology.,Transfaculty Research Platform Molecular and Cognitive Neurosciences.,Psychiatric University Clinics University of Basel, Basel, Switzerland.,Department Biozentrum, Life Sciences Training Facility, University of Basel, Basel, Switzerland
| | - Daniel J Stekhoven
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,NEXUS Personalized Health Technologies, Zurich, Switzerland
| | - Peter Wild
- Institute of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland
| | - Thomas Wüst
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,Scientific IT Services, ETH Zurich, Basel, Switzerland
| | - Bernd Rinn
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland.,Scientific IT Services, ETH Zurich, Basel, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
31
|
Said Mohammed K, Kibinge N, Prins P, Agoti CN, Cotten M, Nokes D, Brand S, Githinji G. Evaluating the performance of tools used to call minority variants from whole genome short-read data. Wellcome Open Res 2018; 3:21. [PMID: 30483597 PMCID: PMC6234735 DOI: 10.12688/wellcomeopenres.13538.2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/03/2018] [Indexed: 01/06/2023] Open
Abstract
Background: High-throughput whole genome sequencing facilitates investigation of minority virus sub-populations from virus positive samples. Minority variants are useful in understanding within and between host diversity, population dynamics and can potentially assist in elucidating person-person transmission pathways. Several minority variant callers have been developed to describe low frequency sub-populations from whole genome sequence data. These callers differ based on bioinformatics and statistical methods used to discriminate sequencing errors from low-frequency variants. Methods: We evaluated the diagnostic performance and concordance between published minority variant callers used in identifying minority variants from whole-genome sequence data from virus samples. We used the ART-Illumina read simulation tool to generate three artificial short-read datasets of varying coverage and error profiles from an RSV reference genome. The datasets were spiked with nucleotide variants at predetermined positions and frequencies. Variants were called using FreeBayes, LoFreq, Vardict, and VarScan2. The variant callers' agreement in identifying known variants was quantified using two measures; concordance accuracy and the inter-caller concordance. Results: The variant callers reported differences in identifying minority variants from the datasets. Concordance accuracy and inter-caller concordance were positively correlated with sample coverage. FreeBayes identified the majority of variants although it was characterised by variable sensitivity and precision in addition to a high false positive rate relative to the other minority variant callers and which varied with sample coverage. LoFreq was the most conservative caller. Conclusions: We conducted a performance and concordance evaluation of four minority variant calling tools used to identify and quantify low frequency variants. Inconsistency in the quality of sequenced samples impacts on sensitivity and accuracy of minority variant callers. Our study suggests that combining at least three tools when identifying minority variants is useful in filtering errors when calling low frequency variants.
Collapse
Affiliation(s)
- Khadija Said Mohammed
- Pwani University, Kilifi, Kenya
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| | - Nelson Kibinge
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| | - Pjotr Prins
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
- University Medical Center Utrecht, Utrecht, The Netherlands
| | - Charles N. Agoti
- Pwani University, Kilifi, Kenya
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| | - Matthew Cotten
- Virosciences Department, Erasmus Medical Centre, Rotterdam, The Netherlands
| | - D.J. Nokes
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
- School of Life Sciences and Zeeman Institute (SBIDER), University of Warwick, Coventry, UK
| | - Samuel Brand
- School of Life Sciences and Zeeman Institute (SBIDER), University of Warwick, Coventry, UK
| | - George Githinji
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| |
Collapse
|
32
|
Said Mohammed K, Kibinge N, Prins P, Agoti CN, Cotten M, Nokes D, Brand S, Githinji G. Evaluating the performance of tools used to call minority variants from whole genome short-read data. Wellcome Open Res 2018; 3:21. [PMID: 30483597 PMCID: PMC6234735 DOI: 10.12688/wellcomeopenres.13538.1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/06/2018] [Indexed: 01/11/2023] Open
Abstract
Background: High-throughput whole genome sequencing facilitates investigation of minority sub-populations from virus positive samples. Minority variants are useful in understanding within and between host diversity, population dynamics and can potentially help to elucidate person-person transmission chains. Several minority variant callers have been developed to describe the minority variants sub-populations from whole genome sequence data. However, they differ on bioinformatics and statistical approaches used to discriminate sequencing errors from low-frequency variants. Methods: We evaluated the diagnostic performance and concordance between published minority variant callers used in identifying minority variants from whole-genome sequence data. The ART-Illumina read simulation tool was used to generate three artificial short-read datasets of varying coverage and error profiles from an RSV reference genome. The datasets were spiked with nucleotide variants at predetermined positions and frequencies. Variants were called using FreeBayes, LoFreq, Vardict, and VarScan2. The variant callers' agreement in identifying known variants was quantified using two measures; concordance accuracy and the inter-caller concordance. Results: The variant callers reported differences in identifying minority variants from the datasets. Concordance accuracy and inter-caller concordance were positively correlated with sample coverage. FreeBayes identified majority of the variants although it was characterised by variable sensitivity and precision in addition to a high false positive rate relative to the other minority variant callers and which varied with sample coverage. LoFreq was the most conservative caller. Conclusions: We conducted a performance and concordance evaluation of four minority variant calling tools used to identify and quantify low frequency variants. Inconsistency in the quality of sequenced samples impact on sensitivity and accuracy of minority variant callers. Our study suggests that combining at least three tools when identifying minority variants is useful in filtering errors when calling low frequency variants.
Collapse
Affiliation(s)
- Khadija Said Mohammed
- Pwani University, Kilifi, Kenya
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| | - Nelson Kibinge
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| | - Pjotr Prins
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
- University Medical Center Utrecht, Utrecht, The Netherlands
| | - Charles N. Agoti
- Pwani University, Kilifi, Kenya
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| | - Matthew Cotten
- Virosciences Department, Erasmus Medical Centre, Rotterdam, The Netherlands
| | - D.J. Nokes
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
- School of Life Sciences and Zeeman Institute (SBIDER), University of Warwick, Coventry, UK
| | - Samuel Brand
- School of Life Sciences and Zeeman Institute (SBIDER), University of Warwick, Coventry, UK
| | - George Githinji
- KEMRI-Wellcome Trust Research Programme, KEMRI Centre for Geographic Medicine Research – Coast, Kilifi, Kenya
| |
Collapse
|
33
|
González S, Volkova N, Beer P, Gerstung M. Immuno-oncology from the perspective of somatic evolution. Semin Cancer Biol 2017; 52:75-85. [PMID: 29223477 DOI: 10.1016/j.semcancer.2017.12.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Revised: 11/29/2017] [Accepted: 12/05/2017] [Indexed: 12/30/2022]
Abstract
The past years have witnessed significant success for cancer immunotherapies that activate a patient's immune system against their cancer cells. At the same time our understanding of the genetic changes driving tumor evolution have progressed dramatically. The study of cancer genomes has shown that tumors are best understood as cell populations governed by the rules of evolution, leading to the emergence and spread of cell lineages with pathogenic mutations. Moreover, somatic evolution can explain the acquisition of mutations conferring drug resistance in the ever-lasting battle for reaching even fitter cell states. Here, we review the current state of the art of somatic cancer evolution and mechanisms of immune control and escape. We also revisit the principles of immunotherapy from the perspective of somatic evolution and discuss the basic rules of resistance to immunotherapies as dictated by evolution.
Collapse
Affiliation(s)
- Santiago González
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Nadezda Volkova
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Philip Beer
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK.
| | - Moritz Gerstung
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK.
| |
Collapse
|
34
|
Worthey EA. Analysis and Annotation of Whole-Genome or Whole-Exome Sequencing Derived Variants for Clinical Diagnosis. ACTA ACUST UNITED AC 2017; 95:9.24.1-9.24.28. [PMID: 29044471 DOI: 10.1002/cphg.49] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Over the last 10 years, next-generation sequencing (NGS) has transformed genomic research through substantial advances in technology and reduction in the cost of sequencing, and also in the systems required for analysis of these large volumes of data. This technology is now being used as a standard molecular diagnostic test in some clinical settings. The advances in sequencing have come so rapidly that the major bottleneck in identification of causal variants is no longer the sequencing or analysis (given access to appropriate tools), but rather clinical interpretation. Interpretation of genetic findings in a complex and ever changing clinical setting is scarcely a new challenge, but the task is increasingly complex in clinical genome-wide sequencing given the dramatic increase in dataset size and complexity. This increase requires application of appropriate interpretation tools, as well as development and application of appropriate methodologies and standard procedures. This unit provides an overview of these items. Specific challenges related to implementation of genome-wide sequencing in a clinical setting are discussed. © 2017 by John Wiley & Sons, Inc.
Collapse
|
35
|
Bohnert R, Vivas S, Jansen G. Comprehensive benchmarking of SNV callers for highly admixed tumor data. PLoS One 2017; 12:e0186175. [PMID: 29020110 PMCID: PMC5636151 DOI: 10.1371/journal.pone.0186175] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2017] [Accepted: 09/26/2017] [Indexed: 12/30/2022] Open
Abstract
Precision medicine attempts to individualize cancer therapy by matching tumor-specific genetic changes with effective targeted therapies. A crucial first step in this process is the reliable identification of cancer-relevant variants, which is considerably complicated by the impurity and heterogeneity of clinical tumor samples. We compared the impact of admixture of non-cancerous cells and low somatic allele frequencies on the sensitivity and precision of 19 state-of-the-art SNV callers. We studied both whole exome and targeted gene panel data and up to 13 distinct parameter configurations for each tool. We found vast differences among callers. Based on our comprehensive analyses we recommend joint tumor-normal calling with MuTect, EBCall or Strelka for whole exome somatic variant calling, and HaplotypeCaller or FreeBayes for whole exome germline calling. For targeted gene panel data on a single tumor sample, LoFreqStar performed best. We further found that tumor impurity and admixture had a negative impact on precision, and in particular, sensitivity in whole exome experiments. At admixture levels of 60% to 90% sometimes seen in pathological biopsies, sensitivity dropped significantly, even when variants were originally present in the tumor at 100% allele frequency. Sensitivity to low-frequency SNVs improved with targeted panel data, but whole exome data allowed more efficient identification of germline variants. Effective somatic variant calling requires high-quality pathological samples with minimal admixture, a consciously selected sequencing strategy, and the appropriate variant calling tool with settings optimized for the chosen type of data.
Collapse
|