1
|
Mochmann LH, Treue D, Bockmayr M, Silva P, Zasada C, Mastrobuoni G, Bayram S, Forbes M, Jurmeister P, Liebig S, Blau O, Schleich K, Splettstoesser B, Nordmann TM, von der Heide EK, Isaakidis K, Schulze V, Busch C, Siddiq H, Schlee C, Hester S, Fransecky L, Neumann M, Kempa S, Klauschen F, Baldus CD. Proteomic profiling reveals ACSS2 facilitating metabolic support in acute myeloid leukemia. Cancer Gene Ther 2024:10.1038/s41417-024-00785-5. [PMID: 38851813 DOI: 10.1038/s41417-024-00785-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 04/19/2024] [Accepted: 05/16/2024] [Indexed: 06/10/2024]
Abstract
Acute myeloid leukemia (AML) is a heterogeneous disease characterized by genomic aberrations in oncogenes, cytogenetic abnormalities, and an aberrant epigenetic landscape. Nearly 50% of AML cases will relapse with current treatment. A major source of therapy resistance is the interaction of mesenchymal stroma with leukemic cells resulting in therapeutic protection. We aimed to determine pro-survival/anti-apoptotic protein networks involved in the stroma protection of leukemic cells. Proteomic profiling of cultured primary AML (n = 14) with Hs5 stroma cell line uncovered an up-regulation of energy-favorable metabolic proteins. Next, we modulated stroma-induced drug resistance with an epigenetic drug library, resulting in reduced apoptosis with histone deacetylase inhibitor (HDACi) treatment versus other epigenetic modifying compounds. Quantitative phosphoproteomic probing of this effect further revealed a metabolic-enriched phosphoproteome including significant up-regulation of acetyl-coenzyme A synthetase (ACSS2, S30) in leukemia-stroma HDACi treated cocultures compared with untreated monocultures. Validating these findings, we show ACSS2 substrate, acetate, promotes leukemic proliferation, ACSS2 knockout in leukemia cells inhibits leukemic proliferation and ACSS2 knockout in the stroma impairs leukemic metabolic fitness. Finally, we identify ACSS1/ACSS2-high expression AML subtype correlating with poor overall survival. Collectively, this study uncovers the leukemia-stroma phosphoproteome emphasizing a role for ACSS2 in mediating AML growth and drug resistance.
Collapse
Affiliation(s)
- Liliana H Mochmann
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Denise Treue
- Institute of Pathology Berlin, Charité - Universitätsmedizin Berlin, a corporate member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Berlin, Germany
| | - Michael Bockmayr
- Institute of Pathology Berlin, Charité - Universitätsmedizin Berlin, a corporate member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Berlin, Germany
- Department of Pediatric Hematology and Oncology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Patricia Silva
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Christin Zasada
- Berlin Institute for Medical Systems Biology (BIMSB) at Max Delbruck Center for Molecular Medicine, Berlin, Germany
| | - Guido Mastrobuoni
- Berlin Institute for Medical Systems Biology (BIMSB) at Max Delbruck Center for Molecular Medicine, Berlin, Germany
| | - Safak Bayram
- Berlin Institute for Medical Systems Biology (BIMSB) at Max Delbruck Center for Molecular Medicine, Berlin, Germany
| | - Martin Forbes
- Berlin Institute for Medical Systems Biology (BIMSB) at Max Delbruck Center for Molecular Medicine, Berlin, Germany
| | - Philipp Jurmeister
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany
- German Cancer Consortium (DKTK), Partner Site Munich, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Sven Liebig
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Olga Blau
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Konstanze Schleich
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Bianca Splettstoesser
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Bavaria, Germany
| | - Thierry M Nordmann
- Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Bavaria, Germany
| | - Eva K von der Heide
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Konstandina Isaakidis
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Veronika Schulze
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Caroline Busch
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Hafsa Siddiq
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Cornelia Schlee
- Department of Hematology and Oncology, Charité - Universitätsmedizin Berlin, a Corporate Member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Campus Benjamin Franklin, Berlin, Germany
| | - Svenja Hester
- Department of Biochemistry, Oxford University, Oxford, UK
| | - Lars Fransecky
- Department of Hematology and Oncology, UKSH, Campus Kiel, Kiel, Germany
| | - Martin Neumann
- Department of Hematology and Oncology, UKSH, Campus Kiel, Kiel, Germany
| | - Stefan Kempa
- Berlin Institute for Medical Systems Biology (BIMSB) at Max Delbruck Center for Molecular Medicine, Berlin, Germany.
| | - Frederick Klauschen
- Institute of Pathology, Ludwig-Maximilians-Universität München, Munich, Germany.
- Institute of Pathology Berlin, Charité - Universitätsmedizin Berlin, a corporate member of Freie Universität Berlin, Humboldt-Universität, and Berlin Institute of Health, Berlin, Germany.
- German Cancer Consortium (DKTK), Partner Site Munich, German Cancer Research Center (DKFZ), Heidelberg, Germany.
| | - Claudia D Baldus
- Department of Hematology and Oncology, UKSH, Campus Kiel, Kiel, Germany.
| |
Collapse
|
2
|
Liu X, Li X, Wang L, Yu K, Wu D, Tao P, Li Y. Pan‑cancer analysis identified ARHGAP23 as a potential biomarker for pancreatic adenocarcinoma. Mol Clin Oncol 2023; 19:100. [PMID: 38022849 PMCID: PMC10666083 DOI: 10.3892/mco.2023.2696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 10/16/2023] [Indexed: 12/01/2023] Open
Abstract
Rho GTPASE-activating protein 23 (ARHGAP23) is known to activate RHO-GTPase and has an important role in the infiltration and metastasis of tumors. Although previous studies suggested its involvement in certain human cancers, its role in pan-cancer remains unclear. In the present study, the expression, prognosis and potential functions of ARHGAP23 in pan-cancer were evaluated through various public databases such as Human Protein Atlas, Tumor IMmune Estimation Resource, Gene Set Co-Expression Analysis, Gene Expression Profiling Interactive Analysis, cBio Cancer Genomics Portal, Tumor-Immune System Interactions Database (TISIDB) and others. Through these data combined with a variety of biological information analysis methods, the potential role of ARHGAP23 as a carcinogenic gene was explored in the present study. The present analysis revealed that ARHGAP23 expressed abnormalities in >10 tumors, which was associated with differences in prognosis. Furthermore, the findings of the present study indicated that ARHGAP23 is associated with DNA methylation and multiple immune cell infiltrations in these tumors. ARHGAP23 expression was related to clinical prognosis, DNA methylation and immune infiltration. These findings support the potential of ARHGAP23 as a prognostic biomarker and a molecular target for cancer treatment.
Collapse
Affiliation(s)
- Xiaolong Liu
- The First School of Clinical Medical, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| | - Xin Li
- The General Surgery Department, Lanzhou University Second Hospital, Lanzhou, Gansu 730000, P.R. China
| | - Ling Wang
- Department of Pathology, Lanzhou, Gansu 730000, P.R. China
| | - Kaihua Yu
- The First School of Clinical Medical, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| | - Dean Wu
- The First School of Clinical Medical, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| | - Pengxian Tao
- Cadre Ward of General Surgery Department, Gansu Provincial Hospital, Lanzhou, Gansu 730000, P.R. China
| | - Yulan Li
- The First School of Clinical Medical, Lanzhou University, Lanzhou, Gansu 730000, P.R. China
| |
Collapse
|
3
|
Angiotensinogen, a promising gene signature for rectum and stomach adenocarcinoma patients. Am J Transl Res 2022; 14:8879-8892. [PMID: 36628228 PMCID: PMC9827296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 10/24/2022] [Indexed: 01/12/2023]
Abstract
OBJECTIVES Angiotensinogen (AGT), as a component of the renin-angiotensin system (RAS), is one of the major risk factors for cancer development. To date, there has not been a systematic pan-cancer analysis of AGT. METHODS This pan-cancer study comprehensively investigated AGT in 24 different cancers based on the UALCAN, KM plotter, GENT2, HPA, MEXPRESS, cBioportal, STRING, TIMER, and CTD databases. RESULTS The results showed that AGT was highly expressed in most tumors, and AGT overexpression may be related to the worst survival of Rectum adenocarcinoma (READ) and Stomach Adenocarcinoma (STAD) patients only. Furthermore, pathway analysis indicated that AGT-associated genes are involved in six critical pathways. Moreover, the higher expression of AGT was found to be detrimental to the promoter methylation level (P<0.05), immune cells infiltration (P<0.05), and genetic alterations. We have also predicted various chemotherapeutic drugs contributing to the expression regulation of AGT. CONCLUSION Our results together support that AGT is a possible biomarker for READ and STAD.
Collapse
|
4
|
Song Y, Kelava L, Zhang L, Kiss I. Microarray data analysis to identify miRNA biomarkers and construct the lncRNA-miRNA-mRNA network in lung adenocarcinoma. Medicine (Baltimore) 2022; 101:e30393. [PMID: 36086747 PMCID: PMC10980501 DOI: 10.1097/md.0000000000030393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 07/25/2022] [Indexed: 12/09/2022] Open
Abstract
MicroRNAs (miRNAs), regulatory noncoding RNAs, are involved in gene regulation and may play a role in cancer development. The aim of this study was to identify miRNAs involved in lung adenocarcinoma (LUAD) using bioinformatics analysis. MiRNA (GSE135918), mRNA (GSE136043) and lncRNA (GSE130779) microarray datasets were downloaded from the Gene Expression Omnibus (GEO) database to identify differentially expressed miRNAs (DEMis), mRNAs (DEMs), and lncRNA (DELs) in LUAD. We used DEMs for functional enrichment analysis. MiRNA expression quantification from The Cancer Genome Atlas (TCGA) was used to validate DEMis. LncBase Predicted v.2, Targetscan, and MiRBase were used to predict lncRNAs and mRNAs. The LUAD data in TCGA were used for overall survival (OS) analysis. We screened the downregulation of 8 DEMis and upregulation of 6 DEMis, and found that 70 signal pathways changed. We chose 3 relevant signaling pathways in lung cancer development, WNT, PI3K-Akt, and Notch, and scanned for mRNAs involved in them that are potential targets of these miRNAs. Then a lncRNA-miRNA-mRNA network was constructed. We also found 7 miRNAs that were associated with poor OS in LUAD. Low expression level of hsa-miR-30a was highly associated with poor OS in LUAD (P < .001) and the target genes of hsa-miR-30a-3p were abundant in the Wnt and AKT signaling pathways. In addition, our results reported for the first time that hsa-miR-3944 and hsa-miR-3652 were highly expressed in LUAD. And the high expression level of hsa-miR-3944 was associated with poor OS (P < .05). Hsa-miR-30a-3p may suppress the occurrence and progression of lung cancer through Wnt and AKT signaling pathways and become a good biomarker in LUAD. Hsa-miR-3944 and hsa-miR-3652 may serve as new biomarkers in LUAD.
Collapse
Affiliation(s)
- Yongan Song
- Department of Public Health Medicine, University of Pécs Medical School, Szigeti str 12, Pécs 7624, Hungary
| | - Leonardo Kelava
- Department of Thermophysiology, Institute for Translational Medicine, Medical School, University of Pécs, Szigeti str 12, Pécs 7624, Hungary
| | - Lu Zhang
- Department of Health Science, Doctoral School of Health Science, University of Pécs, Vasvári Pál utca 4, Pécs 7622, Hungary
| | - István Kiss
- Department of Public Health Medicine, University of Pécs Medical School, Szigeti str 12, Pécs 7624, Hungary
| |
Collapse
|
5
|
Casey F, Negi S, Zhu J, Sun YH, Zavodszky M, Cheng D, Lin D, John S, Penny MA, Sexton D, Zhang B. OmicsView: omics data analysis through interactive visual analytics. Comput Struct Biotechnol J 2022; 20:1277-1285. [PMID: 35356547 PMCID: PMC8924308 DOI: 10.1016/j.csbj.2022.02.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Revised: 02/19/2022] [Accepted: 02/23/2022] [Indexed: 11/30/2022] Open
Abstract
With advances in NGS technologies, transcriptional profiling of human tissue across many diseases is becoming more routine, leading to the generation of petabytes of data deposited in public repositories. There is a need for bench scientists with little computational expertise to be able to access and mine this data to understand disease pathology, identify robust biomarkers of disease and the effect of interventions (in vivo or in vitro). To this end we release an open source analytics and visualization platform for expression data called OmicsView, http://omicsview.org. This platform comes preloaded with 1000 s of samples across many disease areas and normal tissue, including the GTEx database, all processed with a harmonized pipeline. We demonstrate the power and ease-of-use of the platform by means of a Crohn’s disease data mining exercise where we can quickly uncover disease pathology and identify strong biomarkers of disease and response to treatment.
Collapse
|
6
|
Ma C, Wu M, Ma S. Analysis of cancer omics data: a selective review of statistical techniques. Brief Bioinform 2022; 23:6510158. [PMID: 35039832 DOI: 10.1093/bib/bbab585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 12/19/2021] [Accepted: 12/20/2021] [Indexed: 11/13/2022] Open
Abstract
Cancer is an omics disease. The development in high-throughput profiling has fundamentally changed cancer research and clinical practice. Compared with clinical, demographic and environmental data, the analysis of omics data-which has higher dimensionality, weaker signals and more complex distributional properties-is much more challenging. Developments in the literature are often 'scattered', with individual studies focused on one or a few closely related methods. The goal of this review is to assist cancer researchers with limited statistical expertise in establishing the 'overall framework' of cancer omics data analysis. To facilitate understanding, we mainly focus on intuition, concepts and key steps, and refer readers to the original publications for mathematical details. This review broadly covers unsupervised and supervised analysis, as well as individual-gene-based, gene-set-based and gene-network-based analysis. We also briefly discuss 'special topics' including interaction analysis, multi-datasets analysis and multi-omics analysis.
Collapse
Affiliation(s)
- Chenjin Ma
- College of Statistics and Data Science, Faculty of Science, Beijing University of Technology, Beijing, China
| | - Mengyun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| | - Shuangge Ma
- Department of Biostatistics, Yale School of Public Health, New Haven, CT, USA
| |
Collapse
|
7
|
Ceol A, Montanari P, Bartolini I, Ceri S, Ciaccia P, Patella M, Masseroli M. Search and comparison of (epi)genomic feature patterns in multiple genome browser tracks. BMC Bioinformatics 2020; 21:464. [PMID: 33076821 PMCID: PMC7574191 DOI: 10.1186/s12859-020-03781-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Accepted: 09/24/2020] [Indexed: 01/22/2023] Open
Abstract
Background Genome browsers are widely used for locating interesting genomic regions, but their interactive use is obviously limited to inspecting short genomic portions. An ideal interaction is to provide patterns of regions on the browser, and then extract other genomic regions over the whole genome where such patterns occur, ranked by similarity. Results We developed SimSearch, an optimized pattern-search method and an open source plugin for the Integrated Genome Browser (IGB), to find genomic region sets that are similar to a given region pattern. It provides efficient visual genome-wide analytics computation in large datasets; the plugin supports intuitive user interactions for selecting an interesting pattern on IGB tracks and visualizing the computed occurrences of similar patterns along the entire genome. SimSearch also includes functions for the annotation and enrichment of results, and is enhanced with a Quickload repository including numerous epigenomic feature datasets from ENCODE and Roadmap Epigenomics. The paper also includes some use cases to show multiple genome-wide analyses of biological interest, which can be easily performed by taking advantage of the presented approach. Conclusions The novel SimSearch method provides innovative support for effective genome-wide pattern search and visualization; its relevance and practical usefulness is demonstrated through a number of significant use cases of biological interest. The SimSearch IGB plugin, documentation, and code are freely available at https://deib-geco.github.io/simsearch-app/ and https://github.com/DEIB-GECO/simsearch-app/.
Collapse
Affiliation(s)
- Arnaud Ceol
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia (IIT), 20139, Milan, Italy.,IEO, European Institute of Oncology IRCCS, 20141, Milan, Italy
| | | | | | - Stefano Ceri
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, 20133, Milan, Italy
| | | | | | - Marco Masseroli
- Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, 20133, Milan, Italy.
| |
Collapse
|
8
|
Völkel G, Laban S, Fürstberger A, Kühlwein SD, Ikonomi N, Hoffmann TK, Brunner C, Neuberg DS, Gaidzik V, Döhner H, Kraus JM, Kestler HA. Analysis, identification and visualization of subgroups in genomics. Brief Bioinform 2020; 22:5909009. [PMID: 32954413 PMCID: PMC8138884 DOI: 10.1093/bib/bbaa217] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 08/14/2020] [Accepted: 08/17/2020] [Indexed: 12/22/2022] Open
Abstract
Motivation Cancer is a complex and heterogeneous disease involving multiple somatic mutations that accumulate during its progression. In the past years, the wide availability of genomic data from patients’ samples opened new perspectives in the analysis of gene mutations and alterations. Hence, visualizing and further identifying genes mutated in massive sets of patients are nowadays a critical task that sheds light on more personalized intervention approaches. Results Here, we extensively review existing tools for visualization and analysis of alteration data. We compare different approaches to study mutual exclusivity and sample coverage in large-scale omics data. We complement our review with the standalone software AVAtar (‘analysis and visualization of alteration data’) that integrates diverse aspects known from different tools into a comprehensive platform. AVAtar supplements customizable alteration plots by a multi-objective evolutionary algorithm for subset identification and provides an innovative and user-friendly interface for the evaluation of concurrent solutions. A use case from personalized medicine demonstrates its unique features showing an application on vaccination target selection. Availability AVAtar is available at: https://github.com/sysbio-bioinf/avatar Contact hans.kestler@uni-ulm.de, phone: +49 (0) 731 500 24 500, fax: +49 (0) 731 500 24 502
Collapse
Affiliation(s)
| | | | | | | | | | - Thomas K Hoffmann
- Department of Otorhinolaryngology, Head and Neck Surgery, Ulm University Medical Center, Germany
| | - Cornelia Brunner
- Department of Otorhinolaryngology, Head and Neck Surgery, Ulm University Medical Center, Germany
| | - Donna S Neuberg
- Department of Biostatistics, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Verena Gaidzik
- Department of Internal Medicine III, Ulm University Medical Center, Germany
| | - Hartmut Döhner
- Department of Internal Medicine III, Ulm University Medical Center, Germany
| | | | | |
Collapse
|
9
|
Abstract
MOTIVATION Cancer is caused by the accumulation of somatic mutations that lead to the formation of distinct populations of cells, called clones. The resulting clonal architecture is the main cause of relapse and resistance to treatment. With decreasing costs in DNA sequencing technology, rich cancer genomics datasets with many spatial sequencing samples are becoming increasingly available, enabling the inference of high-resolution tumor clones and prevalences across different spatial coordinates. While temporal and phylogenetic aspects of tumor evolution, such as clonal evolution over time and clonal response to treatment, are commonly visualized in various clonal evolution diagrams, visual analytics methods that reveal the spatial clonal architecture are missing. RESULTS This article introduces ClonArch, a web-based tool to interactively visualize the phylogenetic tree and spatial distribution of clones in a single tumor mass. ClonArch uses the marching squares algorithm to draw closed boundaries representing the presence of clones in a real or simulated tumor. ClonArch enables researchers to examine the spatial clonal architecture of a subset of relevant mutations at different prevalence thresholds and across multiple phylogenetic trees. In addition to simulated tumors with varying number of biopsies, we demonstrate the use of ClonArch on a hepatocellular carcinoma tumor with ∼280 sequencing biopsies. ClonArch provides an automated way to interactively examine the spatial clonal architecture of a tumor, facilitating clinical and biological interpretations of the spatial aspects of intra-tumor heterogeneity. AVAILABILITY AND IMPLEMENTATION https://github.com/elkebir-group/ClonArch.
Collapse
Affiliation(s)
- Jiaqi Wu
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Mohammed El-Kebir
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
10
|
Identification and Validation of Immune-Related Gene Prognostic Signature for Hepatocellular Carcinoma. J Immunol Res 2020; 2020:5494858. [PMID: 32211443 PMCID: PMC7081044 DOI: 10.1155/2020/5494858] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Revised: 01/22/2020] [Accepted: 02/05/2020] [Indexed: 02/06/2023] Open
Abstract
Immune-related genes (IRGs) have been identified as critical drivers of the initiation and progression of hepatocellular carcinoma (HCC). This study is aimed at constructing an IRG signature for HCC and validating its prognostic value in clinical application. The prognostic signature was developed by integrating multiple IRG expression data sets from TCGA and GEO databases. The IRGs were then combined with clinical features to validate the robustness of the prognostic signature through bioinformatics tools. A total of 1039 IRGs were identified in the 657 HCC samples. Subsequently, the IRGs were subjected to univariate Cox regression and LASSO Cox regression analyses in the training set to construct an IRG signature comprising nine immune-related gene pairs (IRGPs). Functional analyses revealed that the nine IRGPs were associated with tumor immune mechanisms, including cell proliferation, cell-mediated immunity, and tumorigenesis signal pathway. Concerning the overall survival rate, the IRGPs distinctly grouped the HCC samples into the high- and low-risk groups. Also, we found that the risk score based on nine IRGPs was related to clinical and pathologic factors and remained a valid independent prognostic signature after adjusting for tumor TNM, grade, and grade in multivariate Cox regression analyses. The prognostic value of the nine IRGPs was further validated by forest and nomogram plots, which revealed that it was superior to the tumor TNM, grade, and stage. Our findings suggest that the nine-IRGP signature can be effective in determining the disease outcomes of HCC patients.
Collapse
|
11
|
Pearce TM, Nikiforova MN, Roy S. Interactive Browser-Based Genomics Data Visualization Tools for Translational and Clinical Laboratory Applications. J Mol Diagn 2019; 21:985-993. [PMID: 31382034 DOI: 10.1016/j.jmoldx.2019.06.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 04/21/2019] [Accepted: 06/12/2019] [Indexed: 11/28/2022] Open
Abstract
Visualization-driven data exploration is a highly effective modality for interpreting and discovering insights from high-throughput genomics data sets; however, it is vastly underutilized in routine workflows in clinical and translation settings. We have developed three open-source, browser-based, interactive genomics data visualization widgets that can be used as intuitive stand-alone applications or integrated with existing web-based laboratory information solutions. The widgets were developed in JavaScript using the D3.js library. These widgets run in any modern web browser across desktop and mobile devices for easy accessibility but are designed for client-side data processing to address data privacy concerns. jsProteinMapper plots the location of a variant of interest relative to the protein domains and multiple variant databases, assisting with clinical interpretation of sequence variants. jsComut generates a highly interactive and customizable comutation plot for visual exploration of genomic data sets with clinicopathologic annotations to reveal unique molecular profiles and clinical correlates. jsCodonWheel is an interactive version of the ubiquitous circular codon-to-amino acid translation table, which lets users quickly map nucleotide changes onto resulting amino acid differences. These open-source visualization tools may improve some of the key laboratory workflows that involve the review of large-scale genomics data sets in a high-volume setting. The intuitive and responsive user interface, highly customizable visualizations, and easy integration with existing web-based laboratory software are significant highlights of these tools.
Collapse
Affiliation(s)
- Thomas M Pearce
- Department of Pathology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
| | - Marina N Nikiforova
- Department of Pathology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
| | - Somak Roy
- Department of Pathology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania.
| |
Collapse
|
12
|
Lysophosphatidic Acid Receptor 6 (LPAR6) Expression and Prospective Signaling Pathway Analysis in Breast Cancer. Mol Diagn Ther 2019; 23:127-138. [PMID: 30694446 DOI: 10.1007/s40291-019-00384-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
BACKGROUND AND OBJECTIVE Lysophosphatidic acid (LPA) has widely been reported to participate in the numerous biological behaviors of tumors through its receptors. LPA receptor 6 (LPAR6) is a newly identified G protein-coupled receptor of LPA, and few studies have explored the role of LPAR6 in cancer. In breast cancer (BC), LPAR6 has not, as yet, been studied. This study aimed to evaluate LPAR6 expression in BC patients and to explore its possible role in BC. METHODS A total of 98 pairs of clinical BC and para-cancer tissues were collected, and LPAR6 expression was evaluated by quantitative real-time polymerase chain reaction (qRT-PCR). Kaplan-Meier plots were employed for survival analysis. Human BC cell lines were cultured to study decitabine (5-aza-2'-deoxycytidine [5-Aza]) intervention. Bioinformatic analyses were carried out to support the study conclusions and predictions. RESULTS LPAR6 expression was significantly reduced in BC tissues (p < 0.001). In the analysis of clinical parameters, LPAR6 expression was related to BC molecular classification (p < 0.05). Furthermore, patients with higher LPAR6 expression had better prognoses (p < 0.001). The CpG islands of LPAR6 were hypermethylated in BC tissues relative to those in para-cancer tissues (p < 0.01). 5-Aza significantly upregulated LPAR6 expression in BC cell lines. Additionally, LPAR6 knockdown significantly promoted cell migration and proliferation in the ZR-75-1 cell line (p < 0.001). Finally, through Gene Set Enrichment Analysis (GSEA), LPAR6 was found to be negatively correlated with cancer-promoting factors and positively correlated with tumor-suppressing factors. CONCLUSION LPAR6 was downregulated in BC, and low LPAR6 expression was related to poor prognosis. The anti-tumor drug 5-Aza significantly upregulated LPAR6 expression in vitro, and LPAR6 might act as a tumor suppressor in BC.
Collapse
|
13
|
Nusrat S, Harbig T, Gehlenborg N. Tasks, Techniques, and Tools for Genomic Data Visualization. COMPUTER GRAPHICS FORUM : JOURNAL OF THE EUROPEAN ASSOCIATION FOR COMPUTER GRAPHICS 2019; 38:781-805. [PMID: 31768085 PMCID: PMC6876635 DOI: 10.1111/cgf.13727] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Genomic data visualization is essential for interpretation and hypothesis generation as well as a valuable aid in communicating discoveries. Visual tools bridge the gap between algorithmic approaches and the cognitive skills of investigators. Addressing this need has become crucial in genomics, as biomedical research is increasingly data-driven and many studies lack well-defined hypotheses. A key challenge in data-driven research is to discover unexpected patterns and to formulate hypotheses in an unbiased manner in vast amounts of genomic and other associated data. Over the past two decades, this has driven the development of numerous data visualization techniques and tools for visualizing genomic data. Based on a comprehensive literature survey, we propose taxonomies for data, visualization, and tasks involved in genomic data visualization. Furthermore, we provide a comprehensive review of published genomic visualization tools in the context of the proposed taxonomies.
Collapse
Affiliation(s)
- S Nusrat
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - T Harbig
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - N Gehlenborg
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
14
|
Abstract
Since the discovery that DNA alterations initiate tumorigenesis, scientists and clinicians have been exploring ways to counter these changes with targeted therapeutics. The sequencing of tumor DNA was initially limited to highly actionable hot spots-areas of the genome that are frequently altered and have an approved matched therapy in a specific tumor type. Large-scale genome sequencing programs quickly developed technological improvements that enabled the deployment of whole-exome and whole-genome sequencing technologies at scale for pristine sample materials in research environments. However, the turning point for precision medicine in oncology was the innovations in clinical laboratories that improved turnaround time, depth of coverage, and the ability to reliably sequence archived, clinically available samples. Today, tumor genome sequencing no longer suffers from significant technical or financial hurdles, and the next opportunity for improvement lies in the optimal utilization of the technologies and data for many different tumor types.
Collapse
Affiliation(s)
- Kenna R Mills Shaw
- Khalifa Bin Zayed Institute for Personalized Cancer Therapy and Sheikh Ahmed Center for Pancreatic Cancer Research, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA;
| | - Anirban Maitra
- Khalifa Bin Zayed Institute for Personalized Cancer Therapy and Sheikh Ahmed Center for Pancreatic Cancer Research, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA;
| |
Collapse
|
15
|
Qu Z, Lau CW, Nguyen QV, Zhou Y, Catchpoole DR. Visual Analytics of Genomic and Cancer Data: A Systematic Review. Cancer Inform 2019; 18:1176935119835546. [PMID: 30890859 PMCID: PMC6416684 DOI: 10.1177/1176935119835546] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 01/29/2019] [Indexed: 12/12/2022] Open
Abstract
Visual analytics and visualisation can leverage the human perceptual system to
interpret and uncover hidden patterns in big data. The advent of next-generation
sequencing technologies has allowed the rapid production of massive amounts of
genomic data and created a corresponding need for new tools and methods for
visualising and interpreting these data. Visualising genomic data requires not
only simply plotting of data but should also offer a decision or a choice about
what the message should be conveyed in the particular plot; which methodologies
should be used to represent the results must provide an easy, clear, and
accurate way to the clinicians, experts, or researchers to interact with the
data. Genomic data visual analytics is rapidly evolving in parallel with
advances in high-throughput technologies such as artificial intelligence (AI)
and virtual reality (VR). Personalised medicine requires new genomic
visualisation tools, which can efficiently extract knowledge from the genomic
data and speed up expert decisions about the best treatment of individual
patient’s needs. However, meaningful visual analytics of such large genomic data
remains a serious challenge. This article provides a comprehensive systematic
review and discussion on the tools, methods, and trends for visual analytics of
cancer-related genomic data. We reviewed methods for genomic data visualisation
including traditional approaches such as scatter plots, heatmaps, coordinates,
and networks, as well as emerging technologies using AI and VR. We also
demonstrate the development of genomic data visualisation tools over time and
analyse the evolution of visualising genomic data.
Collapse
Affiliation(s)
- Zhonglin Qu
- School of Computing, Engineering and Mathematics, Western Sydney University, Penrith, NSW, Australia
| | - Chng Wei Lau
- School of Computing, Engineering and Mathematics, Western Sydney University, Penrith, NSW, Australia
| | - Quang Vinh Nguyen
- School of Computing, Engineering and Mathematics, Western Sydney University, Penrith, NSW, Australia.,The MARCS Institute, Western Sydney University, Penrith, NSW, Australia
| | - Yi Zhou
- School of Computing, Engineering and Mathematics, Western Sydney University, Penrith, NSW, Australia
| | - Daniel R Catchpoole
- The Tumour Bank, Children's Cancer Research Unit, Kids Research, The Children's Hospital at Westmead, Westmead, NSW, Australia.,Discipline of Paediatrics and Child Health, Faculty of Medicine, The University of Sydney, Sydney, NSW, Australia.,Faculty of Information Technology, The University of Technology Sydney, Ultimo, NSW, Australia
| |
Collapse
|
16
|
Zhang Q, Peters T, Fenster A. Layer-based visualization and biomedical information exploration of multi-channel large histological data. Comput Med Imaging Graph 2019; 72:34-46. [PMID: 30772074 DOI: 10.1016/j.compmedimag.2019.01.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 11/21/2018] [Accepted: 01/16/2019] [Indexed: 10/27/2022]
Abstract
BACKGROUND AND OBJECTIVE Modern microscopes can acquire multi-channel large histological data from tissues of human beings or animals, which contain rich biomedical information for disease diagnosis and biological feature analysis. However, due to the large size, fuzzy tissue structure, and complicated multiple elements integrated in the image color space, it is still a challenge for current software systems to effectively calculate histological data, show the inner tissue structures and unveil hidden biomedical information. Therefore, we developed new algorithms and a software platform to address this issue. METHODS This paper presents a multi-channel biomedical data computing and visualization system that can efficiently process large 3D histological images acquired from high-resolution microscopes. A novelty of our system is that it can dynamically display a volume of interest and extract tissue information using a layer-based data navigation scheme. During the data exploring process, the actual resolution of the loaded data can be dynamically determined and updated, and data rendering is synchronized in four display windows at each data layer, where 2D textures are extracted from the imaging volume and mapped onto the displayed clipping planes in 3D space. RESULTS To test the efficiency and scalability of this system, we performed extensive evaluations using several different hardware systems and large histological color datasets acquired from a CryoViz 3D digital system. The experimental results demonstrated that our system can deliver interactive data navigation speed and display detailed imaging information in real time, which is beyond the capability of commonly available biomedical data exploration software platforms. CONCLUSION Taking advantage of both CPU (central processing unit) main memory and GPU (graphics processing unit) graphics memory, the presented software platform can efficiently compute, process and visualize very large biomedical data and enhance data information. The performance of this system can satisfactorily address the challenges of navigating and interrogating volumetric multi-spectral large histological image at multiple resolution levels.
Collapse
Affiliation(s)
- Qi Zhang
- School of Information Technology, Illinois State University, 100 North University Street, Normal, IL 61761, United States; Department of Medical Biophysics, Western University, London, Ontario, Canada N6A 5C1.
| | - Terry Peters
- Robarts Research Institute, Western University, 1151 Richmond St. N., London, Ontario, Canada N6A 5B7; Department of Medical Biophysics, Western University, London, Ontario, Canada N6A 5C1.
| | - Aaron Fenster
- Robarts Research Institute, Western University, 1151 Richmond St. N., London, Ontario, Canada N6A 5B7; Department of Medical Biophysics, Western University, London, Ontario, Canada N6A 5C1.
| |
Collapse
|
17
|
|
18
|
O'Donoghue SI, Baldi BF, Clark SJ, Darling AE, Hogan JM, Kaur S, Maier-Hein L, McCarthy DJ, Moore WJ, Stenau E, Swedlow JR, Vuong J, Procter JB. Visualization of Biomedical Data. Annu Rev Biomed Data Sci 2018. [DOI: 10.1146/annurev-biodatasci-080917-013424] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The rapid increase in volume and complexity of biomedical data requires changes in research, communication, and clinical practices. This includes learning how to effectively integrate automated analysis with high–data density visualizations that clearly express complex phenomena. In this review, we summarize key principles and resources from data visualization research that help address this difficult challenge. We then survey how visualization is being used in a selection of emerging biomedical research areas, including three-dimensional genomics, single-cell RNA sequencing (RNA-seq), the protein structure universe, phosphoproteomics, augmented reality–assisted surgery, and metagenomics. While specific research areas need highly tailored visualizations, there are common challenges that can be addressed with general methods and strategies. Also common, however, are poor visualization practices. We outline ongoing initiatives aimed at improving visualization practices in biomedical research via better tools, peer-to-peer learning, and interdisciplinary collaboration with computer scientists, science communicators, and graphic designers. These changes are revolutionizing how we see and think about our data.
Collapse
Affiliation(s)
- Seán I. O'Donoghue
- Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Eveleigh NSW 2015, Australia
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney NSW 2010, Australia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales (UNSW), Kensington NSW 2033, Australia
| | - Benedetta Frida Baldi
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney NSW 2010, Australia
| | - Susan J. Clark
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney NSW 2010, Australia
| | - Aaron E. Darling
- The ithree Institute, University of Technology Sydney, Ultimo NSW 2007, Australia
| | - James M. Hogan
- School of Electrical Engineering and Computer Science, Queensland University of Technology, Brisbane QLD, 4000, Australia
| | - Sandeep Kaur
- School of Computer Science and Engineering, University of New South Wales (UNSW), Kensington NSW 2033, Australia
| | - Lena Maier-Hein
- Division of Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Davis J. McCarthy
- European Bioinformatics Institute (EBI), European Molecular Biology Laboratory (EMBL), Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
- St. Vincent's Institute of Medical Research, Fitzroy VIC 3065, Australia
| | - William J. Moore
- School of Life Sciences, University of Dundee, Dundee DD1 5EH, United Kingdom
| | - Esther Stenau
- Division of Computer Assisted Medical Interventions (CAMI), German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Jason R. Swedlow
- School of Life Sciences, University of Dundee, Dundee DD1 5EH, United Kingdom
| | - Jenny Vuong
- Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Eveleigh NSW 2015, Australia
| | - James B. Procter
- School of Life Sciences, University of Dundee, Dundee DD1 5EH, United Kingdom
| |
Collapse
|
19
|
Kotelnikova EA, Pyatnitskiy M, Paleeva A, Kremenetskaya O, Vinogradov D. Practical aspects of NGS-based pathways analysis for personalized cancer science and medicine. Oncotarget 2018; 7:52493-52516. [PMID: 27191992 PMCID: PMC5239569 DOI: 10.18632/oncotarget.9370] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Accepted: 04/18/2016] [Indexed: 12/17/2022] Open
Abstract
Nowadays, the personalized approach to health care and cancer care in particular is becoming more and more popular and is taking an important place in the translational medicine paradigm. In some cases, detection of the patient-specific individual mutations that point to a targeted therapy has already become a routine practice for clinical oncologists. Wider panels of genetic markers are also on the market which cover a greater number of possible oncogenes including those with lower reliability of resulting medical conclusions. In light of the large availability of high-throughput technologies, it is very tempting to use complete patient-specific New Generation Sequencing (NGS) or other "omics" data for cancer treatment guidance. However, there are still no gold standard methods and protocols to evaluate them. Here we will discuss the clinical utility of each of the data types and describe a systems biology approach adapted for single patient measurements. We will try to summarize the current state of the field focusing on the clinically relevant case-studies and practical aspects of data processing.
Collapse
Affiliation(s)
- Ekaterina A Kotelnikova
- Personal Biomedicine, Moscow, Russia.,A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia.,Institute Biomedical Research August Pi Sunyer (IDIBAPS), Hospital Clinic of Barcelona, Barcelona, Spain
| | - Mikhail Pyatnitskiy
- Personal Biomedicine, Moscow, Russia.,Orekhovich Institute of Biomedical Chemistry, Moscow, Russia.,Pirogov Russian National Research Medical University, Moscow, Russia
| | | | - Olga Kremenetskaya
- Personal Biomedicine, Moscow, Russia.,Center for Theoretical Problems of Physicochemical Pharmacology, Russian Academy of Sciences, Moscow, Russia
| | - Dmitriy Vinogradov
- Personal Biomedicine, Moscow, Russia.,A. A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia.,Lomonosov Moscow State University, Moscow, Russia
| |
Collapse
|
20
|
Abstract
NUMB, and its close homologue NUMBL, behave as tumor suppressor genes by regulating the Notch pathway. The downregulation of these genes in tumors is common, allowing aberrant Notch pathway activation and tumor progression. However, some known differences between NUMB and NUMBL have raised unanswered questions regarding the redundancy and/or combined regulation of the Notch pathway by these genes during the tumorigenic process. We have found that NUMB and NUMBL exhibit mutual exclusivity in human tumors, suggesting that the associated tumor suppressor role is regulated by only one of the two proteins in a specific cell, avoiding duplicate signaling and simplifying the regulatory network. We have also found differences in gene expression due to NUMB or NUMBL downregulation. These differences in gene regulation extend to pathways, such as WNT or Hedgehog. In addition to these differences, the downregulation of either gene triggers a cancer stem cell-like related phenotype. These results show the importance of both genes as an intersection with different effects over cancer stem cell signaling pathways.
Collapse
|
21
|
Mallona I, Sierco A, Peinado MA. The Pancancer DNA Methylation Trackhub: A Window to The Cancer Genome Atlas Epigenomics Data. Methods Mol Biol 2018; 1766:123-135. [PMID: 29605850 DOI: 10.1007/978-1-4939-7768-0_7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The Cancer Genome Atlas (TCGA) epigenome data includes the DNA methylation status of tumor and normal tissues of large cohorts for dozens of cancer types. Due to the moderately large data sizes, retrieving and analyzing them requires basic programming skills. Simple data browsing (e.g., candidate gene search) is hampered by the scarcity of easy-to-use data browsers addressed to the broad community of biomedical researchers. We propose a new visualization method depicting the overall DNA methylation status at each TCGA cohort while emphasizing its heterogeneity, thus facilitating the evaluation of the cohort variability and the normal versus tumor differences. Implemented as a trackhub integrated to the University of California Santa Cruz (UCSC) genome browser, it can be easily added to any genome-wide annotation layer.To exemplify the trackhub usage we evaluate local DNA methylation boundaries, the aberrant DNA methylation of a CpG island located at the estrogen receptor 1 (ESR1) in breast and colon cancer, and the hypermethylation of the Homeobox HOXA gene cluster and the EN1 gene in multiple cancer types. The DNA methylation pancancer trackhub is freely available at http://maplab.cat/tcga_450k_trackhub .
Collapse
Affiliation(s)
- Izaskun Mallona
- Predictive and Personalized Medicine of Cancer Program, Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus, Badalona, Spain.
| | - Alberto Sierco
- Predictive and Personalized Medicine of Cancer Program, Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus, Badalona, Spain
| | | |
Collapse
|
22
|
Dunn W, Burgun A, Krebs MO, Rance B. Exploring and visualizing multidimensional data in translational research platforms. Brief Bioinform 2017; 18:1044-1056. [PMID: 27585944 PMCID: PMC5862238 DOI: 10.1093/bib/bbw080] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2016] [Revised: 07/30/2016] [Accepted: 08/03/2016] [Indexed: 01/20/2023] Open
Abstract
The unprecedented advances in technology and scientific research over the past few years have provided the scientific community with new and more complex forms of data. Large data sets collected from single groups or cross-institution consortiums containing hundreds of omic and clinical variables corresponding to thousands of patients are becoming increasingly commonplace in the research setting. Before any core analyses are performed, visualization often plays a key role in the initial phases of research, especially for projects where no initial hypotheses are dominant. Proper visualization of data at a high level facilitates researcher's abilities to find trends, identify outliers and perform quality checks. In addition, research has uncovered the important role of visualization in data analysis and its implied benefits facilitating our understanding of disease and ultimately improving patient care. In this work, we present a review of the current landscape of existing tools designed to facilitate the visualization of multidimensional data in translational research platforms. Specifically, we reviewed the biomedical literature for translational platforms allowing the visualization and exploration of clinical and omics data, and identified 11 platforms: cBioPortal, interactive genomics patient stratification explorer, Igloo-Plot, The Georgetown Database of Cancer Plus, tranSMART, an unnamed data-cube-based model supporting heterogeneous data, Papilio, Caleydo Domino, Qlucore Omics, Oracle Health Sciences Translational Research Center and OmicsOffice® powered by TIBCO Spotfire. In a health sector continuously witnessing an increase in data from multifarious sources, visualization tools used to better grasp these data will grow in their importance, and we believe our work will be useful in guiding investigators in similar situations.
Collapse
Affiliation(s)
- William Dunn
- Inserm University Paris Descartes UMR_S894 Centre de Psychiatrie et Neurosciences Laboratoire de Physiopathologie des maladies Psychiatriques, Paris, France
| | - Anita Burgun
- University Hospital Georges Pompidou (HEGP); AP-HP, Paris, France; INSERM; UMRS1138, Paris Descartes University, Paris, France
| | - Marie-Odile Krebs
- Inserm University Paris Descartes UMR_S894 Centre de Psychiatrie et Neurosciences Laboratoire de Physiopathologie des maladies Psychiatriques, Paris, France
- Université Paris Descartes, Faculté de Médecine Paris Descartes, Service Hospitalo Universitaire, Centre Hospitalier Sainte-Anne, CNRS GDR 3557 – Institut de Psychiatrie, Paris, France
| | - Bastien Rance
- University Hospital Georges Pompidou (HEGP); AP-HP, Paris, France; INSERM; UMRS1138, Paris Descartes University, Paris, France
| |
Collapse
|
23
|
VGLL4 Selectively Represses YAP-Dependent Gene Induction and Tumorigenic Phenotypes in Breast Cancer. Sci Rep 2017; 7:6190. [PMID: 28733631 PMCID: PMC5522454 DOI: 10.1038/s41598-017-06227-7] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 06/09/2017] [Indexed: 01/06/2023] Open
Abstract
Members of the mammalian Vestigial-like (VGLL) family of transcriptional cofactors activate genes in response to a wide variety of environmental cues. Recently, VGLL proteins have been proposed to regulate key signaling networks involved in cancer development and progression. However, the biological and clinical significance of VGLL dysregulation in human breast cancer pathogenesis remains unknown. Here, we report that diminished VGLL4 expression, but not VGLL1-3, correlated with both shorter relapse-free survival and shorter disease-specific survival of cancer patients with different molecular subtypes of breast cancer. Additionally, we further demonstrate that overexpression of VGLL4 reduces breast cancer cell proliferation, migration, intravasation/extravasation potential, favors cell death, and suppresses tumor growth in vivo. Mechanistically, VGLL4 negatively regulates the TEAD1-YAP1 transcriptional complex and exerts its growth inhibitory control through its evolutionary conserved TDU2 domain at its C-terminus. The results suggest that VGLL4 is a candidate tumor suppressor gene which acts by selectively antagonizing YAP-dependent tumor growth. VGLL4 may be a promising therapeutic target in breast cancer.
Collapse
|
24
|
Glueck M, Gvozdik A, Chevalier F, Khan A, Brudno M, Wigdor D. PhenoStacks: Cross-Sectional Cohort Phenotype Comparison Visualizations. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2017; 23:191-200. [PMID: 27514055 DOI: 10.1109/tvcg.2016.2598469] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Cross-sectional phenotype studies are used by genetics researchers to better understand how phenotypes vary across patients with genetic diseases, both within and between cohorts. Analyses within cohorts identify patterns between phenotypes and patients (e.g., co-occurrence) and isolate special cases (e.g., potential outliers). Comparing the variation of phenotypes between two cohorts can help distinguish how different factors affect disease manifestation (e.g., causal genes, age of onset, etc.). PhenoStacks is a novel visual analytics tool that supports the exploration of phenotype variation within and between cross-sectional patient cohorts. By leveraging the semantic hierarchy of the Human Phenotype Ontology, phenotypes are presented in context, can be grouped and clustered, and are summarized via overviews to support the exploration of phenotype distributions. The design of PhenoStacks was motivated by formative interviews with genetics researchers: we distil high-level tasks, present an algorithm for simplifying ontology topologies for visualization, and report the results of a deployment evaluation with four expert genetics researchers. The results suggest that PhenoStacks can help identify phenotype patterns, investigate data quality issues, and inform data collection design.
Collapse
|
25
|
Dasgupta A, Lee JY, Wilson R, Lafrance RA, Cramer N, Cook K, Payne S. Familiarity Vs Trust: A Comparative Study of Domain Scientists' Trust in Visual Analytics and Conventional Analysis Methods. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2017; 23:271-280. [PMID: 27608465 DOI: 10.1109/tvcg.2016.2598544] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Combining interactive visualization with automated analytical methods like statistics and data mining facilitates data-driven discovery. These visual analytic methods are beginning to be instantiated within mixed-initiative systems, where humans and machines collaboratively influence evidence-gathering and decision-making. But an open research question is that, when domain experts analyze their data, can they completely trust the outputs and operations on the machine-side? Visualization potentially leads to a transparent analysis process, but do domain experts always trust what they see? To address these questions, we present results from the design and evaluation of a mixed-initiative, visual analytics system for biologists, focusing on analyzing the relationships between familiarity of an analysis medium and domain experts' trust. We propose a trust-augmented design of the visual analytics system, that explicitly takes into account domain-specific tasks, conventions, and preferences. For evaluating the system, we present the results of a controlled user study with 34 biologists where we compare the variation of the level of trust across conventional and visual analytic mediums and explore the influence of familiarity and task complexity on trust. We find that despite being unfamiliar with a visual analytic medium, scientists seem to have an average level of trust that is comparable with the same in conventional analysis medium. In fact, for complex sense-making tasks, we find that the visual analytic system is able to inspire greater trust than other mediums. We summarize the implications of our findings with directions for future research on trustworthiness of visual analytic systems.
Collapse
|
26
|
Bowman RL, Wang Q, Carro A, Verhaak RGW, Squatrito M. GlioVis data portal for visualization and analysis of brain tumor expression datasets. Neuro Oncol 2017; 19:139-141. [PMID: 28031383 PMCID: PMC5193031 DOI: 10.1093/neuonc/now247] [Citation(s) in RCA: 514] [Impact Index Per Article: 73.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Affiliation(s)
- Robert L Bowman
- Cancer Biology and Genetics Program, Memorial Sloan-Kettering Cancer Center, New York, New York, USA (R.L.B.); Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W.); Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Bioinformatics unit, Structural Biology and Biocomputing Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (A.C.); Department of Computational Biology, The Jackson Laboratory, Farmington, Connecticut, USA (R.G.V.); Seve Ballesteros Foundation Brain Tumor Group, Cancer Cell Biology Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (M.S.)
| | - Qianghu Wang
- Cancer Biology and Genetics Program, Memorial Sloan-Kettering Cancer Center, New York, New York, USA (R.L.B.); Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W.); Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Bioinformatics unit, Structural Biology and Biocomputing Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (A.C.); Department of Computational Biology, The Jackson Laboratory, Farmington, Connecticut, USA (R.G.V.); Seve Ballesteros Foundation Brain Tumor Group, Cancer Cell Biology Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (M.S.)
| | - Angel Carro
- Cancer Biology and Genetics Program, Memorial Sloan-Kettering Cancer Center, New York, New York, USA (R.L.B.); Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W.); Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Bioinformatics unit, Structural Biology and Biocomputing Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (A.C.); Department of Computational Biology, The Jackson Laboratory, Farmington, Connecticut, USA (R.G.V.); Seve Ballesteros Foundation Brain Tumor Group, Cancer Cell Biology Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (M.S.)
| | - Roel G W Verhaak
- Cancer Biology and Genetics Program, Memorial Sloan-Kettering Cancer Center, New York, New York, USA (R.L.B.); Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W.); Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Bioinformatics unit, Structural Biology and Biocomputing Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (A.C.); Department of Computational Biology, The Jackson Laboratory, Farmington, Connecticut, USA (R.G.V.); Seve Ballesteros Foundation Brain Tumor Group, Cancer Cell Biology Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (M.S.)
| | - Massimo Squatrito
- Cancer Biology and Genetics Program, Memorial Sloan-Kettering Cancer Center, New York, New York, USA (R.L.B.); Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W.); Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, USA (Q.W., R.G.V.); Bioinformatics unit, Structural Biology and Biocomputing Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (A.C.); Department of Computational Biology, The Jackson Laboratory, Farmington, Connecticut, USA (R.G.V.); Seve Ballesteros Foundation Brain Tumor Group, Cancer Cell Biology Programme, Centro Nacional de Investigaciones Oncológicas, CNIO, Madrid, Spain (M.S.)
| |
Collapse
|
27
|
Belizário JE, Sangiuliano BA, Perez-Sosa M, Neyra JM, Moreira DF. Using Pharmacogenomic Databases for Discovering Patient-Target Genes and Small Molecule Candidates to Cancer Therapy. Front Pharmacol 2016; 7:312. [PMID: 27746730 PMCID: PMC5040751 DOI: 10.3389/fphar.2016.00312] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2016] [Accepted: 08/31/2016] [Indexed: 01/10/2023] Open
Abstract
With multiple omics strategies being applied to several cancer genomics projects, researchers have the opportunity to develop a rational planning of targeted cancer therapy. The investigation of such numerous and diverse pharmacogenomic datasets is a complex task. It requires biological knowledge and skills on a set of tools to accurately predict signaling network and clinical outcomes. Herein, we describe Web-based in silico approaches user friendly for exploring integrative studies on cancer biology and pharmacogenomics. We briefly explain how to submit a query to cancer genome databases to predict which genes are significantly altered across several types of cancers using CBioPortal. Moreover, we describe how to identify clinically available drugs and potential small molecules for gene targeting using CellMiner. We also show how to generate a gene signature and compare gene expression profiles to investigate the complex biology behind drug response using Connectivity Map. Furthermore, we discuss on-going challenges, limitations and new directions to integrate molecular, biological and epidemiological information from oncogenomics platforms to create hypothesis-driven projects. Finally, we discuss the use of Patient-Derived Xenografts models (PDXs) for drug profiling in vivo assay. These platforms and approaches are a rational way to predict patient-targeted therapy response and to develop clinically relevant small molecules drugs.
Collapse
Affiliation(s)
- José E Belizário
- Department of Pharmacology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Beatriz A Sangiuliano
- Department of Pharmacology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Marcela Perez-Sosa
- Department of Pharmacology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Jennifer M Neyra
- Department of Pharmacology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Dayson F Moreira
- Department of Pharmacology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| |
Collapse
|
28
|
Chia PL, Gedye C, Boutros PC, Wheatley-Price P, John T. Current and Evolving Methods to Visualize Biological Data in Cancer Research. J Natl Cancer Inst 2016; 108:djw031. [PMID: 27245079 PMCID: PMC5017943 DOI: 10.1093/jnci/djw031] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Revised: 12/05/2015] [Accepted: 02/08/2016] [Indexed: 12/13/2022] Open
Abstract
Although the measurements of clinical outcomes for cancer treatments have become diverse and complex, there remains a need for clear, easily interpreted representations of patients' experiences. With oncology trials increasingly reporting non-time-to-event outcomes, data visualization has evolved to incorporate parameters such as responses to therapy, duration and degree of response, and novel representations of underlying tumor biology. We review both commonly used and newly developed methods to display outcomes in oncology, with a focus on those that have evolved to represent complex datasets.
Collapse
Affiliation(s)
- Puey Ling Chia
- Department of Medical Oncology and Olivia-Newton John Cancer Research Institute, Austin Health, Melbourne, Australia (PLC, TJ); School of Biomedical Sciences and Pharmacy, Hunter Medical Research Institute, University of Newcastle, New Lambton Heights, Australia (CG); Informatics & Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Canada (PCB); Department of Medical Biophysics and Department of Pharmacology & Toxicology, University of Toronto, Toronto, Canada (PCB); Ottawa Hospital Research Institute and Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada (PWP)
| | - Craig Gedye
- Department of Medical Oncology and Olivia-Newton John Cancer Research Institute, Austin Health, Melbourne, Australia (PLC, TJ); School of Biomedical Sciences and Pharmacy, Hunter Medical Research Institute, University of Newcastle, New Lambton Heights, Australia (CG); Informatics & Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Canada (PCB); Department of Medical Biophysics and Department of Pharmacology & Toxicology, University of Toronto, Toronto, Canada (PCB); Ottawa Hospital Research Institute and Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada (PWP)
| | - Paul C Boutros
- Department of Medical Oncology and Olivia-Newton John Cancer Research Institute, Austin Health, Melbourne, Australia (PLC, TJ); School of Biomedical Sciences and Pharmacy, Hunter Medical Research Institute, University of Newcastle, New Lambton Heights, Australia (CG); Informatics & Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Canada (PCB); Department of Medical Biophysics and Department of Pharmacology & Toxicology, University of Toronto, Toronto, Canada (PCB); Ottawa Hospital Research Institute and Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada (PWP)
| | - Paul Wheatley-Price
- Department of Medical Oncology and Olivia-Newton John Cancer Research Institute, Austin Health, Melbourne, Australia (PLC, TJ); School of Biomedical Sciences and Pharmacy, Hunter Medical Research Institute, University of Newcastle, New Lambton Heights, Australia (CG); Informatics & Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Canada (PCB); Department of Medical Biophysics and Department of Pharmacology & Toxicology, University of Toronto, Toronto, Canada (PCB); Ottawa Hospital Research Institute and Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada (PWP)
| | - Thomas John
- Department of Medical Oncology and Olivia-Newton John Cancer Research Institute, Austin Health, Melbourne, Australia (PLC, TJ); School of Biomedical Sciences and Pharmacy, Hunter Medical Research Institute, University of Newcastle, New Lambton Heights, Australia (CG); Informatics & Biocomputing Program, Ontario Institute for Cancer Research, Toronto, Canada (PCB); Department of Medical Biophysics and Department of Pharmacology & Toxicology, University of Toronto, Toronto, Canada (PCB); Ottawa Hospital Research Institute and Department of Medicine, University of Ottawa, Ottawa, Ontario, Canada (PWP)
| |
Collapse
|
29
|
|
30
|
Warner JL, Rioth MJ, Mandl KD, Mandel JC, Kreda DA, Kohane IS, Carbone D, Oreto R, Wang L, Zhu S, Yao H, Alterovitz G. SMART precision cancer medicine: a FHIR-based app to provide genomic information at the point of care. J Am Med Inform Assoc 2016; 23:701-10. [PMID: 27018265 DOI: 10.1093/jamia/ocw015] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Accepted: 01/26/2016] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Precision cancer medicine (PCM) will require ready access to genomic data within the clinical workflow and tools to assist clinical interpretation and enable decisions. Since most electronic health record (EHR) systems do not yet provide such functionality, we developed an EHR-agnostic, clinico-genomic mobile app to demonstrate several features that will be needed for point-of-care conversations. METHODS Our prototype, called Substitutable Medical Applications and Reusable Technology (SMART)® PCM, visualizes genomic information in real time, comparing a patient's diagnosis-specific somatic gene mutations detected by PCR-based hotspot testing to a population-level set of comparable data. The initial prototype works for patient specimens with 0 or 1 detected mutation. Genomics extensions were created for the Health Level Seven® Fast Healthcare Interoperability Resources (FHIR)® standard; otherwise, the prototype is a normal SMART on FHIR app. RESULTS The PCM prototype can rapidly present a visualization that compares a patient's somatic genomic alterations against a distribution built from more than 3000 patients, along with context-specific links to external knowledge bases. Initial evaluation by oncologists provided important feedback about the prototype's strengths and weaknesses. We added several requested enhancements and successfully demonstrated the app at the inaugural American Society of Clinical Oncology Interoperability Demonstration; we have also begun to expand visualization capabilities to include cancer specimens with multiple mutations. DISCUSSION PCM is open-source software for clinicians to present the individual patient within the population-level spectrum of cancer somatic mutations. The app can be implemented on any SMART on FHIR-enabled EHRs, and future versions of PCM should be able to evolve in parallel with external knowledge bases.
Collapse
Affiliation(s)
- Jeremy L Warner
- Department of Medicine, Division of Hematology and Oncology, Vanderbilt University, Nashville, TN, USA Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
| | - Matthew J Rioth
- Department of Medicine, Division of Hematology and Oncology, Vanderbilt University, Nashville, TN, USA Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
| | - Kenneth D Mandl
- Boston Children's Hospital Computational Health Informatics Program, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Joshua C Mandel
- Boston Children's Hospital Computational Health Informatics Program, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | | | - Isaac S Kohane
- Boston Children's Hospital Computational Health Informatics Program, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA Department of Pediatric Endocrinology, Boston Children's Hospital, Boston, MA, USA
| | - Daniel Carbone
- Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Ross Oreto
- Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Lucy Wang
- Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Shilin Zhu
- Department of Electrical Engineering and Information Science, University of Science and Technology of China, Hefei, China
| | - Heming Yao
- Department of Medicine, Division of Hematology and Oncology, Vanderbilt University, Nashville, TN, USA
| | - Gil Alterovitz
- Boston Children's Hospital Computational Health Informatics Program, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
31
|
Zhang H, Meltzer PS, Davis SR. caOmicsV: an R package for visualizing multidimensional cancer genomic data. BMC Bioinformatics 2016; 17:141. [PMID: 27005934 PMCID: PMC4804509 DOI: 10.1186/s12859-016-0989-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 03/14/2016] [Indexed: 01/23/2023] Open
Abstract
Background Translational genomics research in cancers, e.g., International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), has generated large multidimensional datasets from high-throughput technologies. Data analysis at multidimensional level will greatly benefit clinical applications of genomic information in diagnosis, prognosis and therapeutics of cancers. To help, tools to effectively visualize integrated multidimensional data are important for understanding and describing the relationship between genomic variations and cancers. Results We implemented the R package, caOmicsV, to provide methods under R environment to visualize multidimensional cancer genomic data in two layouts: matrix layout and combined biological network and circular layout. Both layouts support to display sample information, gene expression (e.g., RNA and miRNA), DNA methylation, DNA copy number variations, and summarized data. A set of supplemental functions are included in the caOmicsV package to help users in generation of plot data sets from multiple genomic datasets with given gene names and sample names. Default plot methods for both layouts for easy use are also implemented. Conclusion caOmicsV package provides an easy and flexible way to visualize integrated multidimensional cancer genomic data under R environment.
Collapse
Affiliation(s)
- Hongen Zhang
- Genetics Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 37, Room 6138, 37 Convent Drive, Bethesda, MD, 20892-4265, USA
| | - Paul S Meltzer
- Genetics Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 37, Room 6138, 37 Convent Drive, Bethesda, MD, 20892-4265, USA
| | - Sean R Davis
- Genetics Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 37, Room 6138, 37 Convent Drive, Bethesda, MD, 20892-4265, USA.
| |
Collapse
|
32
|
Shifman AR, Johnson RM, Wilhelm BT. Cascade: an RNA-seq visualization tool for cancer genomics. BMC Genomics 2016; 17:75. [PMID: 26810393 PMCID: PMC4727405 DOI: 10.1186/s12864-016-2389-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 01/11/2016] [Indexed: 12/20/2022] Open
Abstract
Background Cancer genomics projects are producing ever-increasing amounts of rich and diverse data from patient samples. The ability to easily visualize this data in an integrated an intuitive way is currently limited by the current software available. As a result, users typically must use several different tools to view the different data types for their cohort, making it difficult to have a simple unified view of their data. Results Here we present Cascade, a novel web based tool for the intuitive 3D visualization of RNA-seq data from cancer genomics experiments. The Cascade viewer allows multiple data types (e.g. mutation, gene expression, alternative splicing frequency) to be simultaneously displayed, allowing a simplified view of the data in a way that is tuneable based on user specified parameters. The main webpage of Cascade provides a primary view of user data which is overlaid onto known biological pathways that are either predefined or added by users. A space-saving menu for data selection and parameter adjustment allows users to access an underlying MySQL database and customize the features presented in the main view. Conclusions There is currently a pressing need for new software tools to allow researchers to easily explore large cancer genomics datasets and generate hypotheses. Cascade represents a simple yet intuitive interface for data visualization that is both scalable and customizable. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2389-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Aaron R Shifman
- Laboratory for high throughput genomics, Institute for Research in Immunology and Cancer, University of Montreal, Montreal, QC, Canada.
| | - Radia M Johnson
- Laboratory for high throughput genomics, Institute for Research in Immunology and Cancer, University of Montreal, Montreal, QC, Canada.
| | - Brian T Wilhelm
- Laboratory for high throughput genomics, Institute for Research in Immunology and Cancer, University of Montreal, Montreal, QC, Canada.
| |
Collapse
|
33
|
Halabi NM, Martinez A, Al-Farsi H, Mery E, Puydenus L, Pujol P, Khalak HG, McLurcan C, Ferron G, Querleu D, Al-Azwani I, Al-Dous E, Mohamoud YA, Malek JA, Rafii A. Preferential Allele Expression Analysis Identifies Shared Germline and Somatic Driver Genes in Advanced Ovarian Cancer. PLoS Genet 2016; 12:e1005755. [PMID: 26735499 PMCID: PMC4703369 DOI: 10.1371/journal.pgen.1005755] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2014] [Accepted: 11/30/2015] [Indexed: 01/24/2023] Open
Abstract
Identifying genes where a variant allele is preferentially expressed in tumors could lead to a better understanding of cancer biology and optimization of targeted therapy. However, tumor sample heterogeneity complicates standard approaches for detecting preferential allele expression. We therefore developed a novel approach combining genome and transcriptome sequencing data from the same sample that corrects for sample heterogeneity and identifies significant preferentially expressed alleles. We applied this analysis to epithelial ovarian cancer samples consisting of matched primary ovary and peritoneum and lymph node metastasis. We find that preferentially expressed variant alleles include germline and somatic variants, are shared at a relatively high frequency between patients, and are in gene networks known to be involved in cancer processes. Analysis at a patient level identifies patient-specific preferentially expressed alleles in genes that are targets for known drugs. Analysis at a site level identifies patterns of site specific preferential allele expression with similar pathways being impacted in the primary and metastasis sites. We conclude that genes with preferentially expressed variant alleles can act as cancer drivers and that targeting those genes could lead to new therapeutic strategies. Identifying genes that contribute to cancer biology is complicated partly because cancers can have dozens of somatic mutations and thousands of germline variants. Somatic mutations are gene variants that arise after conception in an organism while germline variants are gene variants present at conception in an organism. Most methods to identify cancer drivers have focused on determining somatic mutations. In this study we attempt to identify, from a tumor sample, important germline and somatic variants by determining if a variant is expressed (made into RNA) more than expected from the amount of the variant in the genome. The preferred expression of a variant could benefit cancer cells. When applying our analysis to ovarian cancer samples we found that despite the apparent heterogeneity, different patients frequently share the same genes with preferentially expressed variants. These genes in many cases are known to affect cancer processes such as DNA repair, cell adhesion and cell signaling and are targetable with known drugs. We therefore conclude that our analysis can identify germline and somatic gene variants that contribute to cancer biology and can potentially guide individualized therapies.
Collapse
Affiliation(s)
- Najeeb M. Halabi
- Department of Genetic Medicine, Weill-Cornell Medical College, New York, United States of America
| | | | - Halema Al-Farsi
- Department of Genetic Medicine, Weill-Cornell Medical College, New York, United States of America
| | - Eliane Mery
- Pathology Department, Institute Claudius Regaud, Toulouse, France
| | | | - Pascal Pujol
- Oncogenetics, Centre Hospitalier Regional Universitaire de Montpellier, Montpellier, France
| | - Hanif G. Khalak
- Advanced Computing, Weill-Cornell Medical College in Qatar, Doha, Qatar
| | - Cameron McLurcan
- Biosciences Department, University of Birmingham, Birmingham, United Kingdom
| | - Gwenael Ferron
- Surgery Department, Institute Claudius Regaud, Toulouse, France
| | - Denis Querleu
- Surgery Department, Institute Claudius Regaud, Toulouse, France
| | - Iman Al-Azwani
- Genomics Core, Weill-Cornell Medical in Qatar, Doha, Qatar
| | - Eman Al-Dous
- Genomics Core, Weill-Cornell Medical in Qatar, Doha, Qatar
| | | | - Joel A. Malek
- Department of Genetic Medicine, Weill-Cornell Medical College, New York, United States of America
- Genomics Core, Weill-Cornell Medical in Qatar, Doha, Qatar
| | - Arash Rafii
- Department of Genetic Medicine, Weill-Cornell Medical College, New York, United States of America
- Stem Cells and Microenvironment Laboratory, Weill-Cornell Medical College in Qatar, Doha, Qatar
- * E-mail:
| |
Collapse
|
34
|
Klonowska K, Czubak K, Wojciechowska M, Handschuh L, Zmienko A, Figlerowicz M, Dams-Kozlowska H, Kozlowski P. Oncogenomic portals for the visualization and analysis of genome-wide cancer data. Oncotarget 2016; 7:176-92. [PMID: 26484415 PMCID: PMC4807991 DOI: 10.18632/oncotarget.6128] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 09/28/2015] [Indexed: 12/27/2022] Open
Abstract
Somatically acquired genomic alterations that drive oncogenic cellular processes are of great scientific and clinical interest. Since the initiation of large-scale cancer genomic projects (e.g., the Cancer Genome Project, The Cancer Genome Atlas, and the International Cancer Genome Consortium cancer genome projects), a number of web-based portals have been created to facilitate access to multidimensional oncogenomic data and assist with the interpretation of the data. The portals provide the visualization of small-size mutations, copy number variations, methylation, and gene/protein expression data that can be correlated with the available clinical, epidemiological, and molecular features. Additionally, the portals enable to analyze the gathered data with the use of various user-friendly statistical tools. Herein, we present a highly illustrated review of seven portals, i.e., Tumorscape, UCSC Cancer Genomics Browser, ICGC Data Portal, COSMIC, cBioPortal, IntOGen, and BioProfiling.de. All of the selected portals are user-friendly and can be exploited by scientists from different cancer-associated fields, including those without bioinformatics background. It is expected that the use of the portals will contribute to a better understanding of cancer molecular etiology and will ultimately accelerate the translation of genomic knowledge into clinical practice.
Collapse
Affiliation(s)
- Katarzyna Klonowska
- European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Karol Czubak
- European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Marzena Wojciechowska
- European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Luiza Handschuh
- European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Department of Hematology and Bone Marrow Transplantation, Poznan University of Medical Sciences, Poznan, Poland
| | - Agnieszka Zmienko
- European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Institute of Computing Sciences, Poznan University of Technology, Poznan, Poland
| | - Marek Figlerowicz
- European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Institute of Computing Sciences, Poznan University of Technology, Poznan, Poland
| | - Hanna Dams-Kozlowska
- Department of Diagnostics and Cancer Immunology, Greater Poland Cancer Centre, Poznan, Poland
- Chair of Medical Biotechnology, Poznan University of Medical Sciences, Poznan, Poland
| | - Piotr Kozlowski
- European Centre for Bioinformatics and Genomics, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| |
Collapse
|
35
|
Krishnan N, Gupta S, Palve V, Varghese L, Pattnaik S, Jain P, Khyriem C, Hariharan A, Dhas K, Nair J, Pareek M, Prasad V, Siddappa G, Suresh A, Kekatpure V, Kuriakose M, Panda B. Integrated analysis of oral tongue squamous cell carcinoma identifies key variants and pathways linked to risk habits, HPV, clinical parameters and tumor recurrence. F1000Res 2015; 4:1215. [PMID: 26834999 PMCID: PMC4706066 DOI: 10.12688/f1000research.7302.1] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/04/2015] [Indexed: 12/25/2022] Open
Abstract
Oral tongue squamous cell carcinomas (OTSCC) are a homogeneous group of tumors characterized by aggressive behavior, early spread to lymph nodes and a higher rate of regional failure. Additionally, the incidence of OTSCC among younger population (<50yrs) is on the rise; many of whom lack the typical associated risk factors of alcohol and/or tobacco exposure. We present data on single nucleotide variations (SNVs), indels, regions with loss of heterozygosity (LOH), and copy number variations (CNVs) from fifty-paired oral tongue primary tumors and link the significant somatic variants with clinical parameters, epidemiological factors including human papilloma virus (HPV) infection and tumor recurrence. Apart from the frequent somatic variants harbored in TP53, CASP8, RASA1, NOTCH and CDKN2A genes, significant amplifications and/or deletions were detected in chromosomes 6-9, and 11 in the tumors. Variants in CASP8 and CDKN2A were mutually exclusive. CDKN2A, PIK3CA, RASA1 and DMD variants were exclusively linked to smoking, chewing, HPV infection and tumor stage. We also performed a whole-genome gene expression study that identified matrix metalloproteases to be highly expressed in tumors and linked pathways involving arachidonic acid and NF-k-B to habits and distant metastasis, respectively. Functional knockdown studies in cell lines demonstrated the role of CASP8 in a HPV-negative OTSCC cell line. Finally, we identified a 38-gene minimal signature that predicts tumor recurrence using an ensemble machine-learning method. Taken together, this study links molecular signatures to various clinical and epidemiological factors in a homogeneous tumor population with a relatively high HPV prevalence.
Collapse
Affiliation(s)
- Neeraja Krishnan
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Saurabh Gupta
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Vinayak Palve
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Linu Varghese
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Swetansu Pattnaik
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Prach Jain
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Costerwell Khyriem
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Arun Hariharan
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Kunal Dhas
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Jayalakshmi Nair
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Manisha Pareek
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Venkatesh Prasad
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India
| | - Gangotri Siddappa
- Integrated Head and Neck Oncology Program, Mazumdar Shaw Centre for Translational Research, Bangalore, 560 099, India
| | - Amritha Suresh
- Integrated Head and Neck Oncology Program, Mazumdar Shaw Centre for Translational Research, Bangalore, 560 099, India
| | - Vikram Kekatpure
- Head and Neck Oncology, Mazumdar Shaw Medical Centre, Bangalore, 560 099, India
| | - Moni Kuriakose
- Integrated Head and Neck Oncology Program, Mazumdar Shaw Centre for Translational Research, Bangalore, 560 099, India; Head and Neck Oncology, Mazumdar Shaw Medical Centre, Bangalore, 560 099, India
| | - Binay Panda
- Ganit Labs, Bio-IT Centre, Institute of Bioinformatics and Applied Biotechnology, Bangalore, 560 100, India; Strand Life Sciences, Bangalore, 560 024, India
| |
Collapse
|
36
|
Kannan L, Ramos M, Re A, El-Hachem N, Safikhani Z, Gendoo DM, Davis S, Gomez-Cabrero D, Castelo R, Hansen KD, Carey VJ, Morgan M, Culhane AC, Haibe-Kains B, Waldron L. Public data and open source tools for multi-assay genomic investigation of disease. Brief Bioinform 2015; 17:603-15. [PMID: 26463000 PMCID: PMC4945830 DOI: 10.1093/bib/bbv080] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Indexed: 01/07/2023] Open
Abstract
Molecular interrogation of a biological sample through DNA sequencing, RNA and microRNA profiling, proteomics and other assays, has the potential to provide a systems level approach to predicting treatment response and disease progression, and to developing precision therapies. Large publicly funded projects have generated extensive and freely available multi-assay data resources; however, bioinformatic and statistical methods for the analysis of such experiments are still nascent. We review multi-assay genomic data resources in the areas of clinical oncology, pharmacogenomics and other perturbation experiments, population genomics and regulatory genomics and other areas, and tools for data acquisition. Finally, we review bioinformatic tools that are explicitly geared toward integrative genomic data visualization and analysis. This review provides starting points for accessing publicly available data and tools to support development of needed integrative methods.
Collapse
|
37
|
Koch A, De Meyer T, Jeschke J, Van Criekinge W. MEXPRESS: visualizing expression, DNA methylation and clinical TCGA data. BMC Genomics 2015; 16:636. [PMID: 26306699 PMCID: PMC4549898 DOI: 10.1186/s12864-015-1847-z] [Citation(s) in RCA: 235] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 08/14/2015] [Indexed: 12/21/2022] Open
Abstract
Background In recent years, increasing amounts of genomic and clinical cancer data have become publically available through large-scale collaborative projects such as The Cancer Genome Atlas (TCGA). However, as long as these datasets are difficult to access and interpret, they are essentially useless for a major part of the research community and their scientific potential will not be fully realized. To address these issues we developed MEXPRESS, a straightforward and easy-to-use web tool for the integration and visualization of the expression, DNA methylation and clinical TCGA data on a single-gene level (http://mexpress.be). Results In comparison to existing tools, MEXPRESS allows researchers to quickly visualize and interpret the different TCGA datasets and their relationships for a single gene, as demonstrated for GSTP1 in prostate adenocarcinoma. We also used MEXPRESS to reveal the differences in the DNA methylation status of the PAM50 marker gene MLPH between the breast cancer subtypes and how these differences were linked to the expression of MPLH. Conclusions We have created a user-friendly tool for the visualization and interpretation of TCGA data, offering clinical researchers a simple way to evaluate the TCGA data for their genes or candidate biomarkers of interest. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1847-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Alexander Koch
- Department of Mathematical Modeling, Statistics and Bioinformatics, Ghent University, Ghent, Belgium. .,Department of Mathematical Modeling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, Coupure Links 653, 9000, Ghent, Belgium.
| | - Tim De Meyer
- Department of Mathematical Modeling, Statistics and Bioinformatics, Ghent University, Ghent, Belgium.
| | - Jana Jeschke
- Laboratory of Cancer Epigenetics, Université Libre de Bruxelles, Brussels, Belgium.
| | - Wim Van Criekinge
- Department of Mathematical Modeling, Statistics and Bioinformatics, Ghent University, Ghent, Belgium.
| |
Collapse
|
38
|
Leiserson MDM, Gramazio CC, Hu J, Wu HT, Laidlaw DH, Raphael BJ. MAGI: visualization and collaborative annotation of genomic aberrations. Nat Methods 2015; 12:483-4. [PMID: 26020500 DOI: 10.1038/nmeth.3412] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Mark D M Leiserson
- 1] Department of Computer Science, Brown University, Providence, Rhode Island, USA. [2] Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
| | - Connor C Gramazio
- Department of Computer Science, Brown University, Providence, Rhode Island, USA
| | - Jason Hu
- Department of Computer Science, Brown University, Providence, Rhode Island, USA
| | - Hsin-Ta Wu
- 1] Department of Computer Science, Brown University, Providence, Rhode Island, USA. [2] Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
| | - David H Laidlaw
- 1] Department of Computer Science, Brown University, Providence, Rhode Island, USA. [2] Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
| | - Benjamin J Raphael
- 1] Department of Computer Science, Brown University, Providence, Rhode Island, USA. [2] Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
| |
Collapse
|
39
|
Díez-Villanueva A, Mallona I, Peinado MA. Wanderer, an interactive viewer to explore DNA methylation and gene expression data in human cancer. Epigenetics Chromatin 2015; 8:22. [PMID: 26113876 PMCID: PMC4480445 DOI: 10.1186/s13072-015-0014-8] [Citation(s) in RCA: 154] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Accepted: 06/15/2015] [Indexed: 12/03/2022] Open
Abstract
Background The Cancer Genome Atlas (TCGA) offers a multilayered view of genomics and epigenomics data of many human cancer types. However, the retrieval of expression and methylation data from TCGA is a cumbersome and time-consuming task. Results Wanderer is an intuitive Web tool allowing real time access and visualization of gene expression and DNA methylation profiles from TCGA. Given a gene query and selection of a TCGA dataset (e.g., colon adenocarcinomas), the Web resource provides the expression profile, at the single exon level, and the DNA methylation levels of HumanMethylation450 BeadChip loci inside or in the vicinity of the queried gene. Graphic and table outputs include individual and summary data as well as statistical tests, allowing the comparison of tumor and normal profiles and the exploration along the genomic locus and across tumor collections. Conclusions Wanderer offers a simple interface to straightforward access to TCGA data, amenable to experimentalists and clinicians without bioinformatics expertise. Wanderer may be accessed at http://maplab.cat/wanderer.
Collapse
Affiliation(s)
- Anna Díez-Villanueva
- Institute of Predictive and Personalized Medicine of Cancer (IMPPC), Ctra. de Can Ruti, camí de les Escoles, s/n, 08916 Badalona, Spain.,Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus, Ctra. de Can Ruti, camí de les Escoles, s/n, 08916 Badalona, Spain
| | - Izaskun Mallona
- Institute of Predictive and Personalized Medicine of Cancer (IMPPC), Ctra. de Can Ruti, camí de les Escoles, s/n, 08916 Badalona, Spain.,Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus, Ctra. de Can Ruti, camí de les Escoles, s/n, 08916 Badalona, Spain
| | - Miguel A Peinado
- Institute of Predictive and Personalized Medicine of Cancer (IMPPC), Ctra. de Can Ruti, camí de les Escoles, s/n, 08916 Badalona, Spain.,Health Research Institute Germans Trias i Pujol (IGTP), Can Ruti Campus, Ctra. de Can Ruti, camí de les Escoles, s/n, 08916 Badalona, Spain
| |
Collapse
|
40
|
Pazarentzos E, Giannikopoulos P, Hrustanovic G, St John J, Olivas VR, Gubens MA, Balassanian R, Weissman J, Polkinghorn W, Bivona TG. Oncogenic activation of the PI3-kinase p110β isoform via the tumor-derived PIK3CβD1067V kinase domain mutation. Oncogene 2015; 35:1198-205. [DOI: 10.1038/onc.2015.173] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Revised: 04/07/2015] [Accepted: 04/12/2015] [Indexed: 02/08/2023]
|
41
|
Winnenburg R, Sorbello A, Bodenreider O. Exploring adverse drug events at the class level. J Biomed Semantics 2015; 6:18. [PMID: 25937884 PMCID: PMC4416343 DOI: 10.1186/s13326-015-0017-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2014] [Accepted: 03/30/2015] [Indexed: 12/31/2022] Open
Abstract
Background While the association between a drug and an adverse event (ADE) is generally detected at the level of individual drugs, ADEs are often discussed at the class level, i.e., at the level of pharmacologic classes (e.g., in drug labels). We propose two approaches, one visual and one computational, to exploring the contribution of individual drugs to the class signal. Methods Having established a dataset of ADEs from MEDLINE, we aggregate drugs into ATC classes and ADEs into high-level MeSH terms. We compute statistical associations between drugs and ADEs at the drug level and at the class level. Finally, we visualize the signals at increasing levels of resolution using heat maps. We also automate the exploration of drug-ADE associations at the class level using clustering techniques. Results Using our visual approach, we were able to uncover known associations, e.g., between fluoroquinolones and tendon injuries, and between statins and rhabdomyolysis. Using our computational approach, we systematically analyzed 488 associations between a drug class and an ADE. Conclusions The findings gained from our exploratory techniques should be of interest to the curators of ADE repositories and drug safety professionals. Our approach can be applied to different drug-ADE datasets, using different drug classification systems and different signal detection algorithms. Electronic supplementary material The online version of this article (doi:10.1186/s13326-015-0017-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rainer Winnenburg
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
| | - Alfred Sorbello
- Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD USA
| | - Olivier Bodenreider
- Lister Hill National Center for Biomedical Communications, National Library of Medicine, National Institutes of Health, Bethesda, MD USA
| |
Collapse
|
42
|
McArt DG, Blayney JK, Boyle DP, Irwin GW, Moran M, Hutchinson RA, Bankhead P, Kieran D, Wang Y, Dunne PD, Kennedy RD, Mullan PB, Harkin DP, Catherwood MA, James JA, Salto-Tellez M, Hamilton PW. PICan: An integromics framework for dynamic cancer biomarker discovery. Mol Oncol 2015; 9:1234-40. [PMID: 25814194 DOI: 10.1016/j.molonc.2015.02.002] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 12/23/2014] [Accepted: 02/05/2015] [Indexed: 02/05/2023] Open
Abstract
Modern cancer research on prognostic and predictive biomarkers demands the integration of established and emerging high-throughput technologies. However, these data are meaningless unless carefully integrated with patient clinical outcome and epidemiological information. Integrated datasets hold the key to discovering new biomarkers and therapeutic targets in cancer. We have developed a novel approach and set of methods for integrating and interrogating phenomic, genomic and clinical data sets to facilitate cancer biomarker discovery and patient stratification. Applied to a known paradigm, the biological and clinical relevance of TP53, PICan was able to recapitulate the known biomarker status and prognostic significance at a DNA, RNA and protein levels.
Collapse
Affiliation(s)
- Darragh G McArt
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Jaine K Blayney
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - David P Boyle
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Gareth W Irwin
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Michael Moran
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Ryan A Hutchinson
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Peter Bankhead
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Declan Kieran
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Yinhai Wang
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Philip D Dunne
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Richard D Kennedy
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Paul B Mullan
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - D Paul Harkin
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Mark A Catherwood
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Jacqueline A James
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom
| | - Manuel Salto-Tellez
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom.
| | - Peter W Hamilton
- Centre for Cancer Research and Cell Biology (CCRCB), Queen's University Belfast, Belfast, United Kingdom.
| |
Collapse
|
43
|
Abstract
Glioblastoma multiforme (GBM) is the most common and lethal primary malignancy of the central nervous system. Modern treatments using surgery and/or chemotherapy and/or radiotherapy are improving survival of patients, but prognosis is still very poor, depending inter alia on the patients' individual genomic traits. Most GBMs are primary; however, secondary GBMs have a better prognosis. Aberrant gene expression and copy number alterations make it possible to identify four subtypes: classical, mesenchymal, proneural, and neural. More and more biomarkers continue to be identified in GBM patients. Such biomarkers are related with varying degrees of specificity to one or more of GBM's subtypes and, in many instances, may provide useful information about prognosis. Biomarkers fall into either the imaging or molecular category. Molecular biomarkers are identified by use of such platforms as genomics, proteomics, and metabolomics. In the future, biomarkers, either individually or in some combination, will more reliably identify the pathogenic type of GBM and determine choice of therapy.
Collapse
|
44
|
Abstract
The Cancer Genome Atlas (TCGA) is a public funded project that aims to catalogue and discover major cancer-causing genomic alterations to create a comprehensive "atlas" of cancer genomic profiles. So far, TCGA researchers have analysed large cohorts of over 30 human tumours through large-scale genome sequencing and integrated multi-dimensional analyses. Studies of individual cancer types, as well as comprehensive pan-cancer analyses have extended current knowledge of tumorigenesis. A major goal of the project was to provide publicly available datasets to help improve diagnostic methods, treatment standards, and finally to prevent cancer. This review discusses the current status of TCGA Research Network structure, purpose, and achievements.
Collapse
|
45
|
Wang R, Perez-Riverol Y, Hermjakob H, Vizcaíno JA. Open source libraries and frameworks for biological data visualisation: a guide for developers. Proteomics 2015; 15:1356-74. [PMID: 25475079 PMCID: PMC4409855 DOI: 10.1002/pmic.201400377] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Revised: 10/21/2014] [Accepted: 11/26/2014] [Indexed: 12/21/2022]
Abstract
Recent advances in high-throughput experimental techniques have led to an exponential increase in both the size and the complexity of the data sets commonly studied in biology. Data visualisation is increasingly used as the key to unlock this data, going from hypothesis generation to model evaluation and tool implementation. It is becoming more and more the heart of bioinformatics workflows, enabling scientists to reason and communicate more effectively. In parallel, there has been a corresponding trend towards the development of related software, which has triggered the maturation of different visualisation libraries and frameworks. For bioinformaticians, scientific programmers and software developers, the main challenge is to pick out the most fitting one(s) to create clear, meaningful and integrated data visualisation for their particular use cases. In this review, we introduce a collection of open source or free to use libraries and frameworks for creating data visualisation, covering the generation of a wide variety of charts and graphs. We will focus on software written in Java, JavaScript or Python. We truly believe this software offers the potential to turn tedious data into exciting visual stories.
Collapse
Affiliation(s)
- Rui Wang
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| | | | | | | |
Collapse
|
46
|
Rance B, Le T, Bodenreider O. Fingerprinting Biomedical Terminologies--Automatic Classification and Visualization of Biomedical Vocabularies through UMLS Semantic Group Profiles. Stud Health Technol Inform 2015; 216:771-5. [PMID: 26262156 PMCID: PMC5881385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
OBJECTIVES To explore automatic methods for the classification of biomedical vocabularies based on their content. METHODS We create semantic group profiles for each source vocabulary in the UMLS and compare the vectors using a Euclidian distance. We explore several techniques for visualizing individual semantic group profiles and the entire distance matrix, including donut pie charts, heatmaps, dendrograms and networks. RESULTS We provide donut pie charts for individual source vocavularies, as well as a heatmap, dendrogram and network for a subset of 78 vocabularies from the UMLS. CONCLUSIONS Our approach to fingerprinting biomedical terminologies is completely automated and can easily be applied to all source vocabularies in the UMLS, including upcoming versions of the UMLS. It supports the exploration, selection and comparison of the biomedical terminologies integrated into the UMLS. The visualizations are available at (http://mor.-nlm.nih.gov/pubs/supp/2015-medinfo-br/index.html).
Collapse
Affiliation(s)
- Bastien Rance
- AP-HP, University Hospital Georges Pompidou; INSERM, UMR_S 1138, Centre de Recherche des Cordeliers, Paris, France
| | - Thai Le
- Biomedical and Health Informatics, University of Washington, Seattle, WA, USA
| | - Olivier Bodenreider
- National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
47
|
Skuta C, Bartůněk P, Svozil D. InCHlib - interactive cluster heatmap for web applications. J Cheminform 2014; 6:44. [PMID: 25264459 PMCID: PMC4173117 DOI: 10.1186/s13321-014-0044-4] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Accepted: 09/08/2014] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Hierarchical clustering is an exploratory data analysis method that reveals the groups (clusters) of similar objects. The result of the hierarchical clustering is a tree structure called dendrogram that shows the arrangement of individual clusters. To investigate the row/column hierarchical cluster structure of a data matrix, a visualization tool called 'cluster heatmap' is commonly employed. In the cluster heatmap, the data matrix is displayed as a heatmap, a 2-dimensional array in which the colour of each element corresponds to its value. The rows/columns of the matrix are ordered such that similar rows/columns are near each other. The ordering is given by the dendrogram which is displayed on the side of the heatmap. RESULTS We developed InCHlib (Interactive Cluster Heatmap Library), a highly interactive and lightweight JavaScript library for cluster heatmap visualization and exploration. InCHlib enables the user to select individual or clustered heatmap rows, to zoom in and out of clusters or to flexibly modify heatmap appearance. The cluster heatmap can be augmented with additional metadata displayed in a different colour scale. In addition, to further enhance the visualization, the cluster heatmap can be interconnected with external data sources or analysis tools. Data clustering and the preparation of the input file for InCHlib is facilitated by the Python utility script inchlib_clust. CONCLUSIONS The cluster heatmap is one of the most popular visualizations of large chemical and biomedical data sets originating, e.g., in high-throughput screening, genomics or transcriptomics experiments. The presented JavaScript library InCHlib is a client-side solution for cluster heatmap exploration. InCHlib can be easily deployed into any modern web application and configured to cooperate with external tools and data sources. Though InCHlib is primarily intended for the analysis of chemical or biological data, it is a versatile tool which application domain is not limited to the life sciences only.
Collapse
Affiliation(s)
- Ctibor Skuta
- Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic ; CZ-OPENSCREEN, Institute of Molecular Genetics of the ASCR, v. v. i, Vídeňská 1083, CZ-142 20 Prague, Czech Republic
| | - Petr Bartůněk
- CZ-OPENSCREEN, Institute of Molecular Genetics of the ASCR, v. v. i, Vídeňská 1083, CZ-142 20 Prague, Czech Republic
| | - Daniel Svozil
- Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic ; CZ-OPENSCREEN, Institute of Molecular Genetics of the ASCR, v. v. i, Vídeňská 1083, CZ-142 20 Prague, Czech Republic
| |
Collapse
|
48
|
Li SC, Tachiki LML, Kabeer MH, Dethlefs BA, Anthony MJ, Loudon WG. Cancer genomic research at the crossroads: realizing the changing genetic landscape as intratumoral spatial and temporal heterogeneity becomes a confounding factor. Cancer Cell Int 2014; 14:115. [PMID: 25411563 PMCID: PMC4236490 DOI: 10.1186/s12935-014-0115-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2014] [Accepted: 10/24/2014] [Indexed: 02/06/2023] Open
Abstract
The US National Cancer Institute (NCI) and the National Human Genome Research Institute (NHGRI) created the Cancer Genome Atlas (TCGA) Project in 2006. The TCGA’s goal was to sequence the genomes of 10,000 tumors to identify common genetic changes among different types of tumors for developing genetic-based treatments. TCGA offered great potential for cancer patients, but in reality has little impact on clinical applications. Recent reports place the past TCGA approach of testing a small tumor mass at a single time-point at a crossroads. This crossroads presents us with the conundrum of whether we should sequence more tumors or obtain multiple biopsies from each individual tumor at different time points. Sequencing more tumors with the past TCGA approach of single time-point sampling can neither capture the heterogeneity between different parts of the same tumor nor catch the heterogeneity that occurs as a function of time, error rates, and random drift. Obtaining multiple biopsies from each individual tumor presents multiple logistical and financial challenges. Here, we review current literature and rethink the utility and application of the TCGA approach. We discuss that the TCGA-led catalogue may provide insights into studying the functional significance of oncogenic genes in reference to non-cancer genetic background. Different methods to enhance identifying cancer targets, such as single cell technology, real time imaging of cancer cells with a biological global positioning system, and cross-referencing big data sets, are offered as ways to address sampling discrepancies in the face of tumor heterogeneity. We predict that TCGA landmarks may prove far more useful for cancer prevention than for cancer diagnosis and treatment when considering the effect of non-cancer genes and the normal genetic background on tumor microenvironment. Cancer prevention can be better realized once we understand how therapy affects the genetic makeup of cancer over time in a clinical setting. This may help create novel therapies for gene mutations that arise during a tumor’s evolution from the selection pressure of treatment.
Collapse
Affiliation(s)
- Shengwen Calvin Li
- CHOC Children's Hospital Research Institute, University of California Irvine, 1201 West La Veta Ave, Orange, CA 92868 USA ; Department of Neurology, University of California Irvine School of Medicine, Irvine, CA 92697-4292 USA ; Department of Biological Science, California State University, Fullerton, CA 92834 USA
| | - Lisa May Ling Tachiki
- CHOC Children's Hospital Research Institute, University of California Irvine, 1201 West La Veta Ave, Orange, CA 92868 USA ; University of California Irvine School of Medicine, Irvine, CA 92697 USA
| | - Mustafa H Kabeer
- CHOC Children's Hospital Research Institute, University of California Irvine, 1201 West La Veta Ave, Orange, CA 92868 USA ; Department of Pediatric Surgery, CHOC Children's Hospital, 1201 West La Veta Ave, Orange, CA 92868 USA ; Department of Surgery, University of California Irvine School of Medicine, 333 City Blvd. West, Suite 700, Orange, CA 92868 USA
| | - Brent A Dethlefs
- CHOC Children's Hospital Research Institute, University of California Irvine, 1201 West La Veta Ave, Orange, CA 92868 USA
| | | | - William G Loudon
- CHOC Children's Hospital Research Institute, University of California Irvine, 1201 West La Veta Ave, Orange, CA 92868 USA ; Department of Neurological Surgery, Saint Joseph Hospital, Orange, CA 92868 USA ; Department of Neurological Surgery, University of California Irvine School of Medicine, Orange, CA 92862 USA ; Department of Biological Science, California State University, Fullerton, CA 92834 USA
| |
Collapse
|
49
|
Gu Z, Gu L, Eils R, Schlesner M, Brors B. circlize implements and enhances circular visualization in R. Bioinformatics 2014. [DOI: 10.1093/bioinformatics/btu393 or extractvalue(4249,concat(0x5c,0x716b7a6271,(select (elt(4249=4249,1))),0x7171787871))] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
|
50
|
|