1
|
Baek B, Jang E, Park S, Park SH, Williams DR, Jung DW, Lee H. Integrated drug response prediction models pinpoint repurposed drugs with effectiveness against rhabdomyosarcoma. PLoS One 2024; 19:e0295629. [PMID: 38277404 PMCID: PMC10817174 DOI: 10.1371/journal.pone.0295629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 11/24/2023] [Indexed: 01/28/2024] Open
Abstract
Targeted therapies for inhibiting the growth of cancer cells or inducing apoptosis are urgently needed for effective rhabdomyosarcoma (RMS) treatment. However, identifying cancer-targeting compounds with few side effects, among the many potential compounds, is expensive and time-consuming. A computational approach to reduce the number of potential candidate drugs can facilitate the discovery of attractive lead compounds. To address this and obtain reliable predictions of novel cell-line-specific drugs, we apply prediction models that have the potential to improve drug discovery approaches for RMS treatment. The results of two prediction models were ensemble and validated via in vitro experiments. The computational models were trained using data extracted from the Genomics of Drug Sensitivity in Cancer database and tested on two RMS cell lines to select potential RMS drug candidates. Among 235 candidate drugs, 22 were selected following the result of the computational approach, and three candidate drugs were identified (NSC207895, vorinostat, and belinostat) that showed selective effectiveness in RMS cell lines in vitro via the induction of apoptosis. Our in vitro experiments have demonstrated that our proposed methods can effectively identify and repurpose drugs for treating RMS.
Collapse
Affiliation(s)
- Bin Baek
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| | - Eunmi Jang
- School of Life Sciences, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| | - Sejin Park
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| | - Sung-Hye Park
- Department of Pathology, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
- Institute of Neuroscience, Seoul National University Hospital, Seoul, Republic of Korea
| | - Darren Reece Williams
- School of Life Sciences, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| | - Da-Woon Jung
- School of Life Sciences, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| | - Hyunju Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
- Artificial Intelligence Graduate School, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| |
Collapse
|
2
|
Ploenzke M, Irizarry R. Reassessing pharmacogenomic cell sensitivity with multilevel statistical models. Biostatistics 2023; 24:901-921. [PMID: 35277956 PMCID: PMC10583722 DOI: 10.1093/biostatistics/kxac010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 02/04/2022] [Accepted: 02/14/2022] [Indexed: 10/19/2023] Open
Abstract
Pharmacogenomic experiments allow for the systematic testing of drugs, at varying dosage concentrations, to study how genomic markers correlate with cell sensitivity to treatment. The first step in the analysis is to quantify the response of cell lines to variable dosage concentrations of the drugs being tested. The signal to noise in these measurements can be low due to biological and experimental variability. However, the increasing availability of pharmacogenomic studies provides replicated data sets that can be leveraged to gain power. To do this, we formulate a hierarchical mixture model to estimate the drug-specific mixture distributions for estimating cell sensitivity and for assessing drug effect type as either broad or targeted effect. We use this formulation to propose a unified approach that can yield posterior probability of a cell being susceptible to a drug conditional on being a targeted effect or relative effect sizes conditioned on the cell being broad. We demonstrate the usefulness of our approach via case studies. First, we assess pairwise agreements for cell lines/drugs within the intersection of two data sets and confirm the moderate pairwise agreement between many publicly available pharmacogenomic data sets. We then present an analysis that identifies sensitivity to the drug crizotinib for cells harboring EML4-ALK or NPM1-ALK gene fusions, as well as significantly down-regulated cell-matrix pathways associated with crizotinib sensitivity.
Collapse
Affiliation(s)
- Matt Ploenzke
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, 655 Huntington Ave, Building 2, 4th Floor, Boston, MA 02115
| | - Rafael Irizarry
- Department of Data Science, Dana Farber Cancer Institute, 450 Brookline Ave, CLSB 11007, Boston, MA 02215
| |
Collapse
|
3
|
Kaushik AC, Zhao Z. Machine learning-driven exploration of drug therapies for triple-negative breast cancer treatment. Front Mol Biosci 2023; 10:1215204. [PMID: 37602329 PMCID: PMC10436744 DOI: 10.3389/fmolb.2023.1215204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Accepted: 07/21/2023] [Indexed: 08/22/2023] Open
Abstract
Breast cancer is the second leading cause of cancer death in women among all cancer types. It is highly heterogeneous in nature, which means that the tumors have different morphologies and there is heterogeneity even among people who have the same type of tumor. Several staging and classifying systems have been developed due to the variability of different types of breast cancer. Due to high heterogeneity, personalized treatment has become a new strategy. Out of all breast cancer subtypes, triple-negative breast cancer (TNBC) comprises ∼10%-15%. TNBC refers to the subtype of breast cancer where cells do not express estrogen receptors, progesterone receptors, or human epidermal growth factor receptors (ERs, PRs, and HERs). Tumors in TNBC have a diverse set of genetic markers and prognostic indicators. We scanned the Cancer Cell Line Encyclopedia (CCLE) and Genomics of Drug Sensitivity in Cancer (GDSC) databases for potential drugs using human breast cancer cell lines and drug sensitivity data. Three different machine-learning approaches were used to evaluate the prediction of six effective drugs against the TNBC cell lines. The top biomarkers were then shortlisted on the basis of their involvement in breast cancer and further subjected to testing for radion resistance using data from the Cleveland database. It was observed that Panobinostat, PLX4720, Lapatinib, Nilotinib, Selumetinib, and Tanespimycin were six effective drugs against the TNBC cell lines. We could identify potential derivates that may be used against approved drugs. Only one biomarker (SETD7) was sensitive to all six drugs on the shortlist, while two others (SRARP and YIPF5) were sensitive to both radiation and drugs. Furthermore, we did not find any radioresistance markers for the TNBC. The proposed biomarkers and drug sensitivity analysis will provide potential candidates for future clinical investigation.
Collapse
Affiliation(s)
- Aman Chandra Kaushik
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States
- MD Anderson Cancer Center, UTHealth Graduate School of Biomedical Sciences, Houston, TX, United States
| |
Collapse
|
4
|
Das T, Bhattarai K, Rajaganapathy S, Wang L, Cerhan JR, Zong N. Leveraging multi-source to resolve inconsistency across pharmacogenomic datasets in drug sensitivity prediction. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.05.25.23290546. [PMID: 37333219 PMCID: PMC10274988 DOI: 10.1101/2023.05.25.23290546] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Pharmacogenomics datasets have been generated for various purposes, such as investigating different biomarkers. However, when studying the same cell line with the same drugs, differences in drug responses exist between studies. These variations arise from factors such as inter-tumoral heterogeneity, experimental standardization, and the complexity of cell subtypes. Consequently, drug response prediction suffers from limited generalizability. To address these challenges, we propose a computational model based on Federated Learning (FL) for drug response prediction. By leveraging three pharmacogenomics datasets (CCLE, GDSC2, and gCSI), we evaluate the performance of our model across diverse cell line-based databases. Our results demonstrate superior predictive performance compared to baseline methods and traditional FL approaches through various experimental tests. This study underscores the potential of employing FL to leverage multiple data sources, enabling the development of generalized models that account for inconsistencies among pharmacogenomics datasets. By addressing the limitations of low generalizability, our approach contributes to advancing drug response prediction in precision oncology.
Collapse
Affiliation(s)
- Trisha Das
- University of Illinois Urbana-Champaign, Champaign, Illinois, United States
| | | | - Sivaraman Rajaganapathy
- Department of Artificial Intelligence and Informatics Research, Mayo Clinic, Rochester, MN, USA
| | - Liewei Wang
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN
| | - James R. Cerhan
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Nansu Zong
- Department of Artificial Intelligence and Informatics Research, Mayo Clinic, Rochester, MN, USA
| |
Collapse
|
5
|
Mehmood A, Nawab S, Jin Y, Hassan H, Kaushik AC, Wei DQ. Ranking Breast Cancer Drugs and Biomarkers Identification Using Machine Learning and Pharmacogenomics. ACS Pharmacol Transl Sci 2023; 6:399-409. [PMID: 36926455 PMCID: PMC10012252 DOI: 10.1021/acsptsci.2c00212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Indexed: 02/26/2023]
Abstract
Breast cancer is one of the major causes of death in women worldwide. It is a diverse illness with substantial intersubject heterogeneity, even among individuals with the same type of tumor, and customized therapy has become increasingly important in this sector. Because of the clinical and physical variability of different kinds of breast cancers, multiple staging and classification systems have been developed. As a result, these tumors exhibit a wide range of gene expression and prognostic indicators. To date, no comprehensive investigation of model training procedures on information from numerous cell line screenings has been conducted together with radiation data. We used human breast cancer cell lines and drug sensitivity information from Cancer Cell Line Encyclopedia (CCLE) and Genomics of Drug Sensitivity in Cancer (GDSC) databases to scan for potential drugs using cell line data. The results are further validated through three machine learning approaches: Elastic Net, LASSO, and Ridge. Next, we selected top-ranked biomarkers based on their role in breast cancer and tested them further for their resistance to radiation using the data from the Cleveland database. We have identified six drugs named Palbociclib, Panobinostat, PD-0325901, PLX4720, Selumetinib, and Tanespimycin that significantly perform on breast cancer cell lines. Also, five biomarkers named TNFSF15, DCAF6, KDM6A, PHETA2, and IFNGR1 are sensitive to all six shortlisted drugs and show sensitivity to the radiations. The proposed biomarkers and drug sensitivity analysis are helpful in translational cancer studies and provide valuable insights for clinical trial design.
Collapse
Affiliation(s)
- Aamir Mehmood
- Department
of Bioinformatics and Biological Statistics, School of Life Sciences
and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, P.R. China
| | - Sadia Nawab
- State
Key Laboratory of Microbial Metabolism and School of Life Sciences
and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, P.R. China
| | - Yifan Jin
- Department
of Bioinformatics and Biological Statistics, School of Life Sciences
and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, P.R. China
| | - Hesham Hassan
- Department
of Pathology, College of Medicine, King
Khalid University, Abha 61421, Saudi Arabia
- Department
of Pathology, Faculty of Medicine, Assiut
University, Assiut 71515, Egypt
| | - Aman Chandra Kaushik
- Department
of Bioinformatics and Biological Statistics, School of Life Sciences
and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, P.R. China
| | - Dong-Qing Wei
- State
Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade
Joint Innovation Center on Antibacterial Resistances, Joint International
Research Laboratory of Metabolic & Developmental Sciences and
School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
- Zhongjing
Research and Industrialization Institute of Chinese Medicine, Zhongguancun Scientific Park, Meixi, Nanyang, Henan 473006, P.R. China
- Peng
Cheng National Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, Guangdong 518055, P.R. China
| |
Collapse
|
6
|
Predictive validity in drug discovery: what it is, why it matters and how to improve it. Nat Rev Drug Discov 2022; 21:915-931. [PMID: 36195754 DOI: 10.1038/s41573-022-00552-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/11/2022] [Indexed: 11/08/2022]
Abstract
Successful drug discovery is like finding oases of safety and efficacy in chemical and biological deserts. Screens in disease models, and other decision tools used in drug research and development (R&D), point towards oases when they score therapeutic candidates in a way that correlates with clinical utility in humans. Otherwise, they probably lead in the wrong direction. This line of thought can be quantified by using decision theory, in which 'predictive validity' is the correlation coefficient between the output of a decision tool and clinical utility across therapeutic candidates. Analyses based on this approach reveal that the detectability of good candidates is extremely sensitive to predictive validity, because the deserts are big and oases small. Both history and decision theory suggest that predictive validity is under-managed in drug R&D, not least because it is so hard to measure before projects succeed or fail later in the process. This article explains the influence of predictive validity on R&D productivity and discusses methods to evaluate and improve it, with the aim of supporting the application of more effective decision tools and catalysing investment in their creation.
Collapse
|
7
|
Trastulla L, Noorbakhsh J, Vazquez F, McFarland J, Iorio F. Computational estimation of quality and clinical relevance of cancer cell lines. Mol Syst Biol 2022; 18:e11017. [PMID: 35822563 PMCID: PMC9277610 DOI: 10.15252/msb.202211017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 06/10/2022] [Accepted: 06/13/2022] [Indexed: 12/12/2022] Open
Abstract
Immortal cancer cell lines (CCLs) are the most widely used system for investigating cancer biology and for the preclinical development of oncology therapies. Pharmacogenomic and genome‐wide editing screenings have facilitated the discovery of clinically relevant gene–drug interactions and novel therapeutic targets via large panels of extensively characterised CCLs. However, tailoring pharmacological strategies in a precision medicine context requires bridging the existing gaps between tumours and in vitro models. Indeed, intrinsic limitations of CCLs such as misidentification, the absence of tumour microenvironment and genetic drift have highlighted the need to identify the most faithful CCLs for each primary tumour while addressing their heterogeneity, with the development of new models where necessary. Here, we discuss the most significant limitations of CCLs in representing patient features, and we review computational methods aiming at systematically evaluating the suitability of CCLs as tumour proxies and identifying the best patient representative in vitro models. Additionally, we provide an overview of the applications of these methods to more complex models and discuss future machine‐learning‐based directions that could resolve some of the arising discrepancies.
Collapse
Affiliation(s)
- Lucia Trastulla
- Human Technopole, Milano, Italy.,Open Targets, Cambridge, UK
| | | | - Francisca Vazquez
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | | | - Francesco Iorio
- Human Technopole, Milano, Italy.,Open Targets, Cambridge, UK
| |
Collapse
|
8
|
Jafari M, Mirzaie M, Bao J, Barneh F, Zheng S, Eriksson J, Heckman CA, Tang J. Bipartite network models to design combination therapies in acute myeloid leukaemia. Nat Commun 2022; 13:2128. [PMID: 35440130 PMCID: PMC9018865 DOI: 10.1038/s41467-022-29793-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 03/30/2022] [Indexed: 12/20/2022] Open
Abstract
Combination therapy is preferred over single-targeted monotherapies for cancer treatment due to its efficiency and safety. However, identifying effective drug combinations costs time and resources. We propose a method for identifying potential drug combinations by bipartite network modelling of patient-related drug response data, specifically the Beat AML dataset. The median of cell viability is used as a drug potency measurement to reconstruct a weighted bipartite network, model drug-biological sample interactions, and find the clusters of nodes inside two projected networks. Then, the clustering results are leveraged to discover effective multi-targeted drug combinations, which are also supported by more evidence using GDSC and ALMANAC databases. The potency and synergy levels of selective drug combinations are corroborated against monotherapy in three cell lines for acute myeloid leukaemia in vitro. In this study, we introduce a nominal data mining approach to improving acute myeloid leukaemia treatment through combinatorial therapy. Identifying effective drug combinations to treat cancer is a challenging task, either experimentally or computationally. Here, the authors develop a bipartite network modelling approach to propose drug combination strategies in acute myeloid leukaemia using patient and cell line drug screening data.
Collapse
Affiliation(s)
- Mohieddin Jafari
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
| | - Mehdi Mirzaie
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Jie Bao
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Farnaz Barneh
- Prinses Maxima Center for Pediatric Oncology, 3584 CS Utrecht, Utrech, the Netherlands
| | - Shuyu Zheng
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Johanna Eriksson
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Caroline A Heckman
- Institute for Molecular Medicine Finland - FIMM, HiLIFE - Helsinki Institute of Life Science, iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Jing Tang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
9
|
Claridge SE, Cavallo JA, Hopkins BD. Patient-Derived In Vitro and In Vivo Models of Cancer. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1361:215-233. [DOI: 10.1007/978-3-030-91836-1_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
10
|
Out-of-distribution generalization from labelled and unlabelled gene expression data for drug response prediction. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00408-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
11
|
Nguyen LV, Caldas C. Functional genomics approaches to improve pre-clinical drug screening and biomarker discovery. EMBO Mol Med 2021; 13:e13189. [PMID: 34254730 PMCID: PMC8422077 DOI: 10.15252/emmm.202013189] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 05/23/2021] [Accepted: 06/10/2021] [Indexed: 12/13/2022] Open
Abstract
Advances in sequencing technology have enabled the genomic and transcriptomic characterization of human malignancies with unprecedented detail. However, this wealth of information has been slow to translate into clinically meaningful outcomes. Different models to study human cancers have been established and extensively characterized. Using these models, functional genomic screens and pre-clinical drug screening platforms have identified genetic dependencies that can be exploited with drug therapy. These genetic dependencies can also be used as biomarkers to predict response to treatment. For many cancers, the identification of such biomarkers remains elusive. In this review, we discuss the development and characterization of models used to study human cancers, RNA interference and CRISPR screens to identify genetic dependencies, large-scale pharmacogenomics studies and drug screening approaches to improve pre-clinical drug screening and biomarker discovery.
Collapse
Affiliation(s)
- Long V Nguyen
- Department of Oncology and Cancer Research UK Cambridge InstituteLi Ka Shing CentreUniversity of CambridgeCambridgeUK
- Cancer Research UK Cambridge Cancer CentreCambridgeUK
| | - Carlos Caldas
- Department of Oncology and Cancer Research UK Cambridge InstituteLi Ka Shing CentreUniversity of CambridgeCambridgeUK
- Cancer Research UK Cambridge Cancer CentreCambridgeUK
| |
Collapse
|
12
|
Sharifi-Noghabi H, Jahangiri-Tazehkand S, Smirnov P, Hon C, Mammoliti A, Nair SK, Mer AS, Ester M, Haibe-Kains B. Drug sensitivity prediction from cell line-based pharmacogenomics data: guidelines for developing machine learning models. Brief Bioinform 2021; 22:6348324. [PMID: 34382071 DOI: 10.1093/bib/bbab294] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 06/29/2021] [Accepted: 07/10/2021] [Indexed: 11/13/2022] Open
Abstract
The goal of precision oncology is to tailor treatment for patients individually using the genomic profile of their tumors. Pharmacogenomics datasets such as cancer cell lines are among the most valuable resources for drug sensitivity prediction, a crucial task of precision oncology. Machine learning methods have been employed to predict drug sensitivity based on the multiple omics data available for large panels of cancer cell lines. However, there are no comprehensive guidelines on how to properly train and validate such machine learning models for drug sensitivity prediction. In this paper, we introduce a set of guidelines for different aspects of training gene expression-based predictors using cell line datasets. These guidelines provide extensive analysis of the generalization of drug sensitivity predictors and challenge many current practices in the community including the choice of training dataset and measure of drug sensitivity. The application of these guidelines in future studies will enable the development of more robust preclinical biomarkers.
Collapse
Affiliation(s)
- Hossein Sharifi-Noghabi
- School of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada.,Vancouver Prostate Center, Vancouver, British Columbia, Canada.,Princess Margaret Cancer Centre, Toronto, Ontario, Canada
| | - Soheil Jahangiri-Tazehkand
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Princess Margaret Cancer Centre, Toronto, Ontario, Canada.,University of Toronto, Toronto, Ontario, Canada
| | - Petr Smirnov
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Princess Margaret Cancer Centre, Toronto, Ontario, Canada.,University of Toronto, Toronto, Ontario, Canada
| | - Casey Hon
- Princess Margaret Cancer Centre, Toronto, Ontario, Canada.,University of Toronto, Toronto, Ontario, Canada
| | - Anthony Mammoliti
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Princess Margaret Cancer Centre, Toronto, Ontario, Canada.,University of Toronto, Toronto, Ontario, Canada
| | | | - Arvind Singh Mer
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Princess Margaret Cancer Centre, Toronto, Ontario, Canada.,University of Toronto, Toronto, Ontario, Canada
| | - Martin Ester
- School of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada.,Vancouver Prostate Center, Vancouver, British Columbia, Canada
| | - Benjamin Haibe-Kains
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Princess Margaret Cancer Centre, Toronto, Ontario, Canada.,Ontario Institute for Cancer Research, Toronto, Ontario, Canada.,University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
13
|
White BS, Khan SA, Mason MJ, Ammad-Ud-Din M, Potdar S, Malani D, Kuusanmäki H, Druker BJ, Heckman C, Kallioniemi O, Kurtz SE, Porkka K, Tognon CE, Tyner JW, Aittokallio T, Wennerberg K, Guinney J. Bayesian multi-source regression and monocyte-associated gene expression predict BCL-2 inhibitor resistance in acute myeloid leukemia. NPJ Precis Oncol 2021; 5:71. [PMID: 34302041 PMCID: PMC8302655 DOI: 10.1038/s41698-021-00209-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Accepted: 06/22/2021] [Indexed: 11/09/2022] Open
Abstract
The FDA recently approved eight targeted therapies for acute myeloid leukemia (AML), including the BCL-2 inhibitor venetoclax. Maximizing efficacy of these treatments requires refining patient selection. To this end, we analyzed two recent AML studies profiling the gene expression and ex vivo drug response of primary patient samples. We find that ex vivo samples often exhibit a general sensitivity to (any) drug exposure, independent of drug target. We observe that this "general response across drugs" (GRD) is associated with FLT3-ITD mutations, clinical response to standard induction chemotherapy, and overall survival. Further, incorporating GRD into expression-based regression models trained on one of the studies improved their performance in predicting ex vivo response in the second study, thus signifying its relevance to precision oncology efforts. We find that venetoclax response is independent of GRD but instead show that it is linked to expression of monocyte-associated genes by developing and applying a multi-source Bayesian regression approach. The method shares information across studies to robustly identify biomarkers of drug response and is broadly applicable in integrative analyses.
Collapse
Affiliation(s)
- Brian S White
- Computational Oncology, Sage Bionetworks, Seattle, WA, USA.
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| | - Suleiman A Khan
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Mike J Mason
- Computational Oncology, Sage Bionetworks, Seattle, WA, USA
| | - Muhammad Ammad-Ud-Din
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Swapnil Potdar
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Disha Malani
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Heikki Kuusanmäki
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Biotech Research & Innovation Centre (BRIC) and Novo Nordisk Foundation Center for Stem Cell Biology (DanStem), University of Copenhagen, Copenhagen, Denmark
| | - Brian J Druker
- Howard Hughes Medical Institute, Portland, OR, USA
- Division of Hematology and Medical Oncology, Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
| | - Caroline Heckman
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Olli Kallioniemi
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Scilifelab, Karolinska Institute, Solna, Sweden
| | - Stephen E Kurtz
- Division of Hematology and Medical Oncology, Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
| | - Kimmo Porkka
- HUS Comprehensive Cancer Center, Hematology Research Unit Helsinki and iCAN Digital Precision Cancer Center Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Cristina E Tognon
- Howard Hughes Medical Institute, Portland, OR, USA
- Division of Hematology and Medical Oncology, Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
| | - Jeffrey W Tyner
- Division of Hematology and Medical Oncology, Knight Cancer Institute, Oregon Health & Science University, Portland, OR, USA
| | - Tero Aittokallio
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Department of Mathematics and Statistics, University of Turku, Turku, Finland
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway
- Centre for Biostatistics and Epidemiology (OCBE), University of Oslo, Oslo, Norway
| | - Krister Wennerberg
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
- Biotech Research & Innovation Centre (BRIC) and Novo Nordisk Foundation Center for Stem Cell Biology (DanStem), University of Copenhagen, Copenhagen, Denmark
| | - Justin Guinney
- Computational Oncology, Sage Bionetworks, Seattle, WA, USA
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
| |
Collapse
|
14
|
Jaiswal A, Gautam P, Pietilä EA, Timonen S, Nordström N, Akimov Y, Sipari N, Tanoli Z, Fleischer T, Lehti K, Wennerberg K, Aittokallio T. Multi-modal meta-analysis of cancer cell line omics profiles identifies ECHDC1 as a novel breast tumor suppressor. Mol Syst Biol 2021; 17:e9526. [PMID: 33750001 PMCID: PMC7983037 DOI: 10.15252/msb.20209526] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 02/17/2021] [Accepted: 02/19/2021] [Indexed: 12/12/2022] Open
Abstract
Molecular and functional profiling of cancer cell lines is subject to laboratory-specific experimental practices and data analysis protocols. The current challenge therefore is how to make an integrated use of the omics profiles of cancer cell lines for reliable biological discoveries. Here, we carried out a systematic analysis of nine types of data modalities using meta-analysis of 53 omics studies across 12 research laboratories for 2,018 cell lines. To account for a relatively low consistency observed for certain data modalities, we developed a robust data integration approach that identifies reproducible signals shared among multiple data modalities and studies. We demonstrated the power of the integrative analyses by identifying a novel driver gene, ECHDC1, with tumor suppressive role validated both in breast cancer cells and patient tumors. The multi-modal meta-analysis approach also identified synthetic lethal partners of cancer drivers, including a co-dependency of PTEN deficient endometrial cancer cells on RNA helicases.
Collapse
Affiliation(s)
- Alok Jaiswal
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
- Present address:
The Broad Institute of MIT and HarvardCambridgeMAUSA
| | - Prson Gautam
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
| | - Elina A Pietilä
- Individualized Drug Therapy, Research Programs UnitUniversity of HelsinkiHelsinkiFinland
| | - Sanna Timonen
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
- Hematology Research Unit HelsinkiUniversity of Helsinki and Helsinki University Hospital Comprehensive Cancer CenterHelsinkiFinland
- Translational Immunology Research Program and Department of Clinical Chemistry and HematologyUniversity of HelsinkiHelsinkiFinland
| | - Nora Nordström
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
| | - Yevhen Akimov
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
| | - Nina Sipari
- Viikki Metabolomics UnitHelsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
| | - Ziaurrehman Tanoli
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
| | - Thomas Fleischer
- Department of Cancer GeneticsInstitute for Cancer ResearchOslo University HospitalOsloNorway
| | - Kaisa Lehti
- Individualized Drug Therapy, Research Programs UnitUniversity of HelsinkiHelsinkiFinland
- Department of Microbiology, Tumor and Cell BiologyKarolinska InstitutetStockholmSweden
- Department of Biomedical Laboratory ScienceNorwegian University of Science and TechnologyTrondheimNorway
| | - Krister Wennerberg
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
- Biotech Research & Innovation Centre (BRIC) and Novo Nordisk Foundation Center for Stem Cell Biology (DanStem)University of CopenhagenCopenhagenDenmark
| | - Tero Aittokallio
- Institute for Molecular Medicine Finland (FIMM)Helsinki Institute of Life Science (HiLIFE)University of HelsinkiHelsinkiFinland
- Department of Cancer GeneticsInstitute for Cancer ResearchOslo University HospitalOsloNorway
- Department of Mathematics and StatisticsUniversity of TurkuTurkuFinland
- Oslo Centre for Biostatistics and Epidemiology (OCBE)University of OsloOsloNorway
| |
Collapse
|
15
|
Yao H, Liang Q, Qian X, Wang J, Sham PC, Li MJ. Methods and resources to access mutation-dependent effects on cancer drug treatment. Brief Bioinform 2020; 21:1886-1903. [PMID: 31750520 DOI: 10.1093/bib/bbz109] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2019] [Revised: 07/31/2019] [Accepted: 08/01/2019] [Indexed: 12/13/2022] Open
Abstract
In clinical cancer treatment, genomic alterations would often affect the response of patients to anticancer drugs. Studies have shown that molecular features of tumors could be biomarkers predictive of sensitivity or resistance to anticancer agents, but the identification of actionable mutations are often constrained by the incomplete understanding of cancer genomes. Recent progresses of next-generation sequencing technology greatly facilitate the extensive molecular characterization of tumors and promote precision medicine in cancers. More and more clinical studies, cancer cell lines studies, CRISPR screening studies as well as patient-derived model studies were performed to identify potential actionable mutations predictive of drug response, which provide rich resources of molecularly and pharmacologically profiled cancer samples at different levels. Such abundance of data also enables the development of various computational models and algorithms to solve the problem of drug sensitivity prediction, biomarker identification and in silico drug prioritization by the integration of multiomics data. Here, we review the recent development of methods and resources that identifies mutation-dependent effects for cancer treatment in clinical studies, functional genomics studies and computational studies and discuss the remaining gaps and future directions in this area.
Collapse
Affiliation(s)
- Hongcheng Yao
- School of Biomedical Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Qian Liang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Xinyi Qian
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Junwen Wang
- Department of Health Sciences Research & Center for Individualized Medicine, Mayo Clinic, Scottsdale, USA
| | - Pak Chung Sham
- Center for Genomic Sciences, The University of Hong Kong, Hong Kong SAR, China.,Departments of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Mulin Jun Li
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China.,Department of Epidemiology and Biostatistics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| |
Collapse
|
16
|
Rahman R, Dhruba SR, Matlock K, De-Niz C, Ghosh S, Pal R. Evaluating the consistency of large-scale pharmacogenomic studies. Brief Bioinform 2020; 20:1734-1753. [PMID: 31846027 DOI: 10.1093/bib/bby046] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Revised: 05/04/2018] [Indexed: 12/21/2022] Open
Abstract
Recent years have seen an increase in the availability of pharmacogenomic databases such as Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE) that provide genomic and functional characterization information for multiple cell lines. Studies have alluded to the fact that specific characterizations may be inconsistent between different databases. Analysis of the potential discrepancies in the different databases is highly significant, as these sources are frequently used to analyze and validate methodologies for personalized cancer therapies. In this article, we review the recent developments in investigating the correspondence between different pharmacogenomics databases and discuss the potential factors that require attention when incorporating these sources in any modeling analysis. Furthermore, we explored the consistency among these databases using copulas that can capture nonlinear dependencies between two sets of data.
Collapse
Affiliation(s)
- Raziur Rahman
- Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | - Saugato Rahman Dhruba
- Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | - Kevin Matlock
- Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | - Carlos De-Niz
- Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX 79409, USA
| | - Souparno Ghosh
- Department of Mathematics and Statistics, Texas Tech University, Lubbock, TX 79409, USA
| | - Ranadip Pal
- Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX 79409, USA.,Department of Mathematics and Statistics, Texas Tech University, Lubbock, TX 79409, USA
| |
Collapse
|
17
|
Kurilov R, Haibe-Kains B, Brors B. Assessment of modelling strategies for drug response prediction in cell lines and xenografts. Sci Rep 2020; 10:2849. [PMID: 32071383 PMCID: PMC7028927 DOI: 10.1038/s41598-020-59656-2] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Accepted: 01/23/2020] [Indexed: 12/20/2022] Open
Abstract
Data from several large high-throughput drug response screens have become available to the scientific community recently. Although many efforts have been made to use this information to predict drug sensitivity, our ability to accurately predict drug response based on genetic data remains limited. In order to systematically examine how different aspects of modelling affect the resulting prediction accuracy, we built a range of models for seven drugs (erlotinib, pacliatxel, lapatinib, PLX4720, sorafenib, nutlin-3 and nilotinib) using data from the largest available cell line and xenograft drug sensitivity screens. We found that the drug response metric, the choice of the molecular data type and the number of training samples have a substantial impact on prediction accuracy. We also compared the tasks of drug response prediction with tissue type prediction and found that, unlike for drug response, tissue type can be predicted with high accuracy. Furthermore, we assessed our ability to predict drug response in four xenograft cohorts (treated either with erlotinib, gemcitabine or paclitaxel) using models trained on cell line data. We could predict response in an erlotinib-treated cohort with a moderate accuracy (correlation ≈ 0.5), but were unable to correctly predict responses in cohorts treated with gemcitabine or paclitaxel.
Collapse
Affiliation(s)
- Roman Kurilov
- Division of Applied Bioinformatics, German Cancer Research Center, Heidelberg, Germany. .,Faculty of Biosciences, Heidelberg University, Heidelberg, Germany.
| | - Benjamin Haibe-Kains
- Princess Margaret Cancer Centre, Toronto, Ontario, M5G 1L7, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, M5G 1L7, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario, M5T 3A1, Canada.,Ontario Institute for Cancer Research, Toronto, Ontario, M5G 1L7, Canada
| | - Benedikt Brors
- Division of Applied Bioinformatics, German Cancer Research Center, Heidelberg, Germany.,National Center for Tumor Diseases (NCT), Heidelberg, Germany.,German Cancer Consortium (DKTK), Core Center Heidelberg, Heidelberg, Germany
| |
Collapse
|
18
|
Baptista D, Ferreira PG, Rocha M. Deep learning for drug response prediction in cancer. Brief Bioinform 2020; 22:360-379. [PMID: 31950132 DOI: 10.1093/bib/bbz171] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 11/04/2019] [Indexed: 01/15/2023] Open
Abstract
Predicting the sensitivity of tumors to specific anti-cancer treatments is a challenge of paramount importance for precision medicine. Machine learning(ML) algorithms can be trained on high-throughput screening data to develop models that are able to predict the response of cancer cell lines and patients to novel drugs or drug combinations. Deep learning (DL) refers to a distinct class of ML algorithms that have achieved top-level performance in a variety of fields, including drug discovery. These types of models have unique characteristics that may make them more suitable for the complex task of modeling drug response based on both biological and chemical data, but the application of DL to drug response prediction has been unexplored until very recently. The few studies that have been published have shown promising results, and the use of DL for drug response prediction is beginning to attract greater interest from researchers in the field. In this article, we critically review recently published studies that have employed DL methods to predict drug response in cancer cell lines. We also provide a brief description of DL and the main types of architectures that have been used in these studies. Additionally, we present a selection of publicly available drug screening data resources that can be used to develop drug response prediction models. Finally, we also address the limitations of these approaches and provide a discussion on possible paths for further improvement. Contact: mrocha@di.uminho.pt.
Collapse
Affiliation(s)
| | | | - Miguel Rocha
- Department of Informatics and a Senior Researcher of the Centre of Biological Engineering at the University of Minho
| |
Collapse
|
19
|
Agreement between two large pan-cancer CRISPR-Cas9 gene dependency data sets. Nat Commun 2019; 10:5817. [PMID: 31862961 PMCID: PMC6925302 DOI: 10.1038/s41467-019-13805-y] [Citation(s) in RCA: 127] [Impact Index Per Article: 25.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 11/26/2019] [Indexed: 02/07/2023] Open
Abstract
Genome-scale CRISPR-Cas9 viability screens performed in cancer cell lines provide a systematic approach to identify cancer dependencies and new therapeutic targets. As multiple large-scale screens become available, a formal assessment of the reproducibility of these experiments becomes necessary. We analyze data from recently published pan-cancer CRISPR-Cas9 screens performed at the Broad and Sanger Institutes. Despite significant differences in experimental protocols and reagents, we find that the screen results are highly concordant across multiple metrics with both common and specific dependencies jointly identified across the two studies. Furthermore, robust biomarkers of gene dependency found in one data set are recovered in the other. Through further analysis and replication experiments at each institute, we show that batch effects are driven principally by two key experimental parameters: the reagent library and the assay length. These results indicate that the Broad and Sanger CRISPR-Cas9 viability screens yield robust and reproducible findings.
Collapse
|
20
|
Dempster JM, Pacini C, Pantel S, Behan FM, Green T, Krill-Burger J, Beaver CM, Younger ST, Zhivich V, Najgebauer H, Allen F, Gonçalves E, Shepherd R, Doench JG, Yusa K, Vazquez F, Parts L, Boehm JS, Golub TR, Hahn WC, Root DE, Garnett MJ, Tsherniak A, Iorio F. Agreement between two large pan-cancer CRISPR-Cas9 gene dependency data sets. Nat Commun 2019. [PMID: 31862961 DOI: 10.1038/s41467-019-13805-y.] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Genome-scale CRISPR-Cas9 viability screens performed in cancer cell lines provide a systematic approach to identify cancer dependencies and new therapeutic targets. As multiple large-scale screens become available, a formal assessment of the reproducibility of these experiments becomes necessary. We analyze data from recently published pan-cancer CRISPR-Cas9 screens performed at the Broad and Sanger Institutes. Despite significant differences in experimental protocols and reagents, we find that the screen results are highly concordant across multiple metrics with both common and specific dependencies jointly identified across the two studies. Furthermore, robust biomarkers of gene dependency found in one data set are recovered in the other. Through further analysis and replication experiments at each institute, we show that batch effects are driven principally by two key experimental parameters: the reagent library and the assay length. These results indicate that the Broad and Sanger CRISPR-Cas9 viability screens yield robust and reproducible findings.
Collapse
Affiliation(s)
| | - Clare Pacini
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Open Targets, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Sasha Pantel
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Fiona M Behan
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Open Targets, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Thomas Green
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | | | - Charlotte M Beaver
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Scott T Younger
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Victor Zhivich
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Hanna Najgebauer
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Open Targets, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Felicity Allen
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Emanuel Gonçalves
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Rebecca Shepherd
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - John G Doench
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Kosuke Yusa
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Stem Cell Genetics, Institute for Frontier Life and Medical Sciences, Kyoto University, Kyoto, 606-8507, Japan
| | | | - Leopold Parts
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Department of Computer Science, University of Tartu, 50090, Tartu, Estonia
| | - Jesse S Boehm
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Todd R Golub
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - William C Hahn
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.,Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - David E Root
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Mathew J Garnett
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Open Targets, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Aviad Tsherniak
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
| | - Francesco Iorio
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK. .,Open Targets, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK. .,Human Technopole, 20157, Milano, Italy.
| |
Collapse
|
21
|
Sidorov P, Naulaerts S, Ariey-Bonnet J, Pasquier E, Ballester PJ. Predicting Synergism of Cancer Drug Combinations Using NCI-ALMANAC Data. Front Chem 2019; 7:509. [PMID: 31380352 PMCID: PMC6646421 DOI: 10.3389/fchem.2019.00509] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Accepted: 07/02/2019] [Indexed: 12/15/2022] Open
Abstract
Drug combinations are of great interest for cancer treatment. Unfortunately, the discovery of synergistic combinations by purely experimental means is only feasible on small sets of drugs. In silico modeling methods can substantially widen this search by providing tools able to predict which of all possible combinations in a large compound library are synergistic. Here we investigate to which extent drug combination synergy can be predicted by exploiting the largest available dataset to date (NCI-ALMANAC, with over 290,000 synergy determinations). Each cell line is modeled using primarily two machine learning techniques, Random Forest (RF) and Extreme Gradient Boosting (XGBoost), on the datasets provided by NCI-ALMANAC. This large-scale predictive modeling study comprises more than 5,000 pair-wise drug combinations, 60 cell lines, 4 types of models, and 5 types of chemical features. The application of a powerful, yet uncommonly used, RF-specific technique for reliability prediction is also investigated. The evaluation of these models shows that it is possible to predict the synergy of unseen drug combinations with high accuracy (Pearson correlations between 0.43 and 0.86 depending on the considered cell line, with XGBoost providing slightly better predictions than RF). We have also found that restricting to the most reliable synergy predictions results in at least 2-fold error decrease with respect to employing the best learning algorithm without any reliability estimation. Alkylating agents, tyrosine kinase inhibitors and topoisomerase inhibitors are the drugs whose synergy with other partner drugs are better predicted by the models. Despite its leading size, NCI-ALMANAC comprises an extremely small part of all conceivable combinations. Given their accuracy and reliability estimation, the developed models should drastically reduce the number of required in vitro tests by predicting in silico which of the considered combinations are likely to be synergistic.
Collapse
Affiliation(s)
- Pavel Sidorov
- CRCM, INSERM, Cancer Research Center of Marseille, Institut Paoli-Calmettes, Aix-Marseille Univ, CNRS, Marseille, France
| | - Stefan Naulaerts
- CRCM, INSERM, Cancer Research Center of Marseille, Institut Paoli-Calmettes, Aix-Marseille Univ, CNRS, Marseille, France
- Department of Tumor Immunology, Institut de Duve, Bruxelles, Belgium
| | - Jérémy Ariey-Bonnet
- CRCM, INSERM, Cancer Research Center of Marseille, Institut Paoli-Calmettes, Aix-Marseille Univ, CNRS, Marseille, France
| | - Eddy Pasquier
- CRCM, INSERM, Cancer Research Center of Marseille, Institut Paoli-Calmettes, Aix-Marseille Univ, CNRS, Marseille, France
| | - Pedro J. Ballester
- CRCM, INSERM, Cancer Research Center of Marseille, Institut Paoli-Calmettes, Aix-Marseille Univ, CNRS, Marseille, France
| |
Collapse
|
22
|
Shockley KR, Gupta S, Harris SF, Lahiri SN, Peddada SD. Quality Control of Quantitative High Throughput Screening Data. Front Genet 2019; 10:387. [PMID: 31143201 PMCID: PMC6520559 DOI: 10.3389/fgene.2019.00387] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Accepted: 04/10/2019] [Indexed: 01/08/2023] Open
Abstract
Quantitative high throughput screening (qHTS) experiments can generate 1000s of concentration-response profiles to screen compounds for potentially adverse effects. However, potency estimates for a single compound can vary considerably in study designs incorporating multiple concentration-response profiles for each compound. We introduce an automated quality control procedure based on analysis of variance (ANOVA) to identify and filter out compounds with multiple cluster response patterns and improve potency estimation in qHTS assays. Our approach, called Cluster Analysis by Subgroups using ANOVA (CASANOVA), clusters compound-specific response patterns into statistically supported subgroups. Applying CASANOVA to 43 publicly available qHTS data sets, we found that only about 20% of compounds with response values outside of the noise band have single cluster responses. The error rates for incorrectly separating true clusters and incorrectly clumping disparate clusters were both less than 5% in extensive simulation studies. Simulation studies also showed that the bias and variance of concentration at half-maximal response (AC50 ) estimates were usually within 10-fold when using a weighted average approach for potency estimation. In short, CASANOVA effectively sorts out compounds with "inconsistent" response patterns and produces trustworthy AC50 values.
Collapse
Affiliation(s)
- Keith R. Shockley
- Biostatistics and Computational Biology Branch, National Institute of Environmental Health Sciences, National Institutes of Health, Durham, NC, United States
| | - Shuva Gupta
- Statistics Department, University of Pennsylvania, Philadelphia, PA, United States
| | | | - Soumendra N. Lahiri
- Department of Statistics, North Carolina State University, Raleigh, NC, United States
| | - Shyamal D. Peddada
- Department of Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA, United States
| |
Collapse
|
23
|
Next-generation characterization of the Cancer Cell Line Encyclopedia. Nature 2019; 569:503-508. [PMID: 31068700 DOI: 10.1038/s41586-019-1186-3] [Citation(s) in RCA: 1757] [Impact Index Per Article: 351.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2018] [Accepted: 04/09/2019] [Indexed: 12/21/2022]
Abstract
Large panels of comprehensively characterized human cancer models, including the Cancer Cell Line Encyclopedia (CCLE), have provided a rigorous framework with which to study genetic variants, candidate targets, and small-molecule and biological therapeutics and to identify new marker-driven cancer dependencies. To improve our understanding of the molecular features that contribute to cancer phenotypes, including drug responses, here we have expanded the characterizations of cancer cell lines to include genetic, RNA splicing, DNA methylation, histone H3 modification, microRNA expression and reverse-phase protein array data for 1,072 cell lines from individuals of various lineages and ethnicities. Integration of these data with functional characterizations such as drug-sensitivity, short hairpin RNA knockdown and CRISPR-Cas9 knockout data reveals potential targets for cancer drugs and associated biomarkers. Together, this dataset and an accompanying public data portal provide a resource for the acceleration of cancer research using model cancer cell lines.
Collapse
|
24
|
Wass MN, Ray L, Michaelis M. Understanding of researcher behavior is required to improve data reliability. Gigascience 2019; 8:giz017. [PMID: 30715291 PMCID: PMC6528747 DOI: 10.1093/gigascience/giz017] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2018] [Revised: 01/20/2019] [Accepted: 01/25/2019] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND A lack of data reproducibility ("reproducibility crisis") has been extensively debated across many academic disciplines. RESULTS Although a reproducibility crisis is widely perceived, conclusive data on the scale of the problem and the underlying reasons are largely lacking. The debate is primarily focused on methodological issues. However, examples such as the use of misidentified cell lines illustrate that the availability of reliable methods does not guarantee good practice. Moreover, research is often characterized by a lack of established methods. Despite the crucial importance of researcher conduct, research and conclusive data on the determinants of researcher behavior are widely missing. CONCLUSION Meta-research that establishes an understanding of the factors that determine researcher behavior is urgently needed. This knowledge can then be used to implement and iteratively improve measures that incentivize researchers to apply the highest standards, resulting in high-quality data.
Collapse
Affiliation(s)
- Mark N Wass
- Industrial Biotechnology Centre and School of Biosciences, University of Kent, Canterbury, CT2 7NJ, UK
| | - Larry Ray
- School of Social Policy, Sociology and Social Research, University of Kent, Canterbury, CT2 7NJ, UK
| | - Martin Michaelis
- Industrial Biotechnology Centre and School of Biosciences, University of Kent, Canterbury, CT2 7NJ, UK
| |
Collapse
|
25
|
Fernández-Torras A, Duran-Frigola M, Aloy P. Encircling the regions of the pharmacogenomic landscape that determine drug response. Genome Med 2019; 11:17. [PMID: 30914058 PMCID: PMC6436215 DOI: 10.1186/s13073-019-0626-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Accepted: 03/05/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND The integration of large-scale drug sensitivity screens and genome-wide experiments is changing the field of pharmacogenomics, revealing molecular determinants of drug response without the need for previous knowledge about drug action. In particular, transcriptional signatures of drug sensitivity may guide drug repositioning, prioritize drug combinations, and point to new therapeutic biomarkers. However, the inherent complexity of transcriptional signatures, with thousands of differentially expressed genes, makes them hard to interpret, thus giving poor mechanistic insights and hampering translation to clinics. METHODS To simplify drug signatures, we have developed a network-based methodology to identify functionally coherent gene modules. Our strategy starts with the calculation of drug-gene correlations and is followed by a pathway-oriented filtering and a network-diffusion analysis across the interactome. RESULTS We apply our approach to 189 drugs tested in 671 cancer cell lines and observe a connection between gene expression levels of the modules and mechanisms of action of the drugs. Further, we characterize multiple aspects of the modules, including their functional categories, tissue-specificity, and prevalence in clinics. Finally, we prove the predictive capability of the modules and demonstrate how they can be used as gene sets in conventional enrichment analyses. CONCLUSIONS Network biology strategies like module detection are able to digest the outcome of large-scale pharmacogenomic initiatives, thereby contributing to their interpretability and improving the characterization of the drugs screened.
Collapse
Affiliation(s)
- Adrià Fernández-Torras
- Joint IRB-BSC-CRG Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
| | - Miquel Duran-Frigola
- Joint IRB-BSC-CRG Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain.
| | - Patrick Aloy
- Joint IRB-BSC-CRG Program in Computational Biology, Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia, Spain.
| |
Collapse
|
26
|
Ling A, Gruener RF, Fessler J, Huang RS. More than fishing for a cure: The promises and pitfalls of high throughput cancer cell line screens. Pharmacol Ther 2018; 191:178-189. [PMID: 29953899 PMCID: PMC7001883 DOI: 10.1016/j.pharmthera.2018.06.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
High-throughput screens in cancer cell lines (CCLs) have been used for decades to help researchers identify compounds with the potential to improve the treatment of cancer and, more recently, to identify genomic susceptibilities in cancer via genome-wide shRNA and CRISPR/Cas9 screens. Additionally, rich genomic and transcriptomic data of these CCLs has allowed researchers to pair this screening data with biological features, enabling efforts to identify biomarkers of treatment response and gene dependencies. In this paper, we review the major CCL screening efforts and the large datasets these screens have made available. We also assess the CCL screens collectively and include a resource with harmonized CCL and compound identifiers to facilitate comparisons across screens. The CCLs in these screens were found to represent a wide range of cancer types, with a strong correlation between the representation of a cancer type and its associated mortality. Patient ages and gender distributions of CCLs were generally as expected, with some notable exceptions of female underrepresentation in certain disease types. Also, ethnicity information, while largely incomplete, suggests that African American and Hispanic patients may be severely underrepresented in these screens. Nearly all genes were targeted in the genetic perturbations screens, but the compounds used for the drug screens target less than half of known cancer drivers, likely reflecting known limitations in our drug design capabilities. Finally, we discuss recent developments in the field and the promise they hold for enabling future screens to overcome previous limitations and lead to new breakthroughs in cancer treatment.
Collapse
Affiliation(s)
- Alexander Ling
- Committee on Cancer Biology, University of Chicago, Chicago, IL, United States; Department of Experimental and Clinical Pharmacology, University of Minnesota, Minneapolis, MN, United States
| | - Robert F Gruener
- Committee on Cancer Biology, University of Chicago, Chicago, IL, United States; Ben May Department for Cancer Research, University of Chicago, Chicago, IL, United States
| | - Jessica Fessler
- Committee on Cancer Biology, University of Chicago, Chicago, IL, United States; Department of Pathology, University of Chicago, Chicago, IL, United States
| | - R Stephanie Huang
- Department of Experimental and Clinical Pharmacology, University of Minnesota, Minneapolis, MN, United States.
| |
Collapse
|
27
|
Kalamara A, Tobalina L, Saez-Rodriguez J. How to find the right drug for each patient? Advances and challenges in pharmacogenomics. CURRENT OPINION IN SYSTEMS BIOLOGY 2018; 10:53-62. [PMID: 31763498 PMCID: PMC6855262 DOI: 10.1016/j.coisb.2018.07.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Cancer is a highly heterogeneous disease with complex underlying biology. For these reasons, effective cancer treatment is still a challenge. Nowadays, it is clear that a cancer therapy that fits all the cases cannot be found, and as a result the design of therapies tailored to the patient's molecular characteristics is needed. Pharmacogenomics aims to study the relationship between an individual's genotype and drug response. Scientists use different biological models, ranging from cell lines to mouse models, as proxies for patients for preclinical and translational studies. The rapid development of "-omics" technologies is increasing the amount of features that can be measured in these models, expanding the possibilities of finding predictive biomarkers of drug response. Finding these relationships requires diverse computational approaches ranging from machine learning to dynamic modeling. Despite major advances, we are still far from being able to precisely predict drug efficacy in cancer models, let alone directly on patients. We believe that the new experimental techniques and computational approaches covered in this review will bring us closer to this goal.
Collapse
Affiliation(s)
- Angeliki Kalamara
- RWTH Aachen University, Faculty of Medicine, Joint Research Centre for Computational Biomedicine, Aachen, Germany
| | - Luis Tobalina
- RWTH Aachen University, Faculty of Medicine, Joint Research Centre for Computational Biomedicine, Aachen, Germany
| | - Julio Saez-Rodriguez
- RWTH Aachen University, Faculty of Medicine, Joint Research Centre for Computational Biomedicine, Aachen, Germany
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, UK
- Heidelberg University, Faculty of Medicine, Institute of Computational Biomedicine, Heidelberg, Germany
| |
Collapse
|
28
|
PATRI, a Genomics Data Integration Tool for Biomarker Discovery. BIOMED RESEARCH INTERNATIONAL 2018; 2018:2012078. [PMID: 30065933 PMCID: PMC6051285 DOI: 10.1155/2018/2012078] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 05/29/2018] [Indexed: 12/31/2022]
Abstract
The availability of genomic datasets in association with clinical, phenotypic, and drug sensitivity information represents an invaluable source for potential therapeutic applications, supporting the identification of new drug sensitivity biomarkers and pharmacological targets. Drug discovery and precision oncology can largely benefit from the integration of treatment molecular discriminants obtained from cell line models and clinical tumor samples; however this task demands comprehensive analysis approaches for the discovery of underlying data connections. Here we introduce PATRI (Platform for the Analysis of TRanslational Integrated data), a standalone tool accessible through a user-friendly graphical interface, conceived for the identification of treatment sensitivity biomarkers from user-provided genomics data, associated with information on sample characteristics. PATRI streamlines a translational analysis workflow: first, baseline genomics signatures are statistically identified, differentiating treatment sensitive from resistant preclinical models; then, these signatures are used for the prediction of treatment sensitivity in clinical samples, via random forest categorization of clinical genomics datasets and statistical evaluation of the relative phenotypic features. The same workflow can also be applied across distinct clinical datasets. The ease of use of the PATRI tool is illustrated with validation analysis examples, performed with sensitivity data for drug treatments with known molecular discriminants.
Collapse
|
29
|
Gene and MicroRNA Perturbations of Cellular Response to Pemetrexed Implicate Biological Networks and Enable Imputation of Response in Lung Adenocarcinoma. Sci Rep 2018; 8:733. [PMID: 29335598 PMCID: PMC5768793 DOI: 10.1038/s41598-017-19004-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2017] [Accepted: 12/20/2017] [Indexed: 12/18/2022] Open
Abstract
Pemetrexed is indicated for non-small cell lung carcinoma and mesothelioma, but often has limited efficacy due to drug resistance. To probe the molecular mechanisms underlying chemotherapeutic response, we performed mRNA and microRNA (miRNA) expression profiling of pemetrexed treated and untreated lymphoblastoid cell lines (LCLs) and applied a hierarchical Bayesian method. We identified genetic variation associated with gene expression in human lung tissue for the most significant differentially expressed genes (Benjamini-Hochberg [BH] adjusted p < 0.05) using the Genotype-Tissue Expression data and found evidence for their clinical relevance using integrated molecular profiling and lung adenocarcinoma survival data from The Cancer Genome Atlas project. We identified 39 miRNAs with significant differential expression (BH adjusted p < 0.05) in LCLs. We developed a gene expression based imputation model of drug sensitivity, quantified its prediction performance, and found a significant correlation of the imputed phenotype generated from expression data with survival time in lung adenocarcinoma patients. Differentially expressed genes (MTHFD2 and SUFU) that are putative targets of differentially expressed miRNAs also showed differential perturbation in A549 fusion lung tumor cells with further replication in A549 cells. Our study suggests pemetrexed may be used in combination with agents that target miRNAs to increase its cytotoxicity.
Collapse
|
30
|
Geeleher P, Zhang Z, Wang F, Gruener RF, Nath A, Morrison G, Bhutra S, Grossman RL, Huang RS. Discovering novel pharmacogenomic biomarkers by imputing drug response in cancer patients from large genomics studies. Genome Res 2017; 27:1743-1751. [PMID: 28847918 PMCID: PMC5630037 DOI: 10.1101/gr.221077.117] [Citation(s) in RCA: 79] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 08/03/2017] [Indexed: 12/20/2022]
Abstract
Obtaining accurate drug response data in large cohorts of cancer patients is very challenging; thus, most cancer pharmacogenomics discovery is conducted in preclinical studies, typically using cell lines and mouse models. However, these platforms suffer from serious limitations, including small sample sizes. Here, we have developed a novel computational method that allows us to impute drug response in very large clinical cancer genomics data sets, such as The Cancer Genome Atlas (TCGA). The approach works by creating statistical models relating gene expression to drug response in large panels of cancer cell lines and applying these models to tumor gene expression data in the clinical data sets (e.g., TCGA). This yields an imputed drug response for every drug in each patient. These imputed drug response data are then associated with somatic genetic variants measured in the clinical cohort, such as copy number changes or mutations in protein coding genes. These analyses recapitulated drug associations for known clinically actionable somatic genetic alterations and identified new predictive biomarkers for existing drugs.
Collapse
Affiliation(s)
- Paul Geeleher
- Section of Hematology/Oncology, The University of Chicago, Chicago, Illinois 60637, USA
| | - Zhenyu Zhang
- Center for Data Intensive Science, The University of Chicago, Chicago, Illinois 60637, USA
| | - Fan Wang
- Section of Hematology/Oncology, The University of Chicago, Chicago, Illinois 60637, USA
| | - Robert F Gruener
- Section of Hematology/Oncology, The University of Chicago, Chicago, Illinois 60637, USA
| | - Aritro Nath
- Section of Hematology/Oncology, The University of Chicago, Chicago, Illinois 60637, USA
| | - Gladys Morrison
- Section of Hematology/Oncology, The University of Chicago, Chicago, Illinois 60637, USA
| | - Steven Bhutra
- Section of Hematology/Oncology, The University of Chicago, Chicago, Illinois 60637, USA
| | - Robert L Grossman
- Center for Data Intensive Science, The University of Chicago, Chicago, Illinois 60637, USA
| | - R Stephanie Huang
- Section of Hematology/Oncology, The University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
31
|
Williams SP, McDermott U. The Pursuit of Therapeutic Biomarkers with High-Throughput Cancer Cell Drug Screens. Cell Chem Biol 2017; 24:1066-1074. [PMID: 28736238 DOI: 10.1016/j.chembiol.2017.06.011] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Revised: 03/09/2017] [Accepted: 06/01/2017] [Indexed: 12/14/2022]
Abstract
In the last decade we have witnessed tremendous advances in our understanding of the landscape of the molecular alterations that underpin many of the most prevalent cancers, in the use of automated high-throughput platforms for high-throughput drug screens in cancer cells, in the creation of more clinically relevant cancer cell models, and lastly in the development of more useful computational approaches in the pursuit of biomarkers of drug response. Separately, each of these improvements will undoubtedly lead to improvements in the treatment of cancer patients but to fulfill the promise of truly personalized cancer medicine, we must bring these disciplines together in a truly multidisciplinary fashion.
Collapse
Affiliation(s)
- Steven P Williams
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Ultan McDermott
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK.
| |
Collapse
|
32
|
|
33
|
Geeleher P, Nath A, Huang RS. Institutional Profile: Pharmacogenomic research in R Stephanie Huang Laboratory. Pharmacogenomics 2017; 18:519-522. [PMID: 28290771 DOI: 10.2217/pgs-2017-0031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The Huang Lab was established in 2009 at the University of Chicago and has since been active in conducting pharmacogenomic research. Our laboratory's main research focus is translational pharmacogenomics with a particular interest in the pharmacogenomics of anticancer agents. By systematically evaluating the human genome and its relationships to drug response and toxicity, our goal is to develop clinically useful models that predict risk for adverse drug reactions and nonresponse prior to administration of chemotherapy. Specifically, the theme of our research evolved around the idea of cell-based pharmacogenomics, which utilizes in vitro models for biomarker discovery and prediction-model construction, followed by in vivo validation. We routinely use cell lines (derived from healthy and diseased individuals as well as commercially available cancer cell lines) and clinical samples to discover and functionally characterize genetic variation and gene, miRNA, and long noncoding RNA expression for their roles in drug sensitivity.
Collapse
Affiliation(s)
- Paul Geeleher
- Section of Hematology/Oncology, The University of Chicago, 900 E 57th Street, KCBD Room 7148, Chicago, IL 60637, USA
| | - Aritro Nath
- Section of Hematology/Oncology, The University of Chicago, 900 E 57th Street, KCBD Room 7148, Chicago, IL 60637, USA
| | - Rong Stephanie Huang
- Section of Hematology/Oncology, The University of Chicago, 900 E 57th Street, KCBD Room 7148, Chicago, IL 60637, USA
| |
Collapse
|
34
|
Safikhani Z, El-Hachem N, Smirnov P, Freeman M, Goldenberg A, Birkbak NJ, Beck AH, Aerts HJWL, Quackenbush J, Haibe-Kains B. Safikhani et al. reply. Nature 2016; 540:E2-E4. [PMID: 27905430 DOI: 10.1038/nature19839] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Zhaleh Safikhani
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario M5G 2M9, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7, Canada
| | - Nehme El-Hachem
- Institut de recherches cliniques de Montréal, Montreal, Quebec H2W 1R7, Canada
| | - Petr Smirnov
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario M5G 2M9, Canada
| | - Mark Freeman
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario M5G 2M9, Canada
| | - Anna Goldenberg
- Hospital for Sick Children, Toronto, Ontario M5G 1X8, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
| | - Nicolai J Birkbak
- The Francis Crick Institute, University College London, London NW1 1AT, UK.University College London Cancer Institute, London, WC1E 6BT, UK
| | - Andrew H Beck
- Beth Israel Deaconess Medical Center, Boston, Massachusetts 02215, USA.,Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Hugo J W L Aerts
- Harvard Medical School, Boston, Massachusetts 02115, USA.,Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA.,Brigham and Women's Hospital, Boston, Massachusetts 02115, USA
| | - John Quackenbush
- Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA.,Harvard School of Public Health, Boston, Massachusetts 02115, USA
| | - Benjamin Haibe-Kains
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario M5G 2M9, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada.,Ontario Institute of Cancer Research, Toronto, Ontario M5G 1L7, Canada
| |
Collapse
|
35
|
Safikhani Z, Smirnov P, Freeman M, El-Hachem N, She A, Rene Q, Goldenberg A, Birkbak NJ, Hatzis C, Shi L, Beck AH, Aerts HJ, Quackenbush J, Haibe-Kains B. Revisiting inconsistency in large pharmacogenomic studies. F1000Res 2016; 5:2333. [PMID: 28928933 PMCID: PMC5580432 DOI: 10.12688/f1000research.9611.3] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/11/2017] [Indexed: 01/30/2023] Open
Abstract
In 2013, we published a comparative analysis of mutation and gene expression profiles and drug sensitivity measurements for 15 drugs characterized in the 471 cancer cell lines screened in the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE). While we found good concordance in gene expression profiles, there was substantial inconsistency in the drug responses reported by the GDSC and CCLE projects. We received extensive feedback on the comparisons that we performed. This feedback, along with the release of new data, prompted us to revisit our initial analysis. We present a new analysis using these expanded data, where we address the most significant suggestions for improvements on our published analysis - that targeted therapies and broad cytotoxic drugs should have been treated differently in assessing consistency, that consistency of both molecular profiles and drug sensitivity measurements should be compared across cell lines, and that the software analysis tools provided should have been easier to run, particularly as the GDSC and CCLE released additional data. Our re-analysis supports our previous finding that gene expression data are significantly more consistent than drug sensitivity measurements. Using new statistics to assess data consistency allowed identification of two broad effect drugs and three targeted drugs with moderate to good consistency in drug sensitivity data between GDSC and CCLE. For three other targeted drugs, there were not enough sensitive cell lines to assess the consistency of the pharmacological profiles. We found evidence of inconsistencies in pharmacological phenotypes for the remaining eight drugs. Overall, our findings suggest that the drug sensitivity data in GDSC and CCLE continue to present challenges for robust biomarker discovery. This re-analysis provides additional support for the argument that experimental standardization and validation of pharmacogenomic response will be necessary to advance the broad use of large pharmacogenomic screens.
Collapse
Affiliation(s)
- Zhaleh Safikhani
- Department of Medical Biophysics, University of Toronto, Toronto, M5G 1L7, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Petr Smirnov
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Mark Freeman
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Nehme El-Hachem
- Institut de Recherches Cliniques de Montréal, Montréal, H2W 1R7, Canada
| | - Adrian She
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Quevedo Rene
- Department of Medical Biophysics, University of Toronto, Toronto, M5G 1L7, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Anna Goldenberg
- Department of Computer Science, University of Toronto, Toronto, M5S 2E4, Canada
- Hospital for Sick Children, Toronto, M5G 1X8, Canada
| | | | - Christos Hatzis
- Yale Cancer Center, Yale University, New Haven, CT, 06510, USA
- Section of Medical Oncology, Yale University School of Medicine, New Haven, CT, 06520, USA
| | - Leming Shi
- University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
- Fudan University, Shanghai City, 200135, China
| | - Andrew H. Beck
- Department of Pathology, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, MA, 02215, USA
| | - Hugo J.W.L. Aerts
- Department of Biostatistics and Computational Biology and Center for Cancer Computational Biology, Boston, MA, 02215, USA
- Department of Radiation Oncology and Radiology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02215, USA
| | - John Quackenbush
- Department of Biostatistics and Computational Biology and Center for Cancer Computational Biology, Boston, MA, 02215, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Benjamin Haibe-Kains
- Department of Medical Biophysics, University of Toronto, Toronto, M5G 1L7, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
- Department of Computer Science, University of Toronto, Toronto, M5S 2E4, Canada
- Ontario Institute of Cancer Research, Toronto, M5G 1L7, Canada
| |
Collapse
|
36
|
Safikhani Z, Smirnov P, Freeman M, El-Hachem N, She A, Rene Q, Goldenberg A, Birkbak NJ, Hatzis C, Shi L, Beck AH, Aerts HJ, Quackenbush J, Haibe-Kains B. Revisiting inconsistency in large pharmacogenomic studies. F1000Res 2016; 5:2333. [PMID: 28928933 PMCID: PMC5580432 DOI: 10.12688/f1000research.9611.1] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/15/2016] [Indexed: 01/22/2023] Open
Abstract
In 2013, we published a comparative analysis mutation and gene expression profiles and drug sensitivity measurements for 15 drugs characterized in the 471 cancer cell lines screened in the Genomics of Drug Sensitivity in Cancer (GDSC) and Cancer Cell Line Encyclopedia (CCLE). While we found good concordance in gene expression profiles, there was substantial inconsistency in the drug responses reported by the GDSC and CCLE projects. We received extensive feedback on the comparisons that we performed. This feedback, along with the release of new data, prompted us to revisit our initial analysis. Here we present a new analysis using these expanded data in which we address the most significant suggestions for improvements on our published analysis - that targeted therapies and broad cytotoxic drugs should have been treated differently in assessing consistency, that consistency of both molecular profiles and drug sensitivity measurements should both be compared across cell lines, and that the software analysis tools we provided should have been easier to run, particularly as the GDSC and CCLE released additional data. Our re-analysis supports our previous finding that gene expression data are significantly more consistent than drug sensitivity measurements. The use of new statistics to assess data consistency allowed us to identify two broad effect drugs and three targeted drugs with moderate to good consistency in drug sensitivity data between GDSC and CCLE. For three other targeted drugs, there were not enough sensitive cell lines to assess the consistency of the pharmacological profiles. We found evidence of inconsistencies in pharmacological phenotypes for the remaining eight drugs. Overall, our findings suggest that the drug sensitivity data in GDSC and CCLE continue to present challenges for robust biomarker discovery. This re-analysis provides additional support for the argument that experimental standardization and validation of pharmacogenomic response will be necessary to advance the broad use of large pharmacogenomic screens.
Collapse
Affiliation(s)
- Zhaleh Safikhani
- Department of Medical Biophysics, University of Toronto, Toronto, M5G 1L7, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Petr Smirnov
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Mark Freeman
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Nehme El-Hachem
- Institut de Recherches Cliniques de Montréal, Montréal, H2W 1R7, Canada
| | - Adrian She
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Quevedo Rene
- Department of Medical Biophysics, University of Toronto, Toronto, M5G 1L7, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
| | - Anna Goldenberg
- Department of Computer Science, University of Toronto, Toronto, M5S 2E4, Canada
- Hospital for Sick Children, Toronto, M5G 1X8, Canada
| | | | - Christos Hatzis
- Yale Cancer Center, Yale University, New Haven, CT, 06510, USA
- Section of Medical Oncology, Yale University School of Medicine, New Haven, CT, 06520, USA
| | - Leming Shi
- University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
- Fudan University, Shanghai City, 200135, China
| | - Andrew H. Beck
- Department of Pathology, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, MA, 02215, USA
| | - Hugo J.W.L. Aerts
- Department of Biostatistics and Computational Biology and Center for Cancer Computational Biology, Boston, MA, 02215, USA
- Department of Radiation Oncology and Radiology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, 02215, USA
| | - John Quackenbush
- Department of Biostatistics and Computational Biology and Center for Cancer Computational Biology, Boston, MA, 02215, USA
- Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Benjamin Haibe-Kains
- Department of Medical Biophysics, University of Toronto, Toronto, M5G 1L7, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, M5G 1L7, Canada
- Department of Computer Science, University of Toronto, Toronto, M5S 2E4, Canada
- Ontario Institute of Cancer Research, Toronto, M5G 1L7, Canada
| |
Collapse
|