1
|
Liu X, Zhang W. A subcomponent-guided deep learning method for interpretable cancer drug response prediction. PLoS Comput Biol 2023; 19:e1011382. [PMID: 37603576 PMCID: PMC10470940 DOI: 10.1371/journal.pcbi.1011382] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 08/31/2023] [Accepted: 07/24/2023] [Indexed: 08/23/2023] Open
Abstract
Accurate prediction of cancer drug response (CDR) is a longstanding challenge in modern oncology that underpins personalized treatment. Current computational methods implement CDR prediction by modeling responses between entire drugs and cell lines, without the consideration that response outcomes may primarily attribute to a few finer-level 'subcomponents', such as privileged substructures of the drug or gene signatures of the cancer cell, thus producing predictions that are hard to explain. Herein, we present SubCDR, a subcomponent-guided deep learning method for interpretable CDR prediction, to recognize the most relevant subcomponents driving response outcomes. Technically, SubCDR is built upon a line of deep neural networks that enables a set of functional subcomponents to be extracted from each drug and cell line profile, and breaks the CDR prediction down to identifying pairwise interactions between subcomponents. Such a subcomponent interaction form can offer a traceable path to explicitly indicate which subcomponents contribute more to the response outcome. We verify the superiority of SubCDR over state-of-the-art CDR prediction methods through extensive computational experiments on the GDSC dataset. Crucially, we found many predicted cases that demonstrate the strength of SubCDR in finding the key subcomponents driving responses and exploiting these subcomponents to discover new therapeutic drugs. These results suggest that SubCDR will be highly useful for biomedical researchers, particularly in anti-cancer drug design.
Collapse
Affiliation(s)
- Xuan Liu
- College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Wen Zhang
- College of Informatics, Huazhong Agricultural University, Wuhan, China
| |
Collapse
|
2
|
Trapotsi MA, Hosseini-Gerami L, Bender A. Computational analyses of mechanism of action (MoA): data, methods and integration. RSC Chem Biol 2022; 3:170-200. [PMID: 35360890 PMCID: PMC8827085 DOI: 10.1039/d1cb00069a] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 12/09/2021] [Indexed: 12/15/2022] Open
Abstract
The elucidation of a compound's Mechanism of Action (MoA) is a challenging task in the drug discovery process, but it is important in order to rationalise phenotypic findings and to anticipate potential side-effects. Bioinformatic approaches, advances in machine learning techniques and the increasing deposition of high-throughput data in public databases have significantly contributed to recent advances in the field, but it is not straightforward to decide which data and methods are most suitable to use in a given case. In this review, we focus on these methods and data and their applications in generating MoA hypotheses for subsequent experimental validation. We discuss compound-specific data such as -omics, cell morphology and bioactivity data, as well as commonly used supplementary prior knowledge such as network and pathway data, and provide information on databases where this data can be accessed. In terms of methodologies, we discuss both well-established methods (connectivity mapping, pathway enrichment) as well as more developing methods (neural networks and multi-omics integration). Finally, we review case studies where the MoA of a compound was successfully suggested from computational analysis by incorporating multiple data modalities and/or methodologies. Our aim for this review is to provide researchers with insights into the benefits and drawbacks of both the data and methods in terms of level of understanding, biases and interpretation - and to highlight future avenues of investigation which we foresee will improve the field of MoA elucidation, including greater public access to -omics data and methodologies which are capable of data integration.
Collapse
Affiliation(s)
- Maria-Anna Trapotsi
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| | - Layla Hosseini-Gerami
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| | - Andreas Bender
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| |
Collapse
|
3
|
Ferreira FS, Mihalik A, Adams RA, Ashburner J, Mourao-Miranda J. A hierarchical Bayesian model to find brain-behaviour associations in incomplete data sets. Neuroimage 2021; 249:118854. [PMID: 34971767 PMCID: PMC8861855 DOI: 10.1016/j.neuroimage.2021.118854] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 11/30/2021] [Accepted: 12/22/2021] [Indexed: 12/02/2022] Open
Abstract
Canonical Correlation Analysis (CCA) and its regularised versions have been widely used in the neuroimaging community to uncover multivariate associations between two data modalities (e.g., brain imaging and behaviour). However, these methods have inherent limitations: (1) statistical inferences about the associations are often not robust; (2) the associations within each data modality are not modelled; (3) missing values need to be imputed or removed. Group Factor Analysis (GFA) is a hierarchical model that addresses the first two limitations by providing Bayesian inference and modelling modality-specific associations. Here, we propose an extension of GFA that handles missing data, and highlight that GFA can be used as a predictive model. We applied GFA to synthetic and real data consisting of brain connectivity and non-imaging measures from the Human Connectome Project (HCP). In synthetic data, GFA uncovered the underlying shared and specific factors and predicted correctly the non-observed data modalities in complete and incomplete data sets. In the HCP data, we identified four relevant shared factors, capturing associations between mood, alcohol and drug use, cognition, demographics and psychopathological measures and the default mode, frontoparietal control, dorsal and ventral networks and insula, as well as two factors describing associations within brain connectivity. In addition, GFA predicted a set of non-imaging measures from brain connectivity. These findings were consistent in complete and incomplete data sets, and replicated previous findings in the literature. GFA is a promising tool that can be used to uncover associations between and within multiple data modalities in benchmark datasets (such as, HCP), and easily extended to more complex models to solve more challenging tasks.
Collapse
Affiliation(s)
- Fabio S Ferreira
- Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, UK.
| | - Agoston Mihalik
- Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, UK
| | - Rick A Adams
- Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, UK; Wellcome Centre for Human Neuroimaging, University College London, London, UK
| | - John Ashburner
- Wellcome Centre for Human Neuroimaging, University College London, London, UK
| | - Janaina Mourao-Miranda
- Centre for Medical Image Computing, Department of Computer Science, University College London, London, UK; Max Planck University College London Centre for Computational Psychiatry and Ageing Research, University College London, UK
| |
Collapse
|
4
|
An X, Chen X, Yi D, Li H, Guan Y. Representation of molecules for drug response prediction. Brief Bioinform 2021; 23:6375515. [PMID: 34571534 DOI: 10.1093/bib/bbab393] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 08/28/2021] [Accepted: 08/30/2021] [Indexed: 12/18/2022] Open
Abstract
The rapid development of machine learning and deep learning algorithms in the recent decade has spurred an outburst of their applications in many research fields. In the chemistry domain, machine learning has been widely used to aid in drug screening, drug toxicity prediction, quantitative structure-activity relationship prediction, anti-cancer synergy score prediction, etc. This review is dedicated to the application of machine learning in drug response prediction. Specifically, we focus on molecular representations, which is a crucial element to the success of drug response prediction and other chemistry-related prediction tasks. We introduce three types of commonly used molecular representation methods, together with their implementation and application examples. This review will serve as a brief introduction of the broad field of molecular representations.
Collapse
Affiliation(s)
- Xin An
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Xi Chen
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Daiyao Yi
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Hongyang Li
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
5
|
Coffin AB, Boney R, Hill J, Tian C, Steyger PS. Detecting Novel Ototoxins and Potentiation of Ototoxicity by Disease Settings. Front Neurol 2021; 12:725566. [PMID: 34489859 PMCID: PMC8418111 DOI: 10.3389/fneur.2021.725566] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 07/22/2021] [Indexed: 12/24/2022] Open
Abstract
Over 100 drugs and chemicals are associated with permanent hearing loss, tinnitus, and vestibular deficits, collectively known as ototoxicity. The ototoxic potential of drugs is rarely assessed in pre-clinical drug development or during clinical trials, so this debilitating side-effect is often discovered as patients begin to report hearing loss. Furthermore, drug-induced ototoxicity in adults, and particularly in elderly patients, may go unrecognized due to hearing loss from a variety of etiologies because of a lack of baseline assessments immediately prior to novel therapeutic treatment. During the current pandemic, there is an intense effort to identify new drugs or repurpose FDA-approved drugs to treat COVID-19. Several potential COVID-19 therapeutics are known ototoxins, including chloroquine (CQ) and lopinavir-ritonavir, demonstrating the necessity to identify ototoxic potential in existing and novel medicines. Furthermore, several factors are emerging as potentiators of ototoxicity, such as inflammation (a hallmark of COVID-19), genetic polymorphisms, and ototoxic synergy with co-therapeutics, increasing the necessity to evaluate a drug's potential to induce ototoxicity under varying conditions. Here, we review the potential of COVID-19 therapies to induce ototoxicity and factors that may compound their ototoxic effects. We then discuss two models for rapidly detecting the potential for ototoxicity: mammalian auditory cell lines and the larval zebrafish lateral line. These models offer considerable value for pre-clinical drug development, including development of COVID-19 therapies. Finally, we show the validity of in silico screening for ototoxic potential using a computational model that compares structural similarity of compounds of interest with a database of known ototoxins and non-ototoxins. Preclinical screening at in silico, in vitro, and in vivo levels can provide an earlier indication of the potential for ototoxicity and identify the subset of candidate therapeutics for treating COVID-19 that need to be monitored for ototoxicity as for other widely-used clinical therapeutics, like aminoglycosides and cisplatin.
Collapse
Affiliation(s)
| | | | - Jordan Hill
- Washington State University Vancouver, Vancouver, WA, United States
| | - Cong Tian
- Department of Biomedical Sciences, School of Medicine, Creighton University, Omaha, NE, United States
| | - Peter S. Steyger
- Department of Biomedical Sciences, School of Medicine, Creighton University, Omaha, NE, United States
- National Center for Rehabilitative Auditory Research, Portland, OR, United States
| |
Collapse
|
6
|
Sudhakar P, Verstockt B, Cremer J, Verstockt S, Sabino J, Ferrante M, Vermeire S. Understanding the Molecular Drivers of Disease Heterogeneity in Crohn's Disease Using Multi-omic Data Integration and Network Analysis. Inflamm Bowel Dis 2021; 27:870-886. [PMID: 33313682 PMCID: PMC8128416 DOI: 10.1093/ibd/izaa281] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Indexed: 12/12/2022]
Abstract
Crohn's disease (CD), a form of inflammatory bowel disease (IBD), is characterized by heterogeneity along multiple clinical axes, which in turn impacts disease progression and treatment modalities. Using advanced data integration approaches and systems biology tools, we studied the contribution of CD susceptibility variants and gene expression in distinct peripheral immune cell subsets (CD14+ monocytes and CD4+ T cells) to relevant clinical traits. Our analyses revealed that most clinical traits capturing CD heterogeneity could be associated with CD14+ and CD4+ gene expression rather than disease susceptibility variants. By disentangling the sources of variation, we identified molecular features that could potentially be driving the heterogeneity of various clinical traits of CD patients. Further downstream analyses identified contextual hub proteins such as genes encoding barrier functions, antimicrobial peptides, chemokines, and their receptors, which are either targeted by drugs used in CD or other inflammatory diseases or are relevant to the biological functions implicated in disease pathology. These hubs could be used as cell type-specific targets to treat specific subtypes of CD patients in a more individualized approach based on the underlying biology driving their disease subtypes. Our study highlights the importance of data integration and systems approaches to investigate complex and heterogeneous diseases such as IBD.
Collapse
Affiliation(s)
- Padhmanand Sudhakar
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID)
| | - Bram Verstockt
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID)
- University Hospitals Leuven, Department of Gastroenterology and Hepatology
| | - Jonathan Cremer
- Department of Microbiology and Immunology, Laboratory of Clinical Immunology, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Sare Verstockt
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID)
| | - João Sabino
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID)
- University Hospitals Leuven, Department of Gastroenterology and Hepatology
| | - Marc Ferrante
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID)
- University Hospitals Leuven, Department of Gastroenterology and Hepatology
| | - Séverine Vermeire
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID)
- University Hospitals Leuven, Department of Gastroenterology and Hepatology
| |
Collapse
|
7
|
Malik A, Khan MI, Karbasian H, Nieminen M, Ammad-Ud-Din M, Khan S. Modelling Public Sentiments about Juul Flavors on Twitter through Machine Learning. Nicotine Tob Res 2021; 23:1869-1879. [PMID: 33991191 DOI: 10.1093/ntr/ntab098] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Accepted: 05/10/2021] [Indexed: 11/14/2022]
Abstract
INTRODUCTION The availability of a variety of e-cigarettes flavors is one of the frequently cited reasons for their adoption. An active stream of discussion about flavoring can be observed online. Analyzing these real-time conversations offers nuanced insights into key factors related to the adoption of flavors, subsequently supporting public health interventions. METHODS Google's BERT, a state-of-the-art deep learning method was employed to model the first sentiment corpus on JUUL flavors. BERT, which is pre-trained with the complete English Wikipedia was fine-tuned by integrating a classification model, with human labeled Tweets, as training data. A collection of 30,075 Tweets about JUUL flavors was classified into positive and negative sentiments. Finally, using topic models, we identify and grouped thematic areas into positive and negative Tweets. RESULTS With an average of 89% cross-validation precision for classifying tweets, the finetuned BERT model classified 24,114 Tweets as positive and 5,961 Tweets as negative. Through the topic modeling approach 10 thematic topics were identified from the predicted positive and negative sentiments expressed in the Tweets. CONCLUSIONS JUUL flavors, notably mango, mint, and cucumber, provoke overwhelmingly positive sentiments indicating a strong likeness due to favoarble taste and odor. Negative discourse about JUUL flavors revolve around addictiveness, high nicotine content, and youth targeted marketing. IMPLICATIONS Limiting the content related to flavors and positive perceptions on social media is necessary to minimize exposure to youth. The novel methodology used in this study may be adopted to monitor e-cigarette discourse periodically, as well as other critical public health phenomena online.
Collapse
Affiliation(s)
- Aqdas Malik
- Department of Computer Science, Aalto University, Konemiehintie, Espoo, Finland Sultan Qaboos University, Muscat, Oman
| | - Muhammad Irfan Khan
- Department of Computer Science, Arcada University of Applied Sciences, Helsinki, Finland
| | - Habib Karbasian
- Department of Information Sciences & Technology, George Mason University, Fairfax, VA, United States
| | - Marko Nieminen
- Department of Computer Science, Aalto University, Espoo, Finland
| | - Muhammad Ammad-Ud-Din
- Helsinki Research Center, Europe Cloud Service Competence Center Huawei Technologies Oy (Finland) Co. Ltd., Helsinki, Finland
| | - Suleiman Khan
- FIMM Institute of Molecular Medicine, Helsinki University, Helsinki, Finland
| |
Collapse
|
8
|
Liu X, Zhang H, Xue Q, Pan W, Zhang A. In silico health effect prioritization of environmental chemicals through transcriptomics data exploration from a chemo-centric view. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 762:143082. [PMID: 33143927 DOI: 10.1016/j.scitotenv.2020.143082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2020] [Revised: 10/11/2020] [Accepted: 10/11/2020] [Indexed: 06/11/2023]
Abstract
With the explosive growth of synthetic compounds, the health effects caused by exogenous chemical exposure have attracted more and more public attention. The prediction of health effect is a never-ending story. Collective resource of transcriptomics data offers an opportunity to understand and identify the multiple health effects of small molecule. Inspired by the fact that environmental chemicals of high health risk frequently share both similar gene expression profile and common structural feature of certain drugs, we here propose a novel computational effect prioritization method for environmental chemicals through transcriptomics data exploration from a chemo-centric view. Specifically, non-negative matrix factorization (NMF) method has been adopted to get the association network linking structural features with transcriptomics characteristics of drugs with specific effects. The model yields 13 pivotal types of effects, so-called components, that represent drug categories with common chemo- and geno- type features. Moreover, the established model effectively prioritizes potential toxic effects for the external chemicals from the endocrine disruptor screening program (EDSP) for their potential estrogenicity and other verified risks. Even if only the highest priority is set for the estrogenic effect, the precision and recall can reach 0.76 and 0.77 respectively for these chemicals. Our effort provides a successful endeavor as to profile potential toxic effects simultaneously for environmental chemicals using both chemical and omics data.
Collapse
Affiliation(s)
- Xian Liu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China.
| | - Huazhou Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China.
| | - Qiao Xue
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China.
| | - Wenxiao Pan
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China.
| | - Aiqian Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China; Institute of Environment and Health, Jianghan University, Wuhan 430056, PR China.
| |
Collapse
|
9
|
Chang S, Chen JY, Chuang YJ, Chen BS. Systems Approach to Pathogenic Mechanism of Type 2 Diabetes and Drug Discovery Design Based on Deep Learning and Drug Design Specifications. Int J Mol Sci 2020; 22:ijms22010166. [PMID: 33375269 PMCID: PMC7795239 DOI: 10.3390/ijms22010166] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 12/21/2020] [Accepted: 12/21/2020] [Indexed: 12/16/2022] Open
Abstract
In this study, we proposed a systems biology approach to investigate the pathogenic mechanism for identifying significant biomarkers as drug targets and a systematic drug discovery strategy to design a potential multiple-molecule targeting drug for type 2 diabetes (T2D) treatment. We first integrated databases to construct the genome-wide genetic and epigenetic networks (GWGENs), which consist of protein–protein interaction networks (PPINs) and gene regulatory networks (GRNs) for T2D and non-T2D (health), respectively. Second, the relevant “real GWGENs” are identified by system identification and system order detection methods performed on the T2D and non-T2D RNA-seq data. To simplify network analysis, principal network projection (PNP) was thereby exploited to extract core GWGENs from real GWGENs. Then, with the help of KEGG pathway annotation, core signaling pathways were constructed to identify significant biomarkers. Furthermore, in order to discover potential drugs for the selected pathogenic biomarkers (i.e., drug targets) from the core signaling pathways, not only did we train a deep neural network (DNN)-based drug–target interaction (DTI) model to predict candidate drug’s binding with the identified biomarkers but also considered a set of design specifications, including drug regulation ability, toxicity, sensitivity, and side effects to sieve out promising drugs suitable for T2D.
Collapse
Affiliation(s)
- Shen Chang
- Laboratory of Automatic Control, Signal Processing and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan; (S.C.); (J.-Y.C.)
| | - Jian-You Chen
- Laboratory of Automatic Control, Signal Processing and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan; (S.C.); (J.-Y.C.)
| | - Yung-Jen Chuang
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu 30013, Taiwan;
| | - Bor-Sen Chen
- Laboratory of Automatic Control, Signal Processing and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan; (S.C.); (J.-Y.C.)
- Correspondence:
| |
Collapse
|
10
|
Kibble M, Khan SA, Ammad-ud-din M, Bollepalli S, Palviainen T, Kaprio J, Pietiläinen KH, Ollikainen M. An integrative machine learning approach to discovering multi-level molecular mechanisms of obesity using data from monozygotic twin pairs. ROYAL SOCIETY OPEN SCIENCE 2020; 7:200872. [PMID: 33204460 PMCID: PMC7657920 DOI: 10.1098/rsos.200872] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 09/29/2020] [Indexed: 05/19/2023]
Abstract
We combined clinical, cytokine, genomic, methylation and dietary data from 43 young adult monozygotic twin pairs (aged 22-36 years, 53% female), where 25 of the twin pairs were substantially weight discordant (delta body mass index > 3 kg m-2). These measurements were originally taken as part of the TwinFat study, a substudy of The Finnish Twin Cohort study. These five large multivariate datasets (comprising 42, 71, 1587, 1605 and 63 variables, respectively) were jointly analysed using an integrative machine learning method called group factor analysis (GFA) to offer new hypotheses into the multi-molecular-level interactions associated with the development of obesity. New potential links between cytokines and weight gain are identified, as well as associations between dietary, inflammatory and epigenetic factors. This encouraging case study aims to enthuse the research community to boldly attempt new machine learning approaches which have the potential to yield novel and unintuitive hypotheses. The source code of the GFA method is publically available as the R package GFA.
Collapse
Affiliation(s)
- Milla Kibble
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- Author for correspondence: Milla Kibble e-mail:
| | - Suleiman A. Khan
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Muhammad Ammad-ud-din
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Sailalitha Bollepalli
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Teemu Palviainen
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Jaakko Kaprio
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Department of Public Health, University of Helsinki, Helsinki, Finland
| | - Kirsi H. Pietiläinen
- Obesity Research Unit, Helsinki University Central Hospital and University of Helsinki, Helsinki, Finland
| | - Miina Ollikainen
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| |
Collapse
|
11
|
Chang S, Wang LHC, Chen BS. Investigating Core Signaling Pathways of Hepatitis B Virus Pathogenesis for Biomarkers Identification and Drug Discovery via Systems Biology and Deep Learning Method. Biomedicines 2020; 8:biomedicines8090320. [PMID: 32878239 PMCID: PMC7555687 DOI: 10.3390/biomedicines8090320] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 08/19/2020] [Accepted: 08/21/2020] [Indexed: 12/17/2022] Open
Abstract
Hepatitis B Virus (HBV) infection is a major cause of morbidity and mortality worldwide. However, poor understanding of its pathogenesis often gives rise to intractable immune escape and prognosis recurrence. Thus, a valid systematic approach based on big data mining and genome-wide RNA-seq data is imperative to further investigate the pathogenetic mechanism and identify biomarkers for drug design. In this study, systems biology method was applied to trim false positives from the host/pathogen genetic and epigenetic interaction network (HPI-GEN) under HBV infection by two-side RNA-seq data. Then, via the principal network projection (PNP) approach and the annotation of KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways, significant biomarkers related to cellular dysfunctions were identified from the core cross-talk signaling pathways as drug targets. Further, based on the pre-trained deep learning-based drug-target interaction (DTI) model and the validated pharmacological properties from databases, i.e., drug regulation ability, toxicity, and sensitivity, a combination of promising multi-target drugs was designed as a multiple-molecule drug to create more possibility for the treatment of HBV infection. Therefore, with the proposed systems medicine discovery and repositioning procedure, we not only shed light on the etiologic mechanism during HBV infection but also efficiently provided a potential drug combination for therapeutic treatment of Hepatitis B.
Collapse
Affiliation(s)
- Shen Chang
- Laboratory of Automatic Control, Signal Processing and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan;
| | - Lily Hui-Ching Wang
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Hsinchu 30013, Taiwan;
| | - Bor-Sen Chen
- Laboratory of Automatic Control, Signal Processing and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan;
- Correspondence:
| |
Collapse
|
12
|
Du Y, Li K, Wang X, Kaushik AC, Junaid M, Wei D. Identification of chlorprothixene as a potential drug that induces apoptosis and autophagic cell death in acute myeloid leukemia cells. FEBS J 2019; 287:1645-1665. [PMID: 31625692 DOI: 10.1111/febs.15102] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Revised: 07/16/2019] [Accepted: 10/16/2019] [Indexed: 01/07/2023]
Abstract
Although acute myeloid leukemia (AML) is a highly heterogeneous malignance, the common molecular mechanisms shared by different AML subtypes play critical roles in AML development. It is possible to identify new drugs that are effective for various AML subtypes based on the common molecular mechanisms. Therefore, we developed a hypothesis-driven bioinformatic drug screening framework by integrating multiple omics data. In this study, we identified that chlorprothixene, a dopamine receptor antagonist, could effectively inhibit growth of AML cells from different subtypes. RNA-seq analysis suggested that chlorprothixene perturbed a series of crucial biological processes such as cell cycle, apoptosis, and autophagy in AML cells. Further investigations indicated that chlorprothixene could induce both apoptosis and autophagy in AML cells, and apoptosis and autophagy could act as partners to induce cell death cooperatively. Remarkably, chlorprothixene was found to inhibit tumor growth and induce in situ leukemic cell apoptosis in the murine xenograft model. Furthermore, chlorprothixene treatment could reduce the level of oncofusion proteins PML-RARα and AML1-ETO, thus elevate the expression of apoptosis-related genes, and lead to AML cell death. Our results provided new insights for drug repositioning of AML therapy and confirmed that chlorprothixene might be a potential candidate for treatment of different subtypes of AML by reducing expression of oncofusion proteins. DATABASE: RNA-seq data are available in GEO database under the accession number GSE124316.
Collapse
Affiliation(s)
- Yuxin Du
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China.,State Key Laboratory of Medical Genomics, Shanghai Institute of Hematology, Rui-Jin Hospital, Shanghai Jiao Tong University, China
| | - Kening Li
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China.,State Key Laboratory of Medical Genomics, Shanghai Institute of Hematology, Rui-Jin Hospital, Shanghai Jiao Tong University, China
| | - Xiangeng Wang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Aman Chandra Kaushik
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Muhammad Junaid
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Dongqing Wei
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| |
Collapse
|
13
|
Olaru OT, Zanfirescu A, Nitulescu GM, Nitulescu G, Dinu-Pirvu CE, Anuta V, Tsatsakis A, Spandidos DA, Margina D, Seremet OC. Predictive power of the Triticum root elongation test for the assessment of novel anti‑proliferative therapies. Int J Mol Med 2019; 44:16-24. [PMID: 31115484 PMCID: PMC6559296 DOI: 10.3892/ijmm.2019.4192] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Accepted: 05/02/2019] [Indexed: 02/03/2023] Open
Abstract
The use of alternative techniques to reduce the number of animals used in anticancer research is an issue of current interest. The aim of this study was to validate the use of a simple and efficient alternative tool for the assessment of the potential of novel anti‑proliferative agents. A set of 20 compounds with various mechanisms were tested in the Triticum aestivum root elongation assay, using aminophylline as negative control. Hierarchical cluster analyses were performed using the furthest neighbor method based on Euclidean distance measure, and the compounds were statistically analyzed in reference to their anti‑proliferative pattern registered in the NCI60 human tumor cell line anticancer drug screen. A correlation between the Triticum test results and the NCI60 anti‑proliferative profile was made for a number of human cells that we defined as the Triticum cell panel. Linear equations were computed that can be used to transform the inhibitory effect measured in any future Triticum assay in order to predict the effect on particular human cells. Of the tested anti‑proliferative agents, methotrexate, colchicine, cantharidin, cisplatin and verapamil produced a growth inhibition over 50%. On the whole, the findings of this study suggest that the Triticum test can be used to detect several types of anti‑proliferative mechanisms, particularly those targeting tubulin, rendering it a useful tool with which to identify novel mitotic spindle inhibitors.
Collapse
Affiliation(s)
- Octavian Tudorel Olaru
- Faculty of Pharmacy, 'Carol Davila' University of Medicine and Pharmacy, 020956 Bucharest, Romania
| | - Anca Zanfirescu
- Faculty of Pharmacy, 'Carol Davila' University of Medicine and Pharmacy, 020956 Bucharest, Romania
| | - George Mihai Nitulescu
- Faculty of Pharmacy, 'Carol Davila' University of Medicine and Pharmacy, 020956 Bucharest, Romania
| | - Georgiana Nitulescu
- Faculty of Pharmacy, 'Carol Davila' University of Medicine and Pharmacy, 020956 Bucharest, Romania
| | | | - Valentina Anuta
- Faculty of Pharmacy, 'Carol Davila' University of Medicine and Pharmacy, 020956 Bucharest, Romania
| | - Aristidis Tsatsakis
- Department of Forensic Sciences and Toxicology, Faculty of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Demetrios A Spandidos
- Laboratory of Clinical Virology, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Denisa Margina
- Faculty of Pharmacy, 'Carol Davila' University of Medicine and Pharmacy, 020956 Bucharest, Romania
| | - Oana Cristina Seremet
- Faculty of Pharmacy, 'Carol Davila' University of Medicine and Pharmacy, 020956 Bucharest, Romania
| |
Collapse
|
14
|
Mizuno T, Kinoshita S, Ito T, Maedera S, Kusuhara H. Development of Orthogonal Linear Separation Analysis (OLSA) to Decompose Drug Effects into Basic Components. Sci Rep 2019; 9:1824. [PMID: 30755704 PMCID: PMC6372619 DOI: 10.1038/s41598-019-38528-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 12/27/2018] [Indexed: 01/06/2023] Open
Abstract
Drugs have multiple, not single, effects. Decomposition of drug effects into basic components helps us to understand the pharmacological properties of a drug and contributes to drug discovery. We have extended factor analysis and developed a novel profile data analysis method: orthogonal linear separation analysis (OLSA). OLSA contracted 11,911 genes to 118 factors from transcriptome data of MCF7 cells treated with 318 compounds in a Connectivity Map. Ontology of the main genes constituting the factors detected significant enrichment of the ontology in 65 of 118 factors and similar results were obtained in two other data sets. In further analysis of the Connectivity Map data set, one factor discriminated two Hsp90 inhibitors, geldanamycin and radicicol, while clustering analysis could not. Doxorubicin and other topoisomerase inhibitors were estimated to inhibit Na+/K+ ATPase, one of the suggested mechanisms of doxorubicin-induced cardiotoxicity. Based on the factor including PI3K/AKT/mTORC1 inhibition activity, 5 compounds were predicted to be novel inducers of autophagy, and other analyses including western blotting revealed that 4 of the 5 actually induced autophagy. These findings indicate the potential of OLSA to decompose the effects of a drug and identify its basic components.
Collapse
Affiliation(s)
- Tadahaya Mizuno
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan.
| | - Setsuo Kinoshita
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan
- ProMedico Co., Ltd., Ota-ku, Tokyo, 143-0023, Japan
| | - Takuya Ito
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan
| | - Shotaro Maedera
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan
| | - Hiroyuki Kusuhara
- Graduate School of Pharmaceutical Sciences, the University of Tokyo, Bunkyo-ku, Tokyo, 113-0033, Japan.
| |
Collapse
|
15
|
Argelaguet R, Velten B, Arnol D, Dietrich S, Zenz T, Marioni JC, Buettner F, Huber W, Stegle O. Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets. Mol Syst Biol 2018; 14:e8124. [PMID: 29925568 PMCID: PMC6010767 DOI: 10.15252/msb.20178124] [Citation(s) in RCA: 481] [Impact Index Per Article: 80.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2017] [Revised: 05/28/2018] [Accepted: 05/29/2018] [Indexed: 12/19/2022] Open
Abstract
Multi-omics studies promise the improved characterization of biological processes across molecular layers. However, methods for the unsupervised integration of the resulting heterogeneous data sets are lacking. We present Multi-Omics Factor Analysis (MOFA), a computational method for discovering the principal sources of variation in multi-omics data sets. MOFA infers a set of (hidden) factors that capture biological and technical sources of variability. It disentangles axes of heterogeneity that are shared across multiple modalities and those specific to individual data modalities. The learnt factors enable a variety of downstream analyses, including identification of sample subgroups, data imputation and the detection of outlier samples. We applied MOFA to a cohort of 200 patient samples of chronic lymphocytic leukaemia, profiled for somatic mutations, RNA expression, DNA methylation and ex vivo drug responses. MOFA identified major dimensions of disease heterogeneity, including immunoglobulin heavy-chain variable region status, trisomy of chromosome 12 and previously underappreciated drivers, such as response to oxidative stress. In a second application, we used MOFA to analyse single-cell multi-omics data, identifying coordinated transcriptional and epigenetic changes along cell differentiation.
Collapse
Affiliation(s)
- Ricard Argelaguet
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
| | - Britta Velten
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Damien Arnol
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
| | | | - Thorsten Zenz
- Heidelberg University Hospital, Heidelberg, Germany
- German Cancer Research Center (dkfz) and National Center for Tumor Diseases (NCT), Heidelberg, Germany
- Germany & Hematology, University Hospital Zurich and University of Zurich, Zurich, Switzerland
| | - John C Marioni
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Florian Buettner
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
- Helmholtz Zentrum München-German Research Center for Environmental Health, Institute of Computational Biology, Neuherberg, Germany
| | - Wolfgang Huber
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Oliver Stegle
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| |
Collapse
|
16
|
Musa A, Ghoraie LS, Zhang SD, Glazko G, Yli-Harja O, Dehmer M, Haibe-Kains B, Emmert-Streib F. A review of connectivity map and computational approaches in pharmacogenomics. Brief Bioinform 2018; 19:506-523. [PMID: 28069634 PMCID: PMC5952941 DOI: 10.1093/bib/bbw112] [Citation(s) in RCA: 93] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Large-scale perturbation databases, such as Connectivity Map (CMap) or Library of Integrated Network-based Cellular Signatures (LINCS), provide enormous opportunities for computational pharmacogenomics and drug design. A reason for this is that in contrast to classical pharmacology focusing at one target at a time, the transcriptomics profiles provided by CMap and LINCS open the door for systems biology approaches on the pathway and network level. In this article, we provide a review of recent developments in computational pharmacogenomics with respect to CMap and LINCS and related applications.
Collapse
Affiliation(s)
- Aliyu Musa
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Laleh Soltan Ghoraie
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
| | - Shu-Dong Zhang
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, University of Ulster, C-TRIC Building, Altnagelvin Area Hospital, Glenshane Road, Derry/Londonderry, Northern Ireland, UK
| | - Galina Glazko
- University of Rochester Department of Biostatistics and Computational Biology, Rochester, New York, USA
| | - Olli Yli-Harja
- Computational Systems Biology, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Matthias Dehmer
- Institute for Bioinformatics and Translational Research, UMIT- The Health and Life Sciences University, Eduard Wallnoefer Zentrum 1, Hall in Tyrol, Austria
| | - Benjamin Haibe-Kains
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Ontario Institute of Cancer Research, Toronto, ON, Canada
| | - Frank Emmert-Streib
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| |
Collapse
|
17
|
Musa A, Ghoraie LS, Zhang SD, Glazko G, Yli-Harja O, Dehmer M, Haibe-Kains B, Emmert-Streib F. A review of connectivity map and computational approaches in pharmacogenomics. Brief Bioinform 2018. [PMID: 28069634 DOI: 10.1093/bib] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2023] Open
Abstract
Large-scale perturbation databases, such as Connectivity Map (CMap) or Library of Integrated Network-based Cellular Signatures (LINCS), provide enormous opportunities for computational pharmacogenomics and drug design. A reason for this is that in contrast to classical pharmacology focusing at one target at a time, the transcriptomics profiles provided by CMap and LINCS open the door for systems biology approaches on the pathway and network level. In this article, we provide a review of recent developments in computational pharmacogenomics with respect to CMap and LINCS and related applications.
Collapse
Affiliation(s)
- Aliyu Musa
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Laleh Soltan Ghoraie
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
| | - Shu-Dong Zhang
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, University of Ulster, C-TRIC Building, Altnagelvin Area Hospital, Glenshane Road, Derry/Londonderry BT47 6SB, Northern Ireland, UK
| | - Galina Glazko
- University of Rochester Department of Biostatistics and Computational Biology, Rochester, New York 14642, USA
| | - Olli Yli-Harja
- Computational Systems Biology, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Matthias Dehmer
- Institute for Bioinformatics and Translational Research, UMIT- The Health and Life Sciences University, Eduard Wallnoefer Zentrum 1, 6060 Hall in Tyrol, Austria
| | - Benjamin Haibe-Kains
- Bioinformatics and Computational Genomics Laboratory, Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Ontario Institute of Cancer Research, Toronto, ON, Canada
| | - Frank Emmert-Streib
- Predictive Medicine and Analytics Lab, Department of Signal Processing, Tampere University of Technology, Tampere, Finland
| |
Collapse
|
18
|
Hodos R, Zhang P, Lee HC, Duan Q, Wang Z, Clark NR, Ma'ayan A, Wang F, Kidd B, Hu J, Sontag D, Dudley J. Cell-specific prediction and application of drug-induced gene expression profiles. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018; 23:32-43. [PMID: 29218867 PMCID: PMC5753597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Gene expression profiling of in vitro drug perturbations is useful for many biomedical discovery applications including drug repurposing and elucidation of drug mechanisms. However, limited data availability across cell types has hindered our capacity to leverage or explore the cell-specificity of these perturbations. While recent efforts have generated a large number of drug perturbation profiles across a variety of human cell types, many gaps remain in this combinatorial drug-cell space. Hence, we asked whether it is possible to fill these gaps by predicting cell-specific drug perturbation profiles using available expression data from related conditions--i.e. from other drugs and cell types. We developed a computational framework that first arranges existing profiles into a three-dimensional array (or tensor) indexed by drugs, genes, and cell types, and then uses either local (nearest-neighbors) or global (tensor completion) information to predict unmeasured profiles. We evaluate prediction accuracy using a variety of metrics, and find that the two methods have complementary performance, each superior in different regions in the drug-cell space. Predictions achieve correlations of 0.68 with true values, and maintain accurate differentially expressed genes (AUC 0.81). Finally, we demonstrate that the predicted profiles add value for making downstream associations with drug targets and therapeutic classes.
Collapse
Affiliation(s)
- Rachel Hodos
- Institute for Next Generation Healthcare, Icahn School of Medicine at Mount Sinai (ISMMS), New York, NY, 10065; New York, USA, ²Department of Genetics and Genomic Sciences, ISMMS, New York, NY, 10029; New York, USA, ³Courant Institute of Mathematical Sciences, New York University, New York, NY, 10012; New York, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Sirci F, Napolitano F, Pisonero-Vaquero S, Carrella D, Medina DL, di Bernardo D. Comparing structural and transcriptional drug networks reveals signatures of drug activity and toxicity in transcriptional responses. NPJ Syst Biol Appl 2017; 3:23. [PMID: 28861278 PMCID: PMC5572457 DOI: 10.1038/s41540-017-0022-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Revised: 06/27/2017] [Accepted: 07/07/2017] [Indexed: 02/07/2023] Open
Abstract
We performed an integrated analysis of drug chemical structures and drug-induced transcriptional responses. We demonstrated that a network representing three-dimensional structural similarities among 5452 compounds can be used to automatically group together drugs with similar scaffolds, physicochemical parameters and mode-of-action. We compared the structural network to a network representing transcriptional similarities among a subset of 1309 drugs for which transcriptional response were available in the Connectivity Map data set. Analysis of structurally similar, but transcriptionally different drugs sharing the same MOA enabled us to detect and remove weak and noisy transcriptional responses, greatly enhancing the reliability of transcription-based approaches to drug discovery and drug repositioning. Cardiac glycosides exhibited the strongest transcriptional responses with a significant induction of pathways related to epigenetic regulation, which suggests an epigenetic mechanism of action for these drugs. Drug classes with the weakest transcriptional responses tended to induce expression of cytochrome P450 enzymes, hinting at drug-induced drug resistance. Analysis of transcriptionally similar, but structurally different drugs with unrelated MOA, led us to the identification of a 'toxic' transcriptional signature indicative of lysosomal stress (lysosomotropism) and lipid accumulation (phospholipidosis) partially masking the target-specific transcriptional effects of these drugs. We found that this transcriptional signature is shared by 258 compounds and it is associated to the activation of the transcription factor TFEB, a master regulator of lysosomal biogenesis and autophagy. Finally, we built a predictive Random Forest model of these 258 compounds based on 128 physicochemical parameters, which should help in the early identification of potentially toxic drug candidates. Transcriptional responses to drug treatment can reveal mechanism of action and off-target effects thus enabling drug repositioning, but only if measured in the appropriate cells at clinically relevant concentrations. A team led by Diego di Bernardo and Diego Medina generated a network representing structural similarities among compounds to automatically group together drugs with similar scaffolds and MOA. By comparing the structural drug network with a transcriptional drug network based on similarities in transcriptional response, the team observed broad differences between the two. This observation led to the identification of a transcriptional signature related lysosomal stress and phospholipidosis, and a physicochemical model to identify such compounds. These results provide general guidelines to prevent erroneous conclusion when using transcriptional responses of small molecules for drug discovery and drug repositioning
Collapse
Affiliation(s)
- Francesco Sirci
- Telethon Institute of Genetics and Medicine (TIGEM), System Biology and Bioinformatics lab. and High Content Screening facility, Via Campi Flegrei 34, 80078 Pozzuoli (NA), Italy
| | - Francesco Napolitano
- Telethon Institute of Genetics and Medicine (TIGEM), System Biology and Bioinformatics lab. and High Content Screening facility, Via Campi Flegrei 34, 80078 Pozzuoli (NA), Italy
| | - Sandra Pisonero-Vaquero
- Telethon Institute of Genetics and Medicine (TIGEM), System Biology and Bioinformatics lab. and High Content Screening facility, Via Campi Flegrei 34, 80078 Pozzuoli (NA), Italy
| | - Diego Carrella
- Telethon Institute of Genetics and Medicine (TIGEM), System Biology and Bioinformatics lab. and High Content Screening facility, Via Campi Flegrei 34, 80078 Pozzuoli (NA), Italy
| | - Diego L Medina
- Telethon Institute of Genetics and Medicine (TIGEM), System Biology and Bioinformatics lab. and High Content Screening facility, Via Campi Flegrei 34, 80078 Pozzuoli (NA), Italy
| | - Diego di Bernardo
- Telethon Institute of Genetics and Medicine (TIGEM), System Biology and Bioinformatics lab. and High Content Screening facility, Via Campi Flegrei 34, 80078 Pozzuoli (NA), Italy.,Department of Chemical, Materials and Industrial Production Engineering, University of Naples Federico II, Piazzale Tecchio 80, 80125 Naples, Italy
| |
Collapse
|
20
|
Islam S, Anand S, Hamid J, Thabane L, Beyene J. Comparing the performance of linear and nonlinear principal components in the context of high-dimensional genomic data integration. Stat Appl Genet Mol Biol 2017; 16:199-216. [PMID: 28727569 DOI: 10.1515/sagmb-2016-0066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Linear principal component analysis (PCA) is a widely used approach to reduce the dimension of gene or miRNA expression data sets. This method relies on the linearity assumption, which often fails to capture the patterns and relationships inherent in the data. Thus, a nonlinear approach such as kernel PCA might be optimal. We develop a copula-based simulation algorithm that takes into account the degree of dependence and nonlinearity observed in these data sets. Using this algorithm, we conduct an extensive simulation to compare the performance of linear and kernel principal component analysis methods towards data integration and death classification. We also compare these methods using a real data set with gene and miRNA expression of lung cancer patients. First few kernel principal components show poor performance compared to the linear principal components in this occasion. Reducing dimensions using linear PCA and a logistic regression model for classification seems to be adequate for this purpose. Integrating information from multiple data sets using either of these two approaches leads to an improved classification accuracy for the outcome.
Collapse
|
21
|
El-Hachem N, Gendoo DMA, Ghoraie LS, Safikhani Z, Smirnov P, Chung C, Deng K, Fang A, Birkwood E, Ho C, Isserlin R, Bader GD, Goldenberg A, Haibe-Kains B. Integrative Cancer Pharmacogenomics to Infer Large-Scale Drug Taxonomy. Cancer Res 2017; 77:3057-3069. [PMID: 28314784 DOI: 10.1158/0008-5472.can-17-0096] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Revised: 02/27/2017] [Accepted: 03/13/2017] [Indexed: 11/16/2022]
Abstract
Identification of drug targets and mechanism of action (MoA) for new and uncharacterized anticancer drugs is important for optimization of treatment efficacy. Current MoA prediction largely relies on prior information including side effects, therapeutic indication, and chemoinformatics. Such information is not transferable or applicable for newly identified, previously uncharacterized small molecules. Therefore, a shift in the paradigm of MoA predictions is necessary toward development of unbiased approaches that can elucidate drug relationships and efficiently classify new compounds with basic input data. We propose here a new integrative computational pharmacogenomic approach, referred to as Drug Network Fusion (DNF), to infer scalable drug taxonomies that rely only on basic drug characteristics toward elucidating drug-drug relationships. DNF is the first framework to integrate drug structural information, high-throughput drug perturbation, and drug sensitivity profiles, enabling drug classification of new experimental compounds with minimal prior information. DNF taxonomy succeeded in identifying pertinent and novel drug-drug relationships, making it suitable for investigating experimental drugs with potential new targets or MoA. The scalability of DNF facilitated identification of key drug relationships across different drug categories, providing a flexible tool for potential clinical applications in precision medicine. Our results support DNF as a valuable resource to the cancer research community by providing new hypotheses on compound MoA and potential insights for drug repurposing. Cancer Res; 77(11); 3057-69. ©2017 AACR.
Collapse
Affiliation(s)
- Nehme El-Hachem
- Integrative Computational Systems Biology, Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada.,Department of Biomedical Sciences. Université de Montréal, Montreal, Quebec, Canada
| | - Deena M A Gendoo
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Laleh Soltan Ghoraie
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Zhaleh Safikhani
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
| | - Petr Smirnov
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada
| | - Christina Chung
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Kenan Deng
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Ailsa Fang
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Erin Birkwood
- School of Computer Science, McGill University, Montreal, Quebec, Canada
| | - Chantal Ho
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Ruth Isserlin
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Gary D Bader
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.,The Donnelly Centre, Toronto, Ontario, Canada.,The Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
| | - Anna Goldenberg
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.,Hospital for Sick Children, Toronto, Ontario, Canada
| | - Benjamin Haibe-Kains
- Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada. .,Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.,Ontario Institute of Cancer Research, Toronto, Ontario, Canada
| |
Collapse
|
22
|
Chen HR, Sherr DH, Hu Z, DeLisi C. A network based approach to drug repositioning identifies plausible candidates for breast cancer and prostate cancer. BMC Med Genomics 2016; 9:51. [PMID: 27475327 PMCID: PMC4967295 DOI: 10.1186/s12920-016-0212-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2016] [Accepted: 07/20/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The high cost and the long time required to bring drugs into commerce is driving efforts to repurpose FDA approved drugs-to find new uses for which they weren't intended, and to thereby reduce the overall cost of commercialization, and shorten the lag between drug discovery and availability. We report on the development, testing and application of a promising new approach to repositioning. METHODS Our approach is based on mining a human functional linkage network for inversely correlated modules of drug and disease gene targets. The method takes account of multiple information sources, including gene mutation, gene expression, and functional connectivity and proximity of within module genes. RESULTS The method was used to identify candidates for treating breast and prostate cancer. We found that (i) the recall rate for FDA approved drugs for breast (prostate) cancer is 20/20 (10/11), while the rates for drugs in clinical trials were 131/154 and 82/106; (ii) the ROC/AUC performance substantially exceeds that of comparable methods; (iii) preliminary in vitro studies indicate that 5/5 candidates have therapeutic indices superior to that of Doxorubicin in MCF7 and SUM149 cancer cell lines. We briefly discuss the biological plausibility of the candidates at a molecular level in the context of the biological processes that they mediate. CONCLUSIONS Our method appears to offer promise for the identification of multi-targeted drug candidates that can correct aberrant cellular functions. In particular the computational performance exceeded that of other CMap-based methods, and in vitro experiments indicate that 5/5 candidates have therapeutic indices superior to that of Doxorubicin in MCF7 and SUM149 cancer cell lines. The approach has the potential to provide a more efficient drug discovery pipeline.
Collapse
Affiliation(s)
- Hsiao-Rong Chen
- Bioinformatics Program, College of Engineering, Boston University, Boston, MA, USA.,Graduate Program in Translational Molecular Medicine, Boston University School of Medicine, Boston, MA, USA
| | - David H Sherr
- Department of Environmental Health, Boston University School of Public Health, Boston, MA, USA
| | - Zhenjun Hu
- Bioinformatics Program, College of Engineering, Boston University, Boston, MA, USA
| | - Charles DeLisi
- Bioinformatics Program, College of Engineering, Boston University, Boston, MA, USA. .,Department of Biomedical Engineering, Boston University, Boston, MA, USA.
| |
Collapse
|
23
|
Yadav B, Gopalacharyulu P, Pemovska T, Khan SA, Szwajda A, Tang J, Wennerberg K, Aittokallio T. From drug response profiling to target addiction scoring in cancer cell models. Dis Model Mech 2016; 8:1255-64. [PMID: 26438695 PMCID: PMC4610238 DOI: 10.1242/dmm.021105] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Deconvoluting the molecular target signals behind observed drug response phenotypes is an important part of phenotype-based drug discovery and repurposing efforts. We demonstrate here how our network-based deconvolution approach, named target addiction score (TAS), provides insights into the functional importance of druggable protein targets in cell-based drug sensitivity testing experiments. Using cancer cell line profiling data sets, we constructed a functional classification across 107 cancer cell models, based on their common and unique target addiction signatures. The pan-cancer addiction correlations could not be explained by the tissue of origin, and only correlated in part with molecular and genomic signatures of the heterogeneous cancer cells. The TAS-based cancer cell classification was also shown to be robust to drug response data resampling, as well as predictive of the transcriptomic patterns in an independent set of cancer cells that shared similar addiction signatures with the 107 cancers. The critical protein targets identified by the integrated approach were also shown to have clinically relevant mutation frequencies in patients with various cancer subtypes, including not only well-established pan-cancer genes, such as PTEN tumor suppressor, but also a number of targets that are less frequently mutated in specific cancer types, including ABL1 oncoprotein in acute myeloid leukemia. An application to leukemia patient primary cell models demonstrated how the target deconvolution approach offers functional insights into patient-specific addiction patterns, such as those indicative of their receptor-type tyrosine-protein kinase FLT3 internal tandem duplication (FLT3-ITD) status and co-addiction partners, which may lead to clinically actionable, personalized drug treatment developments. To promote its application to the future drug testing studies, we have made available an open-source implementation of the TAS calculation in the form of a stand-alone R package.
Collapse
Affiliation(s)
- Bhagwan Yadav
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| | - Peddinti Gopalacharyulu
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| | - Tea Pemovska
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| | - Suleiman A Khan
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| | - Agnieszka Szwajda
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| | - Jing Tang
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| | - Krister Wennerberg
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| | - Tero Aittokallio
- Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, FI-00014 Helsinki, Finland
| |
Collapse
|
24
|
|
25
|
Bunte K, Leppäaho E, Saarinen I, Kaski S. Sparse group factor analysis for biclustering of multiple data sources. ACTA ACUST UNITED AC 2016; 32:2457-63. [PMID: 27153643 DOI: 10.1093/bioinformatics/btw207] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Accepted: 04/10/2016] [Indexed: 11/13/2022]
Abstract
MOTIVATION Modelling methods that find structure in data are necessary with the current large volumes of genomic data, and there have been various efforts to find subsets of genes exhibiting consistent patterns over subsets of treatments. These biclustering techniques have focused on one data source, often gene expression data. We present a Bayesian approach for joint biclustering of multiple data sources, extending a recent method Group Factor Analysis to have a biclustering interpretation with additional sparsity assumptions. The resulting method enables data-driven detection of linear structure present in parts of the data sources. RESULTS Our simulation studies show that the proposed method reliably infers biclusters from heterogeneous data sources. We tested the method on data from the NCI-DREAM drug sensitivity prediction challenge, resulting in an excellent prediction accuracy. Moreover, the predictions are based on several biclusters which provide insight into the data sources, in this case on gene expression, DNA methylation, protein abundance, exome sequence, functional connectivity fingerprints and drug sensitivity. AVAILABILITY AND IMPLEMENTATION http://research.cs.aalto.fi/pml/software/GFAsparse/ CONTACTS : kerstin.bunte@googlemail.com or samuel.kaski@aalto.fi.
Collapse
Affiliation(s)
- Kerstin Bunte
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, Aalto University, Finland
| | - Eemeli Leppäaho
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, Aalto University, Finland
| | - Inka Saarinen
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, Aalto University, Finland
| | - Samuel Kaski
- Helsinki Institute for Information Technology HIIT, Department of Computer Science, Aalto University, Finland
| |
Collapse
|
26
|
Kibble M, Khan SA, Saarinen N, Iorio F, Saez-Rodriguez J, Mäkelä S, Aittokallio T. Transcriptional response networks for elucidating mechanisms of action of multitargeted agents. Drug Discov Today 2016; 21:1063-75. [PMID: 26979547 DOI: 10.1016/j.drudis.2016.03.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Revised: 02/15/2016] [Accepted: 03/04/2016] [Indexed: 01/08/2023]
Abstract
Drug discovery is moving away from the single target-based approach towards harnessing the potential of polypharmacological agents that modulate the activity of multiple nodes in the complex networks of deregulations underlying disease phenotypes. Computational network pharmacology methods that use systems-level drug-response phenotypes, such as those originating from genome-wide transcriptomic profiles, have proved particularly effective for elucidating the mechanisms of action of multitargeted compounds. Here, we show, via the case study of the natural product pinosylvin, how the combination of two complementary network-based methods can provide novel, unexpected mechanistic insights. This case study also illustrates that elucidating the mechanism of action of multitargeted natural products through transcriptional response-based approaches is a challenging endeavor, often requiring multiple computational-experimental iterations.
Collapse
Affiliation(s)
- Milla Kibble
- Institute for Molecular Medicine Finland (FIMM), Biomedicum Helsinki 2U, Tukholmankatu 8, University of Helsinki, Helsinki 00014, Finland.
| | - Suleiman A Khan
- Institute for Molecular Medicine Finland (FIMM), Biomedicum Helsinki 2U, Tukholmankatu 8, University of Helsinki, Helsinki 00014, Finland
| | - Niina Saarinen
- Institute of Biomedicine, Turku Center for Disease Modeling & Functional Foods Forum, University of Turku, Turku 20014, Finland
| | - Francesco Iorio
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Julio Saez-Rodriguez
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK; Joint Research Centre for Computational Biomedicine (JRC-COMBINE) - RWTH Aachen University, Faculty of Medicine, D-52074 Aachen, Germany
| | - Sari Mäkelä
- Institute of Biomedicine, Turku Center for Disease Modeling & Functional Foods Forum, University of Turku, Turku 20014, Finland
| | - Tero Aittokallio
- Institute for Molecular Medicine Finland (FIMM), Biomedicum Helsinki 2U, Tukholmankatu 8, University of Helsinki, Helsinki 00014, Finland; Department of Mathematics and Statistics, Quantum, University of Turku, Turku 20014, Finland
| |
Collapse
|
27
|
Kibble M, Saarinen N, Tang J, Wennerberg K, Mäkelä S, Aittokallio T. Network pharmacology applications to map the unexplored target space and therapeutic potential of natural products. Nat Prod Rep 2015; 32:1249-66. [PMID: 26030402 DOI: 10.1039/c5np00005j] [Citation(s) in RCA: 277] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
It is widely accepted that drug discovery often requires a systems-level polypharmacology approach to tackle problems such as lack of efficacy and emerging resistance of single-targeted compounds. Network pharmacology approaches are increasingly being developed and applied to find new therapeutic opportunities and to re-purpose approved drugs. However, these recent advances have been relatively slow to be translated into the field of natural products. Here, we argue that a network pharmacology approach would enable an effective mapping of the yet unexplored target space of natural products, hence providing a systematic means to extend the druggable space of proteins implicated in various complex diseases. We give an overview of the key network pharmacology concepts and recent experimental-computational approaches that have been successfully applied to natural product research, including unbiased elucidation of mechanisms of action as well as systematic prediction of effective therapeutic combinations. We focus specifically on anticancer applications that use in vivo and in vitro functional phenotypic measurements, such as genome-wide transcriptomic response profiles, which enable a global modelling of the multi-target activity at the level of the biological pathways and interaction networks. We also provide representative examples of other disease applications, databases and tools as well as existing and emerging resources, which may prove useful for future natural product research. Finally, we offer our personal view of the current limitations, prospective developments and open questions in this exciting field.
Collapse
Affiliation(s)
- Milla Kibble
- Institute for Molecular Medicine Finland (FIMM), Biomedicum Helsinki 2U, 00014 University of Helsinki, Finland.
| | | | | | | | | | | |
Collapse
|