1
|
Al-Taie Z, Liu D, Mitchem JB, Papageorgiou C, Kaifi JT, Warren WC, Shyu CR. Explainable artificial intelligence in high-throughput drug repositioning for subgroup stratifications with interventionable potential. J Biomed Inform 2021; 118:103792. [PMID: 33915273 DOI: 10.1016/j.jbi.2021.103792] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Revised: 03/26/2021] [Accepted: 04/21/2021] [Indexed: 01/02/2023]
Abstract
Enabling precision medicine requires developing robust patient stratification methods as well as drugs tailored to homogeneous subgroups of patients from a heterogeneous population. Developing de novo drugs is expensive and time consuming with an ultimately low FDA approval rate. These limitations make developing new drugs for a small portion of a disease population unfeasible. Therefore, drug repositioning is an essential alternative for developing new drugs for a disease subpopulation. This shows the importance of developing data-driven approaches that find druggable homogeneous subgroups within the disease population and reposition the drugs for these subgroups. In this study, we developed an explainable AI approach for patient stratification and drug repositioning. Contrast pattern mining and network analysis were used to discover homogeneous subgroups within a disease population. For each subgroup, a biomedical network analysis was done to find the drugs that are most relevant to a given subgroup of patients. The set of candidate drugs for each subgroup was ranked using an aggregated drug score assigned to each drug. The proposed method represents a human-in-the-loop framework, where medical experts use the data-driven results to generate hypotheses and obtain insights into potential therapeutic candidates for patients who belong to a subgroup. Colorectal cancer (CRC) was used as a case study. Patients' phenotypic and genotypic data was utilized with a heterogeneous knowledge base because it gives a multi-view perspective for finding new indications for drugs outside of their original use. Our analysis of the top candidate drugs for the subgroups identified by medical experts showed that most of these drugs are cancer-related, and most of them have the potential to be a CRC regimen based on studies in the literature.
Collapse
Affiliation(s)
- Zainab Al-Taie
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA; Department of Computer Science, College of Science for Women, University of Baghdad, Baghdad, Iraq
| | - Danlu Liu
- Electrical Engineering and Computer Science Department, University of Missouri, Columbia, MO 65211, USA
| | - Jonathan B Mitchem
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA; Department of Surgery, School of Medicine, University of Missouri, Columbia, MO 65212, USA; Harry S. Truman Memorial Veterans' Hospital, Columbia, MO 65201, USA.
| | - Christos Papageorgiou
- Department of Surgery, School of Medicine, University of Missouri, Columbia, MO 65212, USA
| | - Jussuf T Kaifi
- Department of Surgery, School of Medicine, University of Missouri, Columbia, MO 65212, USA; Harry S. Truman Memorial Veterans' Hospital, Columbia, MO 65201, USA
| | - Wesley C Warren
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA; Department of Surgery, School of Medicine, University of Missouri, Columbia, MO 65212, USA; Department of Animal Sciences, Bond Life Sciences Center, University of Missouri, 1201 Rollins Street, Columbia, MO 65211, USA
| | - Chi-Ren Shyu
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA; Electrical Engineering and Computer Science Department, University of Missouri, Columbia, MO 65211, USA; Department of Medicine, School of Medicine, University of Missouri, Columbia, MO 65212, USA.
| |
Collapse
|
2
|
Kumar Shukla P, Kumar Shukla P, Sharma P, Rawat P, Samar J, Moriwal R, Kaur M. Efficient prediction of drug-drug interaction using deep learning models. IET Syst Biol 2020; 14:211-216. [PMID: 32737279 PMCID: PMC8687321 DOI: 10.1049/iet-syb.2019.0116] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 04/09/2020] [Accepted: 04/22/2020] [Indexed: 12/23/2022] Open
Abstract
A drug-drug interaction or drug synergy is extensively utilised for cancer treatment. However, prediction of drug-drug interaction is defined as an ill-posed problem, because manual testing is only implementable on small group of drugs. Predicting the drug-drug interaction score has been a popular research topic recently. Recently many machine learning models have proposed in the literature to predict the drug-drug interaction score efficiently. However, these models suffer from the over-fitting issue. Therefore, these models are not so-effective for predicting the drug-drug interaction score. In this work, an integrated convolutional mixture density recurrent neural network is proposed and implemented. The proposed model integrates convolutional neural networks, recurrent neural networks and mixture density networks. Extensive comparative analysis reveals that the proposed model significantly outperforms the competitive models.
Collapse
Affiliation(s)
- Prashant Kumar Shukla
- Department of Computer Science and Engineering, School of Engineering & Technology, Jagran Lake City University (JLU), Bhopal, MP, India
| | - Piyush Kumar Shukla
- Department of Computer Science and Engineering, University Institute of Technology, RGPV, Bhopal, MP, India
| | - Poonam Sharma
- Department of Computer Science & Engineering, Visvesvaraya National Institute of Technology, Nagpur, India
| | - Paresh Rawat
- Department of Electronics and Communication Engineering, Sagar Institute of Science & Technology (SISTec), Gandhi Nagar, Bhopal, MP, India
| | - Jashwant Samar
- Department of Computer Science and Engineering, University Institute of Technology, RGPV, Bhopal, MP, India
| | - Rahul Moriwal
- Department of Computer Science and Engineering-AITR Indore, MP, India
| | - Manjit Kaur
- Department of Computer and Communication Engineering, School of Computing and Information Technology, Manipal University Jaipur, Rajasthan, India.
| |
Collapse
|
3
|
Lau A, So HC. Turning genome-wide association study findings into opportunities for drug repositioning. Comput Struct Biotechnol J 2020; 18:1639-1650. [PMID: 32670504 PMCID: PMC7334463 DOI: 10.1016/j.csbj.2020.06.015] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2019] [Revised: 06/05/2020] [Accepted: 06/05/2020] [Indexed: 02/02/2023] Open
Abstract
Drug development is a very costly and lengthy process, while repositioned or repurposed drugs could be brought into clinical practice within a shorter time-frame and at a much reduced cost. Numerous computational approaches to drug repositioning have been developed, but methods utilizing genome-wide association studies (GWASs) data are less explored. The past decade has observed a massive growth in the amount of data from GWAS; the rich information contained in GWAS has great potential to guide drug repositioning or discovery. While multiple tools are available for finding the most relevant genes from GWAS hits, searching for top susceptibility genes is only one way to guide repositioning, which has its own limitations. Here we provide a comprehensive review of different computational approaches that employ GWAS data to guide drug repositioning. These methods include selecting top candidate genes from GWAS as drug targets, deducing drug candidates based on drug-drug and disease-disease similarities, searching for reversed expression profiles between drugs and diseases, pathway-based methods as well as approaches based on analysis of biological networks. Each method is illustrated with examples, and their respective strengths and limitations are discussed. We also discussed several areas for future research.
Collapse
Affiliation(s)
- Alexandria Lau
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Hon-Cheong So
- School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China
- KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research of Common Diseases, Kunming Zoology Institute of Zoology and The Chinese University of Hong Kong, Hong Kong SAR, China
- Department of Psychiatry, The Chinese University of Hong Kong, Hong Kong SAR, China
- Margaret K.L. Cheung Research Centre for Management of Parkinsonism, The Chinese University of Hong Kong, Hong Kong SAR, China
- Shenzhen Research Institute, The Chinese University of Hong Kong, Shenzhen, China
- Brain and Mind Institute, The Chinese University of Hong Kong, Hong Kong SAR, China
- Hong Kong Branch of the Chinese Academy of Sciences Center for Excellence in Animal Evolution and Genetics, The Chinese University of Hong Kong, Hong Kong SAR, China
- Corresponding author at: School of Biomedical Sciences, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong SAR, China.
| |
Collapse
|
4
|
Yoon S, Lee D. Meta-path Based Prioritization of Functional Drug Actions with Multi-Level Biological Networks. Sci Rep 2019; 9:5469. [PMID: 30940832 PMCID: PMC6445150 DOI: 10.1038/s41598-019-41814-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 03/14/2019] [Indexed: 11/09/2022] Open
Abstract
Functional drug actions refer to drug-affected GO terms. They aid in the investigation of drug effects that are therapeutic or adverse. Previous studies have utilized the linkage information between drugs and functions in molecular level biological networks. Since the current knowledge of molecular level mechanisms of biological functions is still limited, such previous studies were incomplete. We expected that the multi-level biological networks would allow us to more completely investigate the functional drug actions. We constructed multi-level biological networks with genes, GO terms, and diseases. Meta-paths were utilized to extract the features of each GO term. We trained 39 SVM models to prioritize the functional drug actions of the various 39 drugs. Through the multi-level networks, more functional drug actions were utilized for the 39 models and inferred by the models. Multi-level based features improved the performance of the models, and the average AUROC value in the cross-validation was 0.86. Moreover, 60% of the candidates were true.
Collapse
Affiliation(s)
- Seyeol Yoon
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, 34141, Daejeon, Republic of Korea
- Bio-Synergy Research Center, 291 Daehak-ro, Yuseong-gu, 34141, Daejeon, Republic of Korea
| | - Doheon Lee
- Department of Bio and Brain Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, 34141, Daejeon, Republic of Korea.
- Bio-Synergy Research Center, 291 Daehak-ro, Yuseong-gu, 34141, Daejeon, Republic of Korea.
| |
Collapse
|
5
|
Singh H, Rana PS, Singh U. Prediction of drug synergy score using ensemble based differential evolution. IET Syst Biol 2019; 13:24-29. [PMID: 30774113 PMCID: PMC8687263 DOI: 10.1049/iet-syb.2018.5023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Revised: 07/23/2018] [Accepted: 09/05/2018] [Indexed: 12/23/2022] Open
Abstract
Prediction of drug synergy score is an ill-posed problem. It plays an efficient role in the medical field for inhibiting specific cancer agents. An efficient regression-based machine learning technique has an ability to minimise the drug synergy prediction errors. Therefore, in this study, an efficient machine learning technique for drug synergy prediction technique is designed by using ensemble based differential evolution (DE) for optimising the support vector machine (SVM). Because the tuning of the attributes of SVM kernel regulates the prediction precision. The ensemble based DE employs two trial vector generation techniques and two control attributes settings. The initial generation technique has the best solution and the other is without the best solution. The proposed and existing competitive machine learning techniques are applied to drug synergy data. The extensive analysis demonstrates that the proposed technique outperforms others in terms of accuracy, root mean square error and coefficient of correlation.
Collapse
Affiliation(s)
- Harpreet Singh
- Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India.
| | - Prashant Singh Rana
- Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India
| | - Urvinder Singh
- Electronics & Communication Engineering Department, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India
| |
Collapse
|
6
|
Rare Diseases: Drug Discovery and Informatics Resource. Interdiscip Sci 2017; 10:195-204. [PMID: 29094320 DOI: 10.1007/s12539-017-0270-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2017] [Revised: 10/19/2017] [Accepted: 10/23/2017] [Indexed: 12/13/2022]
Abstract
A rare disease refers to any disease with very low prevalence individually. Although the impacted population is small for a single disease, more than 6000 rare diseases affect millions of people across the world. Due to the small market size, high cost and possibly low return on investment, only in recent years, the research and development of rare disease drugs have gradually risen globally, in several domains including gene therapy, enzyme replacement therapy, and drug repositioning. Due to the complex etiology and heterogeneous symptoms, there is a large gap between basic research and patient unmet needs for rare disease drug discovery. As computational biology increasingly arises researchers' awareness, the informatics database on rare disease have grown rapidly in the recent years, including drug targets, genetic variant and mutation, phenotype and ontology and patient registries. Along with the advances of informatics database and networks, new computational models will help accelerate the target identification and lead optimization process for rare disease pre-clinical drug development.
Collapse
|
7
|
Kibble M, Khan SA, Saarinen N, Iorio F, Saez-Rodriguez J, Mäkelä S, Aittokallio T. Transcriptional response networks for elucidating mechanisms of action of multitargeted agents. Drug Discov Today 2016; 21:1063-75. [PMID: 26979547 DOI: 10.1016/j.drudis.2016.03.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Revised: 02/15/2016] [Accepted: 03/04/2016] [Indexed: 01/08/2023]
Abstract
Drug discovery is moving away from the single target-based approach towards harnessing the potential of polypharmacological agents that modulate the activity of multiple nodes in the complex networks of deregulations underlying disease phenotypes. Computational network pharmacology methods that use systems-level drug-response phenotypes, such as those originating from genome-wide transcriptomic profiles, have proved particularly effective for elucidating the mechanisms of action of multitargeted compounds. Here, we show, via the case study of the natural product pinosylvin, how the combination of two complementary network-based methods can provide novel, unexpected mechanistic insights. This case study also illustrates that elucidating the mechanism of action of multitargeted natural products through transcriptional response-based approaches is a challenging endeavor, often requiring multiple computational-experimental iterations.
Collapse
Affiliation(s)
- Milla Kibble
- Institute for Molecular Medicine Finland (FIMM), Biomedicum Helsinki 2U, Tukholmankatu 8, University of Helsinki, Helsinki 00014, Finland.
| | - Suleiman A Khan
- Institute for Molecular Medicine Finland (FIMM), Biomedicum Helsinki 2U, Tukholmankatu 8, University of Helsinki, Helsinki 00014, Finland
| | - Niina Saarinen
- Institute of Biomedicine, Turku Center for Disease Modeling & Functional Foods Forum, University of Turku, Turku 20014, Finland
| | - Francesco Iorio
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Julio Saez-Rodriguez
- European Molecular Biology Laboratory - European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK; Joint Research Centre for Computational Biomedicine (JRC-COMBINE) - RWTH Aachen University, Faculty of Medicine, D-52074 Aachen, Germany
| | - Sari Mäkelä
- Institute of Biomedicine, Turku Center for Disease Modeling & Functional Foods Forum, University of Turku, Turku 20014, Finland
| | - Tero Aittokallio
- Institute for Molecular Medicine Finland (FIMM), Biomedicum Helsinki 2U, Tukholmankatu 8, University of Helsinki, Helsinki 00014, Finland; Department of Mathematics and Statistics, Quantum, University of Turku, Turku 20014, Finland
| |
Collapse
|