251
|
Wong BCF, Chau CKL, Ao FK, Mo CH, Wong SY, Wong YH, So HC. Differential associations of depression-related phenotypes with cardiometabolic risks: Polygenic analyses and exploring shared genetic variants and pathways. Depress Anxiety 2019; 36:330-344. [PMID: 30521077 DOI: 10.1002/da.22861] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Revised: 06/11/2018] [Accepted: 10/20/2018] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Numerous studies have suggested associations between depression and cardiometabolic (CM) diseases. However, little is known about the mechanism underlying this comorbidity, and whether the relationship differs by depression subtypes. METHODS Using polygenic risk scores (PRS) and linkage disequilibrium (LD) score regression, we investigated the genetic overlap of various depression-related phenotypes with a comprehensive panel of 20 CM traits. GWAS results for major depressive disorder (MDD) were taken from the PGC and CONVERGE studies, with the latter focusing on severe melancholic depression. GWAS results on general depressive symptoms (DS) and neuroticism were also included. We identified the shared genetic variants and inferred enriched pathways. We also looked for drugs over-represented among the top-shared genes, with an aim to finding repositioning opportunities for comorbidities. RESULTS We found significant genetic overlap between MDD, DS, and neuroticism with cardiometabolic traits. In general, positive polygenic associations with CM abnormalities were observed except for MDD-CONVERGE. Counterintuitively, PRS representing severe melancholic depression was associated with reduced CM risks. Enrichment analyses of shared SNPs revealed many interesting pathways such as those related to inflammation that underlie the comorbidity of depressive and CM traits. Using a gene-set analysis approach, we also revealed several repositioning candidates with literature support (e.g., bupropion). CONCLUSIONS Our study highlights shared genetic bases of depression with CM traits, and suggests the associations vary by depression subtypes, which may have implications in targeted prevention of cardiovascular events for patients. Identification of shared genetic factors may also guide drug discovery for the comorbidities.
Collapse
Affiliation(s)
- Brian Chi-Fung Wong
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Carlos Kwan-Long Chau
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Fu-Kiu Ao
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Cheuk-Hei Mo
- Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Sze-Yung Wong
- Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Yui-Hang Wong
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Hon-Cheong So
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong.,KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research of Common Diseases, Kunming Institute of Zoology and The Chinese University of Hong Kong, Shatin, Hong Kong
| |
Collapse
|
252
|
Abstract
One of the recommendations of the Cancer Moonshot Blue Ribbon Panel report from 2016 was the creation of a national cancer data ecosystem. We review some of the approaches for building cancer data ecosystems and some of the progress that has been made. A data commons is the colocation of data with cloud computing infrastructure and commonly used software services, tools, and applications for managing, integrating, analyzing, and sharing data to create an interoperable resource for the research community. We discuss data commons and their potential role in cancer data ecosystems and, in particular, how multiple data commons can interoperate to form part of the foundation for a cancer data ecosystem.
Collapse
|
253
|
Xuan P, Cao Y, Zhang T, Wang X, Pan S, Shen T. Drug repositioning through integration of prior knowledge and projections of drugs and diseases. Bioinformatics 2019; 35:4108-4119. [DOI: 10.1093/bioinformatics/btz182] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 02/24/2019] [Accepted: 03/12/2019] [Indexed: 12/20/2022] Open
Abstract
Abstract
Motivation
Identifying and developing novel therapeutic effects for existing drugs contributes to reduction of drug development costs. Most of the previous methods focus on integration of the heterogeneous data of drugs and diseases from multiple sources for predicting the candidate drug–disease associations. However, they fail to take the prior knowledge of drugs and diseases and their sparse characteristic into account. It is essential to develop a method that exploits the more useful information to predict the reliable candidate associations.
Results
We present a method based on non-negative matrix factorization, DisDrugPred, to predict the drug-related candidate disease indications. A new type of drug similarity is firstly calculated based on their associated diseases. DisDrugPred completely integrates two types of disease similarities, the associations between drugs and diseases, and the various similarities between drugs from different levels including the chemical structures of drugs, the target proteins of drugs, the diseases associated with drugs and the side effects of drugs. The prior knowledge of drugs and diseases and the sparse characteristic of drug–disease associations provide a deep biological perspective for capturing the relationships between drugs and diseases. Simultaneously, the possibility that a drug is associated with a disease is also dependant on their projections in the low-dimension feature space. Therefore, DisDrugPred deeply integrates the diverse prior knowledge, the sparse characteristic of associations and the projections of drugs and diseases. DisDrugPred achieves superior prediction performance than several state-of-the-art methods for drug–disease association prediction. During the validation process, DisDrugPred also can retrieve more actual drug–disease associations in the top part of prediction result which often attracts more attention from the biologists. Moreover, case studies on five drugs further confirm DisDrugPred’s ability to discover potential candidate disease indications for drugs.
Availability and implementation
The fourth type of drug similarity and the predicted candidates for all the drugs are available at https://github.com/pingxuan-hlju/DisDrugPred.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Yangkun Cao
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin, China
| | - Xiao Wang
- School of Computer Science, Beijing University of Posts and Telecommunications, Beijing, China
| | - Shuxiang Pan
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Tonghui Shen
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| |
Collapse
|
254
|
Rutherford KD, Mazandu GK, Mulder NJ. A systems-level analysis of drug-target-disease associations for drug repositioning. Brief Funct Genomics 2019; 17:34-41. [PMID: 28968683 DOI: 10.1093/bfgp/elx015] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Drug repositioning is the process of finding new therapeutic uses for existing, approved drugs-a process thathas value when considering the exorbitant costs of novel drug development. Several computational strategies exist as a way to predict these alternative applications. In this study, we used datasets on: (1) human biological drug targets and (2) disease-associated genes and, based on a direct functional interaction between them, searched for potential opportunities for drug repositioning. From the set of 1125 unique drug targets and their 88 490 interactions with disease-associated genes, 30 drug targets were analyzed and (3) discussed in detail for the purpose of this article. The current indications of the drugs thattarget them were validated through the interactions, and new opportunities for repositioning were predicted. Among the set of drugs for potential repositioning werebenzodiazepines for the treatment of autism spectrum disorders; nortriptyline for the treatment of melanoma, glioma and other cancers; and vitamin B6 in prevention of spontaneous abortions and cleft palate birth defects. Special emphasis was also placed on those new potential indications that pertained to orphan diseases-these are diseases whose rarity means that development of novel treatment is not financially viable. This computational drug repositioning approach uses existing information on drugs and drug targets, and insights into the genetic basis of disease, as a means to systematically generate the most probable new uses for the drugs on offer, and in this way harness their true therapeutic power.
Collapse
|
255
|
Polamreddy P, Gattu N. The drug repurposing landscape from 2012 to 2017: evolution, challenges, and possible solutions. Drug Discov Today 2019; 24:789-795. [DOI: 10.1016/j.drudis.2018.11.022] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 11/15/2018] [Accepted: 11/27/2018] [Indexed: 01/13/2023]
|
256
|
Arrouchi H, Lakhlili W, Ibrahimi A. Re-positioning of known drugs for Pim-1 kinase target using molecular docking analysis. Bioinformation 2019; 15:116-120. [PMID: 31435157 PMCID: PMC6677905 DOI: 10.6026/97320630015116] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2019] [Revised: 02/01/2019] [Accepted: 02/01/2019] [Indexed: 12/02/2022] Open
Abstract
The Concept of reusing existing drugs for new targets is gaining momentum in recent years because of cost-effectiveness as safety and toxicology data are already available. Therefore, it is of interest to re-profile known drugs against the Pim-1 kinase target using molecular docking analysis. Results show that known drugs such as nilotinib, vemurafenib, Idelalisib, and other small kinases inhibitors have high binding ability with Pim-1 kinase for consideration as potential inhibitors.
Collapse
Affiliation(s)
- Housna Arrouchi
- Biotechnology Laboratory (Medbiotech),BioInova Research center,Rabat Medical and Pharmacy School,Mohammed V University in Rabat,Rabat,Morroco
| | - Wiame Lakhlili
- Biotechnology Laboratory (Medbiotech),BioInova Research center,Rabat Medical and Pharmacy School,Mohammed V University in Rabat,Rabat,Morroco
| | - Azeddine Ibrahimi
- Biotechnology Laboratory (Medbiotech),BioInova Research center,Rabat Medical and Pharmacy School,Mohammed V University in Rabat,Rabat,Morroco
| |
Collapse
|
257
|
Regan-Fendt KE, Xu J, DiVincenzo M, Duggan MC, Shakya R, Na R, Carson WE, Payne PRO, Li F. Synergy from gene expression and network mining (SynGeNet) method predicts synergistic drug combinations for diverse melanoma genomic subtypes. NPJ Syst Biol Appl 2019; 5:6. [PMID: 30820351 PMCID: PMC6391384 DOI: 10.1038/s41540-019-0085-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Accepted: 01/23/2019] [Indexed: 12/31/2022] Open
Abstract
Systems biology perspectives are crucial for understanding the pathophysiology of complex diseases, and therefore hold great promise for the discovery of novel treatment strategies. Drug combinations have been shown to improve durability and reduce resistance to available first-line therapies in a variety of cancers; however, traditional drug discovery approaches are prohibitively cost and labor-intensive to evaluate large-scale matrices of potential drug combinations. Computational methods are needed to efficiently model complex interactions of drug target pathways and identify mechanisms underlying drug combination synergy. In this study, we employ a computational approach, SynGeNet (Synergy from Gene expression and Network mining), which integrates transcriptomics-based connectivity mapping and network centrality analysis to analyze disease networks and predict drug combinations. As an exemplar of a disease in which combination therapies demonstrate efficacy in genomic-specific contexts, we investigate malignant melanoma. We employed SynGeNet to generate drug combination predictions for each of the four major genomic subtypes of melanoma (BRAF, NRAS, NF1, and triple wild type) using publicly available gene expression and mutation data. We validated synergistic drug combinations predicted by our method across all genomic subtypes using results from a high-throughput drug screening study across. Finally, we prospectively validated the drug combination for BRAF-mutant melanoma that was top ranked by our approach, vemurafenib (BRAF inhibitor) + tretinoin (retinoic acid receptor agonist), using both in vitro and in vivo models of BRAF-mutant melanoma and RNA-sequencing analysis of drug-treated melanoma cells to validate the predicted mechanisms. Our approach is applicable to a wide range of disease domains, and, importantly, can model disease-relevant protein subnetworks in precision medicine contexts.
Collapse
Affiliation(s)
- Kelly E Regan-Fendt
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA
| | - Jielin Xu
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA
| | - Mallory DiVincenzo
- Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Megan C Duggan
- Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Reena Shakya
- Target Validation Shared Resource, The Ohio State University, Columbus, OH, USA
| | - Ryejung Na
- Target Validation Shared Resource, The Ohio State University, Columbus, OH, USA
| | - William E Carson
- Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Philip R O Payne
- Institute for Informatics, Washington University in St. Louis, St. Louis, MO, USA
| | - Fuhai Li
- Institute for Informatics, Washington University in St. Louis, St. Louis, MO, USA.
- Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA.
| |
Collapse
|
258
|
Computational Drug Repositioning for Gastric Cancer using Reversal Gene Expression Profiles. Sci Rep 2019; 9:2660. [PMID: 30804389 PMCID: PMC6389943 DOI: 10.1038/s41598-019-39228-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 01/21/2019] [Indexed: 12/16/2022] Open
Abstract
Treatment of gastric cancer (GC) often produces poor outcomes. Moreover, predicting which GC treatments will be effective remains challenging. Computational drug repositioning using public databases is a promising and efficient tool for discovering new uses for existing drugs. Here we used a computational reversal of gene expression approach based on effects on gene expression signatures by GC disease and drugs to explore new GC drug candidates. Gene expression profiles for individual GC tumoral and normal gastric tissue samples were downloaded from the Gene Expression Omnibus (GEO) and differentially expressed genes (DEGs) in GC were determined with a meta-signature analysis. Profiles drug activity and drug-induced gene expression were downloaded from the ChEMBL and the LINCS databases, respectively. Candidate drugs to treat GC were predicted using reversal gene expression score (RGES). Drug candidates including sorafenib, olaparib, elesclomol, tanespimycin, selumetinib, and ponatinib were predicted to be active for treatment of GC. Meanwhile, GC-related genes such as PLOD3, COL4A1, UBE2C, MIF, and PRPF5 were identified as having gene expression profiles that can be reversed by drugs. These findings support the use of a computational reversal gene expression approach to identify new drug candidates that can be used to treat GC.
Collapse
|
259
|
Zhou X, Dai E, Song Q, Ma X, Meng Q, Jiang Y, Jiang W. In silico drug repositioning based on drug-miRNA associations. Brief Bioinform 2019; 21:498-510. [DOI: 10.1093/bib/bbz012] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 12/14/2018] [Accepted: 01/11/2019] [Indexed: 02/06/2023] Open
Abstract
Abstract
Drug repositioning has become a prevailing tactic as this strategy is efficient, economical and low risk for drug discovery. Meanwhile, recent studies have confirmed that small-molecule drugs can modulate the expression of disease-related miRNAs, which indicates that miRNAs are promising therapeutic targets for complex diseases. In this study, we put forward and verified the hypothesis that drugs with similar miRNA profiles may share similar therapeutic properties. Furthermore, a comprehensive drug–drug interaction network was constructed based on curated drug-miRNA associations. Through random network comparison, topological structure analysis and network module extraction, we found that the closely linked drugs in the network tend to treat the same diseases. Additionally, the curated drug–disease relationships (from the CTD) and random walk with restarts algorithm were utilized on the drug–drug interaction network to identify the potential drugs for a given disease. Both internal validation (leave-one-out cross-validation) and external validation (independent drug–disease data set from the ChEMBL) demonstrated the effectiveness of the proposed approach. Finally, by integrating drug-miRNA and miRNA-disease information, we also explain the modes of action of drugs in the view of miRNA regulation. In summary, our work could determine novel and credible drug indications and offer novel insights and valuable perspectives for drug repositioning.
Collapse
Affiliation(s)
- Xu Zhou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, P. R. China
| | - Enyu Dai
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, P. R. China
| | - Qian Song
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, P. R. China
| | - Xueyan Ma
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, P. R. China
| | - Qianqian Meng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, P. R. China
| | - Yongshuai Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, P. R. China
| | - Wei Jiang
- Department of Biomedical Engineering, College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, P. R. China
| |
Collapse
|
260
|
Zhang C, Yan G. Synergistic drug combinations prediction by integrating pharmacological data. Synth Syst Biotechnol 2019; 4:67-72. [PMID: 30820478 PMCID: PMC6370570 DOI: 10.1016/j.synbio.2018.10.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Revised: 09/30/2018] [Accepted: 10/04/2018] [Indexed: 12/12/2022] Open
Abstract
There is compelling evidence that synergistic drug combinations have become promising strategies for combating complex diseases, and they have evident predominance comparing to traditional one drug - one disease approaches. In this paper, we develop a computational method, namely SyFFM, that takes pharmacological data into consideration and applies field-aware factorization machines to analyze and predict potential synergistic drug combinations. Firstly, features of drug pairs are constructed based on associations between drugs and target, and enzymes, and indication areas. Then, the synergistic scores of drug combinations are obtained by implementing field-aware factorization machines on latent vector space of these features. Finally, synergistic combinations can be predicted by introducing a threshold. We applied SyFFM to predict pairwise synergistic combinations and three-drug synergistic combinations, and the performance is good in terms of cross-validation. Besides, more than 90% combinations of the top ranked predictions are proved by literature and the analysis of parameters in model shows that our method can help to investigate and explain synergistic mechanisms underlying combinatorial therapy.
Collapse
Affiliation(s)
- Chengzhi Zhang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, PR China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, PR China
| | - Guiying Yan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, PR China.,School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, 100049, PR China
| |
Collapse
|
261
|
Prediction of drug-target interaction by integrating diverse heterogeneous information source with multiple kernel learning and clustering methods. Comput Biol Chem 2019; 78:460-467. [DOI: 10.1016/j.compbiolchem.2018.11.028] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Revised: 11/30/2018] [Accepted: 11/30/2018] [Indexed: 02/08/2023]
|
262
|
Ozdemir ES, Halakou F, Nussinov R, Gursoy A, Keskin O. Methods for Discovering and Targeting Druggable Protein-Protein Interfaces and Their Application to Repurposing. Methods Mol Biol 2019; 1903:1-21. [PMID: 30547433 PMCID: PMC8185533 DOI: 10.1007/978-1-4939-8955-3_1] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2023]
Abstract
Drug repurposing is a creative and resourceful approach to increase the number of therapies by exploiting available and approved drugs. However, identifying new protein targets for previously approved drugs is challenging. Although new strategies have been developed for drug repurposing, there is broad agreement that there is room for further improvements. In this chapter, we review protein-protein interaction (PPI) interface-targeting strategies for drug repurposing applications. We discuss certain features, such as hot spot residue and hot region prediction and their importance in drug repurposing, and illustrate common methods used in PPI networks to identify drug off-targets. We also collect available online resources for hot spot prediction, binding pocket identification, and interface clustering which are effective resources in polypharmacology. Finally, we provide case studies showing the significance of protein interfaces and hot spots in drug repurposing.
Collapse
Affiliation(s)
- E Sila Ozdemir
- Department of Chemical and Biological Engineering, Koc University, Istanbul, Turkey
| | - Farideh Halakou
- Department of Computer Engineering, Koc University, Istanbul, Turkey
| | - Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, MD, USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Attila Gursoy
- Department of Computer Engineering, Koc University, Istanbul, Turkey.
| | - Ozlem Keskin
- Department of Chemical and Biological Engineering, Koc University, Istanbul, Turkey.
| |
Collapse
|
263
|
Chakraborti S, Ramakrishnan G, Srinivasan N. Repurposing Drugs Based on Evolutionary Relationships Between Targets of Approved Drugs and Proteins of Interest. Methods Mol Biol 2019; 1903:45-59. [PMID: 30547435 DOI: 10.1007/978-1-4939-8955-3_3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
Drug repurposing has garnered much interest as an effective method for drug development among biopharmaceutical companies. The availability of information on complete sequences of genomes and their associated biological data, genotype-phenotype-disease relationships, and properties of small molecules offers opportunities to explore the repurpose-able potential of existing pharmacopoeia. This method gains further importance, especially, in the context of development of drugs against infectious diseases, some of which pose serious complications due to emergence of drug-resistant pathogens. In this article, we describe computational means to achieve potential repurpose-able drug candidates that may be used against infectious diseases by exploring evolutionary relationships between established targets of FDA-approved drugs and proteins of pathogen of interest.
Collapse
Affiliation(s)
- Sohini Chakraborti
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Gayatri Ramakrishnan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India.,Indian Institute of Science Mathematics Initiative, Indian Institute of Science, Bangalore, India.,Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | | |
Collapse
|
264
|
Vitali F, Berghout J, Fan J, Li J, Li Q, Li H, Lussier YA. Precision drug repurposing via convergent eQTL-based molecules and pathway targeting independent disease-associated polymorphisms. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019; 24:308-319. [PMID: 30864332 PMCID: PMC6425966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Repurposing existing drugs for new therapeutic indications can improve success rates and streamline development. Use of large-scale biomedical data repositories, including eQTL regulatory relationships and genome-wide disease risk associations, offers opportunities to propose novel indications for drugs targeting common or convergent molecular candidates associated to two or more diseases. This proposed novel computational approach scales across 262 complex diseases, building a multi-partite hierarchical network integrating (i) GWAS-derived SNP-to-disease associations, (ii) eQTL-derived SNP-to-eGene associations incorporating both cis- and trans-relationships from 19 tissues, (iii) protein target-to-drug, and (iv) drug-to-disease indications with (iv) Gene Ontology-based information theoretic semantic (ITS) similarity calculated between protein target functions. Our hypothesis is that if two diseases are associated to a common or functionally similar eGene - and a drug targeting that eGene/protein in one disease exists - the second disease becomes a potential repurposing indication. To explore this, all possible pairs of independently segregating GWAS-derived SNPs were generated, and a statistical network of similarity within each SNP-SNP pair was calculated according to scale-free overrepresentation of convergent biological processes activity in regulated eGenes (ITSeGENE-eGENE) and scale-free overrepresentation of common eGene targets between the two SNPs (ITSSNP-SNP). Significance of ITSSNP-SNP was conservatively estimated using empirical scale-free permutation resampling keeping the node-degree constant for each molecule in each permutation. We identified 26 new drug repurposing indication candidates spanning 89 GWAS diseases, including a potential repurposing of the calcium-channel blocker Verapamil from coronary disease to gout. Predictions from our approach are compared to known drug indications using DrugBank as a gold standard (odds ratio=13.1, p-value=2.49x10-8). Because of specific disease-SNPs associations to candidate drug targets, the proposed method provides evidence for future precision drug repositioning to a patient's specific polymorphisms.
Collapse
Affiliation(s)
- Francesca Vitali
- Center for Biomedical Informatics and Biostatistics (CB2), The University of Arizona, Tucson, AZ 85721, USA2Department of Medicine COM-T, The University of Arizona, Tucson, AZ 85721, USA†Authors contributed equally to this work,
| | | | | | | | | | | | | |
Collapse
|
265
|
A Drug Repurposing Method Based on Drug-Drug Interaction Networks and Using Energy Model Layouts. Methods Mol Biol 2019; 1903:185-201. [PMID: 30547443 DOI: 10.1007/978-1-4939-8955-3_11] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Complex network representations of reported drug-drug interactions foster computational strategies that can infer pharmacological functions which, in turn, create incentives for drug repositioning. Here, we use Gephi (a platform for complex network visualization and analysis) to represent a drug-drug interaction network with drug interaction information from DrugBank 4.1. Both modularity class- and force-directed layout ForceAtlas2 are employed to generate drug clusters which correspond to nine specific drug properties. Most drugs comply with their cluster's dominant property; however, some of them seem not to be in a proper position (i.e., in accordance with their already known functions). Such cases, along with cases of drugs that are topologically placed in the overlapping or bordering zones between clusters, may indicate previously unaccounted pharmacologic functions, thus leading to potential repositionings. Out of the 1141 drugs with relevant information on their interactions in DrugBank 4.1, we confirm the predicted properties for 85% of the drugs. The high prediction rate of our methodology suggests that, at least for some of the 15% drugs that seem to be inconsistent with the predicted property, we can get very good repositioning hints. As such, we present illustrative examples of recovered well-known repositionings, as well as recently confirmed pharmacological properties.
Collapse
|
266
|
Tian Z, Teng Z, Cheng S, Guo M. Computational drug repositioning using meta-path-based semantic network analysis. BMC SYSTEMS BIOLOGY 2018; 12:134. [PMID: 30598084 PMCID: PMC6311940 DOI: 10.1186/s12918-018-0658-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
BACKGROUND Drug repositioning is a promising and efficient way to discover new indications for existing drugs, which holds the great potential for precision medicine in the post-genomic era. Many network-based approaches have been proposed for drug repositioning based on similarity networks, which integrate multiple sources of drugs and diseases. However, these methods may simply view nodes as the same-typed and neglect the semantic meanings of different meta-paths in the heterogeneous network. Therefore, it is urgent to develop a rational method to infer new indications for approved drugs. RESULTS In this study, we proposed a novel methodology named HeteSim_DrugDisease (HSDD) for the prediction of drug repositioning. Firstly, we build the drug-drug similarity network and disease-disease similarity network by integrating the information of drugs and diseases. Secondly, a drug-disease heterogeneous network is constructed, which combines the drug similarity network, disease similarity network as well as the known drug-disease association network. Finally, HSDD predicts novel drug-disease associations based on the HeteSim scores of different meta-paths. The experimental results show that HSDD performs significantly better than the existing state-of-the-art approaches. HSDD achieves an AUC score of 0.8994 in the leave-one-out cross validation experiment. Moreover, case studies for selected drugs further illustrate the practical usefulness of HSDD. CONCLUSIONS HSDD can be an effective and feasible way to infer the associations between drugs and diseases using on meta-path-based semantic network analysis.
Collapse
Affiliation(s)
- Zhen Tian
- School of Information Engineering, Zhengzhou University, Zhengzhou, 450001, People's Republic of China
| | - Zhixia Teng
- School of information and computer engineering, Northeast Forestry, Harbin, 150001, People's Republic of China
| | - Shuang Cheng
- Institute of Materials, China Academy of Engineering Physics, Jiang You, 621907, Sichuan, China
| | - Maozu Guo
- School of Electrical and Information Engineering, Beijing University of Civil Engineering and Architecture, Beijing, 100044, People's Republic of China. .,Beijing Key Laboratory of Intelligent Processing for Building Big Data, Beijing, 100044, China.
| |
Collapse
|
267
|
Olayan RS, Ashoor H, Bajic VB. DDR: efficient computational method to predict drug-target interactions using graph mining and machine learning approaches. Bioinformatics 2018; 34:1164-1173. [PMID: 29186331 PMCID: PMC5998943 DOI: 10.1093/bioinformatics/btx731] [Citation(s) in RCA: 107] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Accepted: 11/23/2017] [Indexed: 02/06/2023] Open
Abstract
Motivation Finding computationally drug–target interactions (DTIs) is a convenient strategy to identify new DTIs at low cost with reasonable accuracy. However, the current DTI prediction methods suffer the high false positive prediction rate. Results We developed DDR, a novel method that improves the DTI prediction accuracy. DDR is based on the use of a heterogeneous graph that contains known DTIs with multiple similarities between drugs and multiple similarities between target proteins. DDR applies non-linear similarity fusion method to combine different similarities. Before fusion, DDR performs a pre-processing step where a subset of similarities is selected in a heuristic process to obtain an optimized combination of similarities. Then, DDR applies a random forest model using different graph-based features extracted from the DTI heterogeneous graph. Using 5-repeats of 10-fold cross-validation, three testing setups, and the weighted average of area under the precision-recall curve (AUPR) scores, we show that DDR significantly reduces the AUPR score error relative to the next best start-of-the-art method for predicting DTIs by 31% when the drugs are new, by 23% when targets are new and by 34% when the drugs and the targets are known but not all DTIs between them are not known. Using independent sources of evidence, we verify as correct 22 out of the top 25 DDR novel predictions. This suggests that DDR can be used as an efficient method to identify correct DTIs. Availability and implementation The data and code are provided at https://bitbucket.org/RSO24/ddr/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rawan S Olayan
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| | - Haitham Ashoor
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06032, USA
| | - Vladimir B Bajic
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Thuwal, Saudi Arabia
| |
Collapse
|
268
|
Su C, Tong J, Zhu Y, Cui P, Wang F. Network embedding in biomedical data science. Brief Bioinform 2018; 21:182-197. [PMID: 30535359 DOI: 10.1093/bib/bby117] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 10/08/2018] [Accepted: 11/03/2018] [Indexed: 12/15/2022] Open
Abstract
AbstractOwning to the rapid development of computer technologies, an increasing number of relational data have been emerging in modern biomedical research. Many network-based learning methods have been proposed to perform analysis on such data, which provide people a deep understanding of topology and knowledge behind the biomedical networks and benefit a lot of applications for human healthcare. However, most network-based methods suffer from high computational and space cost. There remain challenges on handling high dimensionality and sparsity of the biomedical networks. The latest advances in network embedding technologies provide new effective paradigms to solve the network analysis problem. It converts network into a low-dimensional space while maximally preserves structural properties. In this way, downstream tasks such as link prediction and node classification can be done by traditional machine learning methods. In this survey, we conduct a comprehensive review of the literature on applying network embedding to advance the biomedical domain. We first briefly introduce the widely used network embedding models. After that, we carefully discuss how the network embedding approaches were performed on biomedical networks as well as how they accelerated the downstream tasks in biomedical science. Finally, we discuss challenges the existing network embedding applications in biomedical domains are faced with and suggest several promising future directions for a better improvement in human healthcare.
Collapse
Affiliation(s)
- Chang Su
- Department of Healthcare Policy and Research, Weill Cornell Medicine at Cornell University, New York, NY, USA
| | - Jie Tong
- Department of Mechanical and Aerospace Engineering at New York University, New York, NY, USA
| | - Yongjun Zhu
- Department of Library and Information Science, Sungkyunkwan University, Seoul, South Korea
| | - Peng Cui
- Department of Computer Science and Technology, Tsinghua University, Beijing, China
| | - Fei Wang
- Department of Healthcare Policy and Research, Weill Cornell Medicine at Cornell University, New York, NY, USA
| |
Collapse
|
269
|
Wang Q, Xu R. Disease comorbidity-guided drug repositioning: a case study in schizophrenia. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2018; 2018:1300-1309. [PMID: 30815174 PMCID: PMC6371343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The key to any computational drug repositioning is the availability of relevant data in machine-understandable format. While large amount of genetic, genomic and chemical data are publicly available, large-scale higher-level disease and drug phenotypic data are limited. We recently constructed a large-scale disease-comorbidity relationship knowledge base (dCombKB) and a comprehensive drug-treatment relationship knowledge base (TreatKB) from 21 million biomedical research articles and other resources. In this study, we demonstrated the potential of dCombKB and TreatKB in drug repositioning for schizophrenia, one of the top ten illnesses contributing to the global burden of disease. dCombKB contains 121,359 unique disease-disease comorbidity pairs for 23,041 diseases. TreatKB contains 208,330 unique drug-disease treatment pairs for 2,484 drugs and 24,511 diseases. We constructed a phenotypic comorbidity disease network (PDN) of 14,645 disease nodes and 101,275 edges based on dCombKB. We applied standard network-based ranking algorithm to find diseases that are phenotypically related to SCZ. We developed a drug prioritization system, PhenoPredict-CDN, to systematically reposition drugs for SCZ from diseases phenotypically related to SCZ. PhenoPredict-CDN found all 18 FDA-approved SCZ drugs and ranked them highly as tested in a de-novo validation setting (recall: 1.0, mean ranking: top 6.05%, median ranking: top 1.65%). When compared to PREDICT, a comprehensive drug repositioning system, for novel predictions, Pheno-Predict-CDN outperformed PREDICT in Precision-Recall (PR) curves across three different evaluation datasets. Compared to PREDICT, PhenoPredict-CDN showed a significant 110.0-230.0% improvements in mean average precision. In summary, large-scale higher-level disease-comorbidity relationships data extracted from biomedical literature has potential in drug discovery for SCZ, a complex disease with unknown pathophysiological mechanisms. All the data are publicly available: dCombKB at http://nlp. CASE edu/public/data/dCombKB, TreatKB at http://nlp. CASE edu/public/data/treatKB/, and predictions for SCZ at http://nlp. CASE edu/public/data/SCZ_CDN/.
Collapse
Affiliation(s)
- QuanQiu Wang
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland OH 44106
| | - Rong Xu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland OH 44106
| |
Collapse
|
270
|
Hankosky ER, Bush HM, Dwoskin LP, Harris DR, Henderson DW, Zhang GQ, Freeman PR, Talbert JC. Retrospective analysis of health claims to evaluate pharmacotherapies with potential for repurposing: Association of bupropion and stimulant use disorder remission. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2018; 2018:1292-1299. [PMID: 30815171 PMCID: PMC6371318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Drug repurposing is the identification of novel indication(s) for existing medications. Health claims data provide a burgeoning resource to evaluate pharmacotherapies with repurposing potential. To demonstrate a workflow for drug repurposing using claims data, we assessed the association between prescription of bupropion and stimulant use disorder (StUD) remission. Using the Truven Marketscan database, 96,156 individuals with a StUD were identified. Logistic regression was used to model the association between new bupropion prescriptions and remission while controlling for age, sex, region, StUD severity, antidepressant co-prescriptions, and comorbid mood and attention disorders. Prescription of bupropion within 30 days offirst documented StUD diagnosis increased odds of a subsequent remission diagnosis by 2.1 times (99% confidence interval: 1.09-3.89) in individuals with an amphetamine use disorder, but not those with a cocaine use disorder. This work provides a framework for reverse-translational drug repurposing, which may be applied to many other medical conditions.
Collapse
|
271
|
Adverse Drug Reaction Predictions Using Stacking Deep Heterogeneous Information Network Embedding Approach. Molecules 2018; 23:molecules23123193. [PMID: 30518099 PMCID: PMC6320974 DOI: 10.3390/molecules23123193] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 11/30/2018] [Accepted: 11/30/2018] [Indexed: 12/14/2022] Open
Abstract
Inferring potential adverse drug reactions is an important and challenging task for the drug discovery and healthcare industry. Many previous studies in computational pharmacology have proposed utilizing multi-source drug information to predict drug side effects have and achieved initial success. However, most of the prediction methods mainly rely on direct similarities inferred from drug information and cannot fully utilize the drug information about the impact of protein–protein interactions (PPI) on potential drug targets. Moreover, most of the methods are designed for specific tasks. In this work, we propose a novel heterogeneous network embedding approach for learning drug representations called SDHINE, which integrates PPI information into drug embeddings and is generic for different adverse drug reaction (ADR) prediction tasks. To integrate heterogeneous drug information and learn drug representations, we first design different meta-path-based proximities to calculate drug similarities, especially target propagation meta-path-based proximity based on PPI network, and then construct a semi-supervised stacking deep neural network model that is jointly optimized by the defined meta-path proximities. Extensive experiments with three state-of-the-art network embedding methods on three ADR prediction tasks demonstrate the effectiveness of the SDHINE model. Furthermore, we compare the drug representations in terms of drug differentiation by mapping the representations into 2D space; the results show that the performance of our approach is superior to that of the comparison methods.
Collapse
|
272
|
Lee T, Yoon Y. Drug repositioning using drug-disease vectors based on an integrated network. BMC Bioinformatics 2018; 19:446. [PMID: 30463505 PMCID: PMC6249928 DOI: 10.1186/s12859-018-2490-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 11/12/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Diverse interactions occur between biomolecules, such as activation, inhibition, expression, or repression. However, previous network-based studies of drug repositioning have employed interaction on the binary protein-protein interaction (PPI) network without considering the characteristics of the interactions. Recently, some studies of drug repositioning using gene expression data found that associations between drug and disease genes are useful information for identifying novel drugs to treat diseases. However, the gene expression profiles for drugs and diseases are not always available. Although gene expression profiles of drugs and diseases are available, existing methods cannot use the drugs or diseases, when differentially expressed genes in the profiles are not included in their network. RESULTS We developed a novel method for identifying candidate indications of existing drugs considering types of interactions between biomolecules based on known drug-disease associations. To obtain associations between drug and disease genes, we constructed a directed network using protein interaction and gene regulation data obtained from various public databases providing diverse biological pathways. The network includes three types of edges depending on relationships between biomolecules. To quantify the association between a target gene and a disease gene, we explored the shortest paths from the target gene to the disease gene and calculated the types and weights of the shortest paths. For each drug-disease pair, we built a vector consisting of values for each disease gene influenced by the drug. Using the vectors and known drug-disease associations, we constructed classifiers to identify novel drugs for each disease. CONCLUSION We propose a method for exploring candidate drugs of diseases using associations between drugs and disease genes derived from a directed gene network instead of gene regulation data obtained from gene expression profiles. Compared to existing methods that require information on gene relationships and gene expression data, our method can be applied to a greater number of drugs and diseases. Furthermore, to validate our predictions, we compared the predictions with drug-disease pairs in clinical trials using the hypergeometric test, which showed significant results. Our method also showed better performance compared to existing methods for the area under the receiver operating characteristic curve (AUC).
Collapse
Affiliation(s)
- Taekeon Lee
- Department of Computer Engineering, Gachon University, 5-22Ho, IT college, 1324 Seongnam-daero, Seongnam-si, 13120 South Korea
| | - Youngmi Yoon
- Department of Computer Engineering, Gachon University, 5-22Ho, IT college, 1324 Seongnam-daero, Seongnam-si, 13120 South Korea
| |
Collapse
|
273
|
Wang F, Lei X, Wu FX. A Review of Drug Repositioning Based Chemical-induced Cell Line Expression Data. Curr Med Chem 2018; 27:5340-5350. [PMID: 30381060 DOI: 10.2174/0929867325666181101115801] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2017] [Revised: 08/10/2018] [Accepted: 10/21/2018] [Indexed: 12/14/2022]
Abstract
Drug repositioning is an important area of biomedical research. The drug repositioning studies have shifted to computational approaches. Large-scale perturbation databases, such as the Connectivity Map and the Library of Integrated Network-Based Cellular Signatures, contain a number of chemical-induced gene expression profiles and provide great opportunities for computational biology and drug repositioning. One reason is that the profiles provided by the Connectivity Map and the Library of Integrated Network-Based Cellular Signatures databases show an overall view of biological mechanism in drugs, diseases and genes. In this article, we provide a review of the two databases and their recent applications in drug repositioning.
Collapse
Affiliation(s)
- Fei Wang
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, Canada
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an, Shaanxi, China
| | - Fang-Xiang Wu
- School of Computer Science, Shaanxi Normal University, Xi'an, Shaanxi, China
| |
Collapse
|
274
|
Brown AS, Patel CJ. A review of validation strategies for computational drug repositioning. Brief Bioinform 2018; 19:174-177. [PMID: 27881429 PMCID: PMC5862266 DOI: 10.1093/bib/bbw110] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Indexed: 12/15/2022] Open
Abstract
Repositioning of previously approved drugs is a promising methodology because it reduces the cost and duration of the drug development pipeline and reduces the likelihood of unforeseen adverse events. Computational repositioning is especially appealing because of the ability to rapidly screen candidates in silico and to reduce the number of possible repositioning candidates. What is unclear, however, is how useful such methods are in producing clinically efficacious repositioning hypotheses. Furthermore, there is no agreement in the field over the proper way to perform validation of in silico predictions, and in fact no systematic review of repositioning validation methodologies. To address this unmet need, we review the computational repositioning literature and capture studies in which authors claimed to have validated their work. Our analysis reveals widespread variation in the types of strategies, predictions made and databases used as ‘gold standards’. We highlight a key weakness of the most commonly used strategy and propose a path forward for the consistent analytic validation of repositioning techniques.
Collapse
Affiliation(s)
- Adam S Brown
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA, USA
| | - Chirag J Patel
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA, USA
- Corresponding author: Chirag J. Patel, Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA 02115, USA. Tel.: (617) 432 1195; E-mail:
| |
Collapse
|
275
|
Fang J, Cai C, Chai Y, Zhou J, Huang Y, Gao L, Wang Q, Cheng F. Quantitative and systems pharmacology 4. Network-based analysis of drug pleiotropy on coronary artery disease. Eur J Med Chem 2018; 161:192-204. [PMID: 30359818 DOI: 10.1016/j.ejmech.2018.10.020] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Revised: 09/26/2018] [Accepted: 10/09/2018] [Indexed: 12/14/2022]
Abstract
Despite recent advance of therapeutic development, coronary artery disease (CAD) remains one of the major issues to public health. The use of genomics and systems biology approaches to inform drug discovery and development have offered the possibilities for new target identification and in silico drug repurposing. In this study, we propose a network-based, systems pharmacology framework for target identification and drug repurposing in pharmacologic treatment and chemoprevention of CAD. Specifically, we build in silico models by integrating known drug-target interactions, CAD genes derived from the genetic and genomic studies, and the human protein-protein interactome. We demonstrate that the proposed in silico models can successfully uncover approved drugs and novel natural products in potentially treating and preventing CAD. In case studies, we highlight several approved drugs (e.g., fasudil, parecoxib, and dexamethasone) or natural products (e.g., resveratrol, luteolin, daidzein and caffeic acid) with new mechanism-of-action in chemical intervention of CAD by network analysis. In summary, this study offers a powerful systems pharmacology approach for target identification and in silico drug repurposing on CAD.
Collapse
Affiliation(s)
- Jiansong Fang
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Chuipu Cai
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Yanting Chai
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Jingwei Zhou
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Yujie Huang
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Li Gao
- Modern Research Center for Traditional Chinese Medicine, Shanxi University, Taiyuan, China
| | - Qi Wang
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44106, USA; Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA; CASE Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA.
| |
Collapse
|
276
|
Hosomi K, Fujimoto M, Ushio K, Mao L, Kato J, Takada M. An integrative approach using real-world data to identify alternative therapeutic uses of existing drugs. PLoS One 2018; 13:e0204648. [PMID: 30300381 PMCID: PMC6177143 DOI: 10.1371/journal.pone.0204648] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 09/12/2018] [Indexed: 12/14/2022] Open
Abstract
Different computational approaches are employed to efficiently identify novel repositioning possibilities utilizing different sources of information and algorithms. It is critical to propose high-valued candidate-repositioning possibilities before conducting lengthy in vivo validation studies that consume significant resources. Here we report a novel multi-methodological approach to identify opportunities for drug repositioning. We performed analyses of real-world data (RWD) acquired from the United States Food and Drug Administration’s Adverse Event Reporting System (FAERS) and the claims database maintained by the Japan Medical Data Center (JMDC). These analyses were followed by cross-validation through bioinformatics analyses of gene expression data. Inverse associations revealed using disproportionality analysis (DPA) and sequence symmetry analysis (SSA) were used to detect potential drug-repositioning signals. To evaluate the validity of the approach, we conducted a feasibility study to identify marketed drugs with the potential for treating inflammatory bowel disease (IBD). Primary analyses of the FAERS and JMDC claims databases identified psycholeptics such as haloperidol, diazepam, and hydroxyzine as candidates that may improve the treatment of IBD. To further investigate the mechanistic relevance between hit compounds and disease pathology, we conducted bioinformatics analyses of the associations of the gene expression profiles of these compounds with disease. We identified common biological features among genes differentially expressed with or without compound treatment as well as disease-perturbation data available from open sources, which strengthened the mechanistic rationale of our initial findings. We further identified pathways such as cytokine signaling that are influenced by these drugs. These pathways are relevant to pathologies and can serve as alternative targets of therapy. Integrative analysis of RWD such as those available from adverse-event databases, claims databases, and transcriptome analyses represent an effective approach that adds value to efficiently identifying potential novel therapeutic opportunities.
Collapse
Affiliation(s)
- Kouichi Hosomi
- Division of Clinical Drug Informatics, School of Pharmacy, Kindai University, Kowakae, Higashi-osaka, Osaka, Japan
| | - Mai Fujimoto
- Division of Clinical Drug Informatics, School of Pharmacy, Kindai University, Kowakae, Higashi-osaka, Osaka, Japan
| | - Kazutaka Ushio
- Innovation and Entrepreneurship, Research, Takeda Pharmaceutical Company Limited, Muraoka-Higashi, 2- Chome, Fujisawa, Kanagawa, Japan
| | - Lili Mao
- Innovation and Entrepreneurship, Research, Takeda Pharmaceutical Company Limited, Muraoka-Higashi, 2- Chome, Fujisawa, Kanagawa, Japan
| | - Juran Kato
- Innovation and Entrepreneurship, Research, Takeda Pharmaceutical Company Limited, Muraoka-Higashi, 2- Chome, Fujisawa, Kanagawa, Japan
| | - Mitsutaka Takada
- Division of Clinical Drug Informatics, School of Pharmacy, Kindai University, Kowakae, Higashi-osaka, Osaka, Japan
- * E-mail:
| |
Collapse
|
277
|
Jang G, Lee T, Hwang S, Park C, Ahn J, Seo S, Hwang Y, Yoon Y. PISTON: Predicting drug indications and side effects using topic modeling and natural language processing. J Biomed Inform 2018; 87:96-107. [PMID: 30268842 DOI: 10.1016/j.jbi.2018.09.015] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Revised: 08/23/2018] [Accepted: 09/27/2018] [Indexed: 02/07/2023]
Abstract
The process of discovering novel drugs to treat diseases requires a long time and high cost. It is important to understand side effects of drugs as well as their therapeutic effects, because these can seriously damage the patients due to unexpected actions of the derived candidate drugs. In order to overcome these limitations, computational methods for predicting the therapeutic effects and side effects have been proposed. In particular, text mining is a widely used technique in the field of systems biology, because it can discover hidden relationships between drugs, genes and diseases from a large amount of literature data. Compared with in vivo/in vitro experiments, text mining derives meaningful results with less time and cost. In this study, we propose an algorithm for predicting novel drug-phenotype associations and drug-side effect associations using topic modeling and natural language processing (NLP). We extract sentences in which drugs and genes co-occur from the abstracts of the literature and identify words that describe the relationship between them using NLP. Considering the characteristics of the identified words, we determine if the drug has an up-regulation effect or a down-regulation effect on the gene. Based on genes that affect drugs and their regulatory relationships, we group the frequently occurring genes and regulatory relationships into topics, and build a drug-topic probability matrix by calculating the score that the drug will have a topic using topic modeling. Using the matrix, a classifier is constructed for predicting the novel indications and side effects of drugs considering the characteristics of known drug-phenotype associations or drug-side effect associations. The proposed method predicts both indications and side effects with a single algorithm, and it can exclude drugs with serious side effects or side effects that patients do not want to experience from among the candidate drugs provided for the treatment of the phenotype. Furthermore, lists of novel candidate drugs for phenotypes and side effects can be continuously updated with our algorithm every time a document is added. More than a thousand documents are produced per day, and it is possible for our algorithm to efficiently derive candidate drugs because it requires less cost than the existing drug repositioning methods. The resource of PISTON is available at databio.gachon.ac.kr/tools/PISTON.
Collapse
Affiliation(s)
- Giup Jang
- Department of IT Convergence Engineering, Gachon University, Seongnam, Republic of Korea
| | - Taekeon Lee
- Department of Computer Engineering, Gachon University, Seongnam, Republic of Korea
| | - Soyoun Hwang
- Department of IT Convergence Engineering, Gachon University, Seongnam, Republic of Korea
| | - Chihyun Park
- Department of Computer Science, Yonsei University, Seoul, Republic of Korea
| | - Jaegyoon Ahn
- Department of Computer Science and Engineering, Incheon University, Incheon, Republic of Korea
| | - Sukyung Seo
- Department of Computer Engineering, Gachon University, Seongnam, Republic of Korea
| | - Youhyeon Hwang
- Department of Computer Science, University of Southern California, Los Angeles, USA
| | - Youngmi Yoon
- Department of Computer Engineering, Gachon University, Seongnam, Republic of Korea.
| |
Collapse
|
278
|
Chen R, Liu X, Jin S, Lin J, Liu J. Machine Learning for Drug-Target Interaction Prediction. Molecules 2018; 23:E2208. [PMID: 30200333 PMCID: PMC6225477 DOI: 10.3390/molecules23092208] [Citation(s) in RCA: 120] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2018] [Revised: 08/27/2018] [Accepted: 08/27/2018] [Indexed: 12/18/2022] Open
Abstract
Identifying drug-target interactions will greatly narrow down the scope of search of candidate medications, and thus can serve as the vital first step in drug discovery. Considering that in vitro experiments are extremely costly and time-consuming, high efficiency computational prediction methods could serve as promising strategies for drug-target interaction (DTI) prediction. In this review, our goal is to focus on machine learning approaches and provide a comprehensive overview. First, we summarize a brief list of databases frequently used in drug discovery. Next, we adopt a hierarchical classification scheme and introduce several representative methods of each category, especially the recent state-of-the-art methods. In addition, we compare the advantages and limitations of methods in each category. Lastly, we discuss the remaining challenges and future outlook of machine learning in DTI prediction. This article may provide a reference and tutorial insights on machine learning-based DTI prediction for future researchers.
Collapse
Affiliation(s)
- Ruolan Chen
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen 361005, China.
| | - Xiangrong Liu
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen 361005, China.
| | - Shuting Jin
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen 361005, China.
| | - Jiawei Lin
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen 361005, China.
| | - Juan Liu
- Department of Instrumental and Electrical Engineering, School of Aerospace Engineering, Xiamen University, Xiamen 361005, China.
| |
Collapse
|
279
|
Wang C, Kurgan L. Review and comparative assessment of similarity-based methods for prediction of drug–protein interactions in the druggable human proteome. Brief Bioinform 2018; 20:2066-2087. [DOI: 10.1093/bib/bby069] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Revised: 06/26/2018] [Accepted: 07/10/2018] [Indexed: 12/18/2022] Open
Abstract
AbstractDrug–protein interactions (DPIs) underlie the desired therapeutic actions and the adverse side effects of a significant majority of drugs. Computational prediction of DPIs facilitates research in drug discovery, characterization and repurposing. Similarity-based methods that do not require knowledge of protein structures are particularly suitable for druggable genome-wide predictions of DPIs. We review 35 high-impact similarity-based predictors that were published in the past decade. We group them based on three types of similarities and their combinations that they use. We discuss and compare key aspects of these methods including source databases, internal databases and their predictive models. Using our novel benchmark database, we perform comparative empirical analysis of predictive performance of seven types of representative predictors that utilize each type of similarity individually and all possible combinations of similarities. We assess predictive quality at the database-wide DPI level and we are the first to also include evaluation over individual drugs. Our comprehensive analysis shows that predictors that use more similarity types outperform methods that employ fewer similarities, and that the model combining all three types of similarities secures area under the receiver operating characteristic curve of 0.93. We offer a comprehensive analysis of sensitivity of predictive performance to intrinsic and extrinsic characteristics of the considered predictors. We find that predictive performance is sensitive to low levels of similarities between sequences of the drug targets and several extrinsic properties of the input drug structures, drug profiles and drug targets. The benchmark database and a webserver for the seven predictors are freely available at http://biomine.cs.vcu.edu/servers/CONNECTOR/.
Collapse
Affiliation(s)
- Chen Wang
- Computer Science Department, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Computer Science Department, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
280
|
|
281
|
De Bastiani MA, Pfaffenseller B, Klamt F. Master Regulators Connectivity Map: A Transcription Factors-Centered Approach to Drug Repositioning. Front Pharmacol 2018; 9:697. [PMID: 30034338 PMCID: PMC6043797 DOI: 10.3389/fphar.2018.00697] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Accepted: 06/08/2018] [Indexed: 01/09/2023] Open
Abstract
Drug discovery is a very expensive and time-consuming endeavor. Fortunately, recent omics technologies and Systems Biology approaches introduced interesting new tools to achieve this task, facilitating the repurposing of already known drugs to new therapeutic assignments using gene expression data and bioinformatics. The inherent role of transcription factors in gene expression modulation makes them strong candidates for master regulators of phenotypic transitions. However, transcription factors expression itself usually does not reflect its activity changes due to post-transcriptional modifications and other complications. In this aspect, the use of high-throughput transcriptomic data may be employed to infer transcription factors-targets interactions and assess their activity through co-expression networks, which can be further used to search for drugs capable of reverting the gene expression profile of pathological phenotypes employing the connectivity maps paradigm. Following this idea, we argue that a module-oriented connectivity map approach using transcription factors-centered networks would aid the query for new repositioning candidates. Through a brief case study, we explored this idea in bipolar disorder, retrieving known drugs used in the usual clinical scenario as well as new candidates with potential therapeutic application in this disease. Indeed, the results of the case study indicate just how promising our approach may be to drug repositioning.
Collapse
Affiliation(s)
- Marco A De Bastiani
- Laboratory of Cellular Biochemistry, Department of Biochemistry, Federal University of Rio Grande do Sul, Porto Alegre, Brazil.,National Institute of Science and Technology for Translational Medicine, Porto Alegre, Brazil
| | - Bianca Pfaffenseller
- Laboratory of Cellular Biochemistry, Department of Biochemistry, Federal University of Rio Grande do Sul, Porto Alegre, Brazil.,Laboratory of Molecular Psychiatry, Clinicas Hospital of Porto Alegre, Federal University of Rio Grande do Sul, Porto Alegre, Brazil
| | - Fabio Klamt
- Laboratory of Cellular Biochemistry, Department of Biochemistry, Federal University of Rio Grande do Sul, Porto Alegre, Brazil.,National Institute of Science and Technology for Translational Medicine, Porto Alegre, Brazil
| |
Collapse
|
282
|
Qiao S, Koyuturk M, Ozsoyoglu MZ. Querying of Disparate Association and Interaction Data in Biomedical Applications. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1052-1065. [PMID: 27959818 DOI: 10.1109/tcbb.2016.2637344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In biomedical applications, network models are commonly used to represent interactions and higher-level associations among biological entities. Integrated analyses of these interaction and association data has proven useful in extracting knowledge, and generating novel hypotheses for biomedical research. However, since most datasets provide their own schema and query interface, opportunities for exploratory and integrative querying of disparate data are currently limited. In this study, we utilize RDF-based representations of biomedical interaction and association data to develop a querying framework that enables flexible specification and efficient processing of graph template matching queries. The proposed framework enables integrative querying of biomedical databases to discover complex patterns of associations among a diverse range of biological entities, including biomolecules, biological processes, organisms, and phenotypes. Our experimental results on the UniProt dataset show that the proposed framework can be used to efficiently process complex queries, and identify biologically relevant patterns of associations that cannot be readily obtained by querying each dataset independently.
Collapse
|
283
|
Glaab E. Computational systems biology approaches for Parkinson's disease. Cell Tissue Res 2018; 373:91-109. [PMID: 29185073 PMCID: PMC6015628 DOI: 10.1007/s00441-017-2734-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 11/06/2017] [Indexed: 12/26/2022]
Abstract
Parkinson's disease (PD) is a prime example of a complex and heterogeneous disorder, characterized by multifaceted and varied motor- and non-motor symptoms and different possible interplays of genetic and environmental risk factors. While investigations of individual PD-causing mutations and risk factors in isolation are providing important insights to improve our understanding of the molecular mechanisms behind PD, there is a growing consensus that a more complete understanding of these mechanisms will require an integrative modeling of multifactorial disease-associated perturbations in molecular networks. Identifying and interpreting the combinatorial effects of multiple PD-associated molecular changes may pave the way towards an earlier and reliable diagnosis and more effective therapeutic interventions. This review provides an overview of computational systems biology approaches developed in recent years to study multifactorial molecular alterations in complex disorders, with a focus on PD research applications. Strengths and weaknesses of different cellular pathway and network analyses, and multivariate machine learning techniques for investigating PD-related omics data are discussed, and strategies proposed to exploit the synergies of multiple biological knowledge and data sources. A final outlook provides an overview of specific challenges and possible next steps for translating systems biology findings in PD to new omics-based diagnostic tools and targeted, drug-based therapeutic approaches.
Collapse
Affiliation(s)
- Enrico Glaab
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 7 avenue des Hauts Fourneaux, L-4362, Esch-sur-Alzette, Luxembourg.
| |
Collapse
|
284
|
Wakharde AA, Halbandge SD, Phule DB, Karuppayil SM. Anticancer Drugs as Antibiofilm Agents in Candida albicans: Potential Targets. Assay Drug Dev Technol 2018; 16:232-246. [DOI: 10.1089/adt.2017.826] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Affiliation(s)
- Archana Anandrao Wakharde
- School of Life Sciences (DST-FIST and UGC-SAP Sponsored) SRTM University (NAAC Accredited with “A” grade), Nanded, Maharashtra, India
| | - Shivkrupa Devrao Halbandge
- School of Life Sciences (DST-FIST and UGC-SAP Sponsored) SRTM University (NAAC Accredited with “A” grade), Nanded, Maharashtra, India
| | - Datta Baburao Phule
- School of Life Sciences (DST-FIST and UGC-SAP Sponsored) SRTM University (NAAC Accredited with “A” grade), Nanded, Maharashtra, India
| | - Sankunny Mohan Karuppayil
- School of Life Sciences (DST-FIST and UGC-SAP Sponsored) SRTM University (NAAC Accredited with “A” grade), Nanded, Maharashtra, India
| |
Collapse
|
285
|
Yella JK, Yaddanapudi S, Wang Y, Jegga AG. Changing Trends in Computational Drug Repositioning. Pharmaceuticals (Basel) 2018; 11:E57. [PMID: 29874824 PMCID: PMC6027196 DOI: 10.3390/ph11020057] [Citation(s) in RCA: 105] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 06/01/2018] [Accepted: 06/02/2018] [Indexed: 12/12/2022] Open
Abstract
Efforts to maximize the indications potential and revenue from drugs that are already marketed are largely motivated by what Sir James Black, a Nobel Prize-winning pharmacologist advocated-"The most fruitful basis for the discovery of a new drug is to start with an old drug". However, rational design of drug mixtures poses formidable challenges because of the lack of or limited information about in vivo cell regulation, mechanisms of genetic pathway activation, and in vivo pathway interactions. Hence, most of the successfully repositioned drugs are the result of "serendipity", discovered during late phase clinical studies of unexpected but beneficial findings. The connections between drug candidates and their potential adverse drug reactions or new applications are often difficult to foresee because the underlying mechanism associating them is largely unknown, complex, or dispersed and buried in silos of information. Discovery of such multi-domain pharmacomodules-pharmacologically relevant sub-networks of biomolecules and/or pathways-from collection of databases by independent/simultaneous mining of multiple datasets is an active area of research. Here, while presenting some of the promising bioinformatics approaches and pipelines, we summarize and discuss the current and evolving landscape of computational drug repositioning.
Collapse
Affiliation(s)
- Jaswanth K Yella
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, 240 Albert Sabin Way MLC 7024, Cincinnati, OH 45229, USA.
| | - Suryanarayana Yaddanapudi
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, 240 Albert Sabin Way MLC 7024, Cincinnati, OH 45229, USA.
| | - Yunguan Wang
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, 240 Albert Sabin Way MLC 7024, Cincinnati, OH 45229, USA.
| | - Anil G Jegga
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, 240 Albert Sabin Way MLC 7024, Cincinnati, OH 45229, USA.
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA.
- Department of Computer Science, University of Cincinnati College of Engineering, Cincinnati, OH 45219, USA.
| |
Collapse
|
286
|
Bakal G, Talari P, Kakani EV, Kavuluru R. Exploiting semantic patterns over biomedical knowledge graphs for predicting treatment and causative relations. J Biomed Inform 2018; 82:189-199. [PMID: 29763706 PMCID: PMC6070294 DOI: 10.1016/j.jbi.2018.05.003] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Revised: 01/31/2018] [Accepted: 05/09/2018] [Indexed: 01/27/2023]
Abstract
BACKGROUND Identifying new potential treatment options for medical conditions that cause human disease burden is a central task of biomedical research. Since all candidate drugs cannot be tested with animal and clinical trials, in vitro approaches are first attempted to identify promising candidates. Likewise, identifying different causal relations between biomedical entities is also critical to understand biomedical processes. Generally, natural language processing (NLP) and machine learning are used to predict specific relations between any given pair of entities using the distant supervision approach. OBJECTIVE To build high accuracy supervised predictive models to predict previously unknown treatment and causative relations between biomedical entities based only on semantic graph pattern features extracted from biomedical knowledge graphs. METHODS We used 7000 treats and 2918 causes hand-curated relations from the UMLS Metathesaurus to train and test our models. Our graph pattern features are extracted from simple paths connecting biomedical entities in the SemMedDB graph (based on the well-known SemMedDB database made available by the U.S. National Library of Medicine). Using these graph patterns connecting biomedical entities as features of logistic regression and decision tree models, we computed mean performance measures (precision, recall, F-score) over 100 distinct 80-20% train-test splits of the datasets. For all experiments, we used a positive:negative class imbalance of 1:10 in the test set to model relatively more realistic scenarios. RESULTS Our models predict treats and causes relations with high F-scores of 99% and 90% respectively. Logistic regression model coefficients also help us identify highly discriminative patterns that have an intuitive interpretation. We are also able to predict some new plausible relations based on false positives that our models scored highly based on our collaborations with two physician co-authors. Finally, our decision tree models are able to retrieve over 50% of treatment relations from a recently created external dataset. CONCLUSIONS We employed semantic graph patterns connecting pairs of candidate biomedical entities in a knowledge graph as features to predict treatment/causative relations between them. We provide what we believe is the first evidence in direct prediction of biomedical relations based on graph features. Our work complements lexical pattern based approaches in that the graph patterns can be used as additional features for weakly supervised relation prediction.
Collapse
Affiliation(s)
- Gokhan Bakal
- Department of Computer Science, University of Kentucky, United States.
| | - Preetham Talari
- Division of Hospital Medicine, Department of Internal Medicine, University of Kentucky, United States.
| | - Elijah V Kakani
- Division of Hospital Medicine, Department of Internal Medicine, University of Kentucky, United States.
| | - Ramakanth Kavuluru
- Division of Biomedical Informatics, Department of Internal Medicine, University of Kentucky, United States; Department of Computer Science, University of Kentucky, United States.
| |
Collapse
|
287
|
Yu AZ, Ramsey SA. A Computational Systems Biology Approach for Identifying Candidate Drugs for Repositioning for Cardiovascular Disease. Interdiscip Sci 2018; 10:449-454. [PMID: 27778232 PMCID: PMC5403631 DOI: 10.1007/s12539-016-0194-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Revised: 10/10/2016] [Accepted: 10/13/2016] [Indexed: 10/20/2022]
Abstract
We report an in silico method to screen for receptors or pathways that could be targeted to elicit beneficial transcriptional changes in a cellular model of a disease of interest. In our method, we integrate: (1) a dataset of transcriptome responses of a cell line to a panel of drugs; (2) two sets of genes for the disease; and (3) mappings between drugs and the receptors or pathways that they target. We carried out a gene set enrichment analysis (GSEA) test for each of the two gene sets against a list of genes ordered by fold-change in response to a drug in a relevant cell line (HL60), with the overall score for a drug being the difference of the two enrichment scores. Next, we applied GSEA for drug targets based on drugs that have been ranked by their differential enrichment scores. The method ranks drugs by the degree of anti-correlation of their gene-level transcriptional effects on the cell line with the genes in the disease gene sets. We applied the method to data from (1) CMap 2.0; (2) gene sets from two transcriptome profiling studies of atherosclerosis; and (3) a combined dataset of drug/target information. Our analysis recapitulated known targets related to CVD (e.g., PPARγ; HMG-CoA reductase, HDACs) and novel targets (e.g., amine oxidase A, δ-opioid receptor). We conclude that combining disease-associated gene sets, drug-transcriptome-responses datasets and drug-target annotations can potentially be useful as a screening tool for diseases that lack an accepted cellular model for in vitro screening.
Collapse
Affiliation(s)
- Alvin Z Yu
- Department of Biomedical Sciences, Oregon State University, 106 Dryden Hall, Corvallis, OR, 97331, USA
| | - Stephen A Ramsey
- Department of Biomedical Sciences, Oregon State University, 106 Dryden Hall, Corvallis, OR, 97331, USA.
- School of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR, 97331, USA.
| |
Collapse
|
288
|
Ferrero E, Agarwal P. Connecting genetics and gene expression data for target prioritisation and drug repositioning. BioData Min 2018; 11:7. [PMID: 29881461 PMCID: PMC5984374 DOI: 10.1186/s13040-018-0171-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2017] [Accepted: 05/14/2018] [Indexed: 12/18/2022] Open
Abstract
Developing new drugs continues to be a highly inefficient and costly business. By repurposing an existing compound for a different indication, drug repositioning offers an attractive alternative to traditional drug discovery. Most of these approaches work by matching transcriptional disease signatures to anti-correlated gene expression profiles of drug perturbations. Genome-wide association studies (GWASs) are of great interest to researchers in the pharmaceutical industry because drug programmes with supporting genetic evidence are more likely to successfully progress through the drug discovery pipeline. Here, we present a systematic approach to generate drug repositioning hypothesis based on disease genetics by mining public repositories of GWAS data and drug transcriptomic profiles. We find that genes genetically associated with a certain disease are more likely to be differentially expressed in the same disease (p-value = 1.54e-17 and AUC = 0.75) and that, in existing drug - disease combinations, genes significantly up- or down-regulated after drug treatment are enriched for genes genetically associated with that disease (p-value = 1.1e-79 and AUC = 0.64). Finally, we use this framework to generate and rank novel GWAS-driven drug repositioning predictions.
Collapse
Affiliation(s)
- Enrico Ferrero
- Computational Biology, Target Sciences, GSK, Gunnels Wood Road, Stevenage, SG1 2NY UK
- Present Address: Autoimmunity, Transplantation and Inflammation, Novartis Institutes for Biomedical Research, Fabrikstrasse 2, Basel, 4056 Switzerland
| | - Pankaj Agarwal
- Computational Biology, Target Sciences, GSK, 1250 S. Collegeville Road, UP12-100, Collegeville, PA 19426-0989 USA
| |
Collapse
|
289
|
Huang H, Zhang G, Zhou Y, Lin C, Chen S, Lin Y, Mai S, Huang Z. Reverse Screening Methods to Search for the Protein Targets of Chemopreventive Compounds. Front Chem 2018; 6:138. [PMID: 29868550 PMCID: PMC5954125 DOI: 10.3389/fchem.2018.00138] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Accepted: 04/09/2018] [Indexed: 12/13/2022] Open
Abstract
This article is a systematic review of reverse screening methods used to search for the protein targets of chemopreventive compounds or drugs. Typical chemopreventive compounds include components of traditional Chinese medicine, natural compounds and Food and Drug Administration (FDA)-approved drugs. Such compounds are somewhat selective but are predisposed to bind multiple protein targets distributed throughout diverse signaling pathways in human cells. In contrast to conventional virtual screening, which identifies the ligands of a targeted protein from a compound database, reverse screening is used to identify the potential targets or unintended targets of a given compound from a large number of receptors by examining their known ligands or crystal structures. This method, also known as in silico or computational target fishing, is highly valuable for discovering the target receptors of query molecules from terrestrial or marine natural products, exploring the molecular mechanisms of chemopreventive compounds, finding alternative indications of existing drugs by drug repositioning, and detecting adverse drug reactions and drug toxicity. Reverse screening can be divided into three major groups: shape screening, pharmacophore screening and reverse docking. Several large software packages, such as Schrödinger and Discovery Studio; typical software/network services such as ChemMapper, PharmMapper, idTarget, and INVDOCK; and practical databases of known target ligands and receptor crystal structures, such as ChEMBL, BindingDB, and the Protein Data Bank (PDB), are available for use in these computational methods. Different programs, online services and databases have different applications and constraints. Here, we conducted a systematic analysis and multilevel classification of the computational programs, online services and compound libraries available for shape screening, pharmacophore screening and reverse docking to enable non-specialist users to quickly learn and grasp the types of calculations used in protein target fishing. In addition, we review the main features of these methods, programs and databases and provide a variety of examples illustrating the application of one or a combination of reverse screening methods for accurate target prediction.
Collapse
Affiliation(s)
- Hongbin Huang
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,The Second School of Clinical Medicine, Guangdong Medical University Dongguan, China
| | - Guigui Zhang
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,School of Pharmacy, Guangdong Medical University Dongguan, China
| | - Yuquan Zhou
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,The Second School of Clinical Medicine, Guangdong Medical University Dongguan, China
| | - Chenru Lin
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,School of Pharmacy, Guangdong Medical University Dongguan, China
| | - Suling Chen
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,The Second School of Clinical Medicine, Guangdong Medical University Dongguan, China
| | - Yutong Lin
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,School of Pharmacy, Guangdong Medical University Dongguan, China
| | - Shangkang Mai
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,The Second School of Clinical Medicine, Guangdong Medical University Dongguan, China
| | - Zunnan Huang
- Key Laboratory for Medical Molecular Diagnostics of Guangdong Province, Dongguan Scientific Research Center, Guangdong Medical University Dongguan, China.,School of Pharmacy, Guangdong Medical University Dongguan, China
| |
Collapse
|
290
|
Wang Q, Li L, Xu R. A systems biology approach to predict and characterize human gut microbial metabolites in colorectal cancer. Sci Rep 2018; 8:6225. [PMID: 29670137 PMCID: PMC5906656 DOI: 10.1038/s41598-018-24315-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Accepted: 03/26/2018] [Indexed: 12/16/2022] Open
Abstract
Colorectal cancer (CRC) is the second leading cause of cancer-related deaths. It is estimated that about half the cases of CRC occurring today are preventable. Recent studies showed that human gut microbiota and their collective metabolic outputs play important roles in CRC. However, the mechanisms by which human gut microbial metabolites interact with host genetics in contributing CRC remain largely unknown. We hypothesize that computational approaches that integrate and analyze vast amounts of publicly available biomedical data have great potential in better understanding how human gut microbial metabolites are mechanistically involved in CRC. Leveraging vast amount of publicly available data, we developed a computational algorithm to predict human gut microbial metabolites for CRC. We validated the prediction algorithm by showing that previously known CRC-associated gut microbial metabolites ranked highly (mean ranking: top 10.52%; median ranking: 6.29%; p-value: 3.85E-16). Moreover, we identified new gut microbial metabolites likely associated with CRC. Through computational analysis, we propose potential roles for tartaric acid, the top one ranked metabolite, in CRC etiology. In summary, our data-driven computation-based study generated a large amount of associations that could serve as a starting point for further experiments to refute or validate these microbial metabolite associations in CRC cancer.
Collapse
Affiliation(s)
| | - Li Li
- Department of Family Medicine and Community Health, Case Comprehensive Cancer Center, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Rong Xu
- Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, 2103 Cornell Road, Cleveland, Ohio, 44106, USA.
| |
Collapse
|
291
|
Jean-Quartier C, Jeanquartier F, Jurisica I, Holzinger A. In silico cancer research towards 3R. BMC Cancer 2018; 18:408. [PMID: 29649981 PMCID: PMC5897933 DOI: 10.1186/s12885-018-4302-0] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Accepted: 03/26/2018] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND Improving our understanding of cancer and other complex diseases requires integrating diverse data sets and algorithms. Intertwining in vivo and in vitro data and in silico models are paramount to overcome intrinsic difficulties given by data complexity. Importantly, this approach also helps to uncover underlying molecular mechanisms. Over the years, research has introduced multiple biochemical and computational methods to study the disease, many of which require animal experiments. However, modeling systems and the comparison of cellular processes in both eukaryotes and prokaryotes help to understand specific aspects of uncontrolled cell growth, eventually leading to improved planning of future experiments. According to the principles for humane techniques milestones in alternative animal testing involve in vitro methods such as cell-based models and microfluidic chips, as well as clinical tests of microdosing and imaging. Up-to-date, the range of alternative methods has expanded towards computational approaches, based on the use of information from past in vitro and in vivo experiments. In fact, in silico techniques are often underrated but can be vital to understanding fundamental processes in cancer. They can rival accuracy of biological assays, and they can provide essential focus and direction to reduce experimental cost. MAIN BODY We give an overview on in vivo, in vitro and in silico methods used in cancer research. Common models as cell-lines, xenografts, or genetically modified rodents reflect relevant pathological processes to a different degree, but can not replicate the full spectrum of human disease. There is an increasing importance of computational biology, advancing from the task of assisting biological analysis with network biology approaches as the basis for understanding a cell's functional organization up to model building for predictive systems. CONCLUSION Underlining and extending the in silico approach with respect to the 3Rs for replacement, reduction and refinement will lead cancer research towards efficient and effective precision medicine. Therefore, we suggest refined translational models and testing methods based on integrative analyses and the incorporation of computational biology within cancer research.
Collapse
Affiliation(s)
- Claire Jean-Quartier
- Holzinger Group, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz, Austria
| | - Fleur Jeanquartier
- Holzinger Group, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz, Austria
- Institute of Interactive Systems and Data Science, Graz University of Technology, Graz, Austria
| | - Igor Jurisica
- Krembil Research Institute, University Health Network; Depts. of Medical Bioph. and Comp. Sci., University of Toronto; Institute of Neuroimmunology, Slovak Academy of Sciences, Toronto, Canada
| | - Andreas Holzinger
- Holzinger Group, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Graz, Austria
- Institute of Interactive Systems and Data Science, Graz University of Technology, Graz, Austria
| |
Collapse
|
292
|
Hameed PN, Verspoor K, Kusljic S, Halgamuge S. A two-tiered unsupervised clustering approach for drug repositioning through heterogeneous data integration. BMC Bioinformatics 2018; 19:129. [PMID: 29642848 PMCID: PMC5896044 DOI: 10.1186/s12859-018-2123-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 03/21/2018] [Indexed: 01/02/2023] Open
Abstract
Background Drug repositioning is the process of identifying new uses for existing drugs. Computational drug repositioning methods can reduce the time, costs and risks of drug development by automating the analysis of the relationships in pharmacology networks. Pharmacology networks are large and heterogeneous. Clustering drugs into small groups can simplify large pharmacology networks, these subgroups can also be used as a starting point for repositioning drugs. In this paper, we propose a two-tiered drug-centric unsupervised clustering approach for drug repositioning, integrating heterogeneous drug data profiles: drug-chemical, drug-disease, drug-gene, drug-protein and drug-side effect relationships. Results The proposed drug repositioning approach is threefold; (i) clustering drugs based on their homogeneous profiles using the Growing Self Organizing Map (GSOM); (ii) clustering drugs based on drug-drug relation matrices based on the previous step, considering three state-of-the-art graph clustering methods; and (iii) inferring drug repositioning candidates and assigning a confidence value for each identified candidate. In this paper, we compare our two-tiered clustering approach against two existing heterogeneous data integration approaches with reference to the Anatomical Therapeutic Chemical (ATC) classification, using GSOM. Our approach yields Normalized Mutual Information (NMI) and Standardized Mutual Information (SMI) of 0.66 and 36.11, respectively, while the two existing methods yield NMI of 0.60 and 0.64 and SMI of 22.26 and 33.59. Moreover, the two existing approaches failed to produce useful cluster separations when using graph clustering algorithms while our approach is able to identify useful clusters for drug repositioning. Furthermore, we provide clinical evidence for four predicted results (Chlorthalidone, Indomethacin, Metformin and Thioridazine) to support that our proposed approach can be reliably used to infer ATC code and drug repositioning. Conclusion The proposed two-tiered unsupervised clustering approach is suitable for drug clustering and enables heterogeneous data integration. It also enables identifying reliable repositioning drug candidates with reference to ATC therapeutic classification. The repositioning drug candidates identified consistently by multiple clustering algorithms and with high confidence have a higher possibility of being effective repositioning candidates. Electronic supplementary material The online version of this article (10.1186/s12859-018-2123-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Pathima Nusrath Hameed
- Department of Mechanical Engineering, University of Melbourne, Parkville, Melbourne, 3010, Australia. .,Data61, Victoria Research Lab, West Melbourne, 3003, Australia. .,Department of Computer Science, University of Ruhuna, Matara, 81000, Sri Lanka.
| | - Karin Verspoor
- Department of Computing and Information Systems, University of Melbourne, Parkville, Melbourne, 3010, Australia
| | - Snezana Kusljic
- Department of Nursing, University of Melbourne, Parkville, Melbourne, 3010, Australia.,The Florey Institute of Neuroscience and Mental Health, University of Melbourne, Parkville, Melbourne, 3010, Australia
| | - Saman Halgamuge
- Research School of Engineering, College of Engineering & Computer Science, The Australian National University, Canberra, ACT, 2601, Australia
| |
Collapse
|
293
|
Sagers JE, Brown AS, Vasilijic S, Lewis RM, Sahin MI, Landegger LD, Perlis RH, Kohane IS, Welling DB, Patel CJ, Stankovic KM. Computational repositioning and preclinical validation of mifepristone for human vestibular schwannoma. Sci Rep 2018; 8:5437. [PMID: 29615643 PMCID: PMC5882888 DOI: 10.1038/s41598-018-23609-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 03/14/2018] [Indexed: 12/30/2022] Open
Abstract
The computational repositioning of existing drugs represents an appealing avenue for identifying effective compounds to treat diseases with no FDA-approved pharmacotherapies. Here we present the largest meta-analysis to date of differential gene expression in human vestibular schwannoma (VS), a debilitating intracranial tumor, and use these data to inform the first application of algorithm-based drug repositioning for this tumor class. We apply an open-source computational drug repositioning platform to gene expression data from 80 patient tumors and identify eight promising FDA-approved drugs with potential for repurposing in VS. Of these eight, mifepristone, a progesterone and glucocorticoid receptor antagonist, consistently and adversely affects the morphology, metabolic activity, and proliferation of primary human VS cells and HEI-193 human schwannoma cells. Mifepristone treatment reduces VS cell viability more significantly than cells derived from patient meningiomas, while healthy human Schwann cells remain unaffected. Our data recommend a Phase II clinical trial of mifepristone in VS.
Collapse
Affiliation(s)
- Jessica E Sagers
- Eaton-Peabody Laboratories, Department of Otolaryngology, Massachusetts Eye and Ear, Boston, MA, 02114, USA
- Program in Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, MA, 02115, USA
- Harvard Program in Therapeutic Science, Harvard Medical School, Boston, MA, 02115, USA
| | - Adam S Brown
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Harvard Program in Therapeutic Science, Harvard Medical School, Boston, MA, 02115, USA
| | - Sasa Vasilijic
- Eaton-Peabody Laboratories, Department of Otolaryngology, Massachusetts Eye and Ear, Boston, MA, 02114, USA
- Department of Otolaryngology, Harvard Medical School, Boston, MA, 02114, USA
| | - Rebecca M Lewis
- Eaton-Peabody Laboratories, Department of Otolaryngology, Massachusetts Eye and Ear, Boston, MA, 02114, USA
- Department of Otolaryngology, Harvard Medical School, Boston, MA, 02114, USA
| | - Mehmet I Sahin
- Eaton-Peabody Laboratories, Department of Otolaryngology, Massachusetts Eye and Ear, Boston, MA, 02114, USA
- Department of Otolaryngology, Harvard Medical School, Boston, MA, 02114, USA
| | - Lukas D Landegger
- Eaton-Peabody Laboratories, Department of Otolaryngology, Massachusetts Eye and Ear, Boston, MA, 02114, USA
- Department of Otolaryngology, Harvard Medical School, Boston, MA, 02114, USA
- Department of Otolaryngology, Vienna General Hospital, Medical University of Vienna, Vienna, 1090, Austria
| | - Roy H Perlis
- Center for Experimental Drugs and Diagnostics, Department of Psychiatry and Center for Human Genetic Research, Massachusetts General Hospital, Harvard Medical School, Boston, MA, 02114, USA
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
| | - D Bradley Welling
- Eaton-Peabody Laboratories, Department of Otolaryngology, Massachusetts Eye and Ear, Boston, MA, 02114, USA
- Department of Otolaryngology, Harvard Medical School, Boston, MA, 02114, USA
| | - Chirag J Patel
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
| | - Konstantina M Stankovic
- Eaton-Peabody Laboratories, Department of Otolaryngology, Massachusetts Eye and Ear, Boston, MA, 02114, USA.
- Program in Speech and Hearing Bioscience and Technology, Harvard Medical School, Boston, MA, 02115, USA.
- Department of Otolaryngology, Harvard Medical School, Boston, MA, 02114, USA.
| |
Collapse
|
294
|
Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, Ferrero E, Agapow PM, Zietz M, Hoffman MM, Xie W, Rosen GL, Lengerich BJ, Israeli J, Lanchantin J, Woloszynek S, Carpenter AE, Shrikumar A, Xu J, Cofer EM, Lavender CA, Turaga SC, Alexandari AM, Lu Z, Harris DJ, DeCaprio D, Qi Y, Kundaje A, Peng Y, Wiley LK, Segler MHS, Boca SM, Swamidass SJ, Huang A, Gitter A, Greene CS. Opportunities and obstacles for deep learning in biology and medicine. J R Soc Interface 2018; 15:20170387. [PMID: 29618526 PMCID: PMC5938574 DOI: 10.1098/rsif.2017.0387] [Citation(s) in RCA: 826] [Impact Index Per Article: 137.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Accepted: 03/07/2018] [Indexed: 11/12/2022] Open
Abstract
Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms have recently shown impressive results across a variety of domains. Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well suited to solve problems of these fields. We examine applications of deep learning to a variety of biomedical problems-patient classification, fundamental biological processes and treatment of patients-and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. Following from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. Even though improvements over previous baselines have been modest in general, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. Though progress has been made linking a specific neural network's prediction to input features, understanding how users should interpret these models to make testable hypotheses about the system under study remains an open challenge. Furthermore, the limited amount of labelled data for training presents problems in some domains, as do legal and privacy constraints on work with sensitive health records. Nonetheless, we foresee deep learning enabling changes at both bench and bedside with the potential to transform several areas of biology and medicine.
Collapse
Affiliation(s)
- Travers Ching
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI, USA
| | - Daniel S Himmelstein
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Brett K Beaulieu-Jones
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Alexandr A Kalinin
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - Gregory P Way
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Enrico Ferrero
- Computational Biology and Stats, Target Sciences, GlaxoSmithKline, Stevenage, UK
| | | | - Michael Zietz
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Wei Xie
- Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA
| | - Gail L Rosen
- Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Benjamin J Lengerich
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Johnny Israeli
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Jack Lanchantin
- Department of Computer Science, University of Virginia, Charlottesville, VA, USA
| | - Stephen Woloszynek
- Ecological and Evolutionary Signal-processing and Informatics Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Anne E Carpenter
- Imaging Platform, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Avanti Shrikumar
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Jinbo Xu
- Toyota Technological Institute at Chicago, Chicago, IL, USA
| | - Evan M Cofer
- Department of Computer Science, Trinity University, San Antonio, TX, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Christopher A Lavender
- Integrative Bioinformatics, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, NC, USA
| | - Srinivas C Turaga
- Howard Hughes Medical Institute, Janelia Research Campus, Ashburn, VA, USA
| | - Amr M Alexandari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Zhiyong Lu
- National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - David J Harris
- Department of Wildlife Ecology and Conservation, University of Florida, Gainesville, FL, USA
| | | | - Yanjun Qi
- Department of Computer Science, University of Virginia, Charlottesville, VA, USA
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Yifan Peng
- National Center for Biotechnology Information and National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Laura K Wiley
- Division of Biomedical Informatics and Personalized Medicine, University of Colorado School of Medicine, Aurora, CO, USA
| | - Marwin H S Segler
- Institute of Organic Chemistry, Westfälische Wilhelms-Universität Münster, Münster, Germany
| | - Simina M Boca
- Innovation Center for Biomedical Informatics, Georgetown University Medical Center, Washington, DC, USA
| | - S Joshua Swamidass
- Department of Pathology and Immunology, Washington University in Saint Louis, St Louis, MO, USA
| | - Austin Huang
- Department of Medicine, Brown University, Providence, RI, USA
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
- Morgridge Institute for Research, Madison, WI, USA
| | - Casey S Greene
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
295
|
Koudijs KKM, Terwisscha van Scheltinga AGT, Böhringer S, Schimmel KJM, Guchelaar HJ. Personalised drug repositioning for Clear Cell Renal Cell Carcinoma using gene expression. Sci Rep 2018; 8:5250. [PMID: 29588458 PMCID: PMC5869697 DOI: 10.1038/s41598-018-23195-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Accepted: 03/07/2018] [Indexed: 02/02/2023] Open
Abstract
Reversal of cancer gene expression is predictive of therapeutic potential and can be used to find new indications for existing drugs (drug repositioning). Gene expression reversal potential is currently calculated, in almost all studies, by pre-aggregating all tumour samples into a single group signature or a limited number of molecular subtype signatures. Here, we investigate whether drug repositioning based on individual tumour sample gene expression signatures outperforms the use of tumour group and subtype signatures. The tumour signatures were created using 534 tumour samples and 72 matched normal samples from 530 clear cell renal cell carcinoma (ccRCC) patients. More than 20,000 drug signatures were extracted from the CMAP and LINCS databases. We show that negative enrichment of individual tumour samples correlated (Spearman's rho = 0.15) much better with the amount of differentially expressed genes in drug signatures than with the tumour group signature (Rho = 0.08) and the 4 tumour subtype signatures (Rho 0.036-0.11). Targeted drugs used against ccRCC, such as sirolimus and temsirolimus, which could not be identified with the pre-aggregated tumour signatures could be recovered using individual sample analysis. Thus, drug repositioning can be personalized by taking into account the gene expression profile of the individual's tumour sample.
Collapse
Affiliation(s)
- Karel K M Koudijs
- Department of Clinical Pharmacy & Toxicology, Leiden University Medical Centre, Leiden, The Netherlands
| | | | - Stefan Böhringer
- Department of Medical Statistics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Kirsten J M Schimmel
- Department of Clinical Pharmacy & Toxicology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Henk-Jan Guchelaar
- Department of Clinical Pharmacy & Toxicology, Leiden University Medical Centre, Leiden, The Netherlands.
| |
Collapse
|
296
|
Large-scale computational drug repositioning to find treatments for rare diseases. NPJ Syst Biol Appl 2018; 4:13. [PMID: 29560273 PMCID: PMC5847522 DOI: 10.1038/s41540-018-0050-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 01/22/2018] [Accepted: 02/03/2018] [Indexed: 11/08/2022] Open
Abstract
Rare, or orphan, diseases are conditions afflicting a small subset of people in a population. Although these disorders collectively pose significant health care problems, drug companies require government incentives to develop drugs for rare diseases due to extremely limited individual markets. Computer-aided drug repositioning, i.e., finding new indications for existing drugs, is a cheaper and faster alternative to traditional drug discovery offering a promising venue for orphan drug research. Structure-based matching of drug-binding pockets is among the most promising computational techniques to inform drug repositioning. In order to find new targets for known drugs ultimately leading to drug repositioning, we recently developed eMatchSite, a new computer program to compare drug-binding sites. In this study, eMatchSite is combined with virtual screening to systematically explore opportunities to reposition known drugs to proteins associated with rare diseases. The effectiveness of this integrated approach is demonstrated for a kinase inhibitor, which is a confirmed candidate for repositioning to synapsin Ia. The resulting dataset comprises 31,142 putative drug-target complexes linked to 980 orphan diseases. The modeling accuracy is evaluated against the structural data recently released for tyrosine-protein kinase HCK. To illustrate how potential therapeutics for rare diseases can be identified, we discuss a possibility to repurpose a steroidal aromatase inhibitor to treat Niemann-Pick disease type C. Overall, the exhaustive exploration of the drug repositioning space exposes new opportunities to combat orphan diseases with existing drugs. DrugBank/Orphanet repositioning data are freely available to research community at https://osf.io/qdjup/.
Collapse
|
297
|
Brown N, Cambruzzi J, Cox PJ, Davies M, Dunbar J, Plumbley D, Sellwood MA, Sim A, Williams-Jones BI, Zwierzyna M, Sheppard DW. Big Data in Drug Discovery. PROGRESS IN MEDICINAL CHEMISTRY 2018; 57:277-356. [PMID: 29680150 DOI: 10.1016/bs.pmch.2017.12.003] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
Interpretation of Big Data in the drug discovery community should enhance project timelines and reduce clinical attrition through improved early decision making. The issues we encounter start with the sheer volume of data and how we first ingest it before building an infrastructure to house it to make use of the data in an efficient and productive way. There are many problems associated with the data itself including general reproducibility, but often, it is the context surrounding an experiment that is critical to success. Help, in the form of artificial intelligence (AI), is required to understand and translate the context. On the back of natural language processing pipelines, AI is also used to prospectively generate new hypotheses by linking data together. We explain Big Data from the context of biology, chemistry and clinical trials, showcasing some of the impressive public domain sources and initiatives now available for interrogation.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Aaron Sim
- BenevolentAI, London, United Kingdom
| | | | - Magdalena Zwierzyna
- BenevolentAI, London, United Kingdom; Institute of Cardiovascular Science, University College London, London, United Kingdom
| | | |
Collapse
|
298
|
Ezzat A, Wu M, Li XL, Kwoh CK. Computational prediction of drug–target interactions using chemogenomic approaches: an empirical survey. Brief Bioinform 2018; 20:1337-1357. [DOI: 10.1093/bib/bby002] [Citation(s) in RCA: 117] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 12/21/2017] [Indexed: 01/18/2023] Open
Abstract
Abstract
Computational prediction of drug–target interactions (DTIs) has become an essential task in the drug discovery process. It narrows down the search space for interactions by suggesting potential interaction candidates for validation via wet-lab experiments that are well known to be expensive and time-consuming. In this article, we aim to provide a comprehensive overview and empirical evaluation on the computational DTI prediction techniques, to act as a guide and reference for our fellow researchers. Specifically, we first describe the data used in such computational DTI prediction efforts. We then categorize and elaborate the state-of-the-art methods for predicting DTIs. Next, an empirical comparison is performed to demonstrate the prediction performance of some representative methods under different scenarios. We also present interesting findings from our evaluation study, discussing the advantages and disadvantages of each method. Finally, we highlight potential avenues for further enhancement of DTI prediction performance as well as related research directions.
Collapse
|
299
|
Drug-Target Interaction Prediction in Drug Repositioning Based on Deep Semi-Supervised Learning. COMPUTATIONAL INTELLIGENCE AND ITS APPLICATIONS 2018. [DOI: 10.1007/978-3-319-89743-1_27] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
|
300
|
Abstract
No therapies have been shown to improve outcomes in patients with acute kidney injury (AKI). Given the high morbidity and mortality associated with AKI this represents an important unmet medical need. A common feature of all of the therapeutic development efforts for AKI is that none were driven by target selection or preclinical modeling that was based primarily on human data. This is important when considering a heterogeneous and dynamic condition such as AKI, in which in the absence of more accurate molecular classifications, clinical cohorts are likely to include patients with different types of injury at different stages in the injury and repair continuum. The National Institutes of Health precision medicine initiative offers an opportunity to address this. By creating a molecular tissue atlas of AKI, defining patient subgroups, and identifying critical cells and pathways involved in human AKI, this initiative has the potential to transform our current approach to therapeutic discovery. In this review, we discuss the opportunities and challenges that this initiative presents, with a specific focus on AKI, what additional efforts will be needed to apply these discoveries to therapeutic development, and how we believe this effort might lead to the development of new therapeutics for subsets of patients with AKI.
Collapse
Affiliation(s)
- Mark de Caestecker
- Nephrology Division, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee.
| | - Raymond Harris
- Nephrology Division, Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee
| |
Collapse
|