1
|
E Z, Qiao G, Wang G, Li Y. GSL-DTI: Graph structure learning network for Drug-Target interaction prediction. Methods 2024; 223:136-145. [PMID: 38360082 DOI: 10.1016/j.ymeth.2024.01.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 12/23/2023] [Accepted: 01/29/2024] [Indexed: 02/17/2024] Open
Abstract
MOTIVATION Drug-target interaction prediction is an important area of research to predict whether there is an interaction between a drug molecule and its target protein. It plays a critical role in drug discovery and development by facilitating the identification of potential drug candidates and expediting the overall process. Given the time-consuming, expensive, and high-risk nature of traditional drug discovery methods, the prediction of drug-target interactions has become an indispensable tool. Using machine learning and deep learning to tackle this class of problems has become a mainstream approach, and graph-based models have recently received much attention in this field. However, many current graph-based Drug-Target Interaction (DTI) prediction methods rely on manually defined rules to construct the Drug-Protein Pair (DPP) network during the DPP representation learning process. However, these methods fail to capture the true underlying relationships between drug molecules and target proteins. RESULTS We propose GSL-DTI, an automatic graph structure learning model used for predicting drug-target interactions (DTIs). Initially, we integrate large-scale heterogeneous networks using a graph convolution network based on meta-paths, effectively learning the representations of drugs and target proteins. Subsequently, we construct drug-protein pairs based on these representations. In contrast to previous studies that construct DPP networks based on manual rules, our method introduces an automatic graph structure learning approach. This approach utilizes a filter gate on the affinity scores of DPPs and relies on the classification loss of downstream tasks to guide the learning of the underlying DPP network structure. Based on the learned DPP network, we transform the prediction of drug-target interactions into a node classification problem. The comprehensive experiments conducted on three public datasets have shown the superiority of GSL-DTI in the tasks of DTI prediction. Additionally, GSL-DTI provides a fresh perspective for advancing research in graph structure learning for DTI prediction.
Collapse
Affiliation(s)
- Zixuan E
- College of Computer and Control Engineering, Northeast Forestry University,Harbin 150006, China
| | - Guanyu Qiao
- College of Computer and Control Engineering, Northeast Forestry University,Harbin 150006, China
| | - Guohua Wang
- College of Computer and Control Engineering, Northeast Forestry University,Harbin 150006, China.
| | - Yang Li
- College of Computer and Control Engineering, Northeast Forestry University,Harbin 150006, China.
| |
Collapse
|
2
|
Li M, Cai X, Xu S, Ji H. Metapath-aggregated heterogeneous graph neural network for drug-target interaction prediction. Brief Bioinform 2023; 24:6966534. [PMID: 36592060 DOI: 10.1093/bib/bbac578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 11/03/2022] [Accepted: 11/26/2022] [Indexed: 01/03/2023] Open
Abstract
Drug-target interaction (DTI) prediction is an essential step in drug repositioning. A few graph neural network (GNN)-based methods have been proposed for DTI prediction using heterogeneous biological data. However, existing GNN-based methods only aggregate information from directly connected nodes restricted in a drug-related or a target-related network and are incapable of capturing high-order dependencies in the biological heterogeneous graph. In this paper, we propose a metapath-aggregated heterogeneous graph neural network (MHGNN) to capture complex structures and rich semantics in the biological heterogeneous graph for DTI prediction. Specifically, MHGNN enhances heterogeneous graph structure learning and high-order semantics learning by modeling high-order relations via metapaths. Additionally, MHGNN enriches high-order correlations between drug-target pairs (DTPs) by constructing a DTP correlation graph with DTPs as nodes. We conduct extensive experiments on three biological heterogeneous datasets. MHGNN favorably surpasses 17 state-of-the-art methods over 6 evaluation metrics, which verifies its efficacy for DTI prediction. The code is available at https://github.com/Zora-LM/MHGNN-DTI.
Collapse
Affiliation(s)
- Mei Li
- Tianjin Key Laboratory of Network and Data Security Technology, China.,College of Computer Science, Nankai University, 300350, Tianjin, China
| | - Xiangrui Cai
- Tianjin Key Laboratory of Network and Data Security Technology, China.,College of Computer Science, Nankai University, 300350, Tianjin, China
| | - Sihan Xu
- Tianjin Key Laboratory of Network and Data Security Technology, China.,College of Cyber Science, Nankai University, 300350, Tianjin, China
| | - Hua Ji
- Tianjin Key Laboratory of Network and Data Security Technology, China.,College of Computer Science, Nankai University, 300350, Tianjin, China
| |
Collapse
|
3
|
Yang J, Zhang D, Cai Y, Yu K, Li M, Liu L, Chen X. Computational Prediction of Drug Phenotypic Effects Based on Substructure-Phenotype Associations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:256-265. [PMID: 35239490 DOI: 10.1109/tcbb.2022.3155453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Identifying drug phenotypic effects, including therapeutic effects and adverse drug reactions (ADRs), is an inseparable part for evaluating the potentiality of new drug candidates (NDCs). However, current computational methods for predicting phenotypic effects of NDCs are mainly based on the overall structure of an NDC or a related target. These approaches often lead to inconsistencies between the structures and functions and limit the prediction space of NDCs. In this study, first, we constructed quantitative associations of substructure-domain, domain-ADR, and domain-ATC (Anatomical Therapeutic Chemical Classification System code) through L1LOG and L1SVM machine learning models. These associations represent relationships between phenotypes (ADRs and ATCs) and local structures of drugs and proteins. Then, based on these established associations, substructure-phenotype relationships were constructed which were utilized to quantify drug-phenotype relationships. Thus, this approach could achieve high-throughput and effective evaluations of the druggability of NDCs by referring to the established substructure-phenotype relationships and structural information of NDCs without additional prior knowledge. Using this computational pipeline, 83,205 drug-ATC relationships (including 1,479 drugs and 178 ATCs) and 306,421 drug-ADR relationships (including 1,752 drugs and 454 ADRs) were predicted in total. The prediction results were validated at four levels: five-fold cross validation, public databases, literature, and molecular docking. Furthermore, three case studies demonstrated the feasibility of our method. 79 ATCs and 269 ADRs were predicted to be related to Maraviroc, an approved drug, including the existing antiviral effect in clinical use. Additionally, we also found risk substructures of severe ADRs, for example, SUB215 (>= 1, saturated or only aromatic carbon ring size 7) can result in shock. And we analyzed the mechanism of action (MOA) of interested drugs based on the established drug-substructure-domain-protein associations. In a word, this approach through establishing drug-substructure-phenotype relationships can achieve quantitative prediction of phenotypes for a given NDC or drug without any prior knowledge except its structure information. Using that way, we can directly obtain the relationships between substructure and phenotype of a compound, which is more convenient to analyze the phenotypic mechanism of drugs and accelerate the process of rational drug design.
Collapse
|
4
|
Dube M, Raphane B, Sethebe B, Seputhe N, Tiroyakgosi T, Imming P, Häberli C, Keiser J, Arnold N, Andrae-Marobela K. Medicinal Plant Preparations Administered by Botswana Traditional Health Practitioners for Treatment of Worm Infections Show Anthelmintic Activities. PLANTS (BASEL, SWITZERLAND) 2022; 11:2945. [PMID: 36365400 PMCID: PMC9658373 DOI: 10.3390/plants11212945] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 10/26/2022] [Accepted: 10/26/2022] [Indexed: 06/16/2023]
Abstract
Schistosomiasis and soil-transmitted helminths are some of the priority neglected tropical diseases (NTDs) targeted for elimination by the World Health Organization (WHO). They are prevalent in Botswana and although Botswana has begun mass drug administration with the hope of eliminating soil-transmitted helminths as a public health problem, the prevalence of schistosomiasis does not meet the threshold required to warrant large-scale interventions. Although Botswana has a modern healthcare system, many people in Botswana rely on traditional medicine to treat worm infections and schistosomiasis. In this study, ten plant species used by traditional health practitioners against worm infections were collected and tested against Ancylostoma ceylanicum (zoonotic hookworm), Heligmosomoides polygyrus (roundworm of rodents), Necator americanus (New World hookworm), Schistosoma mansoni (blood fluke) [adult and newly transformed schistosomula (NTS)], Strongyloides ratti (threadworm) and Trichuris muris (nematode parasite of mice) in vitro. Extracts of two plants, Laphangium luteoalbum and Commiphora pyaracanthoides, displayed promising anthelmintic activity against NTS and adult S. mansoni, respectively. L. luteoalbum displayed 85.4% activity at 1 μg/mL against NTS, while C. pyracanthoides displayed 78.5% activity against adult S. mansoni at 10 μg/mL.
Collapse
Affiliation(s)
- Mthandazo Dube
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry, Weinberg 3, D-06120 Halle (Saale), Germany
| | - Boingotlo Raphane
- Department of Biological Sciences, Faculty of Science, University of Botswana, Gaborone P.O. Box 0022, Botswana
| | - Bongani Sethebe
- Department of Biological Sciences, Faculty of Science, University of Botswana, Gaborone P.O. Box 0022, Botswana
| | | | | | - Peter Imming
- Institute of Pharmacy, Faculty of Natural Sciences, Martin-Luther-University Halle-Wittenberg, D-06120 Halle (Saale), Germany
| | - Cécile Häberli
- Swiss Tropical and Public Health Institute, Kreuzstrasse 2, CH-4123 Allschwil, Switzerland
- University of Basel, CH-4051 Basel, Switzerland
| | - Jennifer Keiser
- Swiss Tropical and Public Health Institute, Kreuzstrasse 2, CH-4123 Allschwil, Switzerland
- University of Basel, CH-4051 Basel, Switzerland
| | - Norbert Arnold
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry, Weinberg 3, D-06120 Halle (Saale), Germany
| | - Kerstin Andrae-Marobela
- Department of Biological Sciences, Faculty of Science, University of Botswana, Gaborone P.O. Box 0022, Botswana
| |
Collapse
|
5
|
Joshi P, Vedhanayagam M, Ramesh R. An Ensembled SVM Based Approach for Predicting Adverse Drug Reactions. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200707141420] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Preventing adverse drug reactions (ADRs) is imperative for the safety of
the people. The problem of under-reporting the ADRs has been prevalent across the world, making it
difficult to develop the prediction models, which are unbiased. As a result, most of the models are
skewed to the negative samples leading to high accuracy but poor performance in other metrics such
as precision, recall, F1 score, and AUROC score.
Objective:
In this work, we have proposed a novel way of predicting the ADRs by balancing the dataset.
Method:
The whole data set has been partitioned into balanced smaller data sets. SVMs with
optimal kernel have been learned using each of the balanced data sets and the prediction of given
ADR for the given drug has been obtained by voting from the ensembled optimal SVMs learned.
Results:
We have found that results are encouraging and comparable with the competing methods in
the literature and obtained the average sensitivity of 0.97 for all the ADRs. The model has been
interpreted and explained with SHAP values by various plots.
Conclusion:
A novel way of predicting ADRs by balancing the dataset has been proposed thereby
reducing the effect of unbalanced datasets.
Collapse
Affiliation(s)
- Pratik Joshi
- Department of Computer Science and Engineering, IIITDM Kancheepuram, Chennai, India
| | | | | |
Collapse
|
6
|
Gao D, Chen Q, Zeng Y, Jiang M, Zhang Y. Applications of Machine Learning in Drug Target Discovery. Curr Drug Metab 2020; 21:790-803. [PMID: 32723266 DOI: 10.2174/1567201817999200728142023] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 03/12/2020] [Accepted: 05/13/2020] [Indexed: 12/15/2022]
Abstract
Drug target discovery is a critical step in drug development. It is the basis of modern drug development because it determines the target molecules related to specific diseases in advance. Predicting drug targets by computational methods saves a great deal of financial and material resources compared to in vitro experiments. Therefore, several computational methods for drug target discovery have been designed. Recently, machine learning (ML) methods in biomedicine have developed rapidly. In this paper, we present an overview of drug target discovery methods based on machine learning. Considering that some machine learning methods integrate network analysis to predict drug targets, network-based methods are also introduced in this article. Finally, the challenges and future outlook of drug target discovery are discussed.
Collapse
Affiliation(s)
- Dongrui Gao
- School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China
| | - Qingyuan Chen
- School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China
| | - Yuanqi Zeng
- School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China
| | - Meng Jiang
- School of Mechanical Automotive Engineering, Nanyang Institute of Technology, Nanyang 473000, China
| | - Yongqing Zhang
- School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China
| |
Collapse
|
7
|
Yang J, Zhang D, Liu L, Li G, Cai Y, Zhang Y, Jin H, Chen X. Computational drug repositioning based on the relationships between substructure-indication. Brief Bioinform 2020; 22:6032618. [PMID: 33313675 DOI: 10.1093/bib/bbaa348] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 10/13/2020] [Accepted: 10/30/2020] [Indexed: 12/15/2022] Open
Abstract
At present, computational methods for drug repositioning are mainly based on the whole structures of drugs, which limits the discovery of new functions due to the similarities between local structures of drugs. In this article, we, for the first time, integrated the features of chemical-genomics (substructure-domain) and pharmaco-genomics (domain-indication) based on the assumption that drug-target interactions are mediated by the substructures of drugs and the domains of proteins to identify the relationships between substructure-indication and establish a drug-substructure-indication network for predicting all therapeutic effects of tested drugs through only information on the substructures of drugs. In total, 83 205 drug-indication relationships with different correlation scores were obtained. We used three different verification methods to indicate the accuracy of the method and the reliability of the scoring system. We predicted all indications of olaparib using our method, including the known antitumor effect and unknown antiviral effect verified by literature, and we also discovered the inhibitory mechanism of olaparib toward DNA repair through its specific sub494 (o = C-C: C), as it participates in the low synthesis of the poly subfunction of the apoptosis pathway (hsa04210) by inhibiting the Inositol 1,4,5-trisphosphate receptor(s) (ITPRs) and hydrolyzing poly (ADP ribose) polymerases. ElectroCardioGrams of four drugs (quinidine, amiodarone, milrinone and fosinopril) demonstrated the effect of anti-arrhythmia. Unlike previous studies focusing on the overall structures of drugs, our research has great potential in the search for more therapeutic effects of drugs and in predicting all potential effects and mechanisms of a drug from the local structural similarity.
Collapse
Affiliation(s)
- Jingbo Yang
- College of Bioinformatics Science and Technology, Harbin Medical University
| | - Denan Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University
| | - Lei Liu
- College of Bioinformatics Science and Technology, Harbin Medical University
| | - Guoqi Li
- College of Bioinformatics Science and Technology, Harbin Medical University
| | - Yiyang Cai
- College of Bioinformatics Science and Technology, Harbin Medical University
| | - Yan Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University
| | - Hongbo Jin
- Department of Physiology, Harbin Medical University
| | - Xiujie Chen
- College of Bioinformatics Science and Technology, Harbin Medical University
| |
Collapse
|
8
|
Chu Y, Shan X, Chen T, Jiang M, Wang Y, Wang Q, Salahub DR, Xiong Y, Wei DQ. DTI-MLCD: predicting drug-target interactions using multi-label learning with community detection method. Brief Bioinform 2020; 22:5910189. [PMID: 32964234 DOI: 10.1093/bib/bbaa205] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Revised: 08/06/2020] [Accepted: 08/10/2020] [Indexed: 12/20/2022] Open
Abstract
Identifying drug-target interactions (DTIs) is an important step for drug discovery and drug repositioning. To reduce the experimental cost, a large number of computational approaches have been proposed for this task. The machine learning-based models, especially binary classification models, have been developed to predict whether a drug-target pair interacts or not. However, there is still much room for improvement in the performance of current methods. Multi-label learning can overcome some difficulties caused by single-label learning in order to improve the predictive performance. The key challenge faced by multi-label learning is the exponential-sized output space, and considering label correlations can help to overcome this challenge. In this paper, we facilitate multi-label classification by introducing community detection methods for DTI prediction, named DTI-MLCD. Moreover, we updated the gold standard data set by adding 15,000 more positive DTI samples in comparison to the data set, which has widely been used by most of previously published DTI prediction methods since 2008. The proposed DTI-MLCD is applied to both data sets, demonstrating its superiority over other machine learning methods and several existing methods. The data sets and source code of this study are freely available at https://github.com/a96123155/DTI-MLCD.
Collapse
Affiliation(s)
- Yanyi Chu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| | - Xiaoqi Shan
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| | - Tianhang Chen
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| | - Mingming Jiang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| | - Yanjing Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| | - Qiankun Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| | | | - Yi Xiong
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| | - Dong-Qing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University
| |
Collapse
|
9
|
Benchmarking Data Sets from PubChem BioAssay Data: Current Scenario and Room for Improvement. Int J Mol Sci 2020; 21:ijms21124380. [PMID: 32575564 PMCID: PMC7352161 DOI: 10.3390/ijms21124380] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 06/15/2020] [Accepted: 06/18/2020] [Indexed: 11/17/2022] Open
Abstract
Developing realistic data sets for evaluating virtual screening methods is a task that has been tackled by the cheminformatics community for many years. Numerous artificially constructed data collections were developed, such as DUD, DUD-E, or DEKOIS. However, they all suffer from multiple drawbacks, one of which is the absence of experimental results confirming the impotence of presumably inactive molecules, leading to possible false negatives in the ligand sets. In light of this problem, the PubChem BioAssay database, an open-access repository providing the bioactivity information of compounds that were already tested on a biological target, is now a recommended source for data set construction. Nevertheless, there exist several issues with the use of such data that need to be properly addressed. In this article, an overview of benchmarking data collections built upon experimental PubChem BioAssay input is provided, along with a thorough discussion of noteworthy issues that one must consider during the design of new ligand sets from this database. The points raised in this review are expected to guide future developments in this regard, in hopes of offering better evaluation tools for novel in silico screening procedures.
Collapse
|
10
|
Amano Y, Honda H, Sawada R, Nukada Y, Yamane M, Ikeda N, Morita O, Yamanishi Y. In silico systems for predicting chemical-induced side effects using known and potential chemical protein interactions, enabling mechanism estimation. J Toxicol Sci 2020; 45:137-149. [PMID: 32147637 DOI: 10.2131/jts.45.137] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
In silico models for predicting chemical-induced side effects have become increasingly important for the development of pharmaceuticals and functional food products. However, existing predictive models have difficulty in estimating the mechanisms of side effects in terms of molecular targets or they do not cover the wide range of pharmacological targets. In the present study, we constructed novel in silico models to predict chemical-induced side effects and estimate the underlying mechanisms with high general versatility by integrating the comprehensive prediction of potential chemical-protein interactions (CPIs) with machine learning. First, the potential CPIs were comprehensively estimated by chemometrics based on the known CPI data (1,179,848 interactions involving 3,905 proteins and 824,143 chemicals). Second, the predictive models for 61 side effects in the cardiovascular system (CVS), gastrointestinal system (GIS), and central nervous system (CNS) were constructed by sparsity-induced classifiers based on the known and potential CPI data. The cross validation experiments showed that the proposed CPI-based models had a higher or comparable performance than the traditional chemical structure-based models. Moreover, our enrichment analysis indicated that the highly weighted proteins derived from predictive models could be involved in the corresponding functions of the side effects. For example, in CVS, the carcinogenesis-related pathways (e.g., prostate cancer, PI3K-Akt signal pathway), which were recently reported to be involved in cardiovascular side effects, were enriched. Therefore, our predictive models are biologically valid and would be useful for predicting side effects and novel potential underlying mechanisms of chemical-induced side effects.
Collapse
Affiliation(s)
- Yuto Amano
- R&D Safety Science Research, Kao Corporation
| | | | - Ryusuke Sawada
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology
| | - Yuko Nukada
- R&D Safety Science Research, Kao Corporation
| | | | | | | | | |
Collapse
|
11
|
Jiang H, Qiu Y, Hou W, Cheng X, Yim MY, Ching WK. Drug Side-Effect Profiles Prediction: From Empirical to Structural Risk Minimization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:402-410. [PMID: 29994681 DOI: 10.1109/tcbb.2018.2850884] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The identification of drug side-effects is considered to be an important step in drug design, which could not only shorten the time but also reduce the cost of drug development. In this paper, we investigate the relationship between the potential side-effects of drug candidates and their chemical structures. The preliminary Regularized Regression (RR) model for drug side-effects prediction has promising features in the efficiency of model training and the existence of a closed form solution. It performs better than other state-of-the-art methods, in terms of minimum accuracy and average accuracy. In order to dig inside how drug structure will associate with side effect, we further propose weighted GTS (Generalized T-Student Kernel: WGTS) SVM model from a structural risk minimization perspective. The SVM model proposed in this paper provides a better understanding of drug side-effects in the process of drug development. The usefulness of the WGTS model lies in the superior performance in a cross validation setting on 888 approved drugs with 1385 side-effects profiling from SIDER database. This work is expected to shed light on intriguing studies that predict potential un-identifying side-effects and suggest how we can avoid drug side-effects by the removal of some distinguished chemical structures.
Collapse
|
12
|
Zheng Y, Peng H, Zhang X, Zhao Z, Gao X, Li J. Old drug repositioning and new drug discovery through similarity learning from drug-target joint feature spaces. BMC Bioinformatics 2019; 20:605. [PMID: 31881829 PMCID: PMC6933655 DOI: 10.1186/s12859-019-3238-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Accepted: 11/14/2019] [Indexed: 12/24/2022] Open
Abstract
Background Detection of new drug-target interactions by computational algorithms is of crucial value to both old drug repositioning and new drug discovery. Existing machine-learning methods rely only on experimentally validated drug-target interactions (i.e., positive samples) for the predictions. Their performance is severely impeded by the lack of reliable negative samples. Results We propose a method to construct highly-reliable negative samples for drug target prediction by a pairwise drug-target similarity measurement and OCSVM with a high-recall constraint. On one hand, we measure the pairwise similarity between every two drug-target interactions by combining the chemical similarity between their drugs and the Gene Ontology-based similarity between their targets. Then we calculate the accumulative similarity with all known drug-target interactions for each unobserved drug-target interaction. On the other hand, we obtain the signed distance from OCSVM learned from the known interactions with high recall (≥0.95) for each unobserved drug-target interaction. After normalizing all accumulative similarities and signed distances to the range [0,1], we compute the score for each unobserved drug-target interaction via averaging its accumulative similarity and signed distance. Unobserved interactions with lower scores are preferentially served as reliable negative samples for the classification algorithms. The performance of the proposed method is evaluated on the interaction data between 1094 drugs and 1556 target proteins. Extensive comparison experiments using four classical classifiers and one domain predictive method demonstrate the superior performance of the proposed method. A better decision boundary has been learned from the constructed reliable negative samples. Conclusions Proper construction of highly-reliable negative samples can help the classification models learn a clear decision boundary which contributes to the performance improvement.
Collapse
Affiliation(s)
- Yi Zheng
- Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007, Australia
| | - Hui Peng
- Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007, Australia
| | - Xiaocai Zhang
- Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007, Australia
| | - Zhixun Zhao
- Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007, Australia
| | - Xiaoying Gao
- School of Engineering and Computer Science, Victoria University of Wellington, Cotton Building, Kelburn Campus, Wellington, 6140, New Zealand
| | - Jinyan Li
- Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007, Australia.
| |
Collapse
|
13
|
Mahmud SMH, Chen W, Meng H, Jahan H, Liu Y, Hasan SMM. Prediction of drug-target interaction based on protein features using undersampling and feature selection techniques with boosting. Anal Biochem 2019; 589:113507. [PMID: 31734254 DOI: 10.1016/j.ab.2019.113507] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Revised: 11/05/2019] [Accepted: 11/08/2019] [Indexed: 12/29/2022]
Abstract
Accurate identification of drug-target interaction (DTI) is a crucial and challenging task in the drug discovery process, having enormous benefit to the patients and pharmaceutical company. The traditional wet-lab experiments of DTI is expensive, time-consuming, and labor-intensive. Therefore, many computational techniques have been established for this purpose; although a huge number of interactions are still undiscovered. Here, we present pdti-EssB, a new computational model for identification of DTI using protein sequence and drug molecular structure. More specifically, each drug molecule is transformed as the molecular substructure fingerprint. For a protein sequence, different descriptors are utilized to represent its evolutionary, sequence, and structural information. Besides, our proposed method uses data balancing techniques to handle the imbalance problem and applies a novel feature eliminator to extract the best optimal features for accurate prediction. In this paper, four classes of DTI benchmark datasets are used to construct a predictive model with XGBoost. Here, the auROC is utilized as an evaluation metric to compare the performance of pdti-EssB method with recent methods, applying five-fold cross-validation. Finally, the experimental results indicate that our proposed method is able to outperform other approaches in predicting DTI, and introduces new drug-target interaction samples based on prediction probability scores. pdti-EssB webserver is available online at http://pdtiessb-uestc.com/.
Collapse
Affiliation(s)
- S M Hasan Mahmud
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China.
| | - Wenyu Chen
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China.
| | - Han Meng
- School of Political Science and Public Administration, University of Electronic Science and Technology of China, Chengdu, 611731, China.
| | - Hosney Jahan
- College of Computer Science, Sichuan University, Chengdu, 610065, China.
| | - Yongsheng Liu
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China.
| | - S M Mamun Hasan
- Department of Internal Medicine, Rangpur Medical College, Rangpur, 5400, Bangladesh.
| |
Collapse
|
14
|
Jippo H, Matsuo T, Kikuchi R, Fukuda D, Matsuura A, Ohfuchi M. Graph Classification of Molecules Using Force Field Atom and Bond Types. Mol Inform 2019; 39:e1800155. [PMID: 31589809 DOI: 10.1002/minf.201800155] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Accepted: 09/23/2019] [Indexed: 11/08/2022]
Abstract
Classification of the biological activities of chemical substances is important for developing new medicines efficiently. Various machine learning methods are often employed to screen large libraries of compounds and predict the activities of new substances by training the molecular structure-activity relationships. One such method is graph classification, in which a molecular structure can be represented in terms of a labeled graph with nodes that correspond to atoms and edges that correspond to the bonds between these atoms. In a conventional graph definition, atomic symbols and bond orders are employed as node and edge labels, respectively. In this study, we developed new graph definitions using the assignment of atom and bond types in the force fields of molecular dynamics methods as node and edge labels, respectively. We found that these graph definitions improved the accuracies of activity classifications for chemical substances using graph kernels with support vector machines and deep neural networks. The higher accuracies obtained using our proposed definitions can enhance the development of the materials informatics using graph-based machine learning methods.
Collapse
Affiliation(s)
- Hideyuki Jippo
- Digital Annealer Unit, Fujitsu Laboratories Ltd., 10-1 Morinosato-Wakamiya, Atsugi, Kanagawa, 243-0197, Japan
| | - Tatsuru Matsuo
- Artificial Intelligence Laboratory, Fujitsu Laboratories Ltd., 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa, 211-8588, Japan
| | - Ryota Kikuchi
- Artificial Intelligence Laboratory, Fujitsu Laboratories Ltd., 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa, 211-8588, Japan
| | - Daisuke Fukuda
- Artificial Intelligence Laboratory, Fujitsu Laboratories Ltd., 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa, 211-8588, Japan
| | - Azuma Matsuura
- Digital Annealer Unit, Fujitsu Laboratories Ltd., 10-1 Morinosato-Wakamiya, Atsugi, Kanagawa, 243-0197, Japan
| | - Mari Ohfuchi
- Digital Annealer Unit, Fujitsu Laboratories Ltd., 10-1 Morinosato-Wakamiya, Atsugi, Kanagawa, 243-0197, Japan
| |
Collapse
|
15
|
Zhang W, Huai Y, Miao Z, Qian A, Wang Y. Systems Pharmacology for Investigation of the Mechanisms of Action of Traditional Chinese Medicine in Drug Discovery. Front Pharmacol 2019; 10:743. [PMID: 31379563 PMCID: PMC6657703 DOI: 10.3389/fphar.2019.00743] [Citation(s) in RCA: 100] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 06/07/2019] [Indexed: 01/01/2023] Open
Abstract
As a traditional medical intervention in Asia and a complementary and alternative medicine in western countries, traditional Chinese medicine (TCM) has attracted global attention in the life science field. TCM provides extensive natural resources for medicinal compounds, and these resources are generally regarded as effective and safe for use in drug discovery. However, owing to the complexity of compounds and their related multiple targets of TCM, it remains difficult to dissect the mechanisms of action of herbal medicines at a holistic level. To solve the issue, in the review, we proposed a novel approach of systems pharmacology to identify the bioactive compounds, predict their related targets, and illustrate the molecular mechanisms of action of TCM. With a predominant focus on the mechanisms of actions of TCM, we also highlighted the application of the systems pharmacology approach for the prediction of drug combination and dynamic analysis, the synergistic effects of TCMs, formula dissection, and theory analysis. In summary, the systems pharmacology method contributes to understand the complex interactions among biological systems, drugs, and complex diseases from a network perspective. Consequently, systems pharmacology provides a novel approach to promote drug discovery in a precise manner and a systems level, thus facilitating the modernization of TCM.
Collapse
Affiliation(s)
- Wenjuan Zhang
- Lab for Bone Metabolism, Key Lab for Space Biosciences and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- Research Center for Special Medicine and Health Systems Engineering, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- NPU-UAB Joint Laboratory for Bone Metabolism, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
| | - Ying Huai
- Lab for Bone Metabolism, Key Lab for Space Biosciences and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- Research Center for Special Medicine and Health Systems Engineering, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- NPU-UAB Joint Laboratory for Bone Metabolism, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
| | - Zhiping Miao
- Lab for Bone Metabolism, Key Lab for Space Biosciences and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- Research Center for Special Medicine and Health Systems Engineering, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- NPU-UAB Joint Laboratory for Bone Metabolism, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
| | - Airong Qian
- Lab for Bone Metabolism, Key Lab for Space Biosciences and Biotechnology, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- Research Center for Special Medicine and Health Systems Engineering, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
- NPU-UAB Joint Laboratory for Bone Metabolism, School of Life Sciences, Northwestern Polytechnical University, Xi’an, China
| | - Yonghua Wang
- Lab of Systems Pharmacology, College of Life Sciences, Northwest University, Xi’an, China
| |
Collapse
|
16
|
Fukuoka Y. Machine Learning Approach for Predicting New Uses of Existing Drugs and Evaluation of Their Reliabilities. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2019; 1903:269-279. [PMID: 30547448 DOI: 10.1007/978-1-4939-8955-3_16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In this chapter, a new method to evaluate the reliability of predicting new uses of existing drugs was proposed. The prediction was performed with a support vector machine (SVM) using various data. Because the reliability of prediction could not be evaluated based on the output of an SVM, which was binary, the proposed method evaluated the reliability as a product of a distance from the separating hyperplane of the SVM and a similarity between the disease targeted by the drug and a candidate disease. A validation using real data revealed that the performance of the proposed method was promising.
Collapse
|
17
|
CFSBoost: Cumulative feature subspace boosting for drug-target interaction prediction. J Theor Biol 2019; 464:1-8. [DOI: 10.1016/j.jtbi.2018.12.024] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 12/13/2018] [Accepted: 12/18/2018] [Indexed: 01/12/2023]
|
18
|
Wang CS, Lin PJ, Cheng CL, Tai SH, Kao Yang YH, Chiang JH. Detecting Potential Adverse Drug Reactions Using a Deep Neural Network Model. J Med Internet Res 2019; 21:e11016. [PMID: 30724742 PMCID: PMC6381404 DOI: 10.2196/11016] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Revised: 10/03/2018] [Accepted: 11/04/2018] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Adverse drug reactions (ADRs) are common and are the underlying cause of over a million serious injuries and deaths each year. The most familiar method to detect ADRs is relying on spontaneous reports. Unfortunately, the low reporting rate of spontaneous reports is a serious limitation of pharmacovigilance. OBJECTIVE The objective of this study was to identify a method to detect potential ADRs of drugs automatically using a deep neural network (DNN). METHODS We designed a DNN model that utilizes the chemical, biological, and biomedical information of drugs to detect ADRs. This model aimed to fulfill two main purposes: identifying the potential ADRs of drugs and predicting the possible ADRs of a new drug. For improving the detection performance, we distributed representations of the target drugs in a vector space to capture the drug relationships using the word-embedding approach to process substantial biomedical literature. Moreover, we built a mapping function to address new drugs that do not appear in the dataset. RESULTS Using the drug information and the ADRs reported up to 2009, we predicted the ADRs of drugs recorded up to 2012. There were 746 drugs and 232 new drugs, which were only recorded in 2012 with 1325 ADRs. The experimental results showed that the overall performance of our model with mean average precision at top-10 achieved is 0.523 and the rea under the receiver operating characteristic curve (AUC) score achieved is 0.844 for ADR prediction on the dataset. CONCLUSIONS Our model is effective in identifying the potential ADRs of a drug and the possible ADRs of a new drug. Most importantly, it can detect potential ADRs irrespective of whether they have been reported in the past.
Collapse
Affiliation(s)
- Chi-Shiang Wang
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan
| | - Pei-Ju Lin
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan
| | - Ching-Lan Cheng
- School of Pharmacy, College of Medicine, National Cheng Kung University, Tainan, Taiwan.,Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan.,Department of Pharmacy, National Cheng Kung University Hospital, National Cheng Kung University, Tainan, Taiwan
| | - Shu-Hua Tai
- Department of Pharmacy, National Cheng Kung University Hospital, National Cheng Kung University, Tainan, Taiwan
| | - Yea-Huei Kao Yang
- School of Pharmacy, College of Medicine, National Cheng Kung University, Tainan, Taiwan.,Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | - Jung-Hsien Chiang
- Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan.,Institute of Medical Informatics, National Cheng Kung University, Tainan, Taiwan
| |
Collapse
|
19
|
Zheng Y, Peng H, Zhang X, Zhao Z, Yin J, Li J. Predicting adverse drug reactions of combined medication from heterogeneous pharmacologic databases. BMC Bioinformatics 2018; 19:517. [PMID: 30598065 PMCID: PMC6311930 DOI: 10.1186/s12859-018-2520-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Early and accurate identification of potential adverse drug reactions (ADRs) for combined medication is vital for public health. Existing methods either rely on expensive wet-lab experiments or detecting existing associations from related records. Thus, they inevitably suffer under-reporting, delays in reporting, and inability to detect ADRs for new and rare drugs. The current application of machine learning methods is severely impeded by the lack of proper drug representation and credible negative samples. Therefore, a method to represent drugs properly and to select credible negative samples becomes vital in applying machine learning methods to this problem. RESULTS In this work, we propose a machine learning method to predict ADRs of combined medication from pharmacologic databases by building up highly-credible negative samples (HCNS-ADR). Specifically, we fuse heterogeneous information from different databases and represent each drug as a multi-dimensional vector according to its chemical substructures, target proteins, substituents, and related pathways first. Then, a drug-pair vector is obtained by appending the vector of one drug to the other. Next, we construct a drug-disease-gene network and devise a scoring method to measure the interaction probability of every drug pair via network analysis. Drug pairs with lower interaction probability are preferentially selected as negative samples. Following that, the validated positive samples and the selected credible negative samples are projected into a lower-dimensional space using the principal component analysis. Finally, a classifier is built for each ADR using its positive and negative samples with reduced dimensions. The performance of the proposed method is evaluated on simulative prediction for 1276 ADRs and 1048 drugs, comparing using four machine learning algorithms and with two baseline approaches. Extensive experiments show that the proposed way to represent drugs characterizes drugs accurately. With highly-credible negative samples selected by HCNS-ADR, the four machine learning algorithms achieve significant performance improvements. HCNS-ADR is also shown to be able to predict both known and novel drug-drug-ADR associations, outperforming two other baseline approaches significantly. CONCLUSIONS The results demonstrate that integration of different drug properties to represent drugs are valuable for ADR prediction of combined medication and the selection of highly-credible negative samples can significantly improve the prediction performance.
Collapse
Affiliation(s)
- Yi Zheng
- Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007 Australia
| | - Hui Peng
- Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007 Australia
| | - Xiaocai Zhang
- Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007 Australia
| | - Zhixun Zhao
- Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007 Australia
| | - Jie Yin
- Discipline of Business Analytics, The University of Sydney, Darlington, Sydney, 2006 Australia
| | - Jinyan Li
- Advanced Analytics Institute, Faculty of Engineering and Information Technology, University of Technology Sydney, 15 Broadway Ultimo, Sydney, 2007 Australia
| |
Collapse
|
20
|
Xiao T, Qi X, Chen Y, Jiang Y. Development of Ligand-based Big Data Deep Neural Network Models for Virtual Screening of Large Compound Libraries. Mol Inform 2018; 37:e1800031. [PMID: 29882343 DOI: 10.1002/minf.201800031] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 05/20/2018] [Indexed: 11/08/2022]
Abstract
High-performance ligand-based virtual screening (VS) models have been developed using various computational methods, including the deep neural network (DNN) method. There are high expectations for exploration of the advanced capabilities of DNN to improve VS performance, and this capability has been optimally achieved using large data training datasets. However, their ability to screen large compound libraries has not been evaluated. There is a need for developing and evaluating ligand-based large data DNN VS models for large compound libraries. In this study, we developed ligand-based large data DNN VS models for inhibitors of six anticancer targets using 0.5 M training compounds. The developed VS models were evaluated by 10-fold cross-validation, achieving 77.9-97.8 % sensitivity, 99.9-100 % specificity, 0.82-0.98 Matthews correlation coefficient and 0.98-0.99 area under the curve, outperforming random forest models. Moreover, DNN VS models developed by pre-2015 inhibitors identified 50 % of post-2015 inhibitors with a 0.01-0.09 % false positive rate in screening 89 M PubChem compounds, also outperforming previous models. Experimental assays of the selected virtual hits of the EGFR inhibitor model led to reasonable novel structures of EGFR inhibitors. Our results confirmed the usefulness of the large data DNN model as a ligand-based VS tool to screen large compound libraries.
Collapse
Affiliation(s)
- Tao Xiao
- Department of Chemistry, Tsinghua University, Beijing, 100084, PR China.,The State Key Laboratory of Chemical Oncogenomics, the Graduate School at Shenzhen, Tsinghua University, Shenzhen, 518055, P. R. China
| | - Xingxing Qi
- The State Key Laboratory of Chemical Oncogenomics, the Graduate School at Shenzhen, Tsinghua University, Shenzhen, 518055, P. R. China
| | - Yuzong Chen
- Bioinformatics and Drug Design Group, Department of Pharmacy, National, University of Singapore, Singapore, 117543, Singapore.,Shenzhen Kivita Innovative Drug Institute, Shenzhen, 518055, P. R. China
| | - Yuyang Jiang
- The State Key Laboratory of Chemical Oncogenomics, the Graduate School at Shenzhen, Tsinghua University, Shenzhen, 518055, P. R. China.,School of Pharmaceutical Sciences, Tsinghua University, Beijing, 100084, P. R. China
| |
Collapse
|
21
|
Huang G, Li J, Zhao C. Computational Prediction and Analysis of Associations between Small Molecules and Binding-Associated S-Nitrosylation Sites. Molecules 2018; 23:molecules23040954. [PMID: 29671802 PMCID: PMC6017196 DOI: 10.3390/molecules23040954] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Revised: 03/30/2018] [Accepted: 04/09/2018] [Indexed: 01/12/2023] Open
Abstract
Interactions between drugs and proteins occupy a central position during the process of drug discovery and development. Numerous methods have recently been developed for identifying drug–target interactions, but few have been devoted to finding interactions between post-translationally modified proteins and drugs. We presented a machine learning-based method for identifying associations between small molecules and binding-associated S-nitrosylated (SNO-) proteins. Namely, small molecules were encoded by molecular fingerprint, SNO-proteins were encoded by the information entropy-based method, and the random forest was used to train a classifier. Ten-fold and leave-one-out cross validations achieved, respectively, 0.7235 and 0.7490 of the area under a receiver operating characteristic curve. Computational analysis of similarity suggested that SNO-proteins associated with the same drug shared statistically significant similarity, and vice versa. This method and finding are useful to identify drug–SNO associations and further facilitate the discovery and development of SNO-associated drugs.
Collapse
Affiliation(s)
- Guohua Huang
- Provincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang 422000, China.
- College of Information Engineering, Shaoyang University, Shaoyang 422000, China.
| | - Jincheng Li
- Provincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang 422000, China.
- College of Information Engineering, Shaoyang University, Shaoyang 422000, China.
| | - Chenglin Zhao
- Provincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang 422000, China.
- College of Information Engineering, Shaoyang University, Shaoyang 422000, China.
| |
Collapse
|
22
|
Yamanishi Y. Linear and Kernel Model Construction Methods for Predicting Drug-Target Interactions in a Chemogenomic Framework. Methods Mol Biol 2018; 1825:355-368. [PMID: 30334213 DOI: 10.1007/978-1-4939-8639-2_12] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Identification of drug-target interactions is a crucial process in drug discovery. In this chapter, we present protocols for recent advancements in machine learning methods for predicting drug-target interactions from heterogeneous biological data in a chemogenomic framework, in which prediction is based on the chemical structure data of drug candidate compounds and translated genomic sequence data of target candidate proteins. Most existing methods are based on either linear modeling or kernel modeling. To illustrate linear modeling, we introduce sparsity-induced binary classifiers and sparse canonical correlation analysis. To illustrate kernel modeling, we introduce pairwise kernel-based support vector machines and kernel-based distance learning. Workflows for using these techniques are presented. We also discuss the characteristics of each method and suggest some directions for future research.
Collapse
Affiliation(s)
- Yoshihiro Yamanishi
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, Iizuka, Fukuoka, Japan.
- PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama, Japan.
| |
Collapse
|
23
|
Physicochemical Property Labels as Molecular Descriptors for Improved Analysis of Compound-Protein and Compound-Compound Networks. Methods Mol Biol 2018; 1825:211-225. [PMID: 30334207 DOI: 10.1007/978-1-4939-8639-2_6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Small molecules can be represented in various file formats, (1) one-line systems such as SMILES (Simplified Molecular Input Line Entry System) and InChI (International Chemical Identifier) and (2) table systems such as the molfiles, SDF (Structure Data File), and KCF (KEGG Chemical Function). KCF and KCF-S (KEGG Chemical Function-and-Substructures) apply physicochemical property labels on the representations of small molecules, and contribute to improved analysis of compound-protein networks including drug-target interaction, and compound-compound networks including metabolic pathways. In this chapter, the main concepts, usage, and some example applications of the KCFCO and KCF-S packages are explained.
Collapse
|
24
|
iDTI-ESBoost: Identification of Drug Target Interaction Using Evolutionary and Structural Features with Boosting. Sci Rep 2017; 7:17731. [PMID: 29255285 PMCID: PMC5735173 DOI: 10.1038/s41598-017-18025-2] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Accepted: 12/05/2017] [Indexed: 02/07/2023] Open
Abstract
Prediction of new drug-target interactions is critically important as it can lead the researchers to find new uses for old drugs and to disclose their therapeutic profiles or side effects. However, experimental prediction of drug-target interactions is expensive and time-consuming. As a result, computational methods for predictioning new drug-target interactions have gained a tremendous interest in recent times. Here we present iDTI-ESBoost, a prediction model for identification of drug-target interactions using evolutionary and structural features. Our proposed method uses a novel data balancing and boosting technique to predict drug-target interaction. On four benchmark datasets taken from a gold standard data, iDTI-ESBoost outperforms the state-of-the-art methods in terms of area under receiver operating characteristic (auROC) curve. iDTI-ESBoost also outperforms the latest and the best-performing method found in the literature in terms of area under precision recall (auPR) curve. This is significant as auPR curves are argued as suitable metric for comparison for imbalanced datasets similar to the one studied here. Our reported results show the effectiveness of the classifier, balancing methods and the novel features incorporated in iDTI-ESBoost. iDTI-ESBoost is a novel prediction method that has for the first time exploited the structural features along with the evolutionary features to predict drug-protein interactions. We believe the excellent performance of iDTI-ESBoost both in terms of auROC and auPR would motivate the researchers and practitioners to use it to predict drug-target interactions. To facilitate that, iDTI-ESBoost is implemented and made publicly available at: http://farshidrayhan.pythonanywhere.com/iDTI-ESBoost/.
Collapse
|
25
|
Chen H, Bauer U, Engkvist O. Merged Multiple Ligands. METHODS AND PRINCIPLES IN MEDICINAL CHEMISTRY 2017. [DOI: 10.1002/9783527674381.ch9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Affiliation(s)
- Hongming Chen
- Discovery Sciences, Innovative Medicines and Early Development; AstraZeneca; Pepparedsleden 1 431 83 Mölndal Sweden
| | - Udo Bauer
- Cardiovascular and Metabolic Diseases, Innovative Medicines and Early Development; AstraZeneca; Pepparedsleden 1 431 83 Mölndal Sweden
| | - Ola Engkvist
- Discovery Sciences, Innovative Medicines and Early Development; AstraZeneca; Pepparedsleden 1 431 83 Mölndal Sweden
| |
Collapse
|
26
|
Wang J, Guo Z, Fu Y, Wu Z, Huang C, Zheng C, Shar PA, Wang Z, Xiao W, Wang Y. Weak-binding molecules are not drugs?-toward a systematic strategy for finding effective weak-binding drugs. Brief Bioinform 2017; 18:321-332. [PMID: 26962012 DOI: 10.1093/bib/bbw018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2015] [Indexed: 12/16/2022] Open
Abstract
Designing maximally selective ligands that act on individual drug targets with high binding affinity has been the central dogma of drug discovery and development for the past two decades. However, many low-affinity drugs that aim for several targets at the same time are found more effective than the high-affinity binders when faced with complex disease conditions, such as cancers, Alzheimer's disease and cardiovascular diseases. The aim of this study was to appreciate the importance and reveal the features of weak-binding drugs and propose an integrated strategy for discovering them. Weak-binding drugs can be characterized by their high dissociation rates and transient interactions with their targets. In addition, network topologies and dynamics parameters involved in the targets of weak-binding drugs also influence the effects of the drugs. Here, we first performed a dynamics analysis for 33 elementary subgraphs to determine the desirable topology and dynamics parameters among targets. Then, by applying the elementary subgraphs to the mitogen-activated protein kinase (MAPK) pathway, several optimal target combinations were obtained. Combining drug-target interaction prediction with molecular dynamics simulation, we got two potential weak-binding drug candidates, luteolin and tanshinone IIA, acting on these targets. Further, the binding affinity of these two compounds to their targets and the anti-inflammatory effects of them were validated through in vitro experiments. In conclusion, weak-binding drugs have real opportunities for maximum efficiency and may show reduced adverse reactions, which can offer a bright and promising future for new drug discovery.
Collapse
Affiliation(s)
- Jinan Wang
- Lab of Systems Pharmacology, Center of Bioinformatics, College of Life Science, Northwest A&F University, Yangling, Shaanxi, China, School of Chemical engineering, Dalian University of Technology, Dalian, Liaoning, China, Beijing University of Chinese Medicine, ChaoYang District, Beijing, China and School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| | - Zihu Guo
- Lab of Systems Pharmacology, Center of Bioinformatics, College of Life Science, Northwest A&F University, Yangling, Shaanxi, China, School of Chemical engineering, Dalian University of Technology, Dalian, Liaoning, China, Beijing University of Chinese Medicine, ChaoYang District, Beijing, China and School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| | - Yingxue Fu
- Lab of Systems Pharmacology, Center of Bioinformatics, College of Life Science, Northwest A&F University, Yangling, Shaanxi, China, School of Chemical engineering, Dalian University of Technology, Dalian, Liaoning, China, Beijing University of Chinese Medicine, ChaoYang District, Beijing, China and School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| | - Ziyin Wu
- Lab of Systems Pharmacology, Center of Bioinformatics, College of Life Science, Northwest A&F University, Yangling, Shaanxi, China, School of Chemical engineering, Dalian University of Technology, Dalian, Liaoning, China, Beijing University of Chinese Medicine, ChaoYang District, Beijing, China and School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| | - Chao Huang
- Lab of Systems Pharmacology, Center of Bioinformatics, College of Life Science, Northwest A&F University, Yangling, Shaanxi, China, School of Chemical engineering, Dalian University of Technology, Dalian, Liaoning, China, Beijing University of Chinese Medicine, ChaoYang District, Beijing, China and School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| | - Chunli Zheng
- Lab of Systems Pharmacology, Center of Bioinformatics, College of Life Science, Northwest A&F University, Yangling, Shaanxi, China, School of Chemical engineering, Dalian University of Technology, Dalian, Liaoning, China, Beijing University of Chinese Medicine, ChaoYang District, Beijing, China and School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| | - Piar Ali Shar
- College of Life Science, Northwest A & F University, Yangling, Shaanxi, 712100, China; Center of Bioinformatics, Northwest A & F University, Yangling, Shaanxi, China
| | - Zhenzhong Wang
- Jiangsu Kanion Pharmaceutical Co. Ltd., Lianyungang, PR China
| | - Wei Xiao
- State Key Laboratory of New-Tech for Chinese Medicine Pharmaceutical Process, Lianyungang, Jiangsu, China
| | - Yonghua Wang
- Lab of Systems Pharmacology, Center of Bioinformatics, College of Life Science, Northwest A&F University, Yangling, Shaanxi, China, School of Chemical engineering, Dalian University of Technology, Dalian, Liaoning, China, Beijing University of Chinese Medicine, ChaoYang District, Beijing, China and School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong
| |
Collapse
|
27
|
Chen FS, Jiang HY, Jiang Z. Prediction of drug–pathway interaction pairs with a disease-combined LSA-PU-KNN method. MOLECULAR BIOSYSTEMS 2017; 13:2583-2591. [DOI: 10.1039/c7mb00441a] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
This paper proposes a prediction of potential associations between drugs and pathways based on a disease-related LSA-PU-KNN method.
Collapse
Affiliation(s)
- Fan-Shu Chen
- Shanghai Key Laboratory of Multidimensional Information Processing
- East China Normal University
- Shanghai 200262
- China
- Department of Computer Science and Technology
| | - Hui-Yan Jiang
- Shanghai Key Laboratory of Multidimensional Information Processing
- East China Normal University
- Shanghai 200262
- China
- Department of Computer Science and Technology
| | - Zhenran Jiang
- Shanghai Key Laboratory of Multidimensional Information Processing
- East China Normal University
- Shanghai 200262
- China
- Department of Computer Science and Technology
| |
Collapse
|
28
|
Jiang QY, Zheng MS, Yang XJ, Sun XS. Analysis of molecular networks and targets mining of Chinese herbal medicines on anti-aging. BMC COMPLEMENTARY AND ALTERNATIVE MEDICINE 2016; 16:520. [PMID: 28031022 PMCID: PMC5198498 DOI: 10.1186/s12906-016-1513-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Accepted: 12/09/2016] [Indexed: 12/22/2022]
Abstract
Background Many kidney-tonifying Chinese herbal medicines exert effects on anti-aging by comprehensive interactions of multiple targets. However, the interactions of multi-targets targeted by effective ingredients of kidney-tonifying Chinese herbal medicines are unknown. In this study, to explore the systems pharmacology mechanisms of kidney-tonifying Chinese medicines on anti-aging, we establish the molecular networks with the interactions of multi-targets, analyze bio-functions and pathways with IPA, and calculated the mutual interaction pairs of targets (target pairs) with data mining, respectively. Methods Kidney-tonifying Chinese medicines with anti-aging effects were screened from the Chinese Pharmacopoeia and the literatures. Target proteins of these herbal medicines were obtained from bioinformatics databases. Comparisons of molecular networks, bio-functions and pathways given by Ingenuity Pathway Analysis system showed the similarities and the differences between kidney Yin-tonifying herbal medicines and kidney Yang-tonifying herbal medicines. Target pairs with high correlation related to anti-aging were also discovered by data mining algorithm. And regulatory networks of targets were built based on the target pairs. Results Twenty-eight kidney-tonifying herbal medicines with anti-aging effects and 717 related target proteins were collected. The main bio-functions that all targets enriched in were “Cell Death and Survival”, “Free Radical Scavenging” and “Cellular Movement”, etc. The results of comparison analysis showed that kidney Yin-tonifying herbal medicines focused more on “Cancer related signaling”, “Apoptosis related signaling” and “Cardiovascular related signaling”. And kidney Yang-tonifying herbal medicines focused more on “Cellular stress and injury related signaling” and “Cellular growth, proliferation and development related signaling”. Moreover, the results of regulatory network showed that the anti-aging related target pairs with high correlated degrees of Kidney Yin-tonifying herbal medicines included TNF-PTGS2, TNF-CASP3, PTGS2-CASP3, CASP3-NOS2 and TNF-NOS2, and that of kidney Yang-tonifying herbal medicines included REAL-TNF, REAL-NFKBIA, REAL-JUN, PTGS2-SOD1 and TNF-IL6. Conclusions In this study, we achieved some important targets, target pairs and regulatory networks with bioinformatics and data mining, to discuss the systems pharmacology mechanisms of kidney-tonifying herbal medicines acting on anti-aging. Mutual target pairs related to anti-aging found in this study included TNF-PTGS2, TNF-CASP3, PTGS2-CASP3, CASP3-NOS2, TNF-NOS2, REAL-TNF, REAL-NFKBIA, REAL-JUN, PTGS2-SOD1 and TNF-IL6. Target pairs and regulatory networks of targets could reflect more potential interactions between targets and comprehensive effects on anti-aging. Compared with the existing researches, it was found that the kidney-tonifying herbal medicines may exert anti-aging effects in multiple pathways in this study. Electronic supplementary material The online version of this article (doi:10.1186/s12906-016-1513-2) contains supplementary material, which is available to authorized users.
Collapse
|
29
|
Wang C, Liu J, Luo F, Hu QN. Multi-fields model for predicting target–ligand interaction. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2016.03.079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
30
|
Abstract
INTRODUCTION With the emergence of the 'big data' era, the biomedical research community has great interest in exploiting publicly available chemical information for drug discovery. PubChem is an example of public databases that provide a large amount of chemical information free of charge. AREAS COVERED This article provides an overview of how PubChem's data, tools, and services can be used for virtual screening and reviews recent publications that discuss important aspects of exploiting PubChem for drug discovery. EXPERT OPINION PubChem offers comprehensive chemical information useful for drug discovery. It also provides multiple programmatic access routes, which are essential to build automated virtual screening pipelines that exploit PubChem data. In addition, PubChemRDF allows users to download PubChem data and load them into a local computing facility, facilitating data integration between PubChem and other resources. PubChem resources have been used in many studies for developing bioactivity and toxicity prediction models, discovering polypharmacologic (multi-target) ligands, and identifying new macromolecule targets of compounds (for drug-repurposing or off-target side effect prediction). These studies demonstrate the usefulness of PubChem as a key resource for computer-aided drug discovery and related area.
Collapse
Affiliation(s)
- Sunghwan Kim
- a National Center for Biotechnology Information, National Library of Medicine , National Institutes of Health , Department of Health and Human Services, Bethesda , MD , USA
| |
Collapse
|
31
|
Wang Z, Clark NR, Ma'ayan A. Drug-induced adverse events prediction with the LINCS L1000 data. Bioinformatics 2016; 32:2338-45. [PMID: 27153606 PMCID: PMC4965635 DOI: 10.1093/bioinformatics/btw168] [Citation(s) in RCA: 107] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Revised: 03/05/2016] [Accepted: 03/23/2016] [Indexed: 01/22/2023] Open
Abstract
MOTIVATION Adverse drug reactions (ADRs) are a central consideration during drug development. Here we present a machine learning classifier to prioritize ADRs for approved drugs and pre-clinical small-molecule compounds by combining chemical structure (CS) and gene expression (GE) features. The GE data is from the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 dataset that measured changes in GE before and after treatment of human cells with over 20 000 small-molecule compounds including most of the FDA-approved drugs. Using various benchmarking methods, we show that the integration of GE data with the CS of the drugs can significantly improve the predictability of ADRs. Moreover, transforming GE features to enrichment vectors of biological terms further improves the predictive capability of the classifiers. The most predictive biological-term features can assist in understanding the drug mechanisms of action. Finally, we applied the classifier to all >20 000 small-molecules profiled, and developed a web portal for browsing and searching predictive small-molecule/ADR connections. AVAILABILITY AND IMPLEMENTATION The interface for the adverse event predictions for the >20 000 LINCS compounds is available at http://maayanlab.net/SEP-L1000/ CONTACT: avi.maayan@mssm.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zichen Wang
- Department of Pharmacology and Systems Therapeutics, One Gustave L. Levy Place Box 1215, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Neil R Clark
- Department of Pharmacology and Systems Therapeutics, One Gustave L. Levy Place Box 1215, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Avi Ma'ayan
- Department of Pharmacology and Systems Therapeutics, One Gustave L. Levy Place Box 1215, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| |
Collapse
|
32
|
Hao M, Bryant SH, Wang Y. Cheminformatics analysis of the AR agonist and antagonist datasets in PubChem. J Cheminform 2016; 8:37. [PMID: 27398098 PMCID: PMC4938998 DOI: 10.1186/s13321-016-0150-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 07/01/2016] [Indexed: 01/17/2023] Open
Abstract
Background As one of the largest publicly accessible databases for hosting chemical structures and biological activities, PubChem has been processing bioassay submissions from the community since 2004. With the increase in volume for the deposited data in PubChem, the diversity and wealth of information content also grows. Recently, the Tox21 program, has deposited a series of pairwise data in PubChem regarding to different mechanism of actions (MOA), such as androgen receptor (AR) agonist and antagonist datasets, to study cell toxicity. To the best of our knowledge, little work has been reported from cheminformatics study for these especially pairwise datasets, which may provide insight into the mechanism of actions of the compounds and relationship between chemical structures and functions, as well as guidance for lead compound selection and optimization. Thus, to fill the gap, we performed a comprehensive cheminformatics analysis, including scaffold analysis, matched molecular pair (MMP) analysis as well as activity cliff analysis to investigate the structural characteristics and discontinued structure–activity relationship of the individual dataset (i.e., AR agonist dataset or AR antagonist dataset) and the combined dataset (i.e., the common compounds between the AR agonist and antagonist datasets). Results Scaffolds associated only with potential agonists or antagonists were identified. MMP-based activity cliffs, as well as a small group of compounds with dual MOA reported were recognized and analyzed. Moreover, MOA-cliff, a novel concept, was proposed to indicate one pair of structurally similar molecules which exhibit opposite MOA. Conclusions Cheminformatics methods were successfully applied to the pairwise AR datasets and the identified molecular scaffold characteristics, MMPs as well as activity cliffs might provide useful information when designing new lead compounds for the androgen receptor. Electronic supplementary material The online version of this article (doi:10.1186/s13321-016-0150-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ming Hao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 USA
| | - Stephen H Bryant
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 USA
| | - Yanli Wang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 USA
| |
Collapse
|
33
|
Nikolic K, Mavridis L, Djikic T, Vucicevic J, Agbaba D, Yelekci K, Mitchell JBO. Drug Design for CNS Diseases: Polypharmacological Profiling of Compounds Using Cheminformatic, 3D-QSAR and Virtual Screening Methodologies. Front Neurosci 2016; 10:265. [PMID: 27375423 PMCID: PMC4901078 DOI: 10.3389/fnins.2016.00265] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 05/25/2016] [Indexed: 11/13/2022] Open
Abstract
HIGHLIGHTSMany CNS targets are being explored for multi-target drug design New databases and cheminformatic methods enable prediction of primary pharmaceutical target and off-targets of compounds QSAR, virtual screening and docking methods increase the potential of rational drug design
The diverse cerebral mechanisms implicated in Central Nervous System (CNS) diseases together with the heterogeneous and overlapping nature of phenotypes indicated that multitarget strategies may be appropriate for the improved treatment of complex brain diseases. Understanding how the neurotransmitter systems interact is also important in optimizing therapeutic strategies. Pharmacological intervention on one target will often influence another one, such as the well-established serotonin-dopamine interaction or the dopamine-glutamate interaction. It is now accepted that drug action can involve plural targets and that polypharmacological interaction with multiple targets, to address disease in more subtle and effective ways, is a key concept for development of novel drug candidates against complex CNS diseases. A multi-target therapeutic strategy for Alzheimer‘s disease resulted in the development of very effective Multi-Target Designed Ligands (MTDL) that act on both the cholinergic and monoaminergic systems, and also retard the progression of neurodegeneration by inhibiting amyloid aggregation. Many compounds already in databases have been investigated as ligands for multiple targets in drug-discovery programs. A probabilistic method, the Parzen-Rosenblatt Window approach, was used to build a “predictor” model using data collected from the ChEMBL database. The model can be used to predict both the primary pharmaceutical target and off-targets of a compound based on its structure. Several multi-target ligands were selected for further study, as compounds with possible additional beneficial pharmacological activities. Based on all these findings, it is concluded that multipotent ligands targeting AChE/MAO-A/MAO-B and also D1-R/D2-R/5-HT2A-R/H3-R are promising novel drug candidates with improved efficacy and beneficial neuroleptic and procognitive activities in treatment of Alzheimer's and related neurodegenerative diseases. Structural information for drug targets permits docking and virtual screening and exploration of the molecular determinants of binding, hence facilitating the design of multi-targeted drugs. The crystal structures and models of enzymes of the monoaminergic and cholinergic systems have been used to investigate the structural origins of target selectivity and to identify molecular determinants, in order to design MTDLs.
Collapse
Affiliation(s)
- Katarina Nikolic
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, University of Belgrade Belgrade, Serbia
| | - Lazaros Mavridis
- School of Biological and Chemical Sciences, Queen Mary University of London London, UK
| | - Teodora Djikic
- Department of Bioinformatics and Genetics, Faculty of Engineering and Natural Sciences, Kadir Has University Istanbul, Turkey
| | - Jelica Vucicevic
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, University of Belgrade Belgrade, Serbia
| | - Danica Agbaba
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, University of Belgrade Belgrade, Serbia
| | - Kemal Yelekci
- Department of Bioinformatics and Genetics, Faculty of Engineering and Natural Sciences, Kadir Has University Istanbul, Turkey
| | - John B O Mitchell
- EaStCHEM School of Chemistry and Biomedical Sciences Research Complex, University of St Andrews St Andrews, UK
| |
Collapse
|
34
|
Lee H, Kang S, Kim W. Drug Repositioning for Cancer Therapy Based on Large-Scale Drug-Induced Transcriptional Signatures. PLoS One 2016; 11:e0150460. [PMID: 26954019 PMCID: PMC4783079 DOI: 10.1371/journal.pone.0150460] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Accepted: 02/15/2016] [Indexed: 11/18/2022] Open
Abstract
An in silico chemical genomics approach is developed to predict drug repositioning (DR) candidates for three types of cancer: glioblastoma, lung cancer, and breast cancer. It is based on a recent large-scale dataset of ~20,000 drug-induced expression profiles in multiple cancer cell lines, which provides i) a global impact of transcriptional perturbation of both known targets and unknown off-targets, and ii) rich information on drug's mode-of-action. First, the drug-induced expression profile is shown more effective than other information, such as the drug structure or known target, using multiple HTS datasets as unbiased benchmarks. Particularly, the utility of our method was robustly demonstrated in identifying novel DR candidates. Second, we predicted 14 high-scoring DR candidates solely based on expression signatures. Eight of the fourteen drugs showed significant anti-proliferative activity against glioblastoma; i.e., ivermectin, trifluridine, astemizole, amlodipine, maprotiline, apomorphine, mometasone, and nortriptyline. Our DR score strongly correlated with that of cell-based experimental results; the top seven DR candidates were positive, corresponding to an approximately 20-fold enrichment compared with conventional HTS. Despite diverse original indications and known targets, the perturbed pathways of active DR candidates show five distinct patterns that form tight clusters together with one or more known cancer drugs, suggesting common transcriptome-level mechanisms of anti-proliferative activity.
Collapse
Affiliation(s)
- Haeseung Lee
- Ewha Research Center for Systems Biology, Division of Molecular & Life Sciences, Ewha Womans University, Seoul, Korea
| | - Seungmin Kang
- Ewha Research Center for Systems Biology, Division of Molecular & Life Sciences, Ewha Womans University, Seoul, Korea
| | - Wankyu Kim
- Ewha Research Center for Systems Biology, Division of Molecular & Life Sciences, Ewha Womans University, Seoul, Korea
- * E-mail:
| |
Collapse
|
35
|
Yamanishi Y, Tabei Y, Kotera M. Metabolome-scale de novo pathway reconstruction using regioisomer-sensitive graph alignments. Bioinformatics 2015; 31:i161-70. [PMID: 26072478 PMCID: PMC4765868 DOI: 10.1093/bioinformatics/btv224] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Recent advances in mass spectrometry and related metabolomics technologies have enabled the rapid and comprehensive analysis of numerous metabolites. However, biosynthetic and biodegradation pathways are only known for a small portion of metabolites, with most metabolic pathways remaining uncharacterized. RESULTS In this study, we developed a novel method for supervised de novo metabolic pathway reconstruction with an improved graph alignment-based approach in the reaction-filling framework. We proposed a novel chemical graph alignment algorithm, which we called PACHA (Pairwise Chemical Aligner), to detect the regioisomer-sensitive connectivities between the aligned substructures of two compounds. Unlike other existing graph alignment methods, PACHA can efficiently detect only one common subgraph between two compounds. Our results show that the proposed method outperforms previous descriptor-based methods or existing graph alignment-based methods in the enzymatic reaction-likeness prediction for isomer-enriched reactions. It is also useful for reaction annotation that assigns potential reaction characteristics such as EC (Enzyme Commission) numbers and PIERO (Enzymatic Reaction Ontology for Partial Information) terms to substrate-product pairs. Finally, we conducted a comprehensive enzymatic reaction-likeness prediction for all possible uncharacterized compound pairs, suggesting potential metabolic pathways for newly predicted substrate-product pairs.
Collapse
Affiliation(s)
- Yoshihiro Yamanishi
- Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan
| | - Yasuo Tabei
- Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan
| | - Masaaki Kotera
- Division of System Cohort, Medical Institute of Bioregulation, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8582, Institute for Advanced Study, Kyushu University, Higashi-ku, Fukuoka, Fukuoka 812-8581, PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012 and Graduate School of Bioscience and Biotechnology, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo, 152-8550, Japan
| |
Collapse
|
36
|
Niu SY, Xin MY, Luo J, Liu MY, Jiang ZR. DSEP: A Tool Implementing Novel Method to Predict Side Effects of Drugs. J Comput Biol 2015; 22:1108-17. [DOI: 10.1089/cmb.2015.0129] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Affiliation(s)
- Shu-Yuan Niu
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Ming-Yuan Xin
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Jian Luo
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Ming-Yao Liu
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Zhen-Ran Jiang
- Shanghai Key Laboratory of Multidimensional Information Processing, Department of Computer Science and Technology, East China Normal University, Shanghai, China
| |
Collapse
|
37
|
Chen FS, Jiang ZR. Prediction of drug’s Anatomical Therapeutic Chemical (ATC) code by integrating drug–domain network. J Biomed Inform 2015; 58:80-88. [DOI: 10.1016/j.jbi.2015.09.016] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 09/14/2015] [Accepted: 09/22/2015] [Indexed: 10/22/2022]
|
38
|
Pérez-Nueno VI, Souchet M, Karaboga AS, Ritchie DW. GESSE: Predicting Drug Side Effects from Drug–Target Relationships. J Chem Inf Model 2015; 55:1804-23. [DOI: 10.1021/acs.jcim.5b00120] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Violeta I. Pérez-Nueno
- Harmonic Pharma, Espace Transfert, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| | - Michel Souchet
- Harmonic Pharma, Espace Transfert, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| | - Arnaud S. Karaboga
- Harmonic Pharma, Espace Transfert, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| | - David W. Ritchie
- INRIA Nancy − Grand Est, Equipe Capsid, 615 rue du Jardin Botanique, 54600 Villers-les-Nancy, France
| |
Collapse
|
39
|
Naveed H, Hameed US, Harrus D, Bourguet W, Arold ST, Gao X. An integrated structure- and system-based framework to identify new targets of metabolites and known drugs. Bioinformatics 2015; 31:3922-9. [PMID: 26286808 PMCID: PMC4673972 DOI: 10.1093/bioinformatics/btv477] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2015] [Accepted: 08/08/2015] [Indexed: 02/07/2023] Open
Abstract
Motivation: The inherent promiscuity of small molecules towards protein targets impedes our understanding of healthy versus diseased metabolism. This promiscuity also poses a challenge for the pharmaceutical industry as identifying all protein targets is important to assess (side) effects and repositioning opportunities for a drug. Results: Here, we present a novel integrated structure- and system-based approach of drug-target prediction (iDTP) to enable the large-scale discovery of new targets for small molecules, such as pharmaceutical drugs, co-factors and metabolites (collectively called ‘drugs’). For a given drug, our method uses sequence order–independent structure alignment, hierarchical clustering and probabilistic sequence similarity to construct a probabilistic pocket ensemble (PPE) that captures promiscuous structural features of different binding sites on known targets. A drug’s PPE is combined with an approximation of its delivery profile to reduce false positives. In our cross-validation study, we use iDTP to predict the known targets of 11 drugs, with 63% sensitivity and 81% specificity. We then predicted novel targets for these drugs—two that are of high pharmacological interest, the peroxisome proliferator-activated receptor gamma and the oncogene B-cell lymphoma 2, were successfully validated through in vitro binding experiments. Our method is broadly applicable for the prediction of protein-small molecule interactions with several novel applications to biological research and drug development. Availability and implementation: The program, datasets and results are freely available to academic users at http://sfb.kaust.edu.sa/Pages/Software.aspx. Contact:xin.gao@kaust.edu.sa and stefan.arold@kaust.edu.sa Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hammad Naveed
- Computer, Electrical and Mathematical Sciences and Engineering Division, Computational Bioscience Research Center
| | - Umar S Hameed
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Deborah Harrus
- Inserm U1054, Centre de Biochimie Structurale and CNRS UMR5048, Universités Montpellier 1 & 2, Montpellier, France
| | - William Bourguet
- Inserm U1054, Centre de Biochimie Structurale and CNRS UMR5048, Universités Montpellier 1 & 2, Montpellier, France
| | - Stefan T Arold
- Computational Bioscience Research Center, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Xin Gao
- Computer, Electrical and Mathematical Sciences and Engineering Division, Computational Bioscience Research Center
| |
Collapse
|
40
|
In silico assessment of adverse drug reactions and associated mechanisms. Drug Discov Today 2015; 21:58-71. [PMID: 26272036 DOI: 10.1016/j.drudis.2015.07.018] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2015] [Revised: 07/15/2015] [Accepted: 07/31/2015] [Indexed: 12/31/2022]
Abstract
During recent years, various in silico approaches have been developed to estimate chemical and biological drug features, for example chemical fragments, protein targets, pathways, among others, that correlate with adverse drug reactions (ADRs) and explain the associated mechanisms. These features have also been used for the creation of predictive models that enable estimation of ADRs during the early stages of drug development. In this review, we discuss various in silico approaches to predict these features for a certain drug, estimate correlations with ADRs, establish causal relationships between selected features and ADR mechanisms and create corresponding predictive models.
Collapse
|
41
|
Iwata H, Sawada R, Mizutani S, Yamanishi Y. Systematic drug repositioning for a wide range of diseases with integrative analyses of phenotypic and molecular data. J Chem Inf Model 2015; 55:446-59. [PMID: 25602292 DOI: 10.1021/ci500670q] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Drug repositioning, or the application of known drugs to new indications, is a challenging issue in pharmaceutical science. In this study, we developed a new computational method to predict unknown drug indications for systematic drug repositioning in a framework of supervised network inference. We defined a descriptor for each drug-disease pair based on the phenotypic features of drugs (e.g., medicinal effects and side effects) and various molecular features of diseases (e.g., disease-causing genes, diagnostic markers, disease-related pathways, and environmental factors) and constructed a statistical model to predict new drug-disease associations for a wide range of diseases in the International Classification of Diseases. Our results show that the proposed method outperforms previous methods in terms of accuracy and applicability, and its performance does not depend on drug chemical structure similarity. Finally, we performed a comprehensive prediction of a drug-disease association network consisting of 2349 drugs and 858 diseases and described biologically meaningful examples of newly predicted drug indications for several types of cancers and nonhereditary diseases.
Collapse
Affiliation(s)
- Hiroaki Iwata
- Division of System Cohort, Multi-Scale Research Center for Medical Science, Medical Institute of Bioregulation, Kyushu University , 3-1-1 Maidashi, Higashi-ku, Fukuoka, Fukuoka 812-8582, Japan
| | | | | | | |
Collapse
|
42
|
Wang C, Liu J, Luo F, Deng Z, Hu QN. Predicting target-ligand interactions using protein ligand-binding site and ligand substructures. BMC SYSTEMS BIOLOGY 2015; 9 Suppl 1:S2. [PMID: 25707321 PMCID: PMC4331677 DOI: 10.1186/1752-0509-9-s1-s2] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Background Cell proliferation, differentiation, Gene expression, metabolism, immunization and signal transduction require the participation of ligands and targets. It is a great challenge to identify rules governing molecular recognition between chemical topological substructures of ligands and the binding sites of the targets. Methods We suppose that the ligand-target interactions are determined by ligand substructures as well as the physical-chemical properties of the binding sites. Therefore, we propose a fragment interaction model (FIM) to describe the interactions between ligands and targets, with the purpose of facilitating the chemical interpretation of ligand-target binding. First we extract target-ligand complexes from sc-PDB database, based on which, we get the target binding sites and the ligands. Then we represent each binding site as a fragment vector based on a target fragment dictionary that is composed of 199 clusters (denoted as fragements in this work) obtained by clustering 4200 trimers according to their physical-chemical properties. And then, we represent each ligand as a substructure vector based on a dictionary containing 747 substructures. Finally, we build the FIM by generating the interaction matrix M (representing the fragment interaction network), and the FIM can later be used for predicting unknown ligand-target interactions as well as providing the binding details of the interactions. Results The five-fold cross validation results show that the proposed model can get higher AUC score (92%) than three prevalence algorithms CS-PD (80%), BLM-NII (85%) and RF (85%), demonstrating the remarkable predictive ability of FIM. We also show that the ligand binding sites (local information) overweight the sequence similarities (global information) in ligand-target binding, and introducing too much global information would be harmful to the predictive ability. Moreover, The derived fragment interaction network can provide the chemical insights on the interactions. Conclusions The target and ligand bindings are local events, and the local information dominate the binding ability. Though integrating of the global information can promote the predictive ability, the role is very limited. The fragment interaction network is helpful for understanding the mechanism of the ligand-target interaction.
Collapse
|
43
|
Liu X, Campillos M. Unveiling new biological relationships using shared hits of chemical screening assay pairs. ACTA ACUST UNITED AC 2015; 30:i579-86. [PMID: 25161250 PMCID: PMC4147921 DOI: 10.1093/bioinformatics/btu468] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Motivation: Although the integration and analysis of the activity of small molecules across multiple chemical screens is a common approach to determine the specificity and toxicity of hits, the suitability of these approaches to reveal novel biological information is less explored. Here, we test the hypothesis that assays sharing selective hits are biologically related. Results: We annotated the biological activities (i.e. biological processes or molecular activities) measured in assays and constructed chemical hit profiles with sets of compounds differing on their selectivity level for 1640 assays of ChemBank repository. We compared the similarity of chemical hit profiles of pairs of assays with their biological relationships and observed that assay pairs sharing non-promiscuous chemical hits tend to be biologically related. A detailed analysis of a network containing assay pairs with the highest hit similarity confirmed biological meaningful relationships. Furthermore, the biological roles of predicted molecular targets of the shared hits reinforced the biological associations between assay pairs. Contact:monica.campillos@helmholtz-muenchen.de Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xueping Liu
- Institute of Bioinformatics and Systems Biology and German Center for Diabetes Research, Helmholtz Center Munich, 85764 Neuherberg, Germany Institute of Bioinformatics and Systems Biology and German Center for Diabetes Research, Helmholtz Center Munich, 85764 Neuherberg, Germany
| | - Monica Campillos
- Institute of Bioinformatics and Systems Biology and German Center for Diabetes Research, Helmholtz Center Munich, 85764 Neuherberg, Germany Institute of Bioinformatics and Systems Biology and German Center for Diabetes Research, Helmholtz Center Munich, 85764 Neuherberg, Germany
| |
Collapse
|
44
|
Sawada R, Kotera M, Yamanishi Y. Benchmarking a Wide Range of Chemical Descriptors for Drug-Target Interaction Prediction Using a Chemogenomic Approach. Mol Inform 2014; 33:719-31. [DOI: 10.1002/minf.201400066] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Accepted: 07/23/2014] [Indexed: 01/28/2023]
|
45
|
|
46
|
Cheng T, Pan Y, Hao M, Wang Y, Bryant SH. PubChem applications in drug discovery: a bibliometric analysis. Drug Discov Today 2014; 19:1751-1756. [PMID: 25168772 DOI: 10.1016/j.drudis.2014.08.008] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Revised: 07/17/2014] [Accepted: 08/18/2014] [Indexed: 12/18/2022]
Abstract
A bibliometric analysis of PubChem applications is presented by reviewing 1132 research articles. The massive volume of chemical structure and bioactivity data in PubChem and its online services have been used globally in various fields including chemical biology, medicinal chemistry and informatics research. PubChem supports drug discovery in many aspects such as lead identification and optimization, compound-target profiling, polypharmacology studies and unknown chemical identity elucidation. PubChem has also become a valuable resource for developing secondary databases, informatics tools and web services. The growing PubChem resource with its public availability offers support and great opportunities for the interrogation of pharmacological mechanisms and the genetic basis of diseases, which are vital for drug innovation and repurposing.
Collapse
Affiliation(s)
- Tiejun Cheng
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Yongmei Pan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Ming Hao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Yanli Wang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
| | - Stephen H Bryant
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
| |
Collapse
|
47
|
Mousavian Z, Masoudi-Nejad A. Drug-target interaction prediction via chemogenomic space: learning-based methods. Expert Opin Drug Metab Toxicol 2014; 10:1273-87. [PMID: 25112457 DOI: 10.1517/17425255.2014.950222] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
INTRODUCTION Identification of the interaction between drugs and target proteins is a crucial task in genomic drug discovery. The in silico prediction is an appropriate alternative for the laborious and costly experimental process of drug-target interaction prediction. Developing a variety of computational methods opens a new direction in analyzing and detecting new drug-target pairs. AREAS COVERED In this review, we will focus on chemogenomic methods which have established a learning framework for predicting drug-target interactions. Learning-based methods are classified into supervised and semi-supervised, and the supervised learning methods are studied as two separate parts including similarity-based methods and feature-based methods. EXPERT OPINION In spite of many improvements for pharmacology applications by learning-based methods, there are many over simplification settings in construction of predictive models that may lead to over-optimistic results on drug-target interaction prediction.
Collapse
Affiliation(s)
- Zaynab Mousavian
- University of Tehran, Institute of Biochemistry and Biophysics, Laboratory of Systems Biology and Bioinformatics (LBB) , Tehran , Iran +98 21 6695 9256 ; +98 21 6640 4680 ;
| | | |
Collapse
|
48
|
Shang N, Xu H, Rindflesch TC, Cohen T. Identifying plausible adverse drug reactions using knowledge extracted from the literature. J Biomed Inform 2014; 52:293-310. [PMID: 25046831 DOI: 10.1016/j.jbi.2014.07.011] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Revised: 06/06/2014] [Accepted: 07/10/2014] [Indexed: 01/08/2023]
Abstract
Pharmacovigilance involves continually monitoring drug safety after drugs are put to market. To aid this process; algorithms for the identification of strongly correlated drug/adverse drug reaction (ADR) pairs from data sources such as adverse event reporting systems or Electronic Health Records have been developed. These methods are generally statistical in nature, and do not draw upon the large volumes of knowledge embedded in the biomedical literature. In this paper, we investigate the ability of scalable Literature Based Discovery (LBD) methods to identify side effects of pharmaceutical agents. The advantage of LBD methods is that they can provide evidence from the literature to support the plausibility of a drug/ADR association, thereby assisting human review to validate the signal, which is an essential component of pharmacovigilance. To do so, we draw upon vast repositories of knowledge that has been extracted from the biomedical literature by two Natural Language Processing tools, MetaMap and SemRep. We evaluate two LBD methods that scale comfortably to the volume of knowledge available in these repositories. Specifically, we evaluate Reflective Random Indexing (RRI), a model based on concept-level co-occurrence, and Predication-based Semantic Indexing (PSI), a model that encodes the nature of the relationship between concepts to support reasoning analogically about drug-effect relationships. An evaluation set was constructed from the Side Effect Resource 2 (SIDER2), which contains known drug/ADR relations, and models were evaluated for their ability to "rediscover" these relations. In this paper, we demonstrate that both RRI and PSI can recover known drug-adverse event associations. However, PSI performed better overall, and has the additional advantage of being able to recover the literature underlying the reasoning pathways it used to make its predictions.
Collapse
Affiliation(s)
- Ning Shang
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States.
| | - Hua Xu
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States
| | | | - Trevor Cohen
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States
| |
Collapse
|
49
|
Alderson RG, De Ferrari L, Mavridis L, McDonagh JL, Mitchell JBO, Nath N. Enzyme informatics. Curr Top Med Chem 2014; 12:1911-23. [PMID: 23116471 DOI: 10.2174/156802612804547353] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2012] [Revised: 09/12/2012] [Accepted: 09/15/2012] [Indexed: 12/18/2022]
Abstract
Over the last 50 years, sequencing, structural biology and bioinformatics have completely revolutionised biomolecular science, with millions of sequences and tens of thousands of three dimensional structures becoming available. The bioinformatics of enzymes is well served by, mostly free, online databases. BRENDA describes the chemistry, substrate specificity, kinetics, preparation and biological sources of enzymes, while KEGG is valuable for understanding enzymes and metabolic pathways. EzCatDB, SFLD and MACiE are key repositories for data on the chemical mechanisms by which enzymes operate. At the current rate of genome sequencing and manual annotation, human curation will never finish the functional annotation of the ever-expanding list of known enzymes. Hence there is an increasing need for automated annotation, though it is not yet widespread for enzyme data. In contrast, functional ontologies such as the Gene Ontology already profit from automation. Despite our growing understanding of enzyme structure and dynamics, we are only beginning to be able to design novel enzymes. One can now begin to trace the functional evolution of enzymes using phylogenetics. The ability of enzymes to perform secondary functions, albeit relatively inefficiently, gives clues as to how enzyme function evolves. Substrate promiscuity in enzymes is one example of imperfect specificity in protein-ligand interactions. Similarly, most drugs bind to more than one protein target. This may sometimes result in helpful polypharmacology as a drug modulates plural targets, but also often leads to adverse side-effects. Many chemoinformatics approaches can be used to model the interactions between druglike molecules and proteins in silico. We can even use quantum chemical techniques like DFT and QM/MM to compute the structural and energetic course of enzyme catalysed chemical reaction mechanisms, including a full description of bond making and breaking.
Collapse
Affiliation(s)
- Rosanna G Alderson
- Biomedical Sciences Research Complex and EaStCHEM School of Chemistry, Purdie Building, University of St Andrews, North Haugh, St Andrews, Scotland, UK
| | | | | | | | | | | |
Collapse
|
50
|
Swamidass SJ, Schillebeeckx CN, Matlock M, Hurle MR, Agarwal P. Combined Analysis of Phenotypic and Target-Based Screening in Assay Networks. ACTA ACUST UNITED AC 2014; 19:782-90. [DOI: 10.1177/1087057114523068] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2013] [Accepted: 01/17/2014] [Indexed: 01/09/2023]
Abstract
Small-molecule screens are an integral part of drug discovery. Public domain data in PubChem alone represent more than 158 million measurements, 1.2 million molecules, and 4300 assays. We conducted a global analysis of these data, building a network of assays and connecting the assays if they shared nonpromiscuous active molecules. This network spans both phenotypic and target-based screens, recapitulates known biology, and identifies new polypharmacology. Phenotypic screens are extremely important for drug discovery, contributing to the discovery of a large proportion of new drugs. Connections between phenotypic and biochemical, target-based screens can suggest strategies for repurposing both small-molecule and biologic drugs. For example, a screen for molecules that prevent cell death from a mutated version of superoxide-dismutase is linked with ALOX15. This connection suggests a therapeutic role for ALOX15 inhibitors in amyotrophic lateral sclerosis. An interactive version of the network is available online ( http://swami.wustl.edu/flow/assay_network.html ).
Collapse
Affiliation(s)
- S. Joshua Swamidass
- Division of Laboratory and Genomic Medicine, Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, USA
| | - Constantino N. Schillebeeckx
- Division of Laboratory and Genomic Medicine, Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, USA
| | - Matthew Matlock
- Division of Laboratory and Genomic Medicine, Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, USA
| | - Mark R. Hurle
- Computational Biology, GlaxoSmithKline R&D, King of Prussia, PA, USA
| | - Pankaj Agarwal
- Computational Biology, GlaxoSmithKline R&D, King of Prussia, PA, USA
| |
Collapse
|