1
|
Carnivali GS, Borges CC. Method to link medicines to diseases using multiplex networks. Comput Methods Biomech Biomed Engin 2024:1-14. [PMID: 38907637 DOI: 10.1080/10255842.2024.2362860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Accepted: 05/28/2024] [Indexed: 06/24/2024]
Abstract
The reuse of well-established medicines using computational modeling has gained a lot of attention due to its tremendous benefits. Based on this perspective, a new method for linking known medicines to diseases is proposed. The creation of a new treatment or medicine can be financially and temporally costly and the reuse of medicines is one possibility to accelerate this process efficiently. The main purpose of the reuse of medicines is to reduce some stages of the development of new medicines, motivating the proposition of several methods nowadays. In this work, a new method is developed aiming to connect known medicines to diseases based on available networks of protein interactions and available lists of medicines that affect protein action. The concepts of multiplex networks are used to connect subgraphs of vertices that represent medicines and proteins. The core of the procedure is determined by a weighting strategy constructed to define precisely the more relevant connections. The method was compared to other network link methods in the literature and a case study was presented and evaluated by the proposed method.
Collapse
|
2
|
Xu M, Li W, He J, Wang Y, Lv J, He W, Chen L, Zhi H. DDCM: A Computational Strategy for Drug Repositioning Based on Support-Vector Regression Algorithm. Int J Mol Sci 2024; 25:5267. [PMID: 38791306 PMCID: PMC11121335 DOI: 10.3390/ijms25105267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/25/2024] [Accepted: 05/06/2024] [Indexed: 05/26/2024] Open
Abstract
Computational drug-repositioning technology is an effective tool for speeding up drug development. As biological data resources continue to grow, it becomes more important to find effective methods to identify potential therapeutic drugs for diseases. The effective use of valuable data has become a more rational and efficient approach to drug repositioning. The disease-drug correlation method (DDCM) proposed in this study is a novel approach that integrates data from multiple sources and different levels to predict potential treatments for diseases, utilizing support-vector regression (SVR). The DDCM approach resulted in potential therapeutic drugs for neoplasms and cardiovascular diseases by constructing a correlation hybrid matrix containing the respective similarities of drugs and diseases, implementing the SVR algorithm to predict the correlation scores, and undergoing a randomized perturbation and stepwise screening pipeline. Some potential therapeutic drugs were predicted by this approach. The potential therapeutic ability of these drugs has been well-validated in terms of the literature, function, drug target, and survival-essential genes. The method's feasibility was confirmed by comparing the predicted results with the classical method and conducting a co-drug analysis of the sub-branch. Our method challenges the conventional approach to studying disease-drug correlations and presents a fresh perspective for understanding the pathogenesis of diseases.
Collapse
Affiliation(s)
- Manyi Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150000, China; (M.X.); (W.L.); (J.H.); (Y.W.); (J.L.)
| | - Wan Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150000, China; (M.X.); (W.L.); (J.H.); (Y.W.); (J.L.)
| | - Jiaheng He
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150000, China; (M.X.); (W.L.); (J.H.); (Y.W.); (J.L.)
| | - Yahui Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150000, China; (M.X.); (W.L.); (J.H.); (Y.W.); (J.L.)
| | - Junjie Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150000, China; (M.X.); (W.L.); (J.H.); (Y.W.); (J.L.)
| | - Weiming He
- Institute of Opto-Electronics, Harbin Institute of Technology, Harbin 150000, China;
| | - Lina Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150000, China; (M.X.); (W.L.); (J.H.); (Y.W.); (J.L.)
| | - Hui Zhi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150000, China; (M.X.); (W.L.); (J.H.); (Y.W.); (J.L.)
| |
Collapse
|
3
|
Chen H, King FJ, Zhou B, Wang Y, Canedy CJ, Hayashi J, Zhong Y, Chang MW, Pache L, Wong JL, Jia Y, Joslin J, Jiang T, Benner C, Chanda SK, Zhou Y. Drug target prediction through deep learning functional representation of gene signatures. Nat Commun 2024; 15:1853. [PMID: 38424040 PMCID: PMC10904399 DOI: 10.1038/s41467-024-46089-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 02/14/2024] [Indexed: 03/02/2024] Open
Abstract
Many machine learning applications in bioinformatics currently rely on matching gene identities when analyzing input gene signatures and fail to take advantage of preexisting knowledge about gene functions. To further enable comparative analysis of OMICS datasets, including target deconvolution and mechanism of action studies, we develop an approach that represents gene signatures projected onto their biological functions, instead of their identities, similar to how the word2vec technique works in natural language processing. We develop the Functional Representation of Gene Signatures (FRoGS) approach by training a deep learning model and demonstrate that its application to the Broad Institute's L1000 datasets results in more effective compound-target predictions than models based on gene identities alone. By integrating additional pharmacological activity data sources, FRoGS significantly increases the number of high-quality compound-target predictions relative to existing approaches, many of which are supported by in silico and/or experimental evidence. These results underscore the general utility of FRoGS in machine learning-based bioinformatics applications. Prediction networks pre-equipped with the knowledge of gene functions may help uncover new relationships among gene signatures acquired by large-scale OMICs studies on compounds, cell types, disease models, and patient cohorts.
Collapse
Affiliation(s)
- Hao Chen
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
- Department of Computer Science and Engineering, University of California, Riverside, 900 University Avenue, Riverside, CA, 92521, USA.
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213, USA.
| | - Frederick J King
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Bin Zhou
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Yu Wang
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Carter J Canedy
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Joel Hayashi
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Yang Zhong
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Max W Chang
- Department of Medicine, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
| | - Lars Pache
- NCI Designated Cancer Center, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, 92037, USA
| | - Julian L Wong
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Yong Jia
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - John Joslin
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA
| | - Tao Jiang
- Department of Computer Science and Engineering, University of California, Riverside, 900 University Avenue, Riverside, CA, 92521, USA
| | - Christopher Benner
- Department of Medicine, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA
| | - Sumit K Chanda
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
| | - Yingyao Zhou
- Novartis Biomedical Research, 10675 John Jay Hopkins Drive, San Diego, CA, 92121, USA.
| |
Collapse
|
4
|
Xu M, Abdullah NA, Md Sabri AQ. A method to improve the prediction performance of cancer-gene association by screening negative training samples through gene network data. Comput Biol Chem 2024; 108:107997. [PMID: 38154318 DOI: 10.1016/j.compbiolchem.2023.107997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 11/03/2023] [Accepted: 12/03/2023] [Indexed: 12/30/2023]
Abstract
This work focuses on data sampling in cancer-gene association prediction. Currently, researchers are using machine learning methods to predict genes that are more likely to produce cancer-causing mutations. To improve the performance of machine learning models, methods have been proposed, one of which is to improve the quality of the training data. Existing methods focus mainly on positive data, i.e. cancer driver genes, for screening selection. This paper proposes a low-cancer-related gene screening method based on gene network and graph theory algorithms to improve the negative samples selection. Genetic data with low cancer correlation is used as negative training samples. After experimental verification, using the negative samples screened by this method to train the cancer gene classification model can improve prediction performance. The biggest advantage of this method is that it can be easily combined with other methods that focus on enhancing the quality of positive training samples. It has been demonstrated that significant improvement is achieved by combining this method with three state-of-the-arts cancer gene prediction methods.
Collapse
Affiliation(s)
- Mingzhe Xu
- Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, 50603 Malaysia; School of Energy and Intelligence Engineering, Henan University of Animal Husbandry and Economy, #6 North Longzihu Rd, Zhengzhou 450000, China.
| | - Nor Aniza Abdullah
- Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, 50603 Malaysia.
| | - Aznul Qalid Md Sabri
- Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, 50603 Malaysia.
| |
Collapse
|
5
|
Hu L, Zhang M, Hu P, Zhang J, Niu C, Lu X, Jiang X, Ma Y. Dual-channel hypergraph convolutional network for predicting herb-disease associations. Brief Bioinform 2024; 25:bbae067. [PMID: 38426326 PMCID: PMC10939431 DOI: 10.1093/bib/bbae067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 01/26/2024] [Accepted: 02/05/2024] [Indexed: 03/02/2024] Open
Abstract
Herbs applicability in disease treatment has been verified through experiences over thousands of years. The understanding of herb-disease associations (HDAs) is yet far from complete due to the complicated mechanism inherent in multi-target and multi-component (MTMC) botanical therapeutics. Most of the existing prediction models fail to incorporate the MTMC mechanism. To overcome this problem, we propose a novel dual-channel hypergraph convolutional network, namely HGHDA, for HDA prediction. Technically, HGHDA first adopts an autoencoder to project components and target protein onto a low-dimensional latent space so as to obtain their embeddings by preserving similarity characteristics in their original feature spaces. To model the high-order relations between herbs and their components, we design a channel in HGHDA to encode a hypergraph that describes the high-order patterns of herb-component relations via hypergraph convolution. The other channel in HGHDA is also established in the same way to model the high-order relations between diseases and target proteins. The embeddings of drugs and diseases are then aggregated through our dual-channel network to obtain the prediction results with a scoring function. To evaluate the performance of HGHDA, a series of extensive experiments have been conducted on two benchmark datasets, and the results demonstrate the superiority of HGHDA over the state-of-the-art algorithms proposed for HDA prediction. Besides, our case study on Chuan Xiong and Astragalus membranaceus is a strong indicator to verify the effectiveness of HGHDA, as seven and eight out of the top 10 diseases predicted by HGHDA for Chuan-Xiong and Astragalus-membranaceus, respectively, have been reported in literature.
Collapse
Affiliation(s)
- Lun Hu
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi China
- University of Chinese Academy of Sciences, Beijing, China
- Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi, China
| | - Menglong Zhang
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi China
- University of Chinese Academy of Sciences, Beijing, China
- Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi, China
| | - Pengwei Hu
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi China
- University of Chinese Academy of Sciences, Beijing, China
- Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi, China
| | - Jun Zhang
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi China
- University of Chinese Academy of Sciences, Beijing, China
- Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi, China
| | - Chao Niu
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory Basis of Xinjiang Indigenous Medicinal Plants Resource Utilization, Key Laboratory of Chemistry of Plant Resources in Arid Regions, Xinjiang Technical Institute of Physicsand Chemistry,Chinese Academy of Sciences Urumqi, China
| | - Xueying Lu
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory Basis of Xinjiang Indigenous Medicinal Plants Resource Utilization, Key Laboratory of Chemistry of Plant Resources in Arid Regions, Xinjiang Technical Institute of Physicsand Chemistry,Chinese Academy of Sciences Urumqi, China
| | - Xiangrui Jiang
- State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica,Chinese Academy of Sciences Shanghai, China
| | - Yupeng Ma
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi China
- University of Chinese Academy of Sciences, Beijing, China
- Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi, China
| |
Collapse
|
6
|
Yu Z, Wu Z, Wang Z, Wang Y, Zhou M, Li W, Liu G, Tang Y. Network-Based Methods and Their Applications in Drug Discovery. J Chem Inf Model 2024; 64:57-75. [PMID: 38150548 DOI: 10.1021/acs.jcim.3c01613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2023]
Abstract
Drug discovery is time-consuming, expensive, and predominantly follows the "one drug → one target → one disease" paradigm. With the rapid development of systems biology and network pharmacology, a novel drug discovery paradigm, "multidrug → multitarget → multidisease", has emerged. This new holistic paradigm of drug discovery aligns well with the essence of networks, leading to the emergence of network-based methods in the field of drug discovery. In this Perspective, we initially introduce the concept and data sources of networks and highlight classical methodologies employed in network-based methods. Subsequently, we focus on the practical applications of network-based methods across various areas of drug discovery, such as target prediction, virtual screening, prediction of drug therapeutic effects or adverse drug events, and elucidation of molecular mechanisms. In addition, we provide representative web servers for researchers to use network-based methods in specific applications. Finally, we discuss several challenges of network-based methods and the directions for future development. In a word, network-based methods could serve as powerful tools to accelerate drug discovery.
Collapse
Affiliation(s)
- Zhuohang Yu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Zengrui Wu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Ze Wang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Yimeng Wang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Moran Zhou
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Weihua Li
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Guixia Liu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | - Yun Tang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| |
Collapse
|
7
|
Nath A, Chaube R. Mining Chemogenomic Spaces for Prediction of Drug-Target Interactions. Methods Mol Biol 2024; 2714:155-169. [PMID: 37676598 DOI: 10.1007/978-1-0716-3441-7_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
The pipeline of drug discovery consists of a number of processes; drug-target interaction determination is one of the salient steps among them. Computational prediction of drug-target interactions can facilitate in reducing the search space of experimental wet lab-based verifications steps, thus considerably reducing time and other resources dedicated to the drug discovery pipeline. While machine learning-based methods are more widespread for drug-target interaction prediction, network-centric methods are also evolving. In this chapter, we focus on the process of the drug-target interaction prediction from the perspective of using machine learning algorithms and the various stages involved for developing an accurate predictor.
Collapse
Affiliation(s)
- Abhigyan Nath
- Department of Biochemistry, Pt. Jawahar Lal Nehru Memorial Medical College, Raipur, India
| | - Radha Chaube
- Department of Zoology, Institute of Science, Banaras Hindu University, Varanasi, India
| |
Collapse
|
8
|
Ke D, Pu J. HEM: An Improved Parametric Link Prediction Algorithm Based on Hybrid Network Evolution Mechanism. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1416. [PMID: 37895537 PMCID: PMC10606480 DOI: 10.3390/e25101416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 09/25/2023] [Accepted: 09/27/2023] [Indexed: 10/29/2023]
Abstract
Link prediction plays an important role in the research of complex networks. Its task is to predict missing links or possible new links in the future via existing information in the network. In recent years, many powerful link prediction algorithms have emerged, which have good results in prediction accuracy and interpretability. However, the existing research still cannot clearly point out the relationship between the characteristics of the network and the mechanism of link generation, and the predictability of complex networks with different features remains to be further analyzed. In view of this, this article proposes the corresponding link prediction indexes Reg, DFPA and LW on a regular network, scale-free network and small-world network, respectively, and studies their prediction properties on these three network models. At the same time, we propose a parametric hybrid index HEM and compare the prediction accuracies of HEM and many similarity-based indexes on real-world networks. The experimental results show that HEM performs better than other Birnbaum-Saunders. In addition, we study the factors that play a major role in the prediction of HEM and analyze their relationship with the characteristics of real-world networks. The results show that the predictive properties of factors are closely related to the features of networks.
Collapse
Affiliation(s)
- Dejing Ke
- Department of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Jiansu Pu
- Big Data Visual Analysis Lab, University of Electronic Science and Technology of China, Chengdu 611731, China
| |
Collapse
|
9
|
Yue R, Dutta A. Computational systems biology in disease modeling and control, review and perspectives. NPJ Syst Biol Appl 2022; 8:37. [PMID: 36192551 PMCID: PMC9528884 DOI: 10.1038/s41540-022-00247-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 09/05/2022] [Indexed: 02/02/2023] Open
Abstract
Omics-based approaches have become increasingly influential in identifying disease mechanisms and drug responses. Considering that diseases and drug responses are co-expressed and regulated in the relevant omics data interactions, the traditional way of grabbing omics data from single isolated layers cannot always obtain valuable inference. Also, drugs have adverse effects that may impair patients, and launching new medicines for diseases is costly. To resolve the above difficulties, systems biology is applied to predict potential molecular interactions by integrating omics data from genomic, proteomic, transcriptional, and metabolic layers. Combined with known drug reactions, the resulting models improve medicines' therapeutical performance by re-purposing the existing drugs and combining drug molecules without off-target effects. Based on the identified computational models, drug administration control laws are designed to balance toxicity and efficacy. This review introduces biomedical applications and analyses of interactions among gene, protein and drug molecules for modeling disease mechanisms and drug responses. The therapeutical performance can be improved by combining the predictive and computational models with drug administration designed by control laws. The challenges are also discussed for its clinical uses in this work.
Collapse
Affiliation(s)
- Rongting Yue
- Department of Electrical and Computer Engineering, University of Connecticut, 371 Fairfield Way, Storrs, CT, 06269, USA.
| | - Abhishek Dutta
- Department of Electrical and Computer Engineering, University of Connecticut, 371 Fairfield Way, Storrs, CT, 06269, USA
| |
Collapse
|
10
|
Scope of repurposed drugs against the potential targets of the latest variants of SARS-CoV-2. Struct Chem 2022; 33:1585-1608. [PMID: 35938064 PMCID: PMC9346052 DOI: 10.1007/s11224-022-02020-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Accepted: 07/19/2022] [Indexed: 11/21/2022]
Abstract
The unprecedented outbreak of the severe acute respiratory syndrome (SARS) Coronavirus-2, across the globe, triggered a worldwide uproar in the search for immediate treatment strategies. With no specific drug and not much data available, alternative approaches such as drug repurposing came to the limelight. To date, extensive research on the repositioning of drugs has led to the identification of numerous drugs against various important protein targets of the coronavirus strains, with hopes of the drugs working against the major variants of concerns (alpha, beta, gamma, delta, omicron) of the virus. Advancements in computational sciences have led to improved scope of repurposing via techniques such as structure-based approaches including molecular docking, molecular dynamic simulations and quantitative structure activity relationships, network-based approaches, and artificial intelligence-based approaches with other core machine and deep learning algorithms. This review highlights the various approaches to repurposing drugs from a computational biological perspective, with various mechanisms of action of the drugs against some of the major protein targets of SARS-CoV-2. Additionally, clinical trials data on potential COVID-19 repurposed drugs are also highlighted with stress on the major SARS-CoV-2 targets and the structural effect of variants on these targets. The interaction modelling of some important repurposed drugs has also been elucidated. Furthermore, the merits and demerits of drug repurposing are also discussed, with a focus on the scope and applications of the latest advancements in repurposing.
Collapse
|
11
|
Computational prediction and interpretation of druggable proteins using a stacked ensemble-learning framework. iScience 2022; 25:104883. [PMID: 36046193 PMCID: PMC9421381 DOI: 10.1016/j.isci.2022.104883] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 07/08/2022] [Accepted: 08/02/2022] [Indexed: 11/22/2022] Open
Abstract
Discovery of potential drugs requires rapid and precise identification of drug targets. Although traditional experimental methodologies can accurately identify drug targets, they are time-consuming and inappropriate for high-throughput screening. Computational approaches based on machine learning (ML) algorithms can expedite the prediction of druggable proteins; however, the performance of the existing computational methods remains unsatisfactory. This study proposes a computational tool, SPIDER, to enhance the accurate prediction of druggable proteins. SPIDER employs various feature descriptors pertaining to several aspects, including physicochemical properties, compositional information, and composition-transition-distribution information, coupled with well-known ML algorithms to facilitate the construction of the final meta-predictor. The experimental results showed that SPIDER enabled more precise and robust prediction of druggable proteins than the baseline models and current existing methods in terms of the independent test dataset. An online web server was established and made freely available online. Computational models can expedite the identification of potential druggable proteins SPIDER represents the first stacked model proposed for druggable protein prediction SPIDER enables more precise prediction of druggable proteins than existing methods The SPIDER web server is available at http://pmlabstack.pythonanywhere.com/SPIDER.
Collapse
|
12
|
A comprehensive review of Artificial Intelligence and Network based approaches to drug repurposing in Covid-19. Biomed Pharmacother 2022; 153:113350. [PMID: 35777222 PMCID: PMC9236981 DOI: 10.1016/j.biopha.2022.113350] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/22/2022] [Accepted: 06/24/2022] [Indexed: 11/26/2022] Open
Abstract
Conventional drug discovery and development is tedious and time-taking process; because of which it has failed to keep the required pace to mitigate threats and cater demands of viral and re-occurring diseases, such as Covid-19. The main reasons of this delay in traditional drug development are: high attrition rates, extensive time requirements, and huge financial investment with significant risk. The effective solution to de novo drug discovery is drug repurposing. Previous studies have shown that the network-based approaches and analysis are versatile platform for repurposing as the network biology is used to model the interactions between variety of biological concepts. Herein, we provide a comprehensive background of machine learning and deep learning in drug repurposing while specifically focusing on the applications of network-based approach to drug repurposing in Covid-19, data sources, and tools used. Furthermore, use of network proximity, network diffusion, and AI on network-based drug repurposing for Covid-19 is well-explained. Finally, limitations of network-based approaches in general and specific to network are stated along with future recommendations for better network-based models.
Collapse
|
13
|
DTIP-TC2A: An analytical framework for drug-target interactions prediction methods. Comput Biol Chem 2022; 99:107707. [DOI: 10.1016/j.compbiolchem.2022.107707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 05/01/2022] [Accepted: 05/26/2022] [Indexed: 11/18/2022]
|
14
|
Drug-target interaction prediction via an ensemble of weighted nearest neighbors with interaction recovery. APPL INTELL 2022. [DOI: 10.1007/s10489-021-02495-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
15
|
Zhang H, Cui H, Zhang T, Cao Y, Xuan P. Learning multi-scale heterogenous network topologies and various pairwise attributes for drug-disease association prediction. Brief Bioinform 2022; 23:6523412. [PMID: 35136910 DOI: 10.1093/bib/bbac009] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 12/19/2021] [Accepted: 01/07/2022] [Indexed: 01/18/2023] Open
Abstract
MOTIVATION Identifying new therapeutic effects for the approved drugs is beneficial for effectively reducing the drug development cost and time. Most of the recent computational methods concentrate on exploiting multiple kinds of information about drugs and disease to predict the candidate associations between drugs and diseases. However, the drug and disease nodes have neighboring topologies with multiple scales, and the previous methods did not fully exploit and deeply integrate these topologies. RESULTS We present a prediction method, multi-scale topology learning for drug-disease (MTRD), to integrate and learn multi-scale neighboring topologies and the attributes of a pair of drug and disease nodes. First, for multiple kinds of drug similarities, multiple drug-disease heterogenous networks are constructed respectively to integrate the similarities and associations related to drugs and diseases. Moreover, each heterogenous network has its specific topology structure, which is helpful for learning the corresponding specific topology representation. We formulate the topology embeddings for each drug node and disease node by random walking on each heterogeneous network, and the embeddings cover the neighboring topologies with different scopes. Because the multi-scale topology embeddings have context relationships, we construct Bi-directional long short-term memory-based module to encode these embeddings and their relationships and learn the neighboring topology representation. We also design the attention mechanisms at feature level and at scale level to obtain the more informative pairwise features and topology embeddings. A module based on multi-layer convolutional networks is constructed to learn the representative attributes of the drug-disease node pair according to their related similarity and association information. Comprehensive experimental results indicate that MTRD achieves the superior performance than several state-of-the-art methods for predicting drug-disease associations. MTRD also retrieves more actual drug-disease associations in the top-ranked candidates of the prediction result. Case studies on five drugs further demonstrate MTRD's ability in discovering the potential candidate diseases for the interested drugs.
Collapse
Affiliation(s)
- Hongda Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne 3083, Australia
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin 150080, China
| | - Yangkun Cao
- School of Artificial Intelligence, Jilin University, Changchun 130012, China
| | - Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| |
Collapse
|
16
|
Sorkhi AG, Abbasi Z, Mobarakeh MI, Pirgazi J. Drug-target interaction prediction using unifying of graph regularized nuclear norm with bilinear factorization. BMC Bioinformatics 2021; 22:555. [PMID: 34789169 PMCID: PMC8597250 DOI: 10.1186/s12859-021-04464-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 10/29/2021] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Wet-lab experiments for identification of interactions between drugs and target proteins are time-consuming, costly and labor-intensive. The use of computational prediction of drug-target interactions (DTIs), which is one of the significant points in drug discovery, has been considered by many researchers in recent years. It also reduces the search space of interactions by proposing potential interaction candidates. RESULTS In this paper, a new approach based on unifying matrix factorization and nuclear norm minimization is proposed to find a low-rank interaction. In this combined method, to solve the low-rank matrix approximation, the terms in the DTI problem are used in such a way that the nuclear norm regularized problem is optimized by a bilinear factorization based on Rank-Restricted Soft Singular Value Decomposition (RRSSVD). In the proposed method, adjacencies between drugs and targets are encoded by graphs. Drug-target interaction, drug-drug similarity, target-target, and combination of similarities have also been used as input. CONCLUSIONS The proposed method is evaluated on four benchmark datasets known as Enzymes (E), Ion channels (ICs), G protein-coupled receptors (GPCRs) and nuclear receptors (NRs) based on AUC, AUPR, and time measure. The results show an improvement in the performance of the proposed method compared to the state-of-the-art techniques.
Collapse
Affiliation(s)
- Ali Ghanbari Sorkhi
- Faculty of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, P.O. Box, 48518-78195, Behshahr, Iran.
| | - Zahra Abbasi
- School of Medicine, Faculty of Medical Biotechnology, Shahroud University of Medical Sciences, Shahroud, Iran
| | | | - Jamshid Pirgazi
- Faculty of Electrical and Computer Engineering, University of Science and Technology of Mazandaran, P.O. Box, 48518-78195, Behshahr, Iran.
| |
Collapse
|
17
|
Sonaye HV, Sheikh RY, Doifode CA. Drug repurposing: Iron in the fire for older drugs. Biomed Pharmacother 2021; 141:111638. [PMID: 34153846 DOI: 10.1016/j.biopha.2021.111638] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 04/19/2021] [Accepted: 04/19/2021] [Indexed: 12/22/2022] Open
Abstract
Repositioning or "repurposing" of existing therapies for indications of alternative disease is an attractive approach that can generate lower costs and require a shorter approval time than developing a de novo drug. The development of experimental drugs is time-consuming, expensive, and limited to a fairly small number of targets. The incorporation of separate and complementary data should be used, as each type of data set exposes a specific feature of organism knowledge Drug repurposing opportunities are often focused on sporadic findings or on time-consuming pre-clinical drug tests which are often not guided by hypothesis. In comparison, repurposing in-silico drugs is a new, hypothesis-driven method that takes advantage of big-data use. Nonetheless, the widespread use of omics technology, enhanced data storage, data sense, machine learning algorithms, and computational modeling all give unparalleled knowledge of the methods of action of biological processes and drugs, providing wide availability, for both disease-related data and drug-related data. This review has taken an in-depth look at the current state, possibilities, and limitations of further progress in the field of drug repositioning.
Collapse
Affiliation(s)
- H V Sonaye
- Shri Sachhidanand Shikshan Santh's Taywade College of Pharmacy, Nagpur 441111, India.
| | - R Y Sheikh
- K.E.M. Hospital Research Centre, Pune 411011, India.
| | - C A Doifode
- Shri Sachhidanand Shikshan Santh's Taywade College of Pharmacy, Nagpur 441111, India.
| |
Collapse
|
18
|
Dotolo S, Marabotti A, Facchiano A, Tagliaferri R. A review on drug repurposing applicable to COVID-19. Brief Bioinform 2021; 22:726-741. [PMID: 33147623 PMCID: PMC7665348 DOI: 10.1093/bib/bbaa288] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 09/16/2020] [Accepted: 09/30/2020] [Indexed: 12/11/2022] Open
Abstract
Drug repurposing involves the identification of new applications for existing drugs at a lower cost and in a shorter time. There are different computational drug-repurposing strategies and some of these approaches have been applied to the coronavirus disease 2019 (COVID-19) pandemic. Computational drug-repositioning approaches applied to COVID-19 can be broadly categorized into (i) network-based models, (ii) structure-based approaches and (iii) artificial intelligence (AI) approaches. Network-based approaches are divided into two categories: network-based clustering approaches and network-based propagation approaches. Both of them allowed to annotate some important patterns, to identify proteins that are functionally associated with COVID-19 and to discover novel drug–disease or drug–target relationships useful for new therapies. Structure-based approaches allowed to identify small chemical compounds able to bind macromolecular targets to evaluate how a chemical compound can interact with the biological counterpart, trying to find new applications for existing drugs. AI-based networks appear, at the moment, less relevant since they need more data for their application.
Collapse
Affiliation(s)
| | | | | | - Roberto Tagliaferri
- Artificial Intelligence, Statistical Pattern Recognition, Clustering, Biomedical imaging and Bioinformatics
| |
Collapse
|
19
|
Mahmud SMH, Chen W, Liu Y, Awal MA, Ahmed K, Rahman MH, Moni MA. PreDTIs: prediction of drug-target interactions based on multiple feature information using gradient boosting framework with data balancing and feature selection techniques. Brief Bioinform 2021; 22:6168499. [PMID: 33709119 PMCID: PMC7989622 DOI: 10.1093/bib/bbab046] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 01/25/2021] [Accepted: 01/29/2021] [Indexed: 12/13/2022] Open
Abstract
Discovering drug–target (protein) interactions (DTIs) is of great significance for researching and developing novel drugs, having a tremendous advantage to pharmaceutical industries and patients. However, the prediction of DTIs using wet-lab experimental methods is generally expensive and time-consuming. Therefore, different machine learning-based methods have been developed for this purpose, but there are still substantial unknown interactions needed to discover. Furthermore, data imbalance and feature dimensionality problems are a critical challenge in drug-target datasets, which can decrease the classifier performances that have not been significantly addressed yet. This paper proposed a novel drug–target interaction prediction method called PreDTIs. First, the feature vectors of the protein sequence are extracted by the pseudo-position-specific scoring matrix (PsePSSM), dipeptide composition (DC) and pseudo amino acid composition (PseAAC); and the drug is encoded with MACCS substructure fingerings. Besides, we propose a FastUS algorithm to handle the class imbalance problem and also develop a MoIFS algorithm to remove the irrelevant and redundant features for getting the best optimal features. Finally, balanced and optimal features are provided to the LightGBM Classifier to identify DTIs, and the 5-fold CV validation test method was applied to evaluate the prediction ability of the proposed method. Prediction results indicate that the proposed model PreDTIs is significantly superior to other existing methods in predicting DTIs, and our model could be used to discover new drugs for unknown disorders or infections, such as for the coronavirus disease 2019 using existing drugs compounds and severe acute respiratory syndrome coronavirus 2 protein sequences.
Collapse
Affiliation(s)
- S M Hasan Mahmud
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Wenyu Chen
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Yongsheng Liu
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Md Abdul Awal
- Electronics and Communication Engineering Discipline, Khulna University, Khulna 9208, Bangladesh
| | - Kawsar Ahmed
- Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail-1902, Bangladesh
| | - Md Habibur Rahman
- Department of Computer Science and Engineering, Islamic University, Kushtia-7003, Bangladesh
| | - Mohammad Ali Moni
- UNSW Digital Health, WHO Center for eHealth, School of Public Health and Community Medicine, Faculty of Medicine, The University of New South Wales, Sydney, Australia
| |
Collapse
|
20
|
He S, Wen Y, Yang X, Liu Z, Song X, Huang X, Bo X. PIMD: An Integrative Approach for Drug Repositioning Using Multiple Characterization Fusion. GENOMICS PROTEOMICS & BIOINFORMATICS 2020; 18:565-581. [PMID: 33075523 PMCID: PMC8377380 DOI: 10.1016/j.gpb.2018.10.012] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 09/21/2018] [Accepted: 10/10/2018] [Indexed: 11/28/2022]
Abstract
The accumulation of various types of drug informatics data and computational approaches for drug repositioning can accelerate pharmaceutical research and development. However, the integration of multi-dimensional drug data for precision repositioning remains a pressing challenge. Here, we propose a systematic framework named PIMD to predict drug therapeutic properties by integrating multi-dimensional data for drug repositioning. In PIMD, drug similarity networks (DSNs) based on chemical, pharmacological, and clinical data are fused into an integrated DSN (iDSN) composed of many clusters. Rather than simple fusion, PIMD offers a systematic way to annotate clusters. Unexpected drugs within clusters and drug pairs with a high iDSN similarity score are therefore identified to predict novel therapeutic uses. PIMD provides new insights into the universality, individuality, and complementarity of different drug properties by evaluating the contribution of each property data. To test the performance of PIMD, we use chemical, pharmacological, and clinical properties to generate an iDSN. Analyses of the contributions of each drug property indicate that this iDSN was driven by all data types and performs better than other DSNs. Within the top 20 recommended drug pairs, 7 drugs have been reported to be repurposed. The source code for PIMD is available at https://github.com/Sepstar/PIMD/.
Collapse
Affiliation(s)
- Song He
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Yuqi Wen
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Xiaoxi Yang
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Zhen Liu
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Xinyu Song
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Xin Huang
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Xiaochen Bo
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China.
| |
Collapse
|
21
|
Usha T, Middha SK, Kukanur AA, Shravani RV, Anupama MN, Harshitha N, Rahamath A, Kukanuri SA, Goyal AK. Drug Repurposing Approaches: Existing Leads For Novel Threats And Drug Targets. Curr Protein Pept Sci 2020; 22:CPPS-EPUB-110124. [PMID: 32957901 DOI: 10.2174/1389203721666200921152853] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Revised: 07/29/2020] [Accepted: 08/07/2020] [Indexed: 11/22/2022]
Abstract
Drug Repurposing (DR) is an alternative to the traditional drug discovery process. It is cost and time effective, with high returns and low risk process that can tackle the increasing need for interventions for varied diseases and new outbreaks. Repurposing of old drugs for other diseases has gained a wider attention, as there have been several old drugs approved by FDA for new diseases. In the global emergency of COVID19 pandemic, this is one of the strategies implemented in repurposing of old anti-infective, anti-rheumatic and anti-thrombotic drugs. The goal of the current review is to elaborate the process of DR, its advantages, repurposed drugs for a plethora of disorders, and the evolution of related academic publications. Further, detailed are the computational approaches: literature mining and semantic inference, network-based drug repositioning, signature matching, retrospective clinical analysis, molecular docking and experimental phenotypic screening. We discuss the legal and economical potential barriers in DR, existent collaborative models and recommendations for overcoming these hurdles and leveraging the complete potential of DR in finding new indications.
Collapse
Affiliation(s)
- Talambedu Usha
- Department of Biochemistry, Bangalore University, Bengaluru, Karnataka. India
| | - Sushil K Middha
- DBT-BIF Centre, Department of Biotechnology, Maharani Lakshmi Ammanni College for Women(mLAC), Bengaluru, Karnataka. India
| | | | | | | | | | - Ameena Rahamath
- Department of Biochemistry, mLAC, Bengaluru, Karnataka. India
| | | | - Arvind K Goyal
- Department of Biotechnology, Bodoland University, Kokrajhar783370, BTAD, Assam. India
| |
Collapse
|
22
|
|
23
|
Kumar Shukla P, Kumar Shukla P, Sharma P, Rawat P, Samar J, Moriwal R, Kaur M. Efficient prediction of drug-drug interaction using deep learning models. IET Syst Biol 2020; 14:211-216. [PMID: 32737279 DOI: 10.1049/iet-syb.2019.0116] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
A drug-drug interaction or drug synergy is extensively utilised for cancer treatment. However, prediction of drug-drug interaction is defined as an ill-posed problem, because manual testing is only implementable on small group of drugs. Predicting the drug-drug interaction score has been a popular research topic recently. Recently many machine learning models have proposed in the literature to predict the drug-drug interaction score efficiently. However, these models suffer from the over-fitting issue. Therefore, these models are not so-effective for predicting the drug-drug interaction score. In this work, an integrated convolutional mixture density recurrent neural network is proposed and implemented. The proposed model integrates convolutional neural networks, recurrent neural networks and mixture density networks. Extensive comparative analysis reveals that the proposed model significantly outperforms the competitive models.
Collapse
Affiliation(s)
- Prashant Kumar Shukla
- Department of Computer Science and Engineering, School of Engineering & Technology, Jagran Lake City University (JLU), Bhopal, MP, India
| | - Piyush Kumar Shukla
- Department of Computer Science and Engineering, University Institute of Technology, RGPV, Bhopal, MP, India
| | - Poonam Sharma
- Department of Computer Science & Engineering, Visvesvaraya National Institute of Technology, Nagpur, India
| | - Paresh Rawat
- Department of Electronics and Communication Engineering, Sagar Institute of Science & Technology (SISTec), Gandhi Nagar, Bhopal, MP, India
| | - Jashwant Samar
- Department of Computer Science and Engineering, University Institute of Technology, RGPV, Bhopal, MP, India
| | - Rahul Moriwal
- Department of Computer Science and Engineering-AITR Indore, MP, India
| | - Manjit Kaur
- Department of Computer and Communication Engineering, School of Computing and Information Technology, Manipal University Jaipur, Rajasthan, India.
| |
Collapse
|
24
|
Eslami Manoochehri H, Nourani M. Drug-target interaction prediction using semi-bipartite graph model and deep learning. BMC Bioinformatics 2020; 21:248. [PMID: 32631230 PMCID: PMC7336396 DOI: 10.1186/s12859-020-3518-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Identifying drug-target interaction is a key element in drug discovery. In silico prediction of drug-target interaction can speed up the process of identifying unknown interactions between drugs and target proteins. In recent studies, handcrafted features, similarity metrics and machine learning methods have been proposed for predicting drug-target interactions. However, these methods cannot fully learn the underlying relations between drugs and targets. In this paper, we propose anew framework for drug-target interaction prediction that learns latent features from drug-target interaction network. RESULTS We present a framework to utilize the network topology and identify interacting and non-interacting drug-target pairs. We model the problem as a semi-bipartite graph in which we are able to use drug-drug and protein-protein similarity in a drug-protein network. We have then used a graph labeling method for vertex ordering in our graph embedding process. Finally, we employed deep neural network to learn the complex pattern of interacting pairs from embedded graphs. We show our approach is able to learn sophisticated drug-target topological features and outperforms other state-of-the-art approaches. CONCLUSIONS The proposed learning model on semi-bipartite graph model, can integrate drug-drug and protein-protein similarities which are semantically different than drug-protein information in a drug-target interaction network. We show our model can determine interaction likelihood for each drug-target pair and outperform other heuristics.
Collapse
Affiliation(s)
- Hafez Eslami Manoochehri
- Department of Electrical and Computer Engineering, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, TX, 75080, USA
| | - Mehrdad Nourani
- Department of Electrical and Computer Engineering, The University of Texas at Dallas, 800 W Campbell Rd, Richardson, TX, 75080, USA.
| |
Collapse
|
25
|
Kaushik AC, Mehmood A, Dai X, Wei DQ. A comparative chemogenic analysis for predicting Drug-Target Pair via Machine Learning Approaches. Sci Rep 2020; 10:6870. [PMID: 32322011 PMCID: PMC7176722 DOI: 10.1038/s41598-020-63842-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 04/04/2020] [Indexed: 12/26/2022] Open
Abstract
A computational technique for predicting the DTIs has now turned out to be an indispensable job during the process of drug finding. It tapers the exploration room for interactions by propounding possible interaction contenders for authentication through experiments of wet-lab which are known for their expensiveness and time consumption. Chemogenomics, an emerging research area focused on the systematic examination of the biological impact of a broad series of minute molecular-weighting ligands on a broad raiment of macromolecular target spots. Additionally, with the advancement in time, the complexity of the algorithms is increasing which may result in the entry of big data technologies like Spark in this field soon. In the presented work, we intend to offer an inclusive idea and realistic evaluation of the computational Drug Target Interaction projection approaches, to perform as a guide and reference for researchers who are carrying out work in a similar direction. Precisely, we first explain the data utilized in computational Drug Target Interaction prediction attempts like this. We then sort and explain the best and most modern techniques for the prediction of DTIs. Then, a realistic assessment is executed to show the projection performance of several illustrative approaches in various situations. Ultimately, we underline possible opportunities for additional improvement of Drug Target Interaction projection enactment and also linked study objectives.
Collapse
Affiliation(s)
- Aman Chandra Kaushik
- Wuxi School of Medicine, Jiangnan University, Wuxi, China.
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China.
| | - Aamir Mehmood
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China
| | - Xiaofeng Dai
- Wuxi School of Medicine, Jiangnan University, Wuxi, China
| | - Dong-Qing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, China.
| |
Collapse
|
26
|
Dezső Z, Ceccarelli M. Machine learning prediction of oncology drug targets based on protein and network properties. BMC Bioinformatics 2020; 21:104. [PMID: 32171238 PMCID: PMC7071582 DOI: 10.1186/s12859-020-3442-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 03/04/2020] [Indexed: 01/12/2023] Open
Abstract
Background The selection and prioritization of drug targets is a central problem in drug discovery. Computational approaches can leverage the growing number of large-scale human genomics and proteomics data to make in-silico target identification, reducing the cost and the time needed. Results We developed a machine learning approach to score proteins to generate a druggability score of novel targets. In our model we incorporated 70 protein features which included properties derived from the sequence, features characterizing protein functions as well as network properties derived from the protein-protein interaction network. The advantage of this approach is that it is unbiased and even less studied proteins with limited information about their function can score well as most of the features are independent of the accumulated literature. We build models on a training set which consist of targets with approved drugs and a negative set of non-drug targets. The machine learning techniques help to identify the most important combination of features differentiating validated targets from non-targets. We validated our predictions on an independent set of clinical trial drug targets, achieving a high accuracy characterized by an Area Under the Curve (AUC) of 0.89. Our most predictive features included biological function of proteins, network centrality measures, protein essentiality, tissue specificity, localization and solvent accessibility. Our predictions, based on a small set of 102 validated oncology targets, recovered the majority of known drug targets and identifies a novel set of proteins as drug target candidates. Conclusions We developed a machine learning approach to prioritize proteins according to their similarity to approved drug targets. We have shown that the method proposed is highly predictive on a validation dataset consisting of 277 targets of clinical trial drug confirming that our computational approach is an efficient and cost-effective tool for drug target discovery and prioritization. Our predictions were based on oncology targets and cancer relevant biological functions, resulting in significantly higher scores for targets of oncology clinical trial drugs compared to the scores of targets of trial drugs for other indications. Our approach can be used to make indication specific drug-target prediction by combining generic druggability features with indication specific biological functions.
Collapse
Affiliation(s)
- Zoltán Dezső
- Computational Biology-Genomic Research Center, ABBVIE, Redwood City, CA, USA.
| | - Michele Ceccarelli
- Computational Biology-Genomic Research Center, ABBVIE, Redwood City, CA, USA. .,Department of Electrical Engineering and Information Technology (DIETI), University of Naples "Federico II", 80128, Naples, Italy. .,Istituto di Ricerche Genetiche "G. Salvatore", Biogem s.c.ar.l, 83031, Ariano Irpino, Italy.
| |
Collapse
|
27
|
Zhang W, Lin W, Zhang D, Wang S, Shi J, Niu Y. Recent Advances in the Machine Learning-Based Drug-Target Interaction Prediction. Curr Drug Metab 2019; 20:194-202. [PMID: 30129407 DOI: 10.2174/1389200219666180821094047] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Revised: 01/18/2018] [Accepted: 03/19/2018] [Indexed: 12/28/2022]
Abstract
BACKGROUND The identification of drug-target interactions is a crucial issue in drug discovery. In recent years, researchers have made great efforts on the drug-target interaction predictions, and developed databases, software and computational methods. RESULTS In the paper, we review the recent advances in machine learning-based drug-target interaction prediction. First, we briefly introduce the datasets and data, and summarize features for drugs and targets which can be extracted from different data. Since drug-drug similarity and target-target similarity are important for many machine learning prediction models, we introduce how to calculate similarities based on data or features. Different machine learningbased drug-target interaction prediction methods can be proposed by using different features or information. Thus, we summarize, analyze and compare different machine learning-based prediction methods. CONCLUSION This study provides the guide to the development of computational methods for the drug-target interaction prediction.
Collapse
Affiliation(s)
- Wen Zhang
- School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Weiran Lin
- School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Ding Zhang
- School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Siman Wang
- School of Computer Science, Wuhan University, Wuhan 430072, China
| | - Jingwen Shi
- School of Mathematics and Statistics, Wuhan University, Wuhan 430072, China
| | - Yanqing Niu
- School of Mathematics and Statistics, South-Central University for Nationalities, Wuhan 430074, China
| |
Collapse
|
28
|
A Bayesian machine learning approach for drug target identification using diverse data types. Nat Commun 2019; 10:5221. [PMID: 31745082 PMCID: PMC6863850 DOI: 10.1038/s41467-019-12928-6] [Citation(s) in RCA: 116] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 10/10/2019] [Indexed: 11/18/2022] Open
Abstract
Drug target identification is a crucial step in development, yet is also among the most complex. To address this, we develop BANDIT, a Bayesian machine-learning approach that integrates multiple data types to predict drug binding targets. Integrating public data, BANDIT benchmarked a ~90% accuracy on 2000+ small molecules. Applied to 14,000+ compounds without known targets, BANDIT generated ~4,000 previously unknown molecule-target predictions. From this set we validate 14 novel microtubule inhibitors, including 3 with activity on resistant cancer cells. We applied BANDIT to ONC201—an anti-cancer compound in clinical development whose target had remained elusive. We identified and validated DRD2 as ONC201’s target, and this information is now being used for precise clinical trial design. Finally, BANDIT identifies connections between different drug classes, elucidating previously unexplained clinical observations and suggesting new drug repositioning opportunities. Overall, BANDIT represents an efficient and accurate platform to accelerate drug discovery and direct clinical application. Drug target identification is a crucial step in drug development. Here, the authors introduce a Bayesian machine learning framework that integrates multiple data types to predict the targets of small molecules, enabling identification of a new set of microtubule inhibitors and the target of the anti-cancer molecule ONC201.
Collapse
|
29
|
Lotfi Shahreza M, Ghadiri N, Mousavi SR, Varshosaz J, Green JR. A review of network-based approaches to drug repositioning. Brief Bioinform 2019; 19:878-892. [PMID: 28334136 DOI: 10.1093/bib/bbx017] [Citation(s) in RCA: 161] [Impact Index Per Article: 32.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Indexed: 01/17/2023] Open
Abstract
Experimental drug development is time-consuming, expensive and limited to a relatively small number of targets. However, recent studies show that repositioning of existing drugs can function more efficiently than de novo experimental drug development to minimize costs and risks. Previous studies have proven that network analysis is a versatile platform for this purpose, as the biological networks are used to model interactions between many different biological concepts. The present study is an attempt to review network-based methods in predicting drug targets for drug repositioning. For each method, the preferred type of data set is described, and their advantages and limitations are discussed. For each method, we seek to provide a brief description, as well as an evaluation based on its performance metrics.We conclude that integrating distinct and complementary data should be used because each type of data set reveals a unique aspect of information about an organism. We also suggest that applying a standard set of evaluation metrics and data sets would be essential in this fast-growing research domain.
Collapse
Affiliation(s)
- Maryam Lotfi Shahreza
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
| | - Nasser Ghadiri
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
| | | | - Jaleh Varshosaz
- Drug Delivery Systems Research Center of Isfahan University of Medical Sciences
| | - James R Green
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
| |
Collapse
|
30
|
Xuan P, Cao Y, Zhang T, Wang X, Pan S, Shen T. Drug repositioning through integration of prior knowledge and projections of drugs and diseases. Bioinformatics 2019; 35:4108-4119. [DOI: 10.1093/bioinformatics/btz182] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 02/24/2019] [Accepted: 03/12/2019] [Indexed: 12/20/2022] Open
Abstract
Abstract
Motivation
Identifying and developing novel therapeutic effects for existing drugs contributes to reduction of drug development costs. Most of the previous methods focus on integration of the heterogeneous data of drugs and diseases from multiple sources for predicting the candidate drug–disease associations. However, they fail to take the prior knowledge of drugs and diseases and their sparse characteristic into account. It is essential to develop a method that exploits the more useful information to predict the reliable candidate associations.
Results
We present a method based on non-negative matrix factorization, DisDrugPred, to predict the drug-related candidate disease indications. A new type of drug similarity is firstly calculated based on their associated diseases. DisDrugPred completely integrates two types of disease similarities, the associations between drugs and diseases, and the various similarities between drugs from different levels including the chemical structures of drugs, the target proteins of drugs, the diseases associated with drugs and the side effects of drugs. The prior knowledge of drugs and diseases and the sparse characteristic of drug–disease associations provide a deep biological perspective for capturing the relationships between drugs and diseases. Simultaneously, the possibility that a drug is associated with a disease is also dependant on their projections in the low-dimension feature space. Therefore, DisDrugPred deeply integrates the diverse prior knowledge, the sparse characteristic of associations and the projections of drugs and diseases. DisDrugPred achieves superior prediction performance than several state-of-the-art methods for drug–disease association prediction. During the validation process, DisDrugPred also can retrieve more actual drug–disease associations in the top part of prediction result which often attracts more attention from the biologists. Moreover, case studies on five drugs further confirm DisDrugPred’s ability to discover potential candidate disease indications for drugs.
Availability and implementation
The fourth type of drug similarity and the predicted candidates for all the drugs are available at https://github.com/pingxuan-hlju/DisDrugPred.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Yangkun Cao
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin, China
| | - Xiao Wang
- School of Computer Science, Beijing University of Posts and Telecommunications, Beijing, China
| | - Shuxiang Pan
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Tonghui Shen
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| |
Collapse
|
31
|
Alaimo S, Pulvirenti A. Network-Based Drug Repositioning: Approaches, Resources, and Research Directions. Methods Mol Biol 2019; 1903:97-113. [PMID: 30547438 DOI: 10.1007/978-1-4939-8955-3_6] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The wealth of knowledge and omic data available in drug research allowed the rising of several computational methods in drug discovery field yielding a novel and exciting application called drug repositioning. Several computational methods try to make a high-level integration of all the knowledge in order to discover unknown mechanisms. In this chapter we present an in-depth review of data resources and computational models for drug repositioning.
Collapse
Affiliation(s)
- Salvatore Alaimo
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy.
| |
Collapse
|
32
|
Ezzat A, Wu M, Li X, Kwoh CK. Computational Prediction of Drug-Target Interactions via Ensemble Learning. Methods Mol Biol 2019; 1903:239-254. [PMID: 30547446 DOI: 10.1007/978-1-4939-8955-3_14] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Therapeutic effects of drugs are mediated via interactions between them and their intended targets. As such, prediction of drug-target interactions is of great importance. Drug-target interaction prediction is especially relevant in the case of drug repositioning where attempts are made to repurpose old drugs for new indications. While experimental wet-lab techniques exist for predicting such interactions, they are tedious and time-consuming. On the other hand, computational methods also exist for predicting interactions, and they do so with reasonable accuracy. In addition, computational methods can help guide their wet-lab counterparts by recommending interactions for further validation. In this chapter, a computational method for predicting drug-target interactions is presented. Specifically, we describe a machine learning method that utilizes ensemble learning to perform predictions. We also mention details pertaining to the preparation of the data required for the prediction effort and demonstrate how to evaluate and improve prediction performance.
Collapse
Affiliation(s)
- Ali Ezzat
- Biomedical Informatics Lab, School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore.
| | - Min Wu
- Data Analytics Department, Institute for Infocomm Research, A-Star, Singapore, Singapore
| | - Xiaoli Li
- Data Analytics Department, Institute for Infocomm Research, A-Star, Singapore, Singapore
| | - Chee-Keong Kwoh
- Division of Software and Information Systems, School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
33
|
Sharma A, Rani R. BE-DTI': Ensemble framework for drug target interaction prediction using dimensionality reduction and active learning. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2018; 165:151-162. [PMID: 30337070 DOI: 10.1016/j.cmpb.2018.08.011] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Revised: 08/03/2018] [Accepted: 08/17/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND AND OBJECTIVE Drug-target interaction prediction plays an intrinsic role in the drug discovery process. Prediction of novel drugs and targets helps in identifying optimal drug therapies for various stringent diseases. Computational prediction of drug-target interactions can help to identify potential drug-target pairs and speed-up the process of drug repositioning. In our present, work we have focused on machine learning algorithms for predicting drug-target interactions from the pool of existing drug-target data. The key idea is to train the classifier using existing DTI so as to predict new or unknown DTI. However, there are various challenges such as class imbalance and high dimensional nature of data that need to be addressed before developing optimal drug-target interaction model. METHODS In this paper, we propose a bagging based ensemble framework named BE-DTI' for drug-target interaction prediction using dimensionality reduction and active learning to deal with class-imbalanced data. Active learning helps to improve under-sampling bagging based ensembles. Dimensionality reduction is used to deal with high dimensional data. RESULTS Results show that the proposed technique outperforms the other five competing methods in 10-fold cross-validation experiments in terms of AUC=0.927, Sensitivity=0.886, Specificity=0.864, and G-mean=0.874. CONCLUSION Missing interactions and new interactions are predicted using the proposed framework. Some of the known interactions are removed from the original dataset and their interactions are recalculated to check the accuracy of the proposed framework. Moreover, validation of the proposed approach is performed using the external dataset. All these results show that structurally similar drugs tend to interact with similar targets.
Collapse
Affiliation(s)
- Aman Sharma
- Computer Science and Engineering Department, Thapar Institute of Engineering & Technology, Punjab, Patiala, India.
| | - Rinkle Rani
- Computer Science and Engineering Department, Thapar Institute of Engineering & Technology, Punjab, Patiala, India.
| |
Collapse
|
34
|
Ezzat A, Wu M, Li XL, Kwoh CK. Computational prediction of drug–target interactions using chemogenomic approaches: an empirical survey. Brief Bioinform 2018; 20:1337-1357. [DOI: 10.1093/bib/bby002] [Citation(s) in RCA: 117] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 12/21/2017] [Indexed: 01/18/2023] Open
Abstract
Abstract
Computational prediction of drug–target interactions (DTIs) has become an essential task in the drug discovery process. It narrows down the search space for interactions by suggesting potential interaction candidates for validation via wet-lab experiments that are well known to be expensive and time-consuming. In this article, we aim to provide a comprehensive overview and empirical evaluation on the computational DTI prediction techniques, to act as a guide and reference for our fellow researchers. Specifically, we first describe the data used in such computational DTI prediction efforts. We then categorize and elaborate the state-of-the-art methods for predicting DTIs. Next, an empirical comparison is performed to demonstrate the prediction performance of some representative methods under different scenarios. We also present interesting findings from our evaluation study, discussing the advantages and disadvantages of each method. Finally, we highlight potential avenues for further enhancement of DTI prediction performance as well as related research directions.
Collapse
|
35
|
Drug-target interaction prediction using ensemble learning and dimensionality reduction. Methods 2017; 129:81-88. [DOI: 10.1016/j.ymeth.2017.05.016] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2017] [Revised: 04/03/2017] [Accepted: 05/18/2017] [Indexed: 11/23/2022] Open
|
36
|
Farnadi G, Bach SH, Moens MF, Getoor L, De Cock M. Soft quantification in statistical relational learning. Mach Learn 2017. [DOI: 10.1007/s10994-017-5647-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
37
|
Lotfi Shahreza M, Ghadiri N, Mousavi SR, Varshosaz J, Green JR. Heter-LP: A heterogeneous label propagation algorithm and its application in drug repositioning. J Biomed Inform 2017; 68:167-183. [DOI: 10.1016/j.jbi.2017.03.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2016] [Revised: 02/09/2017] [Accepted: 03/10/2017] [Indexed: 12/14/2022]
|
38
|
Abstract
The prediction of molecular targets is highly beneficial during the drug discovery process, be it for off-target elucidation or deconvolution of phenotypic screens. Here, we present OCEAN, a target prediction tool exclusively utilizing publically available ChEMBL data. OCEAN uses a heuristics approach based on a validation set containing almost 1000 drug ← → target relationships. New ChEMBL data (ChEMBL20 as well as ChEMBL21) released after the validation was used for a prospective OCEAN performance check. The success rates of OCEAN to predict correctly the targets within the TOP10 ranks are 77% for recently marketed drugs and 62% for all new ChEMBL20 compounds and 51% for all new ChEMBL21 compounds. OCEAN is also capable of identifying polypharmacological compounds; the success rate for molecules simultaneously hitting at least two targets is 64% to be correctly predicted within the TOP10 ranks. The source code of OCEAN can be found at http://www.github.com/rdkit/OCEAN.
Collapse
Affiliation(s)
- Paul Czodrowski
- Discovery Technologies, Merck Serono Research, Merck KGaA , Frankfurter Straße 250, 64293 Darmstadt, Germany
| | - Wolf-Guido Bolick
- Discovery Technologies, Merck Serono Research, Merck KGaA , Frankfurter Straße 250, 64293 Darmstadt, Germany
| |
Collapse
|
39
|
Mullen J, Cockell SJ, Tipney H, Woollard PM, Wipat A. Mining integrated semantic networks for drug repositioning opportunities. PeerJ 2016; 4:e1558. [PMID: 26844016 PMCID: PMC4736989 DOI: 10.7717/peerj.1558] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Accepted: 12/11/2015] [Indexed: 11/20/2022] Open
Abstract
Current research and development approaches to drug discovery have become less fruitful and more costly. One alternative paradigm is that of drug repositioning. Many marketed examples of repositioned drugs have been identified through serendipitous or rational observations, highlighting the need for more systematic methodologies to tackle the problem. Systems level approaches have the potential to enable the development of novel methods to understand the action of therapeutic compounds, but requires an integrative approach to biological data. Integrated networks can facilitate systems level analyses by combining multiple sources of evidence to provide a rich description of drugs, their targets and their interactions. Classically, such networks can be mined manually where a skilled person is able to identify portions of the graph (semantic subgraphs) that are indicative of relationships between drugs and highlight possible repositioning opportunities. However, this approach is not scalable. Automated approaches are required to systematically mine integrated networks for these subgraphs and bring them to the attention of the user. We introduce a formal framework for the definition of integrated networks and their associated semantic subgraphs for drug interaction analysis and describe DReSMin, an algorithm for mining semantically-rich networks for occurrences of a given semantic subgraph. This algorithm allows instances of complex semantic subgraphs that contain data about putative drug repositioning opportunities to be identified in a computationally tractable fashion, scaling close to linearly with network data. We demonstrate the utility of our approach by mining an integrated drug interaction network built from 11 sources. This work identified and ranked 9,643,061 putative drug-target interactions, showing a strong correlation between highly scored associations and those supported by literature. We discuss the 20 top ranked associations in more detail, of which 14 are novel and 6 are supported by the literature. We also show that our approach better prioritizes known drug-target interactions, than other state-of-the art approaches for predicting such interactions.
Collapse
Affiliation(s)
- Joseph Mullen
- Interdisciplinary Computing and Complex BioSystems Research Group, School of Computing Science, University of Newcastle-upon-Tyne , Newcastle upon Tyne , United Kingdom
| | - Simon J Cockell
- Bioinformatics Support Unit, University of Newcastle-upon-Tyne , United Kingdom
| | - Hannah Tipney
- Computational Biology, Target Sciences, GSK R&D, GlaxoSmithKline , Stevenage, Hertfordshire , United Kingdom
| | - Peter M Woollard
- Computational Biology, Target Sciences, GSK R&D, GlaxoSmithKline , Stevenage, Hertfordshire , United Kingdom
| | - Anil Wipat
- Interdisciplinary Computing and Complex BioSystems Research Group, School of Computing Science, University of Newcastle-upon-Tyne , Newcastle upon Tyne , United Kingdom
| |
Collapse
|
40
|
Peng L, Liao B, Zhu W, Li Z, Li K. Predicting Drug-Target Interactions With Multi-Information Fusion. IEEE J Biomed Health Inform 2015; 21:561-572. [PMID: 26731781 DOI: 10.1109/jbhi.2015.2513200] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Identifying potential associations between drugs and targets is a critical prerequisite for modern drug discovery and repurposing. However, predicting these associations is difficult because of the limitations of existing computational methods. Most models only consider chemical structures and protein sequences, and other models are oversimplified. Moreover, datasets used for analysis contain only true-positive interactions, and experimentally validated negative samples are unavailable. To overcome these limitations, we developed a semi-supervised based learning framework called NormMulInf through collaborative filtering theory by using labeled and unlabeled interaction information. The proposed method initially determines similarity measures, such as similarities among samples and local correlations among the labels of the samples, by integrating biological information. The similarity information is then integrated into a robust principal component analysis model, which is solved using augmented Lagrange multipliers. Experimental results on four classes of drug-target interaction networks suggest that the proposed approach can accurately classify and predict drug-target interactions. Part of the predicted interactions are reported in public databases. The proposed method can also predict possible targets for new drugs and can be used to determine whether atropine may interact with alpha1B- and beta1- adrenergic receptors. Furthermore, the developed technique identifies potential drugs for new targets and can be used to assess whether olanzapine and propiomazine may target 5HT2B. Finally, the proposed method can potentially address limitations on studies of multitarget drugs and multidrug targets.
Collapse
|