1
|
Feng X, Ma Z, Yu C, Xin R. MRNDR: Multihead Attention-Based Recommendation Network for Drug Repurposing. J Chem Inf Model 2024; 64:2654-2669. [PMID: 38373300 DOI: 10.1021/acs.jcim.3c01726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
As is well-known, the process of developing new drugs is extremely expensive, whereas drug repurposing represents a promising approach to augment the efficiency of new drug development. While this method can indeed spare us from expensive drug toxicity and safety experiments, it still demands a substantial amount of time to carry out precise efficacy experiments for specific diseases, thereby consuming a significant quantity of resources. Therefore, if we can prescreen potential other indications for selected drugs, it could result in substantial cost savings. In light of this, this paper introduces a drug repurposing recommendation model called MRNDR, which stands for Multi-head attention-based Recommendation Network for Drug Repurposing. This model serves as a prediction tool for drug-disease relationships, leveraging the multihead self-attention mechanism that demonstrates robust generalization capabilities. These capabilities stem not only from our extensive million-level training data set, BioRE (Biology Recommended Entity data), but also from the utilization of the WRDS (Weighted Representation Distance Score) algorithm proposed by us. The MRNDR model has achieved new state-of-the-art results on the GP-KG public data set, with an MRR (Mean Reciprocal Rank) score of 0.308 and a Hits@10 score of 0.628. This represents significant improvements of 4.7% (MRR) and 18.1% (Hits@10) over the current best-performing models. Additionally, to further validate the practical utility of the model, we examined results recommended by MRNDR that were not present in the training data set. Some of these recommendations have undergone clinical trials, as evidenced by their presence on ClinicalTrials.gov and the China Clinical Trials Center, indirectly confirming the applicability of MRNDR. The MRNDR model can predict the reusability of candidate drugs, reducing the need for manual expert assessments and enabling efficient drug repurposing.
Collapse
Affiliation(s)
- Xin Feng
- School of Science, Jilin Institute of Chemical Technology, Jilin 130000, P.R. China
- State Key Laboratory of Inorganic Synthesis and Preparative Chemistry, College of Chemistry, Jilin University, Changchun 130012, P.R. China
- Department of Epidemiology and Biostatistics, School of Public Health, Jilin University, Changchun 130012, P.R. China
| | - Zhansen Ma
- College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin 130000, P.R. China
| | - Cuinan Yu
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, P.R. China
| | - Ruihao Xin
- College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin 130000, P.R. China
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, P.R. China
| |
Collapse
|
2
|
Hay Mele B, Rossetti F, Cubellis MV, Monticelli M, Andreotti G. Drug Repurposing and Lysosomal Storage Disorders: A Trick to Treat. Genes (Basel) 2024; 15:290. [PMID: 38540351 PMCID: PMC10970111 DOI: 10.3390/genes15030290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 02/19/2024] [Accepted: 02/22/2024] [Indexed: 06/14/2024] Open
Abstract
Rare diseases, or orphan diseases, are defined as diseases affecting a small number of people compared to the general population. Among these, we find lysosomal storage disorders (LSDs), a cluster of rare metabolic diseases characterized by enzyme mutations causing abnormal glycolipid storage. Drug repositioning involves repurposing existing approved drugs for new therapeutic applications, offering advantages in cost, time savings, and a lower risk of failure. We present a comprehensive analysis of existing drugs, their repurposing potential, and their clinical implications in the context of LSDs, highlighting the necessity of mutation-specific approaches. Our review systematically explores the landscape of drug repositioning as a means to enhance LSDs therapies. The findings advocate for the strategic repositioning of drugs, accentuating its role in expediting the discovery of effective treatments. We conclude that drug repurposing represents a viable pathway for accelerating therapeutic discovery for LSDs, emphasizing the need for the careful evaluation of drug efficacy and toxicity in disease-specific contexts.
Collapse
Affiliation(s)
- Bruno Hay Mele
- Department of Biology, University of Napoli “Federico II”, Complesso Universitario Monte Sant’Angelo, Via Cinthia, 80126 Napoli, Italy; (B.H.M.); (F.R.); (M.V.C.)
| | - Federica Rossetti
- Department of Biology, University of Napoli “Federico II”, Complesso Universitario Monte Sant’Angelo, Via Cinthia, 80126 Napoli, Italy; (B.H.M.); (F.R.); (M.V.C.)
| | - Maria Vittoria Cubellis
- Department of Biology, University of Napoli “Federico II”, Complesso Universitario Monte Sant’Angelo, Via Cinthia, 80126 Napoli, Italy; (B.H.M.); (F.R.); (M.V.C.)
- Institute of Biomolecular Chemistry ICB, CNR, Via Campi Flegrei 34, 80078 Pozzuoli, Italy;
- Stazione Zoologica “Anton Dohrn”, Villa Comunale, 80121 Naples, Italy
| | - Maria Monticelli
- Department of Biology, University of Napoli “Federico II”, Complesso Universitario Monte Sant’Angelo, Via Cinthia, 80126 Napoli, Italy; (B.H.M.); (F.R.); (M.V.C.)
- Institute of Biomolecular Chemistry ICB, CNR, Via Campi Flegrei 34, 80078 Pozzuoli, Italy;
| | - Giuseppina Andreotti
- Institute of Biomolecular Chemistry ICB, CNR, Via Campi Flegrei 34, 80078 Pozzuoli, Italy;
| |
Collapse
|
3
|
Liu X, Yang Z, Cheng J. Music recommendation algorithms based on knowledge graph and multi-task feature learning. Sci Rep 2024; 14:2055. [PMID: 38267571 PMCID: PMC10808181 DOI: 10.1038/s41598-024-52463-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
During music recommendation scenarios, sparsity and cold start problems are inevitable. Auxiliary information has been utilized in music recommendation algorithms to provide users with more accurate music recommendation results. This study proposes an end-to-end framework, MMSS_MKR, that uses a knowledge graph as a source of auxiliary information to serve the information obtained from it to the recommendation module. The framework exploits Cross & Compression Units to bridge the knowledge graph embedding task with recommendation task modules. We can obtain more realistic triple information and exclude false triple information as much as possible, because our model obtains triple information through the music knowledge graph, and the information obtained through the recommendation module is used to determine the truth of the triple information; thus, the knowledge graph embedding task is used to perform the recommendation task. In the recommendation module, multiple predictions are adopted to predict the recommendation accuracy. In the knowledge graph embedding module, multiple calculations are used to calculate the score. Finally, the loss function of the model is improved to help us to obtain more useful information for music recommendations. The MMSS_MKR model achieved significant improvements in music recommendations compared with many existing recommendation models.
Collapse
Affiliation(s)
- Xinqiao Liu
- School of Music, Qufu Normal University, Rizhao, 276826, China
| | - Zhisheng Yang
- Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 250353, China
| | - Jinyong Cheng
- Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 250353, China.
| |
Collapse
|
4
|
Gao Z, Ding P, Xu R. IUPHAR review - Data-driven computational drug repurposing approaches for opioid use disorder. Pharmacol Res 2024; 199:106960. [PMID: 37832859 DOI: 10.1016/j.phrs.2023.106960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 10/08/2023] [Accepted: 10/10/2023] [Indexed: 10/15/2023]
Abstract
Opioid Use Disorder (OUD) is a chronic and relapsing condition characterized by the misuse of opioid drugs, causing significant morbidity and mortality in the United States. Existing medications for OUD are limited, and there is an immediate need to discover treatments with enhanced safety and efficacy. Drug repurposing aims to find new indications for existing medications, offering a time-saving and cost-efficient alternative strategy to traditional drug discovery. Computational approaches have been developed to further facilitate the drug repurposing process. In this paper, we reviewed state-of-the-art data-driven computational drug repurposing approaches for OUD and discussed their advantages and potential challenges. We also highlighted promising repurposed candidate drugs for OUD that were identified by computational drug repurposing techniques and reviewed studies supporting their potential mechanisms of action in treating OUD.
Collapse
Affiliation(s)
- Zhenxiang Gao
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Pingjian Ding
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Rong Xu
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, OH, USA.
| |
Collapse
|
5
|
Banerjee J, Taroni JN, Allaway RJ, Prasad DV, Guinney J, Greene C. Machine learning in rare disease. Nat Methods 2023:10.1038/s41592-023-01886-z. [PMID: 37248386 DOI: 10.1038/s41592-023-01886-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 04/22/2023] [Indexed: 05/31/2023]
Abstract
High-throughput profiling methods (such as genomics or imaging) have accelerated basic research and made deep molecular characterization of patient samples routine. These approaches provide a rich portrait of genes, molecular pathways and cell types involved in disease phenotypes. Machine learning (ML) can be a useful tool for extracting disease-relevant patterns from high-dimensional datasets. However, depending upon the complexity of the biological question, machine learning often requires many samples to identify recurrent and biologically meaningful patterns. Rare diseases are inherently limited in clinical cases, leading to few samples to study. In this Perspective, we outline the challenges and emerging solutions for using ML for small sample sets, specifically in rare diseases. Advances in ML methods for rare diseases are likely to be informative for applications beyond rare diseases for which few samples exist with high-dimensional data. We propose that the method community prioritize the development of ML techniques for rare disease research.
Collapse
Affiliation(s)
| | - Jaclyn N Taroni
- Childhood Cancer Data Lab, Alex's Lemonade Stand Foundation, Philadelphia, PA, USA
| | | | | | | | - Casey Greene
- Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, CO, USA.
| |
Collapse
|
6
|
Yue Y, McDonald D, Hao L, Lei H, Butler MS, He S. FLONE: fully Lorentz network embedding for inferring novel drug targets. BIOINFORMATICS ADVANCES 2023; 3:vbad066. [PMID: 37275772 PMCID: PMC10235194 DOI: 10.1093/bioadv/vbad066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 05/11/2023] [Accepted: 05/23/2023] [Indexed: 06/07/2023]
Abstract
Motivation To predict drug targets, graph-based machine-learning methods have been widely used to capture the relationships between drug, target and disease entities in drug-disease-target (DDT) networks. However, many methods cannot explicitly consider disease types at inference time and so will predict the same target for a given drug under any disease condition. Meanwhile, DDT networks are usually organized hierarchically carrying interactive relationships between involved entities, but these methods, especially those based on Euclidean embedding cannot fully utilize such topological information, which might lead to sub-optimal results. We hypothesized that, by importing hyperbolic embedding specifically for modeling hierarchical DDT networks, graph-based algorithms could better capture relationships between aforementioned entities, which ultimately improves target prediction performance. Results We formulated the target prediction problem as a knowledge graph completion task explicitly considering disease types. We proposed FLONE, a hyperbolic embedding-based method based on capturing hierarchical topological information in DDT networks. The experimental results on two DDT networks showed that by introducing hyperbolic space, FLONE generates more accurate target predictions than its Euclidean counterparts, which supports our hypothesis. We also devised hyperbolic encoders to fuse external domain knowledge, to make FLONE enable handling samples corresponding to previously unseen drugs and targets for more practical scenarios. Availability and implementation Source code and dataset information are at: https://github.com/arantir123/DDT_triple_prediction. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Yang Yue
- Centre for Computational Biology, School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - David McDonald
- AIA Insights Ltd., 71-75 Shelton Street, London, Greater London, WC2H 9JQ, UK
| | - Luoying Hao
- Centre for Computational Biology, School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - Huangshu Lei
- YaoPharma Co., Ltd., 100 Xingguang Avenue, Renhe Town, Yubei District, Chongqing, 401121, China
| | - Mark S Butler
- AIA Insights Ltd., 71-75 Shelton Street, London, Greater London, WC2H 9JQ, UK
| | - Shan He
- Centre for Computational Biology, School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| |
Collapse
|
7
|
Kang H, Hou L, Gu Y, Lu X, Li J, Li Q. Drug-disease association prediction with literature based multi-feature fusion. Front Pharmacol 2023; 14:1205144. [PMID: 37284317 PMCID: PMC10239876 DOI: 10.3389/fphar.2023.1205144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 05/09/2023] [Indexed: 06/08/2023] Open
Abstract
Introduction: Exploring the potential efficacy of a drug is a valid approach for drug development with shorter development times and lower costs. Recently, several computational drug repositioning methods have been introduced to learn multi-features for potential association prediction. However, fully leveraging the vast amount of information in the scientific literature to enhance drug-disease association prediction is a great challenge. Methods: We constructed a drug-disease association prediction method called Literature Based Multi-Feature Fusion (LBMFF), which effectively integrated known drugs, diseases, side effects and target associations from public databases as well as literature semantic features. Specifically, a pre-training and fine-tuning BERT model was introduced to extract literature semantic information for similarity assessment. Then, we revealed drug and disease embeddings from the constructed fusion similarity matrix by a graph convolutional network with an attention mechanism. Results: LBMFF achieved superior performance in drug-disease association prediction with an AUC value of 0.8818 and an AUPR value of 0.5916. Discussion: LBMFF achieved relative improvements of 31.67% and 16.09%, respectively, over the second-best results, compared to single feature methods and seven existing state-of-the-art prediction methods on the same test datasets. Meanwhile, case studies have verified that LBMFF can discover new associations to accelerate drug development. The proposed benchmark dataset and source code are available at: https://github.com/kang-hongyu/LBMFF.
Collapse
Affiliation(s)
- Hongyu Kang
- Department of Biomedical Engineering, School of Life Science, Beijing Institute of Technology, Beijing, China
- Institute of Medical Information, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Li Hou
- Institute of Medical Information, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yaowen Gu
- Institute of Medical Information, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Xiao Lu
- Department of Biomedical Engineering, School of Life Science, Beijing Institute of Technology, Beijing, China
| | - Jiao Li
- Institute of Medical Information, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Qin Li
- Department of Biomedical Engineering, School of Life Science, Beijing Institute of Technology, Beijing, China
| |
Collapse
|
8
|
Xu Z, Eichler B, Klausner EA, Duffy-Matzner J, Zheng W. Lead/Drug Discovery from Natural Resources. Molecules 2022; 27:molecules27238280. [PMID: 36500375 PMCID: PMC9736696 DOI: 10.3390/molecules27238280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 11/18/2022] [Accepted: 11/18/2022] [Indexed: 11/29/2022] Open
Abstract
Natural products and their derivatives have been shown to be effective drug candidates against various diseases for many years. Over a long period of time, nature has produced an abundant and prosperous source pool for novel therapeutic agents with distinctive structures. Major natural-product-based drugs approved for clinical use include anti-infectives and anticancer agents. This paper will review some natural-product-related potent anticancer, anti-HIV, antibacterial and antimalarial drugs or lead compounds mainly discovered from 2016 to 2022. Structurally typical marine bioactive products are also included. Molecular modeling, machine learning, bioinformatics and other computer-assisted techniques that are very important in narrowing down bioactive core structural scaffolds and helping to design new structures to fight against key disease-associated molecular targets based on available natural products are considered and briefly reviewed.
Collapse
Affiliation(s)
- Zhihong Xu
- Department of Chemistry and Biochemistry, Augustana University, 2001 S Summit Ave., Sioux Falls, SD 57197, USA
- Institute of Interventional & Vascular Surgery, Tongji University, Shanghai 200072, China
- Department of Pharmaceutical Sciences, South College School of Pharmacy, 400 Goody’s Lane, Knoxville, TN 37922, USA
- Correspondence: ; Tel.: +1-(605)-274-5008
| | - Barrett Eichler
- Department of Chemistry and Biochemistry, Augustana University, 2001 S Summit Ave., Sioux Falls, SD 57197, USA
| | - Eytan A. Klausner
- Department of Pharmaceutical Sciences, South College School of Pharmacy, 400 Goody’s Lane, Knoxville, TN 37922, USA
| | - Jetty Duffy-Matzner
- Department of Chemistry and Biochemistry, Augustana University, 2001 S Summit Ave., Sioux Falls, SD 57197, USA
| | - Weifan Zheng
- Biomanufacturing Research Institute and Technology Enterprise, North Carolina Central University, 1801 Fayetteville St., Durham, NC 27707, USA
- Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
9
|
Gao Z, Ding P, Xu R. KG-Predict: A knowledge graph computational framework for drug repurposing. J Biomed Inform 2022; 132:104133. [PMID: 35840060 PMCID: PMC9595135 DOI: 10.1016/j.jbi.2022.104133] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 06/18/2022] [Accepted: 07/03/2022] [Indexed: 11/26/2022]
Abstract
The emergence of large-scale phenotypic, genetic, and other multi-model biochemical data has offered unprecedented opportunities for drug discovery including drug repurposing. Various knowledge graph-based methods have been developed to integrate and analyze complex and heterogeneous data sources to find new therapeutic applications for existing drugs. However, existing methods have limitations in modeling and capturing context-sensitive inter-relationships among tens of thousands of biomedical entities. In this paper, we developed KG-Predict: a knowledge graph computational framework for drug repurposing. We first integrated multiple types of entities and relations from various genotypic and phenotypic databases to construct a knowledge graph termed GP-KG. GP-KG was composed of 1,246,726 associations between 61,146 entities. KG-Predict then aggregated the heterogeneous topological and semantic information from GP-KG to learn low-dimensional representations of entities and relations, and further utilized these representations to infer new drug-disease interactions. In cross-validation experiments, KG-Predict achieved high performances [AUROC (the area under receiver operating characteristic) = 0.981, AUPR (the area under precision-recall) = 0.409 and MRR (the mean reciprocal rank) = 0.261], outperforming other state-of-art graph embedding methods. We applied KG-Predict in identifying novel repositioned candidate drugs for Alzheimer's disease (AD) and showed that KG-Predict prioritized both FDA-approved and active clinical trial anti-AD drugs among the top (AUROC = 0.868 and AUPR = 0.364).
Collapse
Affiliation(s)
- Zhenxiang Gao
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, 44106 OH, USA.
| | - Pingjian Ding
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, 44106 OH, USA.
| | - Rong Xu
- Center for Artificial Intelligence in Drug Discovery, School of Medicine, Case Western Reserve University, Cleveland, 44106 OH, USA.
| |
Collapse
|
10
|
Sadeghi S, Lu J, Ngom A. An Integrative Heterogeneous Graph Neural Network-Based Method for Multi-Labeled Drug Repurposing. Front Pharmacol 2022; 13:908549. [PMID: 35873597 PMCID: PMC9298882 DOI: 10.3389/fphar.2022.908549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 05/09/2022] [Indexed: 12/01/2022] Open
Abstract
Drug repurposing is the process of discovering new indications (i.e., diseases or conditions) for already approved drugs. Many computational methods have been proposed for predicting new associations between drugs and diseases. In this article, we proposed a new method, called DR-HGNN, an integrative heterogeneous graph neural network-based method for multi-labeled drug repurposing, to discover new indications for existing drugs. For this purpose, we first used the DTINet dataset to construct a heterogeneous drug–protein–disease (DPD) network, which is a graph composed of four types of nodes (drugs, proteins, diseases, and drug side effects) and eight types of edges. Second, we labeled each drug–protein edge, dpi,j = (di, pj), of the DPD network with a set of diseases, {δi,j,1, … , δi,j,k} associated with both di and pj and then devised multi-label ranking approaches which incorporate neural network architecture that operates on the heterogeneous graph-structured data and which leverages both the interaction patterns and the features of drug and protein nodes. We used a derivative of the GraphSAGE algorithm, HinSAGE, on the heterogeneous DPD network to learn low-dimensional vector representation of features of drugs and proteins. Finally, we used the drug–protein network to learn the embeddings of the drug–protein edges and then predict the disease labels that act as bridges between drugs and proteins. The proposed method shows better results than existing methods applied to the DTINet dataset, with an AUC of 0.964.
Collapse
Affiliation(s)
| | - Jianguo Lu
- School of Computer Science, University of Windsor, Windsor, ON, Canada
| | - Alioune Ngom
- School of Computer Science, University of Windsor, Windsor, ON, Canada
| |
Collapse
|
11
|
Liang X, Li J, Fu Y, Qu L, Tan Y, Zhang P. A novel machine learning model based on sparse structure learning with adaptive graph regularization for predicting drug side effects. J Biomed Inform 2022; 132:104131. [PMID: 35840061 DOI: 10.1016/j.jbi.2022.104131] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 06/08/2022] [Accepted: 06/29/2022] [Indexed: 10/17/2022]
Abstract
Drug side effects are closely related to the success and failure of drug development. Here we present a novel machine learning method for side effect prediction. The proposed method treats side effect prediction as a multi-label learning problem and uses sparse structure learning to model the relationships between side effects. Additionally, the proposed method adopts the adaptive graph regularization strategy to explore the local structure in drug data and fuse multiple types of drug features. An alternating optimization algorithm is proposed to solve the optimization problem. We collected chemical structures and biological pathway features of drugs as the inputs of our method to predict drug side effects. The results of the cross-validation experiment showed that our method could significantly improve the prediction performance compared to the other state-of-the-art methods. Besides, our model is highly interpretable. It could learn the drug neighbourhood relationships, side effect relationships, and drug features related to side effects. We systematically validated the information extracted by the model with independent data. Some prediction results could also be supported by literature reports. The proposed method could be applied to integrate both chemical and biological data to predict side effects and helps improve drug safety.
Collapse
Affiliation(s)
- Xujun Liang
- NHC Key Laboratory of Cancer Proteomics, Department of Oncology, PR China; National Clinical Research Center for Gerontology, Xiangya Hospital, Central South University, PR China.
| | - Jun Li
- NHC Key Laboratory of Cancer Proteomics, Department of Oncology, PR China
| | - Ying Fu
- NHC Key Laboratory of Cancer Proteomics, Department of Oncology, PR China
| | - Lingzhi Qu
- NHC Key Laboratory of Cancer Proteomics, Department of Oncology, PR China
| | - Yuying Tan
- NHC Key Laboratory of Cancer Proteomics, Department of Oncology, PR China
| | - Pengfei Zhang
- NHC Key Laboratory of Cancer Proteomics, Department of Oncology, PR China; National Clinical Research Center for Gerontology, Xiangya Hospital, Central South University, PR China
| |
Collapse
|