1
|
Bai H, Lu S, Zhang T, Cui H, Nakaguchi T, Xuan P. Graph reasoning method enhanced by relational transformers and knowledge distillation for drug-related side effect prediction. iScience 2024; 27:109571. [PMID: 38799562 PMCID: PMC11126883 DOI: 10.1016/j.isci.2024.109571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 09/29/2023] [Accepted: 03/22/2024] [Indexed: 05/29/2024] Open
Abstract
Identifying the side effects related to drugs is beneficial for reducing the risk of drug development failure and saving the drug development cost. We proposed a graph reasoning method, RKDSP, to fuse the semantics of multiple connection relationships, the local knowledge within each meta-path, the global knowledge among multiple meta-paths, and the attributes of the drug and side effect node pairs. We constructed drug-side effect heterogeneous graphs consisting of the drugs, side effects, and their similarity and association connections. Multiple relational transformers were established to learn node features from diverse meta-path semantic perspectives. A knowledge distillation module was constructed to learn local and global knowledge of multiple meta-paths. Finally, an adaptive convolutional neural network-based strategy was presented to adaptively encode the attributes of each drug-side effect node pair. The experimental results demonstrated that RKDSP outperforms the compared state-of-the-art prediction approaches.
Collapse
Affiliation(s)
- Honglei Bai
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Siyuan Lu
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
| | - Tiangang Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin, China
- School of Mathematical Science, Heilongjiang University, Harbin, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC, Australia
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba, Japan
| | - Ping Xuan
- Department of Computer Science and Technology, Shantou University, Shantou, China
| |
Collapse
|
2
|
Zhang H, Chen J, Liao B, Wu FX, Bi XA. Deep Canonical Correlation Fusion Algorithm Based on Denoising Autoencoder for ASD Diagnosis and Pathogenic Brain Region Identification. Interdiscip Sci 2024; 16:455-468. [PMID: 38573456 DOI: 10.1007/s12539-024-00625-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/22/2024] [Accepted: 02/25/2024] [Indexed: 04/05/2024]
Abstract
Autism Spectrum Disorder (ASD) is defined as a neurodevelopmental condition distinguished by unconventional neural activities. Early intervention is key to managing the progress of ASD, and current research primarily focuses on the use of structural magnetic resonance imaging (sMRI) or resting-state functional magnetic resonance imaging (rs-fMRI) for diagnosis. Moreover, the use of autoencoders for disease classification has not been sufficiently explored. In this study, we introduce a new framework based on autoencoder, the Deep Canonical Correlation Fusion algorithm based on Denoising Autoencoder (DCCF-DAE), which proves to be effective in handling high-dimensional data. This framework involves efficient feature extraction from different types of data with an advanced autoencoder, followed by the fusion of these features through the DCCF model. Then we utilize the fused features for disease classification. DCCF integrates functional and structural data to help accurately diagnose ASD and identify critical Regions of Interest (ROIs) in disease mechanisms. We compare the proposed framework with other methods by the Autism Brain Imaging Data Exchange (ABIDE) database and the results demonstrate its outstanding performance in ASD diagnosis. The superiority of DCCF-DAE highlights its potential as a crucial tool for early ASD diagnosis and monitoring.
Collapse
Affiliation(s)
- Huilian Zhang
- Key Laboratory of Data Science and Intelligence Education, Ministry of Education, Hainan Normal University, Haikou, 571126, China
- College of Mathematics and Statistics, Hainan Normal University, Haikou, 571126, China
| | - Jie Chen
- Key Laboratory of Data Science and Intelligence Education, Ministry of Education, Hainan Normal University, Haikou, 571126, China
- College of Mathematics and Statistics, Hainan Normal University, Haikou, 571126, China
| | - Bo Liao
- Key Laboratory of Data Science and Intelligence Education, Ministry of Education, Hainan Normal University, Haikou, 571126, China
- College of Mathematics and Statistics, Hainan Normal University, Haikou, 571126, China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, S7N5A9, Canada
| | - Xia-An Bi
- Key Laboratory of Data Science and Intelligence Education, Ministry of Education, Hainan Normal University, Haikou, 571126, China.
- College of Mathematics and Statistics, Hainan Normal University, Haikou, 571126, China.
- College of Information Science and Engineering, Hunan Normal University, Changsha, Hunan, 410081, China.
| |
Collapse
|
3
|
Hu F, Zhang W, Huang H, Li W, Li Y, Yin P. A Transferability-Based Method for Evaluating the Protein Representation Learning. IEEE J Biomed Health Inform 2024; 28:3158-3166. [PMID: 38416611 DOI: 10.1109/jbhi.2024.3370680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2024]
Abstract
Self-supervised pre-trained language models have recently risen as a powerful approach in learning protein representations, showing exceptional effectiveness in various biological tasks, such as drug discovery. Amidst the evolving trend in protein language model development, there is an observable shift towards employing large-scale multimodal and multitask models. However, the predominant reliance on empirical assessments using specific benchmark datasets for evaluating these models raises concerns about the comprehensiveness and efficiency of current evaluation methods. Addressing this gap, our study introduces a novel quantitative approach for estimating the performance of transferring multi-task pre-trained protein representations to downstream tasks. This transferability-based method is designed to quantify the similarities in latent space distributions between pre-trained features and those fine-tuned for downstream tasks. It encompasses a broad spectrum, covering multiple domains and a variety of heterogeneous tasks. To validate this method, we constructed a diverse set of protein-specific pre-training tasks. The resulting protein representations were then evaluated across several downstream biological tasks. Our experimental results demonstrate a robust correlation between the transferability scores obtained using our method and the actual transfer performance observed. This significant correlation highlights the potential of our method as a more comprehensive and efficient tool for evaluating protein representation learning.
Collapse
|
4
|
Zhang Y, Liu C, Liu M, Liu T, Lin H, Huang CB, Ning L. Attention is all you need: utilizing attention in AI-enabled drug discovery. Brief Bioinform 2023; 25:bbad467. [PMID: 38189543 PMCID: PMC10772984 DOI: 10.1093/bib/bbad467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 11/03/2023] [Accepted: 11/25/2023] [Indexed: 01/09/2024] Open
Abstract
Recently, attention mechanism and derived models have gained significant traction in drug development due to their outstanding performance and interpretability in handling complex data structures. This review offers an in-depth exploration of the principles underlying attention-based models and their advantages in drug discovery. We further elaborate on their applications in various aspects of drug development, from molecular screening and target binding to property prediction and molecule generation. Finally, we discuss the current challenges faced in the application of attention mechanisms and Artificial Intelligence technologies, including data quality, model interpretability and computational resource constraints, along with future directions for research. Given the accelerating pace of technological advancement, we believe that attention-based models will have an increasingly prominent role in future drug discovery. We anticipate that these models will usher in revolutionary breakthroughs in the pharmaceutical domain, significantly accelerating the pace of drug development.
Collapse
Affiliation(s)
- Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Caiqi Liu
- Department of Gastrointestinal Medical Oncology, Harbin Medical University Cancer Hospital, No.150 Haping Road, Nangang District, Harbin, Heilongjiang 150081, China
- Key Laboratory of Molecular Oncology of Heilongjiang Province, No.150 Haping Road, Nangang District, Harbin, Heilongjiang 150081, China
| | - Mujiexin Liu
- Chongqing Key Laboratory of Sichuan-Chongqing Co-construction for Diagnosis and Treatment of Infectious Diseases Integrated Traditional Chinese and Western Medicine, College of Medical Technology, Chengdu University of Traditional Chinese Medicine, Chengdu, China
| | - Tianyuan Liu
- Graduate School of Science and Technology, University of Tsukuba, Tsukuba, Japan
| | - Hao Lin
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Cheng-Bing Huang
- School of Computer Science and Technology, Aba Teachers University, Aba, China
| | - Lin Ning
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| |
Collapse
|
5
|
Ning G, Sun Y, Ling J, Chen J, He J. BDN-DDI: A bilinear dual-view representation learning framework for drug-drug interaction prediction. Comput Biol Med 2023; 165:107340. [PMID: 37603959 DOI: 10.1016/j.compbiomed.2023.107340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/23/2023] [Accepted: 08/07/2023] [Indexed: 08/23/2023]
Abstract
Drug-drug interactions (DDIs) refer to the potential effects of two or more drugs interacting with each other when used simultaneously, which may lead to adverse reactions or reduced drug efficacy. Accurate prediction of DDIs is a significant concern in recent years. Currently, the drug chemical substructure-based learning method has substantially improved DDIs prediction. However, we notice that most related works ignore the detailed interaction among atoms when extracting the substructure information of drugs. This problem results in incomplete information extraction and may limit the model's predictive ability. In this work, we proposed a novel framework named BDN-DDI (a bilinear dual-view representation learning framework for drug-drug interaction prediction) to infer potential DDIs. In the proposed framework, the encoder consists of six stacked BDN blocks, each of which extracts the feature representation of drug molecules through a bilinear representation extraction layer. The extracted feature is then used to learn embeddings of drug substructures from the single drug learning layer (intra-layer) and the drug-pair learning layer (inter-layer). Finally, the learned embeddings are fed into a decoder to predict DDI events. Based on our experiments, BDN-DDI has an AUROC value of over 99% for the warm-start task. Additionally, it outperformed the state-of-the-art methods by an average of 3.4% for the cold-start tasks. Finally, our method's effectiveness is further validated by visualizing several case studies.
Collapse
Affiliation(s)
- Guoquan Ning
- School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
| | - Yuping Sun
- School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China.
| | - Jie Ling
- School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
| | - Jijia Chen
- School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
| | - Jiaxi He
- School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China
| |
Collapse
|
6
|
Masumshah R, Eslahchi C. DPSP: a multimodal deep learning framework for polypharmacy side effects prediction. BIOINFORMATICS ADVANCES 2023; 3:vbad110. [PMID: 37701676 PMCID: PMC10493180 DOI: 10.1093/bioadv/vbad110] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/01/2023] [Accepted: 08/15/2023] [Indexed: 09/14/2023]
Abstract
Motivation Because unanticipated drug-drug interactions (DDIs) can result in severe bodily harm, identifying the adverse effects of polypharmacy is one of the most important tasks in human health. Over the past few decades, computational methods for predicting the adverse effects of polypharmacy have been developed. Results This article presents DPSP, a framework for predicting polypharmacy side effects based on the construction of novel drug features and the application of a deep neural network to predict DDIs. In the first step, a variety of drug information is evaluated, and a feature extraction method and the Jaccard similarity are used to determine similarities between two drugs. By combining these similarities, a novel feature vector is generated for each drug. In the second step, the method predicts DDIs for specific DDI events using a multimodal framework and drug feature vectors. On three benchmark datasets, the performance of DPSP is measured by comparing its results to those of several well-known methods, such as GNN-DDI, MSTE, MDF-SA-DDI, NNPS, DDIMDL, DNN, DeepDDI, KNN, LR, and RF. DPSP outperforms these classification methods based on a variety of classification metrics. The results indicate that the use of diverse drug information is effective and efficient for identifying DDI adverse effects. Availability and implementation The source code and datasets are available at https://github.com/raziyehmasumshah/DPSP.
Collapse
Affiliation(s)
- Raziyeh Masumshah
- Department of Computer and Data Sciences, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran 1983969411, Iran
| | - Changiz Eslahchi
- Department of Computer and Data Sciences, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran 1983969411, Iran
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran 193955746, Iran
| |
Collapse
|
7
|
Han CD, Wang CC, Huang L, Chen X. MCFF-MTDDI: multi-channel feature fusion for multi-typed drug-drug interaction prediction. Brief Bioinform 2023; 24:bbad215. [PMID: 37291761 DOI: 10.1093/bib/bbad215] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 05/11/2023] [Accepted: 05/21/2023] [Indexed: 06/10/2023] Open
Abstract
Adverse drug-drug interactions (DDIs) have become an increasingly serious problem in the medical and health system. Recently, the effective application of deep learning and biomedical knowledge graphs (KGs) have improved the DDI prediction performance of computational models. However, the problems of feature redundancy and KG noise also arise, bringing new challenges for researchers. To overcome these challenges, we proposed a Multi-Channel Feature Fusion model for multi-typed DDI prediction (MCFF-MTDDI). Specifically, we first extracted drug chemical structure features, drug pairs' extra label features, and KG features of drugs. Then, these different features were effectively fused by a multi-channel feature fusion module. Finally, multi-typed DDIs were predicted through the fully connected neural network. To our knowledge, we are the first to integrate the extra label information into KG-based multi-typed DDI prediction; besides, we innovatively proposed a novel KG feature learning method and a State Encoder to obtain target drug pairs' KG-based features which contained more abundant and more key drug-related KG information with less noise; furthermore, a Gated Recurrent Unit-based multi-channel feature fusion module was proposed in an innovative way to yield more comprehensive feature information about drug pairs, effectively alleviating the problem of feature redundancy. We experimented with four datasets in the multi-class and the multi-label prediction tasks to comprehensively evaluate the performance of MCFF-MTDDI for predicting interactions of known-known drugs, known-new drugs and new-new drugs. In addition, we further conducted ablation studies and case studies. All the results fully demonstrated the effectiveness of MCFF-MTDDI.
Collapse
Affiliation(s)
- Chen-Di Han
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China
| | - Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China
- School of Science, Jiangnan University, Wuxi, 214122, China
| | - Li Huang
- The Future Laboratory, Tsinghua University, Beijing, 100084, China
| | - Xing Chen
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China
- School of Science, Jiangnan University, Wuxi, 214122, China
- Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou, 221116, China
| |
Collapse
|
8
|
Zhou L, Wang Y, Peng L, Li Z, Luo X. Identifying potential drug-target interactions based on ensemble deep learning. Front Aging Neurosci 2023; 15:1176400. [PMID: 37396659 PMCID: PMC10309650 DOI: 10.3389/fnagi.2023.1176400] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 05/10/2023] [Indexed: 07/04/2023] Open
Abstract
Introduction Drug-target interaction prediction is one important step in drug research and development. Experimental methods are time consuming and laborious. Methods In this study, we developed a novel DTI prediction method called EnGDD by combining initial feature acquisition, dimensional reduction, and DTI classification based on Gradient boosting neural network, Deep neural network, and Deep Forest. Results EnGDD was compared with seven stat-of-the-art DTI prediction methods (BLM-NII, NRLMF, WNNGIP, NEDTP, DTi2Vec, RoFDT, and MolTrans) on the nuclear receptor, GPCR, ion channel, and enzyme datasets under cross validations on drugs, targets, and drug-target pairs, respectively. EnGDD computed the best recall, accuracy, F1-score, AUC, and AUPR under the majority of conditions, demonstrating its powerful DTI identification performance. EnGDD predicted that D00182 and hsa2099, D07871 and hsa1813, DB00599 and hsa2562, D00002 and hsa10935 have a higher interaction probabilities among unknown drug-target pairs and may be potential DTIs on the four datasets, respectively. In particular, D00002 (Nadide) was identified to interact with hsa10935 (Mitochondrial peroxiredoxin3) whose up-regulation might be used to treat neurodegenerative diseases. Finally, EnGDD was used to find possible drug targets for Parkinson's disease and Alzheimer's disease after confirming its DTI identification performance. The results show that D01277, D04641, and D08969 may be applied to the treatment of Parkinson's disease through targeting hsa1813 (dopamine receptor D2) and D02173, D02558, and D03822 may be the clues of treatment for patients with Alzheimer's disease through targeting hsa5743 (prostaglandinendoperoxide synthase 2). The above prediction results need further biomedical validation. Discussion We anticipate that our proposed EnGDD model can help discover potential therapeutic clues for various diseases including neurodegenerative diseases.
Collapse
Affiliation(s)
- Liqian Zhou
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| | - Yuzhuang Wang
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| | - Lihong Peng
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| | - Zejun Li
- School of Computer Science, Hunan Institute of Technology, Hengyang, China
| | - Xueming Luo
- School of Computer Science, Hunan University of Technology, Zhuzhou, China
| |
Collapse
|
9
|
Kha QH, Le VH, Hung TNK, Nguyen NTK, Le NQK. Development and Validation of an Explainable Machine Learning-Based Prediction Model for Drug-Food Interactions from Chemical Structures. SENSORS (BASEL, SWITZERLAND) 2023; 23:3962. [PMID: 37112302 PMCID: PMC10143839 DOI: 10.3390/s23083962] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Revised: 03/26/2023] [Accepted: 04/12/2023] [Indexed: 06/19/2023]
Abstract
Possible drug-food constituent interactions (DFIs) could change the intended efficiency of particular therapeutics in medical practice. The increasing number of multiple-drug prescriptions leads to the rise of drug-drug interactions (DDIs) and DFIs. These adverse interactions lead to other implications, e.g., the decline in medicament's effect, the withdrawals of various medications, and harmful impacts on the patients' health. However, the importance of DFIs remains underestimated, as the number of studies on these topics is constrained. Recently, scientists have applied artificial intelligence-based models to study DFIs. However, there were still some limitations in data mining, input, and detailed annotations. This study proposed a novel prediction model to address the limitations of previous studies. In detail, we extracted 70,477 food compounds from the FooDB database and 13,580 drugs from the DrugBank database. We extracted 3780 features from each drug-food compound pair. The optimal model was eXtreme Gradient Boosting (XGBoost). We also validated the performance of our model on one external test set from a previous study which contained 1922 DFIs. Finally, we applied our model to recommend whether a drug should or should not be taken with some food compounds based on their interactions. The model can provide highly accurate and clinically relevant recommendations, especially for DFIs that may cause severe adverse events and even death. Our proposed model can contribute to developing more robust predictive models to help patients, under the supervision and consultants of physicians, avoid DFI adverse effects in combining drugs and foods for therapy.
Collapse
Affiliation(s)
- Quang-Hien Kha
- International Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan
- AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan
| | - Viet-Huan Le
- International Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan
- AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan
- Department of Thoracic Surgery, Khanh Hoa General Hospital, Nha Trang City 65000, Vietnam
| | | | - Ngan Thi Kim Nguyen
- Undergraduate Program of Nutrition Science, National Taiwan Normal University, Taipei 106, Taiwan
| | - Nguyen Quoc Khanh Le
- AIBioMed Research Group, Taipei Medical University, Taipei 110, Taiwan
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan
- Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 110, Taiwan
- Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan
| |
Collapse
|