1
|
Li X, Yang Q, Luo G, Xu L, Dong W, Wang W, Dong S, Wang K, Xuan P, Gao X. SAGDTI: self-attention and graph neural network with multiple information representations for the prediction of drug-target interactions. BIOINFORMATICS ADVANCES 2023; 3:vbad116. [PMID: 38282612 PMCID: PMC10818136 DOI: 10.1093/bioadv/vbad116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 07/31/2023] [Accepted: 08/24/2023] [Indexed: 01/30/2024]
Abstract
Motivation Accurate identification of target proteins that interact with drugs is a vital step in silico, which can significantly foster the development of drug repurposing and drug discovery. In recent years, numerous deep learning-based methods have been introduced to treat drug-target interaction (DTI) prediction as a classification task. The output of this task is binary identification suggesting the absence or presence of interactions. However, existing studies often (i) neglect the unique molecular attributes when embedding drugs and proteins, and (ii) determine the interaction of drug-target pairs without considering biological interaction information. Results In this study, we propose an end-to-end attention-derived method based on the self-attention mechanism and graph neural network, termed SAGDTI. The aim of this method is to overcome the aforementioned drawbacks in the identification of DTI. SAGDTI is the first method to sufficiently consider the unique molecular attribute representations for both drugs and targets in the input form of the SMILES sequences and three-dimensional structure graphs. In addition, our method aggregates the feature attributes of biological information between drugs and targets through multi-scale topologies and diverse connections. Experimental results illustrate that SAGDTI outperforms existing prediction models, which benefit from the unique molecular attributes embedded by atom-level attention and biological interaction information representation aggregated by node-level attention. Moreover, a case study on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) shows that our model is a powerful tool for identifying DTIs in real life. Availability and implementation The data and codes underlying this article are available in Github at https://github.com/lixiaokun2020/SAGDTI.
Collapse
Affiliation(s)
- Xiaokun Li
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Harbin 150090, China
| | - Qiang Yang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Harbin 150090, China
| | - Gongning Luo
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Long Xu
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Harbin 150090, China
| | - Weihe Dong
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Harbin 150090, China
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Wei Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Suyu Dong
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China
| | - Kuanquan Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China
| | - Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
- Department of Computer Science, School of Engineering, Shantou University, Shantou 515063, China
| | - Xin Gao
- Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, 4700 KAUST, Thuwal 23955, Saudi Arabia
| |
Collapse
|
2
|
Binatlı OC, Gönen M. MOKPE: drug-target interaction prediction via manifold optimization based kernel preserving embedding. BMC Bioinformatics 2023; 24:276. [PMID: 37407927 DOI: 10.1186/s12859-023-05401-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 06/25/2023] [Indexed: 07/07/2023] Open
Abstract
BACKGROUND In many applications of bioinformatics, data stem from distinct heterogeneous sources. One of the well-known examples is the identification of drug-target interactions (DTIs), which is of significant importance in drug discovery. In this paper, we propose a novel framework, manifold optimization based kernel preserving embedding (MOKPE), to efficiently solve the problem of modeling heterogeneous data. Our model projects heterogeneous drug and target data into a unified embedding space by preserving drug-target interactions and drug-drug, target-target similarities simultaneously. RESULTS We performed ten replications of ten-fold cross validation on four different drug-target interaction network data sets for predicting DTIs for previously unseen drugs. The classification evaluation metrics showed better or comparable performance compared to previous similarity-based state-of-the-art methods. We also evaluated MOKPE on predicting unknown DTIs of a given network. Our implementation of the proposed algorithm in R together with the scripts that replicate the reported experiments is publicly available at https://github.com/ocbinatli/mokpe .
Collapse
Affiliation(s)
- Oğuz C Binatlı
- Graduate School of Sciences and Engineering, Koç University, 34450, Istanbul, Turkey
| | - Mehmet Gönen
- Department of Industrial Engineering, College of Engineering, Koç University, 34450, Istanbul, Turkey.
- School of Medicine, Koç University, 34450, Istanbul, Turkey.
| |
Collapse
|
3
|
Wen J, Gan H, Yang Z, Zhou R, Zhao J, Ye Z. Mutual-DTI: A mutual interaction feature-based neural network for drug-target protein interaction prediction. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:10610-10625. [PMID: 37322951 DOI: 10.3934/mbe.2023469] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
The prediction of drug-target protein interaction (DTI) is a crucial task in the development of new drugs in modern medicine. Accurately identifying DTI through computer simulations can significantly reduce development time and costs. In recent years, many sequence-based DTI prediction methods have been proposed, and introducing attention mechanisms has improved their forecasting performance. However, these methods have some shortcomings. For example, inappropriate dataset partitioning during data preprocessing can lead to overly optimistic prediction results. Additionally, only single non-covalent intermolecular interactions are considered in the DTI simulation, ignoring the complex interactions between their internal atoms and amino acids. In this paper, we propose a network model called Mutual-DTI that predicts DTI based on the interaction properties of sequences and a Transformer model. We use multi-head attention to extract the long-distance interdependent features of the sequence and introduce a module to extract the sequence's mutual interaction features in mining complex reaction processes of atoms and amino acids. We evaluate the experiments on two benchmark datasets, and the results show that Mutual-DTI outperforms the latest baseline significantly. In addition, we conduct ablation experiments on a label-inversion dataset that is split more rigorously. The results show that there is a significant improvement in the evaluation metrics after introducing the extracted sequence interaction feature module. This suggests that Mutual-DTI may contribute to modern medical drug development research. The experimental results show the effectiveness of our approach. The code for Mutual-DTI can be downloaded from https://github.com/a610lab/Mutual-DTI.
Collapse
Affiliation(s)
- Jiahui Wen
- School of Computer Science, Hubei University of Technology, Wuhan 430068, China
| | - Haitao Gan
- School of Computer Science, Hubei University of Technology, Wuhan 430068, China
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Hubei University, Wuhan 430062, China
| | - Zhi Yang
- School of Computer Science, Hubei University of Technology, Wuhan 430068, China
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Hubei University, Wuhan 430062, China
| | - Ran Zhou
- School of Computer Science, Hubei University of Technology, Wuhan 430068, China
| | - Jing Zhao
- State Key Laboratory of Biocatalysis and Enzyme Engineering, Hubei University, Wuhan 430062, China
| | - Zhiwei Ye
- School of Computer Science, Hubei University of Technology, Wuhan 430068, China
| |
Collapse
|
4
|
Iwabuchi S, Tsukahara T, Okayama T, Kitabatake M, Motobayashi H, Shichino S, Imafuku T, Yamaji K, Miyamoto K, Tamura S, Ueha S, Ito T, Murata SI, Kondo T, Ikeo K, Suzuki Y, Matsushima K, Kohara M, Torigoe T, Yamaue H, Hashimoto S. B cell receptor repertoire analysis from autopsy samples of COVID-19 patients. Front Immunol 2023; 14:1034978. [PMID: 36911681 PMCID: PMC9996338 DOI: 10.3389/fimmu.2023.1034978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 02/10/2023] [Indexed: 02/25/2023] Open
Abstract
Neutralizing antibodies against the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are being developed world over. We investigated the possibility of producing artificial antibodies from the formalin fixation and paraffin-embedding (FFPE) lung lobes of a patient who died by coronavirus disease 2019 (COVID-19). The B-cell receptors repertoire in the lung tissue where SARS-CoV-2 was detected were considered to have highly sensitive virus-neutralizing activity, and artificial antibodies were produced by combining the most frequently detected heavy and light chains. Some neutralizing effects against the SARS-CoV-2 were observed, and mixing two different artificial antibodies had a higher tendency to suppress the virus. The neutralizing effects were similar to the immunoglobulin G obtained from healthy donors who had received a COVID-19 mRNA vaccine. Therefore, the use of FFPE lung tissue, which preserves the condition of direct virus sensitization, to generate artificial antibodies may be useful against future unknown infectious diseases.
Collapse
Affiliation(s)
- Sadahiro Iwabuchi
- Department of Molecular Pathophysiology, Institute of Advanced Medicine, Wakayama Medical University, Wakayama, Japan
| | - Tomohide Tsukahara
- Department of Pathology, Sapporo Medical University School of Medicine, Hokkaido, Japan
| | - Toshitugu Okayama
- Laboratory of DNA Data Analysis, National Institute of Genetics, Shizuoka, Japan
| | | | - Hideki Motobayashi
- Second Department of Surgery, Wakayama Medical University, Wakayama, Japan
| | - Shigeyuki Shichino
- Division of Molecular Regulation of Inflammatory and Immune Disease, Research Institute for Biomedical Sciences, Tokyo University of Science, Chiba, Japan
| | - Tadashi Imafuku
- Department of Molecular Pathophysiology, Institute of Advanced Medicine, Wakayama Medical University, Wakayama, Japan
| | - Kenzaburo Yamaji
- Department of Diseases and Infection, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Kyohei Miyamoto
- Department of Emergency and Critical Care Medicine, Wakayama Medical University, Wakayama, Japan
| | - Shinobu Tamura
- Department of Emergency and Critical Care Medicine, Wakayama Medical University, Wakayama, Japan
| | - Satoshi Ueha
- Division of Molecular Regulation of Inflammatory and Immune Disease, Research Institute for Biomedical Sciences, Tokyo University of Science, Chiba, Japan
| | - Toshihiro Ito
- Department of Immunology, Nara Medical University, Nara, Japan
| | - Shin-Ichi Murata
- Departments of Human Pathology, Wakayama Medical University, Wakayama, Japan
| | - Toshikazu Kondo
- Department of Forensic Medicine, Wakayama Medical University, Wakayama, Japan
| | - Kazuho Ikeo
- Laboratory of DNA Data Analysis, National Institute of Genetics, Shizuoka, Japan
| | - Yutaka Suzuki
- Department of Computational Biology, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Kouji Matsushima
- Division of Molecular Regulation of Inflammatory and Immune Disease, Research Institute for Biomedical Sciences, Tokyo University of Science, Chiba, Japan
| | - Michinori Kohara
- Department of Diseases and Infection, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Toshihiko Torigoe
- Department of Pathology, Sapporo Medical University School of Medicine, Hokkaido, Japan
| | - Hiroki Yamaue
- Second Department of Surgery, Wakayama Medical University, Wakayama, Japan.,Departments of Cancer Immunology, Wakayama Medical University, Wakayama, Japan
| | - Shinichi Hashimoto
- Department of Molecular Pathophysiology, Institute of Advanced Medicine, Wakayama Medical University, Wakayama, Japan
| |
Collapse
|
5
|
Deng L, Fan Z, Xiao X, Liu H, Zhang J. Dual-Channel Heterogeneous Graph Neural Network for Predicting microRNA-Mediated Drug Sensitivity. J Chem Inf Model 2022; 62:5929-5937. [PMID: 36413746 DOI: 10.1021/acs.jcim.2c01060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Many studies have confirmed that microRNAs (miRNAs) are mediated in the sensitivity of tumor cells to anticancer drugs. MiRNAs are emerging as a type of promising therapeutic targets to overcome drug resistance. However, there is limited attention paid to the computational prediction of the associations between miRNAs and drug sensitivity. In this work, we proposed a heterogeneous network-based representation learning method to predict miRNA-drug sensitivity associations (DGNNMDA). An miRNA-drug heterogeneous network was constructed by integrating miRNA similarity network, drug similarity network, and experimentally validated miRNA-drug sensitivity associations. Next, we developed a dual-channel heterogeneous graph neural network model to perform feature propagation among the homogeneous and heterogeneous nodes so that our method can learn expressive representations for miRNA and drug nodes. On two benchmark datasets, our method outperformed other seven competitive methods. We also verified the effectiveness of the feature propagations on homogeneous and heterogeneous nodes. Moreover, we have conducted two case studies to verify the reliability of our methods and tried to reveal the regulatory mechanism of miRNAs mediated in drug sensitivity. The source code and datasets are freely available at https://github.com/19990915fzy/DGNNMDA.
Collapse
Affiliation(s)
- Lei Deng
- School of Computer Science and Engineering, Central South University, Changsha410083, China
| | - Ziyu Fan
- School of Computer Science and Engineering, Central South University, Changsha410083, China
| | - Xiaojun Xiao
- Software School, Xinjiang University, Urumqi830091, China
| | - Hui Liu
- School of Computer Science and Technology, Nanjing Tech University, Nanjing211816, China
| | - Jiaxuan Zhang
- Department of Electrical and Computer Engineering, University of California, San Diego, San Diego, California92161, United States
| |
Collapse
|
6
|
Drug-Disease Association Prediction Using Heterogeneous Networks for Computational Drug Repositioning. Biomolecules 2022; 12:biom12101497. [PMID: 36291706 PMCID: PMC9599692 DOI: 10.3390/biom12101497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 10/10/2022] [Accepted: 10/13/2022] [Indexed: 11/18/2022] Open
Abstract
Drug repositioning, which involves the identification of new therapeutic indications for approved drugs, considerably reduces the time and cost of developing new drugs. Recent computational drug repositioning methods use heterogeneous networks to identify drug–disease associations. This review reveals existing network-based approaches for predicting drug–disease associations in three major categories: graph mining, matrix factorization or completion, and deep learning. We selected eleven methods from the three categories to compare their predictive performances. The experiment was conducted using two uniform datasets on the drug and disease sides, separately. We constructed heterogeneous networks using drug–drug similarities based on chemical structures and ATC codes, ontology-based disease–disease similarities, and drug–disease associations. An improved evaluation metric was used to reflect data imbalance as positive associations are typically sparse. The prediction results demonstrated that methods in the graph mining and matrix factorization or completion categories performed well in the overall assessment. Furthermore, prediction on the drug side had higher accuracy than on the disease side. Selecting and integrating informative drug features in drug–drug similarity measurement are crucial for improving disease-side prediction.
Collapse
|
7
|
Xuan P, Zhang X, Zhang Y, Hu K, Nakaguchi T, Zhang T. multi-type neighbors enhanced global topology and pairwise attribute learning for drug-protein interaction prediction. Brief Bioinform 2022; 23:6581435. [PMID: 35514190 DOI: 10.1093/bib/bbac120] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 03/07/2022] [Accepted: 03/15/2022] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Accurate identification of proteins interacted with drugs helps reduce the time and cost of drug development. Most of previous methods focused on integrating multisource data about drugs and proteins for predicting drug-target interactions (DTIs). There are both similarity connection and interaction connection between two drugs, and these connections reflect their relationships from different perspectives. Similarly, two proteins have various connections from multiple perspectives. However, most of previous methods failed to deeply integrate these connections. In addition, multiple drug-protein heterogeneous networks can be constructed based on multiple kinds of connections. The diverse topological structures of these networks are still not exploited completely. RESULTS We propose a novel model to extract and integrate multi-type neighbor topology information, diverse similarities and interactions related to drugs and proteins. Firstly, multiple drug-protein heterogeneous networks are constructed according to multiple kinds of connections among drugs and those among proteins. The multi-type neighbor node sequences of a drug node (or a protein node) are formed by random walks on each network and they reflect the hidden neighbor topological structure of the node. Secondly, a module based on graph neural network (GNN) is proposed to learn the multi-type neighbor topologies of each node. We propose attention mechanisms at neighbor node level and at neighbor type level to learn more informative neighbor nodes and neighbor types. A network-level attention is also designed to enhance the context dependency among multiple neighbor topologies of a pair of drug and protein nodes. Finally, the attribute embedding of the drug-protein pair is formulated by a proposed embedding strategy, and the embedding covers the similarities and interactions about the pair. A module based on three-dimensional convolutional neural networks (CNN) is constructed to deeply integrate pairwise attributes. Extensive experiments have been performed and the results indicate GCDTI outperforms several state-of-the-art prediction methods. The recall rate estimation over the top-ranked candidates and case studies on 5 drugs further demonstrate GCDTI's ability in discovering potential drug-protein interactions.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China.,School of Computer Science, Shaanxi Normal University, Xi'an 710062, China
| | - Xiaowen Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Yu Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Kaimiao Hu
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba 2638522, Japan
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin 150080, China
| |
Collapse
|
8
|
Shao K, Zhang Y, Wen Y, Zhang Z, He S, Bo X. DTI-HETA: prediction of drug-target interactions based on GCN and GAT on heterogeneous graph. Brief Bioinform 2022; 23:6563180. [PMID: 35380622 DOI: 10.1093/bib/bbac109] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 02/14/2022] [Accepted: 03/03/2022] [Indexed: 12/19/2022] Open
Abstract
Drug-target interaction (DTI) prediction plays an important role in drug repositioning, drug discovery and drug design. However, due to the large size of the chemical and genomic spaces and the complex interactions between drugs and targets, experimental identification of DTIs is costly and time-consuming. In recent years, the emerging graph neural network (GNN) has been applied to DTI prediction because DTIs can be represented effectively using graphs. However, some of these methods are only based on homogeneous graphs, and some consist of two decoupled steps that cannot be trained jointly. To further explore GNN-based DTI prediction by integrating heterogeneous graph information, this study regards DTI prediction as a link prediction problem and proposes an end-to-end model based on HETerogeneous graph with Attention mechanism (DTI-HETA). In this model, a heterogeneous graph is first constructed based on the drug-drug and target-target similarity matrices and the DTI matrix. Then, the graph convolutional neural network is utilized to obtain the embedded representation of the drugs and targets. To highlight the contribution of different neighborhood nodes to the central node in aggregating the graph convolution information, a graph attention mechanism is introduced into the node embedding process. Afterward, an inner product decoder is applied to predict DTIs. To evaluate the performance of DTI-HETA, experiments are conducted on two datasets. The experimental results show that our model is superior to the state-of-the-art methods. Also, the identification of novel DTIs indicates that DTI-HETA can serve as a powerful tool for integrating heterogeneous graph information to predict DTIs.
Collapse
Affiliation(s)
| | | | - Yuqi Wen
- Beijing Institute of Radiation Medicine, Beijing, China
| | | | - Song He
- Beijing Institute of Radiation Medicine, Beijing, China
| | - Xiaochen Bo
- Beijing Institute of Radiation Medicine, Beijing, China
| |
Collapse
|
9
|
Du BX, Qin Y, Jiang YF, Xu Y, Yiu SM, Yu H, Shi JY. Compound–protein interaction prediction by deep learning: Databases, descriptors and models. Drug Discov Today 2022; 27:1350-1366. [DOI: 10.1016/j.drudis.2022.02.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 11/19/2021] [Accepted: 02/28/2022] [Indexed: 11/24/2022]
|