1
|
Ouyang D, Liang Y, Wang J, Li L, Ai N, Feng J, Lu S, Liao S, Liu X, Xie S. HGCLAMIR: Hypergraph contrastive learning with attention mechanism and integrated multi-view representation for predicting miRNA-disease associations. PLoS Comput Biol 2024; 20:e1011927. [PMID: 38652712 PMCID: PMC11037542 DOI: 10.1371/journal.pcbi.1011927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 02/19/2024] [Indexed: 04/25/2024] Open
Abstract
Existing studies have shown that the abnormal expression of microRNAs (miRNAs) usually leads to the occurrence and development of human diseases. Identifying disease-related miRNAs contributes to studying the pathogenesis of diseases at the molecular level. As traditional biological experiments are time-consuming and expensive, computational methods have been used as an effective complement to infer the potential associations between miRNAs and diseases. However, most of the existing computational methods still face three main challenges: (i) learning of high-order relations; (ii) insufficient representation learning ability; (iii) importance learning and integration of multi-view embedding representation. To this end, we developed a HyperGraph Contrastive Learning with view-aware Attention Mechanism and Integrated multi-view Representation (HGCLAMIR) model to discover potential miRNA-disease associations. First, hypergraph convolutional network (HGCN) was utilized to capture high-order complex relations from hypergraphs related to miRNAs and diseases. Then, we combined HGCN with contrastive learning to improve and enhance the embedded representation learning ability of HGCN. Moreover, we introduced view-aware attention mechanism to adaptively weight the embedded representations of different views, thereby obtaining the importance of multi-view latent representations. Next, we innovatively proposed integrated representation learning to integrate the embedded representation information of multiple views for obtaining more reasonable embedding information. Finally, the integrated representation information was fed into a neural network-based matrix completion method to perform miRNA-disease association prediction. Experimental results on the cross-validation set and independent test set indicated that HGCLAMIR can achieve better prediction performance than other baseline models. Furthermore, the results of case studies and enrichment analysis further demonstrated the accuracy of HGCLAMIR and unconfirmed potential associations had biological significance.
Collapse
Affiliation(s)
- Dong Ouyang
- Peng Cheng Laboratory, Shenzhen, China
- School of Biomedical Engineering, Guangdong Medical University, Dongguan, China
| | - Yong Liang
- Peng Cheng Laboratory, Shenzhen, China
- Pazhou Laboratory (Huangpu), Guangzhou, China
| | - Jinfeng Wang
- College of Mathematics and Informatics, South China Agricultural University, Guangzhou, China
| | - Le Li
- School of Computer Science and Engineering, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, China
| | - Ning Ai
- School of Computer Science and Engineering, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, China
| | - Junning Feng
- School of Computer Science and Engineering, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, China
| | - Shanghui Lu
- School of Computer Science and Engineering, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, China
| | - Shuilin Liao
- School of Computer Science and Engineering, Faculty of Innovation Engineering, Macau University of Science and Technology, Macau, China
| | - Xiaoying Liu
- Computer Engineering Technical College, Guangdong Polytechnic of Science and Technology, Zhuhai, China
| | - Shengli Xie
- Guangdong-HongKong-Macao Joint Laboratory for Smart Discrete Manufacturing, Guangzhou, China
| |
Collapse
|
2
|
Dong B, Sun W, Xu D, Wang G, Zhang T. MDformer: A transformer-based method for predicting miRNA-Disease associations using multi-source feature fusion and maximal meta-path instances encoding. Comput Biol Med 2023; 167:107585. [PMID: 37890424 DOI: 10.1016/j.compbiomed.2023.107585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 09/15/2023] [Accepted: 10/15/2023] [Indexed: 10/29/2023]
Abstract
There is a growing body of evidence suggesting that microRNAs (miRNAs), small biological molecules, play a crucial role in the diagnosis, treatment, and prognostic assessment of diseases. However, it is often inefficient to verify the association between miRNAs and diseases (MDA) through traditional experimental methods. Based on this situation, researchers have proposed various computational-based methods, but the existing methods often have many drawbacks in terms of predictive effectiveness and accuracy. Therefore, in order to improve the prediction performance of computational methods, we propose a transformer-based prediction model (MDformer) for multi-source feature information. Specifically, first, we consider multiple features of miRNAs and diseases from the molecular biology perspective and utilize them in a fusion. Then high-quality node feature embeddings were generated using a feature encoder based on the transformer architecture and meta-path instances. Finally, a deep neural network was built for MDA prediction. To evaluate the performance of our model, we performed multiple 5-fold cross-validations as well as comparison experiments on HMDD v3.2 and HMDD v2.0 databases, and the experimental results of the average ROC area under the curve (AUC) were higher than the comparative methods for both databases at 0.9506 and 0.9369. We conducted case studies on five highly lethal cancers (breast, lung, colorectal, gastric, and hepatocellular cancers), and the first 30 predictions for these five diseases achieved 97.3% accuracy. In conclusion, MDformer is a reliable and scientifically sound tool that can be used to accurately predict MDA. In addition, the source code is available at https://github.com/Linda908/MDformer.
Collapse
Affiliation(s)
- Benzhi Dong
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China
| | - Weidong Sun
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China
| | - Dali Xu
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China
| | - Guohua Wang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China.
| | - Tianjiao Zhang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin, 150040, China.
| |
Collapse
|
3
|
Dong B, Sun W, Xu D, Wang G, Zhang T. DAEMDA: A Method with Dual-Channel Attention Encoding for miRNA-Disease Association Prediction. Biomolecules 2023; 13:1514. [PMID: 37892196 PMCID: PMC10604960 DOI: 10.3390/biom13101514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 10/08/2023] [Indexed: 10/29/2023] Open
Abstract
A growing number of studies have shown that aberrant microRNA (miRNA) expression is closely associated with the evolution and development of various complex human diseases. These key biomarkers' identification and observation are significant for gaining a deeper understanding of disease pathogenesis and therapeutic mechanisms. Consequently, pinpointing potential miRNA-disease associations (MDA) has become a prominent bioinformatics subject, encouraging several new computational methods given the advances in graph neural networks (GNN). Nevertheless, these existing methods commonly fail to exploit the network nodes' global feature information, leaving the generation of high-quality embedding representations using graph properties as a critical unsolved issue. Addressing these challenges, we introduce the DAEMDA, a computational method designed to optimize the current models' efficacy. First, we construct similarity and heterogeneous networks involving miRNAs and diseases, relying on experimentally corroborated miRNA-disease association data and analogous information. Then, a newly-fashioned parallel dual-channel feature encoder, designed to better comprehend the global information within the heterogeneous network and generate varying embedding representations, follows this. Ultimately, employing a neural network classifier, we merge the dual-channel embedding representations and undertake association predictions between miRNA and disease nodes. The experimental results of five-fold cross-validation and case studies of major diseases based on the HMDD v3.2 database show that this method can generate high-quality embedded representations and effectively improve the accuracy of MDA prediction.
Collapse
Affiliation(s)
| | | | | | - Guohua Wang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China; (B.D.)
| | - Tianjiao Zhang
- College of Computer and Control Engineering, Northeast Forestry University, Harbin 150040, China; (B.D.)
| |
Collapse
|
4
|
Chen M, Deng Y, Li Z, Ye Y, He Z. KATZNCP: a miRNA-disease association prediction model integrating KATZ algorithm and network consistency projection. BMC Bioinformatics 2023; 24:229. [PMID: 37268893 DOI: 10.1186/s12859-023-05365-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Accepted: 05/26/2023] [Indexed: 06/04/2023] Open
Abstract
BACKGROUND Clinical studies have shown that miRNAs are closely related to human health. The study of potential associations between miRNAs and diseases will contribute to a profound understanding of the mechanism of disease development, as well as human disease prevention and treatment. MiRNA-disease associations predicted by computational methods are the best complement to biological experiments. RESULTS In this research, a federated computational model KATZNCP was proposed on the basis of the KATZ algorithm and network consistency projection to infer the potential miRNA-disease associations. In KATZNCP, a heterogeneous network was initially constructed by integrating the known miRNA-disease association, integrated miRNA similarities, and integrated disease similarities; then, the KATZ algorithm was implemented in the heterogeneous network to obtain the estimated miRNA-disease prediction scores. Finally, the precise scores were obtained by the network consistency projection method as the final prediction results. KATZNCP achieved the reliable predictive performance in leave-one-out cross-validation (LOOCV) with an AUC value of 0.9325, which was better than the state-of-the-art comparable algorithms. Furthermore, case studies of lung neoplasms and esophageal neoplasms demonstrated the excellent predictive performance of KATZNCP. CONCLUSION A new computational model KATZNCP was proposed for predicting potential miRNA-drug associations based on KATZ and network consistency projections, which can effectively predict the potential miRNA-disease interactions. Therefore, KATZNCP can be used to provide guidance for future experiments.
Collapse
Affiliation(s)
- Min Chen
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, 421002, China
| | - Yingwei Deng
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, 421002, China.
| | - Zejun Li
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, 421002, China
| | - Yifan Ye
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, 421002, China
| | - Ziyi He
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, 421002, China
| |
Collapse
|
5
|
Li Z, Zhang Y, Bai Y, Xie X, Zeng L. IMC-MDA: Prediction of miRNA-disease association based on induction matrix completion. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:10659-10674. [PMID: 37322953 DOI: 10.3934/mbe.2023471] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
To comprehend the etiology and pathogenesis of many illnesses, it is essential to identify disease-associated microRNAs (miRNAs). However, there are a number of challenges with current computational approaches, such as the lack of "negative samples", that is, confirmed irrelevant miRNA-disease pairs, and the poor performance in terms of predicting miRNAs related with "isolated diseases", i.e. illnesses with no known associated miRNAs, which presents the need for novel computational methods. In this study, for the purpose of predicting the connection between disease and miRNA, an inductive matrix completion model was designed, referred to as IMC-MDA. In the model of IMC-MDA, for each miRNA-disease pair, the predicted marks are calculated by combining the known miRNA-disease connection with the integrated disease similarities and miRNA similarities. Based on LOOCV, IMC-MDA had an AUC of 0.8034, which shows better performance than previous methods. Furthermore, experiments have validated the prediction of disease-related miRNAs for three major human diseases: colon cancer, kidney cancer, and lung cancer.
Collapse
Affiliation(s)
- Zejun Li
- School of Computer and Information Science, Hunan Institute of Technology, Hengyang 412002, China
| | - Yuxiang Zhang
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou, Henan, 450001, China
| | - Yuting Bai
- College of Information Science and Engineering, Hunan University, Changsha 410082, Hunan, China
| | - Xiaohui Xie
- School of Computer and Information Science, Hunan Institute of Technology, Hengyang 412002, China
| | - Lijun Zeng
- School of Computer and Information Science, Hunan Institute of Technology, Hengyang 412002, China
| |
Collapse
|
6
|
He Y, Yang Y, Su X, Zhao B, Xiong S, Hu L. Incorporating higher order network structures to improve miRNA-disease association prediction based on functional modularity. Brief Bioinform 2023; 24:6958503. [PMID: 36562706 DOI: 10.1093/bib/bbac562] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 10/29/2022] [Accepted: 11/19/2022] [Indexed: 12/24/2022] Open
Abstract
As microRNAs (miRNAs) are involved in many essential biological processes, their abnormal expressions can serve as biomarkers and prognostic indicators to prevent the development of complex diseases, thus providing accurate early detection and prognostic evaluation. Although a number of computational methods have been proposed to predict miRNA-disease associations (MDAs) for further experimental verification, their performance is limited primarily by the inadequacy of exploiting lower order patterns characterizing known MDAs to identify missing ones from MDA networks. Hence, in this work, we present a novel prediction model, namely HiSCMDA, by incorporating higher order network structures for improved performance of MDA prediction. To this end, HiSCMDA first integrates miRNA similarity network, disease similarity network and MDA network to preserve the advantages of all these networks. After that, it identifies overlapping functional modules from the integrated network by predefining several higher order connectivity patterns of interest. Last, a path-based scoring function is designed to infer potential MDAs based on network paths across related functional modules. HiSCMDA yields the best performance across all datasets and evaluation metrics in the cross-validation and independent validation experiments. Furthermore, in the case studies, 49 and 50 out of the top 50 miRNAs, respectively, predicted for colon neoplasms and lung neoplasms have been validated by well-established databases. Experimental results show that rich higher order organizational structures exposed in the MDA network gain new insight into the MDA prediction based on higher order connectivity patterns.
Collapse
Affiliation(s)
- Yizhou He
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, 430070, China
| | - Yue Yang
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, 430070, China
| | - Xiaorui Su
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China
| | - Bowei Zhao
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China
| | - Shengwu Xiong
- School of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, 430070, China
| | - Lun Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China
| |
Collapse
|
7
|
Feng H, Jin D, Li J, Li Y, Zou Q, Liu T. Matrix reconstruction with reliable neighbors for predicting potential MiRNA-disease associations. Brief Bioinform 2023; 24:6960615. [PMID: 36567252 DOI: 10.1093/bib/bbac571] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 10/16/2022] [Accepted: 11/23/2022] [Indexed: 12/27/2022] Open
Abstract
Numerous experimental studies have indicated that alteration and dysregulation in mircroRNAs (miRNAs) are associated with serious diseases. Identifying disease-related miRNAs is therefore an essential and challenging task in bioinformatics research. Computational methods are an efficient and economical alternative to conventional biomedical studies and can reveal underlying miRNA-disease associations for subsequent experimental confirmation with reasonable confidence. Despite the success of existing computational approaches, most of them only rely on the known miRNA-disease associations to predict associations without adding other data to increase the prediction accuracy, and they are affected by issues of data sparsity. In this paper, we present MRRN, a model that combines matrix reconstruction with node reliability to predict probable miRNA-disease associations. In MRRN, the most reliable neighbors of miRNA and disease are used to update the original miRNA-disease association matrix, which significantly reduces data sparsity. Unknown miRNA-disease associations are reconstructed by aggregating the most reliable first-order neighbors to increase prediction accuracy by representing the local and global structure of the heterogeneous network. Five-fold cross-validation of MRRN produced an area under the curve (AUC) of 0.9355 and area under the precision-recall curve (AUPR) of 0.2646, values that were greater than those produced by comparable models. Two different types of case studies using three diseases were conducted to demonstrate the accuracy of MRRN, and all top 30 predicted miRNAs were verified.
Collapse
Affiliation(s)
- Hailin Feng
- School of mathematics and computer science, Zhejiang A&F University, No.666 Wusu Street,Lin'an District, 311300, Hangzhou, China
| | - Dongdong Jin
- School of mathematics and computer science, Zhejiang A&F University, No.666 Wusu Street,Lin'an District, 311300, Hangzhou, China
| | - Jian Li
- School of mathematics and computer science, Zhejiang A&F University, No.666 Wusu Street,Lin'an District, 311300, Hangzhou, China
| | - Yane Li
- School of mathematics and computer science, Zhejiang A&F University, No.666 Wusu Street,Lin'an District, 311300, Hangzhou, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, No. 2006, Xiyuan Avenue, West District, high tech Zone, 611731, Chengdu, China
| | - Tongcun Liu
- School of mathematics and computer science, Zhejiang A&F University, No.666 Wusu Street,Lin'an District, 311300, Hangzhou, China
| |
Collapse
|
8
|
Han G, Kuang Z, Deng L. MSCNE:Predict miRNA-Disease Associations Using Neural Network Based on Multi-Source Biological Information. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2926-2937. [PMID: 34410928 DOI: 10.1109/tcbb.2021.3106006] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The important role of microRNA (miRNA) in human diseases has been confirmed by some studies. However, only using biological experiments has greater blindness, leading to higher experimental costs. In this paper a high-efficiency algorithm based on a variety of biological source information and applying a combination of a convolutional neural network (CNN) feature extractor and an extreme learning machine (ELM) classifier is proposed. Specifically, the semantic similarity of diseases, the gaussian interaction profile kernel similarity of the four biological information of miRNA, disease, long non-coding RNA (lncRNA) and environmental factors (EFs), and the similarities of miRNAs are fused together. Among them, miRNAs similarity is composed of miRNA target information, sequence information, family information, and function information. Then, the dimensionality of the data set is reduced by the autoencoder (AE). Finally, deep features are extracted through CNN, and then the association between miRNA and disease is predicted by ELM. The experimental results show that the average AUC value based on the multi-biological source information (MSCNE) model is 0.9630, which can reach higher performance than the other classic classifier, feature extractor mentioned and the other existing algorithms. The results show the MSCNE algorithm is effective to predict the correlation of miRNA-disease.
Collapse
|
9
|
Wang W, Chen H. Predicting miRNA-disease associations based on graph attention networks and dual Laplacian regularized least squares. Brief Bioinform 2022; 23:6645486. [PMID: 35849099 DOI: 10.1093/bib/bbac292] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 06/23/2022] [Accepted: 06/26/2022] [Indexed: 01/05/2023] Open
Abstract
Increasing biomedical evidence has proved that the dysregulation of miRNAs is associated with human complex diseases. Identification of disease-related miRNAs is of great importance for disease prevention, diagnosis and remedy. To reduce the time and cost of biomedical experiments, there is a strong incentive to develop efficient computational methods to infer potential miRNA-disease associations. Although many computational approaches have been proposed to address this issue, the prediction accuracy needs to be further improved. In this study, we present a computational framework MKGAT to predict possible associations between miRNAs and diseases through graph attention networks (GATs) using dual Laplacian regularized least squares. We use GATs to learn embeddings of miRNAs and diseases on each layer from initial input features of known miRNA-disease associations, intra-miRNA similarities and intra-disease similarities. We then calculate kernel matrices of miRNAs and diseases based on Gaussian interaction profile (GIP) with the learned embeddings. We further fuse the kernel matrices of each layer and initial similarities with attention mechanism. Dual Laplacian regularized least squares are finally applied for new miRNA-disease association predictions with the fused miRNA and disease kernels. Compared with six state-of-the-art methods by 5-fold cross-validations, our method MKGAT receives the highest AUROC value of 0.9627 and AUPR value of 0.7372. We use MKGAT to predict related miRNAs for three cancers and discover that all the top 50 predicted results in the three diseases are confirmed by existing databases. The excellent performance indicates that MKGAT would be a useful computational tool for revealing disease-related miRNAs.
Collapse
Affiliation(s)
- Wengang Wang
- School of Software, East China Jiaotong University, Nanchang 330013, China
| | - Hailin Chen
- School of Software, East China Jiaotong University, Nanchang 330013, China
| |
Collapse
|
10
|
Yu L, Zheng Y, Ju B, Ao C, Gao L. Research progress of miRNA-disease association prediction and comparison of related algorithms. Brief Bioinform 2022; 23:6542222. [PMID: 35246678 DOI: 10.1093/bib/bbac066] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 01/30/2022] [Accepted: 02/08/2022] [Indexed: 11/13/2022] Open
Abstract
With an in-depth understanding of noncoding ribonucleic acid (RNA), many studies have shown that microRNA (miRNA) plays an important role in human diseases. Because traditional biological experiments are time-consuming and laborious, new calculation methods have recently been developed to predict associations between miRNA and diseases. In this review, we collected various miRNA-disease association prediction models proposed in recent years and used two common data sets to evaluate the performance of the prediction models. First, we systematically summarized the commonly used databases and similarity data for predicting miRNA-disease associations, and then divided the various calculation models into four categories for summary and detailed introduction. In this study, two independent datasets (D5430 and D6088) were compiled to systematically evaluate 11 publicly available prediction tools for miRNA-disease associations. The experimental results indicate that the methods based on information dissemination and the method based on scoring function require shorter running time. The method based on matrix transformation often requires a longer running time, but the overall prediction result is better than the previous two methods. We hope that the summary of work related to miRNA and disease will provide comprehensive knowledge for predicting the relationship between miRNA and disease and contribute to advanced computation tools in the future.
Collapse
Affiliation(s)
- Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Yujia Zheng
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Bingyi Ju
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Chunyan Ao
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an, China
| |
Collapse
|
11
|
Li J, Liu T, Wang J, Li Q, Ning C, Yang Y. MvKFN-MDA: Multi-view Kernel Fusion Network for miRNA-disease association prediction. Artif Intell Med 2021; 118:102115. [PMID: 34412838 DOI: 10.1016/j.artmed.2021.102115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Revised: 05/13/2021] [Accepted: 05/21/2021] [Indexed: 12/01/2022]
Abstract
Predicting the associations between microRNAs (miRNAs) and diseases is of great significance for identifying miRNAs related to human diseases. Since it is time-consuming and costly to identify the association between miRNA and disease through biological experiments, computational methods are currently used as an effective supplement to identify the potential association between disease and miRNA. This paper presents a Multi-view Kernel Fusion Network (MvKFN) based prediction method (MvKFN-MDA) to address the problem of miRNA-disease associations prediction. A novel multiple kernel fusion framework Multi-view Kernel Fusion Network (MvKFN) is first proposed to effectively fuse different views similarity kernels constructed from different data sources in a highly nonlinear way. Using MvKFNs, both different base similarity kernels for miRNA, such as sequence, functional, semantic, Gaussian profile kernels and different base similarity kernels for diseases, such as semantic, Gaussian profile kernel are nonlinearly fused into two integrated similarity kernels, one for miRNA, another for disease. Then, miRNA and disease feature representations are extracted from the miRNA and disease integrated similarity kernels respectively. These features are then fed into a neural matrix completion framework which finally outputs the association prediction scores. The parameters of MvKFN-MDA are learned based on the known miRNA-disease association matrix in a supervised end-to-end way. We compare the proposed method with other state-of-the-art methods. The AUCs of our proposed method were superior to the existing methods in both 5-FCV and LOOCV on two open experimental datasets. Furthermore, 49, 48, and 47 of the top 50 predicted miRNAs for three high-risk human diseases, namely, colon cancer, lymphoma, and kidney cancer, are verified respectively using experimental literature. Finally, 100% accuracy from the top 50 predicted miRNAs is achieved when breast cancer is used as a case study to evaluate the ability of MvKFN-MDA for predicting a new disease without any known related miRNAs.
Collapse
Affiliation(s)
- Jin Li
- School of Software, Yunnan University, Kunming, China; Kunming Key Laboratory of Data Science and Intelligent Computing, Kunming, China
| | - Tao Liu
- School of Software, Yunnan University, Kunming, China
| | - Jingru Wang
- School of Software, Yunnan University, Kunming, China
| | - Qing Li
- First Affiliated Hospital of Kunming Medical University, Kunming, China
| | - Chenxi Ning
- School of Software, Yunnan University, Kunming, China
| | - Yun Yang
- School of Software, Yunnan University, Kunming, China; Kunming Key Laboratory of Data Science and Intelligent Computing, Kunming, China.
| |
Collapse
|
12
|
Cai L, Lu C, Xu J, Meng Y, Wang P, Fu X, Zeng X, Su Y. Drug repositioning based on the heterogeneous information fusion graph convolutional network. Brief Bioinform 2021; 22:6347207. [PMID: 34378011 DOI: 10.1093/bib/bbab319] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 06/30/2021] [Accepted: 07/21/2021] [Indexed: 11/13/2022] Open
Abstract
In silico reuse of old drugs (also known as drug repositioning) to treat common and rare diseases is increasingly becoming an attractive proposition because it involves the use of de-risked drugs, with potentially lower overall development costs and shorter development timelines. Therefore, there is a pressing need for computational drug repurposing methodologies to facilitate drug discovery. In this study, we propose a new method, called DRHGCN (Drug Repositioning based on the Heterogeneous information fusion Graph Convolutional Network), to discover potential drugs for a certain disease. To make full use of different topology information in different domains (i.e. drug-drug similarity, disease-disease similarity and drug-disease association networks), we first design inter- and intra-domain feature extraction modules by applying graph convolution operations to the networks to learn the embedding of drugs and diseases, instead of simply integrating the three networks into a heterogeneous network. Afterwards, we parallelly fuse the inter- and intra-domain embeddings to obtain the more representative embeddings of drug and disease. Lastly, we introduce a layer attention mechanism to combine embeddings from multiple graph convolution layers for further improving the prediction performance. We find that DRHGCN achieves high performance (the average AUROC is 0.934 and the average AUPR is 0.539) in four benchmark datasets, outperforming the current approaches. Importantly, we conducted molecular docking experiments on DRHGCN-predicted candidate drugs, providing several novel approved drugs for Alzheimer's disease (e.g. benzatropine) and Parkinson's disease (e.g. trihexyphenidyl and haloperidol).
Collapse
Affiliation(s)
- Lijun Cai
- Hunan University, Changsha, Hunan, 410082, China
| | | | - Junlin Xu
- Hunan University, Changsha, Hunan, 410082, China
| | - Yajie Meng
- Hunan University, Changsha, Hunan, 410082, China
| | - Peng Wang
- Hunan University, Changsha, Hunan, 410082, China
| | | | | | - Yansen Su
- Anhui University, Changsha, Hunan, 410082, China
| |
Collapse
|
13
|
Li A, Deng Y, Tan Y, Chen M. A novel miRNA-disease association prediction model using dual random walk with restart and space projection federated method. PLoS One 2021; 16:e0252971. [PMID: 34138933 PMCID: PMC8211179 DOI: 10.1371/journal.pone.0252971] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Accepted: 05/26/2021] [Indexed: 12/27/2022] Open
Abstract
A large number of studies have shown that the variation and disorder of miRNAs are important causes of diseases. The recognition of disease-related miRNAs has become an important topic in the field of biological research. However, the identification of disease-related miRNAs by biological experiments is expensive and time consuming. Thus, computational prediction models that predict disease-related miRNAs must be developed. A novel network projection-based dual random walk with restart (NPRWR) was used to predict potential disease-related miRNAs. The NPRWR model aims to estimate and accurately predict miRNA-disease associations by using dual random walk with restart and network projection technology, respectively. The leave-one-out cross validation (LOOCV) was adopted to evaluate the prediction performance of NPRWR. The results show that the area under the receiver operating characteristic curve(AUC) of NPRWR was 0.9029, which is superior to that of other advanced miRNA-disease associated prediction methods. In addition, lung and kidney neoplasms were selected to present a case study. Among the first 50 miRNAs predicted, 50 and 49 miRNAs have been proven by in databases or relevant literature. Moreover, NPRWR can be used to predict isolated diseases and new miRNAs. LOOCV and the case study achieved good prediction results. Thus, NPRWR will become an effective and accurate disease-miRNA association prediction model.
Collapse
Affiliation(s)
- Ang Li
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang, China
| | - Yingwei Deng
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang, China
- Hainan Key Laboratory for Computational Science and Application, Haikou, China
| | - Yan Tan
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang, China
| | - Min Chen
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang, China
| |
Collapse
|
14
|
Chu Y, Wang X, Dai Q, Wang Y, Wang Q, Peng S, Wei X, Qiu J, Salahub DR, Xiong Y, Wei DQ. MDA-GCNFTG: identifying miRNA-disease associations based on graph convolutional networks via graph sampling through the feature and topology graph. Brief Bioinform 2021; 22:6261915. [PMID: 34009265 DOI: 10.1093/bib/bbab165] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 04/02/2021] [Accepted: 04/08/2021] [Indexed: 11/13/2022] Open
Abstract
Accurate identification of the miRNA-disease associations (MDAs) helps to understand the etiology and mechanisms of various diseases. However, the experimental methods are costly and time-consuming. Thus, it is urgent to develop computational methods towards the prediction of MDAs. Based on the graph theory, the MDA prediction is regarded as a node classification task in the present study. To solve this task, we propose a novel method MDA-GCNFTG, which predicts MDAs based on Graph Convolutional Networks (GCNs) via graph sampling through the Feature and Topology Graph to improve the training efficiency and accuracy. This method models both the potential connections of feature space and the structural relationships of MDA data. The nodes of the graphs are represented by the disease semantic similarity, miRNA functional similarity and Gaussian interaction profile kernel similarity. Moreover, we considered six tasks simultaneously on the MDA prediction problem at the first time, which ensure that under both balanced and unbalanced sample distribution, MDA-GCNFTG can predict not only new MDAs but also new diseases without known related miRNAs and new miRNAs without known related diseases. The results of 5-fold cross-validation show that the MDA-GCNFTG method has achieved satisfactory performance on all six tasks and is significantly superior to the classic machine learning methods and the state-of-the-art MDA prediction methods. Moreover, the effectiveness of GCNs via the graph sampling strategy and the feature and topology graph in MDA-GCNFTG has also been demonstrated. More importantly, case studies for two diseases and three miRNAs are conducted and achieved satisfactory performance.
Collapse
Affiliation(s)
- Yanyi Chu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Xuhong Wang
- School of Electronic, Information and Electrical Engineering (SEIEE), Shanghai Jiao Tong University, China
| | - Qiuying Dai
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Yanjing Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Qiankun Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Shaoliang Peng
- College of Computer Science and Electronic Engineering, Hunan University, China
| | | | | | - Dennis Russell Salahub
- Department of Chemistry, University of Calgary, Fellow Royal Society of Canada and Fellow of the American Association for the Advancement of Science, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint International Research Laboratory of Metabolic & Developmental Sciences and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint International Research Laboratory of Metabolic & Developmental Sciences and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
| |
Collapse
|
15
|
Ding Y, Tian LP, Lei X, Liao B, Wu FX. Variational graph auto-encoders for miRNA-disease association prediction. Methods 2020; 192:25-34. [PMID: 32798654 DOI: 10.1016/j.ymeth.2020.08.004] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 08/03/2020] [Accepted: 08/08/2020] [Indexed: 02/07/2023] Open
Abstract
Cumulative experimental studies have demonstrated the critical roles of microRNAs (miRNAs) in the diverse fundamental and important biological processes, and in the development of numerous complex human diseases. Thus, exploring the relationships between miRNAs and diseases is helpful with understanding the mechanisms, the detection, diagnosis, and treatment of complex diseases. As the identification of miRNA-disease associations via traditional biological experiments is time-consuming and expensive, an effective computational prediction method is appealing. In this study, we present a deep learning framework with variational graph auto-encoder for miRNA-disease association prediction (VGAE-MDA). VGAE-MDA first gets the representations of miRNAs and diseases from the heterogeneous networks constructed by miRNA-miRNA similarity, disease-disease similarity, and known miRNA-disease associations. Then, VGAE-MDA constructs two sub-networks: miRNA-based network and disease-based network. Combining the representations based on the heterogeneous network, two variational graph auto-encoders (VGAE) are deployed for calculating the miRNA-disease association scores from two sub-networks, respectively. Lastly, VGAE-MDA obtains the final predicted association score for a miRNA-disease pair by integrating the scores from these two trained networks. Unlike the previous model, the VGAE-MDA can mitigate the effect of noises from random selection of negative samples. Besides, the use of graph convolutional neural (GCN) network can naturally incorporate the node features from the graph structure while the variational autoencoder (VAE) makes use of latent variables to predict associations from the perspective of data distribution. The experimental results show that VGAE-MDA outperforms the state-of-the-art approaches in miRNA-disease association prediction. Besides, the effectiveness of our model has been further demonstrated by case studies.
Collapse
Affiliation(s)
- Yulian Ding
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada
| | - Li-Ping Tian
- School of Information, Beijing Wuzi University, Beijing 101125, China
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an 710119, China
| | - Bo Liao
- School of Mathematics and Statistics, Hainan Normal University, Haikou 571158, China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada; Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada; Department of Computer Science, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada.
| |
Collapse
|
16
|
Zhang Y, Chen M, Cheng X, Wei H. MSFSP: A Novel miRNA-Disease Association Prediction Model by Federating Multiple-Similarities Fusion and Space Projection. Front Genet 2020; 11:389. [PMID: 32425980 PMCID: PMC7204399 DOI: 10.3389/fgene.2020.00389] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 03/27/2020] [Indexed: 12/11/2022] Open
Abstract
Growing evidences have indicated that microRNAs (miRNAs) play a significant role relating to many important bioprocesses; their mutations and disorders will cause the occurrence of various complex diseases. The prediction of miRNAs associated with underlying diseases via computational approaches is beneficial to identify biomarkers and discover specific medicine, which can greatly reduce the cost of diagnosis, cure, prognosis, and prevention of human diseases. However, how to further achieve a more reliable prediction of potential miRNA-disease associations with effective integration of different biological data is a challenge for researchers. In this study, we proposed a computational model by using a federated method of combined multiple-similarities fusion and space projection (MSFSP). MSFSP firstly fused the integrated disease similarity (composed of disease semantic similarity, disease functional similarity, and disease Hamming similarity) with the integrated miRNA similarity (composed of miRNA functional similarity, miRNA sequence similarity, and miRNA Hamming similarity). Secondly, it constructed the weighted network of miRNA-disease associations from the experimentally verified Boolean network of miRNA-disease associations by using similarity networks. Finally, it calculated the prediction results by weighting miRNA space projection scores and the disease space projection scores. Leave-one-out cross-validation demonstrated that MSFSP has the distinguished predictive accuracy with area under the receiver operating characteristics curve (AUC) of 0.9613 better than that of five other existing models. In case studies, the predictive ability of MSFSP was further confirmed as 96 and 98% of the top 50 predictions for prostatic neoplasms and lung neoplasms were successfully validated by experimental evidences and supporting experimental evidences were also found for 100% of the top 50 predictions for isolated diseases.
Collapse
Affiliation(s)
- Yi Zhang
- School of Information Science and Engineering, Guilin University of Technology, Guilin, China
| | - Min Chen
- School of Computer Science and Technology, Hunan Institute of Technology, Hengyang, China
| | - Xiaohui Cheng
- School of Information Science and Engineering, Guilin University of Technology, Guilin, China
| | - Hanyan Wei
- School of Pharmacy, Guilin Medical University, Guilin, China
| |
Collapse
|
17
|
Zhang Y, Chen M, Li A, Cheng X, Jin H, Liu Y. LDAI-ISPS: LncRNA-Disease Associations Inference Based on Integrated Space Projection Scores. Int J Mol Sci 2020; 21:E1508. [PMID: 32098405 PMCID: PMC7073162 DOI: 10.3390/ijms21041508] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 02/18/2020] [Accepted: 02/19/2020] [Indexed: 12/14/2022] Open
Abstract
Long non-coding RNAs (long ncRNAs, lncRNAs) of all kinds have been implicated in a range of cell developmental processes and diseases, while they are not translated into proteins. Inferring diseases associated lncRNAs by computational methods can be helpful to understand the pathogenesis of diseases, but those current computational methods still have not achieved remarkable predictive performance: such as the inaccurate construction of similarity networks and inadequate numbers of known lncRNA-disease associations. In this research, we proposed a lncRNA-disease associations inference based on integrated space projection scores (LDAI-ISPS) composed of the following key steps: changing the Boolean network of known lncRNA-disease associations into the weighted networks via combining all the global information (e.g., disease semantic similarities, lncRNA functional similarities, and known lncRNA-disease associations); obtaining the space projection scores via vector projections of the weighted networks to form the final prediction scores without biases. The leave-one-out cross validation (LOOCV) results showed that, compared with other methods, LDAI-ISPS had a higher accuracy with area-under-the-curve (AUC) value of 0.9154 for inferring diseases, with AUC value of 0.8865 for inferring new lncRNAs (whose associations related to diseases are unknown), with AUC value of 0.7518 for inferring isolated diseases (whose associations related to lncRNAs are unknown). A case study also confirmed the predictive performance of LDAI-ISPS as a helper for traditional biological experiments in inferring the potential LncRNA-disease associations and isolated diseases.
Collapse
Affiliation(s)
- Yi Zhang
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| | - Min Chen
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang 421002, China
| | - Ang Li
- Hunan Institute of Technology, School of Computer Science and Technology, Hengyang 421002, China
| | - Xiaohui Cheng
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| | - Hong Jin
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| | - Yarong Liu
- School of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China
| |
Collapse
|
18
|
Wang Y, Yang C, Liu X, Zheng J, Zhang F, Wang D, Xue Y, Li X, Shen S, Shao L, Yang Y, Liu L, Ma J, Liu Y. Transcription factor AP-4 (TFAP4)-upstream ORF coding 66 aa inhibits the malignant behaviors of glioma cells by suppressing the TFAP4/long noncoding RNA 00520/microRNA-520f-3p feedback loop. Cancer Sci 2020; 111:891-906. [PMID: 31943575 PMCID: PMC7060482 DOI: 10.1111/cas.14308] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 12/27/2019] [Accepted: 01/02/2020] [Indexed: 02/06/2023] Open
Abstract
Upstream ORF (uORF) is a translational initiation element located in the 5′UTR of eukaryotic mRNAs. Studies have found that uORFs play an important regulatory role in many diseases. Based on The Cancer Genome Atlas database, the results of our experiments and previous research evidence, we investigated transcription factor AP‐4 (TFAP4) and its uORF, LIM and SH3 protein 1 (LASP1), long noncoding RNA 00520 (LINC00520), and microRNA (miR)‐520f‐3p as candidates involved in glioma malignancy, which is a poorly understood process. Both TFAP4‐66aa‐uORF and miR‐520f‐3p were downregulated, and TFAP4, LASP1, and LINC00520 were highly expressed in glioma tissues and cells. TFAP4‐66aa‐uORF or miR‐520f‐3p overexpression or TFAP4, LASP1, or LINC00520 knockdown inhibited glioma cell proliferation, migration, and invasion, but promoted apoptosis. TFAP4‐66aa‐uORF inhibited the translation of TFAP4 by binding to the TFAP4 mRNA. MicroRNA‐520f‐3p inhibited TFAP4 expression by binding to its 3′UTR. However, LINC00520 could promote the expression of TFAP4 by competitively binding to miR‐520f‐3p. In addition, TFAP4 transcriptionally activated LASP1 and LINC00520 expression by binding to their promoter regions, forming a positive feedback loop of TFAP4/LINC00520/miR‐520f‐3p. Our findings together indicated that TFAP4‐66aa‐uORF inhibited the TFAP4/LINC00520/miR‐520f‐3p feedback loop by directly inhibiting TFAP4 expression, subsequently leading to inhibition of glioma malignancy. This provides a basis for developing new therapeutic approaches for glioma treatment.
Collapse
Affiliation(s)
- Yipeng Wang
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| | - Chunqing Yang
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| | - Xiaobai Liu
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| | - Jian Zheng
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| | - Fangfang Zhang
- Department of Neurobiology, School of Life Sciences, China Medical University, Shenyang, China.,Key Laboratory of Cell Biology, Ministry of Public Health of China, China Medical University, Shenyang, China.,Key Laboratory of Medical Cell Biology, Ministry of Education of China, China Medical University, Shenyang, China
| | - Di Wang
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| | - Yixue Xue
- Department of Neurobiology, School of Life Sciences, China Medical University, Shenyang, China.,Key Laboratory of Cell Biology, Ministry of Public Health of China, China Medical University, Shenyang, China.,Key Laboratory of Medical Cell Biology, Ministry of Education of China, China Medical University, Shenyang, China
| | - Xiaozhi Li
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| | - Shuyuan Shen
- Department of Neurobiology, School of Life Sciences, China Medical University, Shenyang, China.,Key Laboratory of Cell Biology, Ministry of Public Health of China, China Medical University, Shenyang, China.,Key Laboratory of Medical Cell Biology, Ministry of Education of China, China Medical University, Shenyang, China
| | - Lianqi Shao
- Department of Neurobiology, School of Life Sciences, China Medical University, Shenyang, China.,Key Laboratory of Cell Biology, Ministry of Public Health of China, China Medical University, Shenyang, China.,Key Laboratory of Medical Cell Biology, Ministry of Education of China, China Medical University, Shenyang, China
| | - Yang Yang
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| | - Libo Liu
- Department of Neurobiology, School of Life Sciences, China Medical University, Shenyang, China.,Key Laboratory of Cell Biology, Ministry of Public Health of China, China Medical University, Shenyang, China.,Key Laboratory of Medical Cell Biology, Ministry of Education of China, China Medical University, Shenyang, China
| | - Jun Ma
- Department of Neurobiology, School of Life Sciences, China Medical University, Shenyang, China.,Key Laboratory of Cell Biology, Ministry of Public Health of China, China Medical University, Shenyang, China.,Key Laboratory of Medical Cell Biology, Ministry of Education of China, China Medical University, Shenyang, China
| | - Yunhui Liu
- Department of Neurosurgery, Shengjing Hospital of China Medical University, Shenyang, China.,Liaoning Clinical Medical Research Center in Nervous System Disease, Shenyang, China.,Key Laboratory of Neuro-oncology in Liaoning Province, Shenyang, China
| |
Collapse
|
19
|
Li J, Zhang S, Liu T, Ning C, Zhang Z, Zhou W. Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction. Bioinformatics 2020; 36:2538-2546. [DOI: 10.1093/bioinformatics/btz965] [Citation(s) in RCA: 100] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 12/17/2019] [Accepted: 12/31/2019] [Indexed: 12/26/2022] Open
Abstract
AbstractMotivationPredicting the association between microRNAs (miRNAs) and diseases plays an import role in identifying human disease-related miRNAs. As identification of miRNA-disease associations via biological experiments is time-consuming and expensive, computational methods are currently used as effective complements to determine the potential associations between disease and miRNA.ResultsWe present a novel method of neural inductive matrix completion with graph convolutional network (NIMCGCN) for predicting miRNA-disease association. NIMCGCN first uses graph convolutional networks to learn miRNA and disease latent feature representations from the miRNA and disease similarity networks. Then, learned features were input into a novel neural inductive matrix completion (NIMC) model to generate an association matrix completion. The parameters of NIMCGCN were learned based on the known miRNA-disease association data in a supervised end-to-end way. We compared the proposed method with other state-of-the-art methods. The area under the receiver operating characteristic curve results showed that our method is significantly superior to existing methods. Furthermore, 50, 47 and 48 of the top 50 predicted miRNAs for three high-risk human diseases, namely, colon cancer, lymphoma and kidney cancer, were verified using experimental literature. Finally, 100% prediction accuracy was achieved when breast cancer was used as a case study to evaluate the ability of NIMCGCN for predicting a new disease without any known related miRNAs.Availability and implementationhttps://github.com/ljatynu/NIMCGCN/Supplementary informationSupplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jin Li
- School of Software, Yunnan University, Kunming 650091, China
| | - Sai Zhang
- School of Software, Yunnan University, Kunming 650091, China
| | - Tao Liu
- School of Software, Yunnan University, Kunming 650091, China
| | - Chenxi Ning
- School of Software, Yunnan University, Kunming 650091, China
| | - Zhuoxuan Zhang
- School of Software, Yunnan University, Kunming 650091, China
| | - Wei Zhou
- School of Software, Yunnan University, Kunming 650091, China
| |
Collapse
|
20
|
Zhang Y, Chen M, Cheng X, Chen Z. LSGSP: a novel miRNA-disease association prediction model using a Laplacian score of the graphs and space projection federated method. RSC Adv 2019; 9:29747-29759. [PMID: 35531537 PMCID: PMC9071959 DOI: 10.1039/c9ra05554a] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 09/09/2019] [Indexed: 12/31/2022] Open
Abstract
Lots of research findings have indicated that miRNAs (microRNAs) are involved in many important biological processes; their mutations and disorders are closely related to diseases, therefore, determining the associations between human diseases and miRNAs is key to understand pathogenic mechanisms. Existing biological experimental methods for identifying miRNA-disease associations are usually expensive and time consuming. Therefore, the development of efficient and reliable computational methods for identifying disease-related miRNAs has become an important topic in the field of biological research in recent years. In this study, we developed a novel miRNA-disease association prediction model using a Laplacian score of the graphs and space projection federated method (LSGSP). This integrates experimentally validated miRNA-disease associations, disease semantic similarity scores, miRNA functional scores, and miRNA family information to build a new disease similarity network and miRNA similarity network, and then obtains the global similarities of these networks through calculating the Laplacian score of the graphs, based on which the miRNA-disease weighted network can be constructed through combination with the miRNA-disease Boolean network. Finally, the miRNA-disease score was obtained via projecting the miRNA space and disease space onto the miRNA-disease weighted network. Compared with several other state-of-the-art methods, using leave-one-out cross validation (LOOCV) to evaluate the accuracy of LSGSP with respect to a benchmark dataset, prediction dataset and compare dataset, LSGSP showed excellent predictive performance with high AUC values of 0.9221, 0.9745 and 0.9194, respectively. In addition, for prostate neoplasms and lung neoplasms, the consistencies between the top 50 predicted miRNAs (obtained from LSGSP) and the results (confirmed from the updated HMDD, miR2Disease, and dbDEMC databases) reached 96% and 100%, respectively. Similarly, for isolated diseases (diseases not associated with any miRNAs), the consistencies between the top 50 predicted miRNAs (obtained from LSGSP) and the results (confirmed from the above-mentioned three databases) reached 98% and 100%, respectively. These results further indicate that LSGSP can effectively predict potential associations between miRNAs and diseases.
Collapse
Affiliation(s)
- Yi Zhang
- School of Information Science and Engineering, Guilin University of Technology 541004 Guilin China
| | - Min Chen
- School of Computer Science and Technology, Hunan Institute of Technology 421002 Hengyang China
| | - Xiaohui Cheng
- School of Information Science and Engineering, Guilin University of Technology 541004 Guilin China
| | - Zheng Chen
- School of Computer Science and Technology, Hunan Institute of Technology 421002 Hengyang China
| |
Collapse
|