1
|
Yang X, Zhang M, Koidis A, Liu X, Guo C, Xu Z, Wei X, Lei H. Identification and quantitation of multiplex camellia oil adulteration based on 11 characteristic lipids using UPLC-Q-Orbitrap-MS. Food Chem 2025; 468:142370. [PMID: 39689496 DOI: 10.1016/j.foodchem.2024.142370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 10/07/2024] [Accepted: 12/02/2024] [Indexed: 12/19/2024]
Abstract
Currently, the identification and quantification of complex adulteration of high-value vegetable oils are still challenging. In this study, the extreme vertex design method was adopted to design representative multivariate adulterated camellia oil samples. Thereafter, 11 characteristic lipid species were identified by considering the statistically significant difference and categorical contribution. A discriminant method and linear regression model were established based on 11 key lipids. The accuracy rate of the model was 100 %, which could correctly discriminate the camellia oil with an adulteration ratio as low as 2.5 %. The root mean square error (RMSE) of the regression equation was close to 0, and the coefficient of determination (R2P) was 0.9054. This method has been successfully applied to commercially available samples for validation purposes, detecting 27.3 % of camellia oil adulteration. Overall, the results indicate that the discriminating method and content prediction model can provide a potential strategy for authentication of multivariate vegetable oils.
Collapse
Affiliation(s)
- Xiaomin Yang
- Guangdong Provincial Key Laboratory of Food Quality and Safety/ National-Local Joint Engineering Research Center for Precision Machining and Safety of Livestock and Poultry Products, College of Food Science, South China Agricultural University, Guangzhou 510642, China.
| | - Mengjie Zhang
- Guangdong Provincial Key Laboratory of Food Quality and Safety/ National-Local Joint Engineering Research Center for Precision Machining and Safety of Livestock and Poultry Products, College of Food Science, South China Agricultural University, Guangzhou 510642, China.
| | - Anastasios Koidis
- Institute for Global Food Security, School of Biological Sciences, Queen's University Belfast, Belfast, UK.
| | - Xiaodong Liu
- Guangdong Provincial Key Laboratory of Food Quality and Safety/ National-Local Joint Engineering Research Center for Precision Machining and Safety of Livestock and Poultry Products, College of Food Science, South China Agricultural University, Guangzhou 510642, China.
| | - Chuangzhong Guo
- Guangdong Shuncheng Alwayseal Technology Ltd, Jieyang 522095, China.
| | - Zhenlin Xu
- Guangdong Provincial Key Laboratory of Food Quality and Safety/ National-Local Joint Engineering Research Center for Precision Machining and Safety of Livestock and Poultry Products, College of Food Science, South China Agricultural University, Guangzhou 510642, China; Heyuan Branch, Guangdong Laboratory for Lingnan Modern Agriculture, 517541, China.
| | - Xiaoqun Wei
- Guangdong Provincial Key Laboratory of Food Quality and Safety/ National-Local Joint Engineering Research Center for Precision Machining and Safety of Livestock and Poultry Products, College of Food Science, South China Agricultural University, Guangzhou 510642, China; Heyuan Branch, Guangdong Laboratory for Lingnan Modern Agriculture, 517541, China.
| | - Hongtao Lei
- Guangdong Provincial Key Laboratory of Food Quality and Safety/ National-Local Joint Engineering Research Center for Precision Machining and Safety of Livestock and Poultry Products, College of Food Science, South China Agricultural University, Guangzhou 510642, China.
| |
Collapse
|
2
|
Chuang KC, Cheng PS, Tsai YH, Tsai MH. Establishing a GRU-GCN coordination-based prediction model for miRNA-disease associations. BMC Genom Data 2025; 26:4. [PMID: 39810100 PMCID: PMC11734345 DOI: 10.1186/s12863-024-01293-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Accepted: 12/19/2024] [Indexed: 01/16/2025] Open
Abstract
BACKGROUND miRNAs (microRNAs) are endogenous RNAs with lengths of 18 to 24 nucleotides and play critical roles in gene regulation and disease progression. Although traditional wet-lab experiments provide direct evidence for miRNA-disease associations, they are often time-consuming and complicated to analyze by current bioinformatics tools. In recent years, machine learning (ML) and deep learning (DL) techniques are powerful tools to analyze large-scale biological data. Hence, developing a model to predict, identify, and rank connections in miRNAs and diseases can significantly enhance the precision and efficiency in investigating the relationships between miRNAs and diseases. RESULTS In this study, we utilized miRNA-disease association data obtained by biotechnological experiments to develop a DL model for miRNA-disease associations. To improve the accuracy of prediction in this model, we introduced two labeling strategies, weight-based and majority-based definitions, to classify miRNA-disease associations. After preprocessing, data was trained with a novel model combining gated recurrent units (GRU) and graph convolutional network (GCN) to predict the level of miRNA-disease associations. The miRNA-disease association datasets were from HMDD (the Human miRNA Disease Database) and categorized by two distinct labeling approaches, weight-based definitions and majority-based definitions. We classified the miRNA-disease associations into three groups, "upregulated", "downregulated" and "nonspecific", by regression analysis and multiclass classification. This GRU-GCN coordinated model achieved a robust area under the curve (AUC) score of 0.8 in all datasets, demonstrating the efficacy in predicting potential miRNA-disease relationships. CONCLUSIONS By introducing innovative label-preprocessing methods, this study addressed the relationships between miRNAs and diseases, and improved the ambiguity of the results in different experiments. Based on these refined label definitions, we developed a DL-based model to refine and predict the results of associations between miRNAs and diseases. This model offers a valuable tool for complementing traditional experimental methods and enhancing our understanding of miRNA-related disease mechanisms.
Collapse
Affiliation(s)
- Kai-Cheng Chuang
- Department of Life Sciences, National Chung Hsing University, Taichung, 402, Taiwan
| | - Ping-Sung Cheng
- Department of Management Information Systems, National Chung Hsing University, Taichung, 402, Taiwan
| | - Yu-Hung Tsai
- Department of Management Information Systems, National Chung Hsing University, Taichung, 402, Taiwan
| | - Meng-Hsiun Tsai
- Department of Management Information Systems, National Chung Hsing University, Taichung, 402, Taiwan.
- Institute of Genomics and Bioinformatics, National Chung Hsing University, Taichung, 402, Taiwan.
| |
Collapse
|
3
|
Cao X, Lu P. DCSGMDA: A dual-channel convolutional model based on stacked deep learning collaborative gradient decomposition for predicting miRNA-disease associations. Comput Biol Chem 2024; 113:108201. [PMID: 39255626 DOI: 10.1016/j.compbiolchem.2024.108201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2024] [Revised: 08/17/2024] [Accepted: 08/31/2024] [Indexed: 09/12/2024]
Abstract
Numerous studies have shown that microRNAs (miRNAs) play a key role in human diseases as critical biomarkers. Its abnormal expression is often accompanied by the emergence of specific diseases. Therefore, studying the relationship between miRNAs and diseases can deepen the insights of their pathogenesis, grasp the process of disease onset and development, and promote drug research of specific diseases. However, many undiscovered relationships between miRNAs and diseases remain, significantly limiting research on miRNA-disease correlations. To explore more potential correlations, we propose a dual-channel convolutional model based on stacked deep learning collaborative gradient decomposition for predicting miRNA-disease associations (DCSGMDA). Firstly, we constructed similarity networks for miRNAs and diseases, as well as an association relationship network. Secondly, potential features were fully mined using stacked deep learning and gradient decomposition networks, along with dual-channel convolutional neural networks. Finally, correlations were scored by a multilayer perceptron. We performed 5-fold and 10-fold cross-validation experiments on DCSGMDA using two datasets based on the Human MicroRNA Disease Database (HMDD). Additionally, parametric, ablation, and comparative experiments, along with case studies, were conducted. The experimental results demonstrate that DCSGMDA performs well in predicting miRNA-disease associations.
Collapse
Affiliation(s)
- Xu Cao
- School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, Gansu, China.
| | - Pengli Lu
- School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, Gansu, China.
| |
Collapse
|
4
|
Gu M, Li C, Ren Y, Chen L, Li S, Zhang D, Zheng X. Exploring the effect of part differences on metabolite molecule changes in refrigerated pork: Identifying key metabolite compounds and their conversion pathways. Food Chem 2024; 460:140308. [PMID: 39024809 DOI: 10.1016/j.foodchem.2024.140308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/28/2024] [Accepted: 06/30/2024] [Indexed: 07/20/2024]
Abstract
Effect of part differences on metabolite molecule alterations in refrigerated pork was investigated. A metabolomics methodology combined with chemometric analysis was successfully established to identify key compounds and their conversion pathways, including precursors and volatile metabolites, in the Longissimus lumborum as well as the breast and flank stored for 11 days. In total, 12 discriminative precursors were identified using the Short Time-series Expression Miner. In tandem with Random Forest and ANOVA analyses, nine volatile metabolites were identified as key compounds that could be attributed to differences in pork sections. Bidirectional orthogonal partial least squares analysis revealed a potential correlation between these key metabolites and discriminative precursors. Metabolic pathway enrichment analysis demonstrated that the primary metabolic process affected by variations in pork sections is linoleic acid metabolism, which participates in the metabolism of cysteine and glutamic acid to produce methoxy-phenyl-oxime. This study provides valuable insights into the identification of differential metabolites.
Collapse
Affiliation(s)
- Minghui Gu
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Cheng Li
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Yuqing Ren
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Li Chen
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Shaobo Li
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Dequan Zhang
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Xiaochun Zheng
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China.
| |
Collapse
|
5
|
Qiu Y, Chen W, Ching WK, Cai H, Jiang H, Zou Q. AGML: Adaptive Graph-Based Multi-Label Learning for Prediction of RBP and as Event Associations During EMT. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:2113-2122. [PMID: 39133592 DOI: 10.1109/tcbb.2024.3440913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Increasing evidence has indicated that RNA-binding proteins (RBPs) play an essential role in mediating alternative splicing (AS) events during epithelial-mesenchymal transition (EMT). However, due to the substantial cost and complexity of biological experiments, how AS events are regulated and influenced remains largely unknown. Thus, it is important to construct effective models for inferring hidden RBP-AS event associations during EMT process. In this paper, a novel and efficient model was developed to identify AS event-related candidate RBPs based on Adaptive Graph-based Multi-Label learning (AGML). In particular, we propose to adaptively learn a new affinity graph to capture the intrinsic structure of data for both RBPs and AS events. Multi-view similarity matrices are employed for maintaining the intrinsic structure and guiding the adaptive graph learning. We then simultaneously update the RBP and AS event associations that are predicted from both spaces by applying multi-label learning. The experimental results have shown that our AGML achieved AUC values of 0.9521 and 0.9873 by 5-fold and leave-one-out cross-validations, respectively, indicating the superiority and effectiveness of our proposed model. Furthermore, AGML can serve as an efficient and reliable tool for uncovering novel AS events-associated RBPs and is applicable for predicting the associations between other biological entities.
Collapse
|
6
|
Long S, Tang X, Si X, Kong T, Zhu Y, Wang C, Qi C, Mu Z, Liu J. TriFusion enables accurate prediction of miRNA-disease association by a tri-channel fusion neural network. Commun Biol 2024; 7:1067. [PMID: 39215090 PMCID: PMC11364641 DOI: 10.1038/s42003-024-06734-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 08/13/2024] [Indexed: 09/04/2024] Open
Abstract
The identification of miRNA-disease associations is crucial for early disease prevention and treatment. However, it is still a computational challenge to accurately predict such associations due to improper information encoding. Previous methods characterize miRNA-disease associations only from single levels, causing the loss of multi-level association information. In this study, we propose TriFusion, a powerful and interpretable deep learning framework for miRNA-disease association prediction. It develops a tri-channel architecture to encode the association features of miRNAs and diseases from different levels and designs a feature fusion encoder to smoothly fuse these features. After training and testing, TriFusion outperforms other leading methods and offers strong interpretability through its learned representations. Furthermore, TriFusion is applied to three high-risk sexually associated cancers (ovarian, breast, and prostate cancers) and exhibits remarkable ability in the identification of miRNAs associated with the three diseases.
Collapse
Affiliation(s)
- Sheng Long
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Xiaoran Tang
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Xinyi Si
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Tongxin Kong
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Yanhao Zhu
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Chuanzhi Wang
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Chenqing Qi
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Zengchao Mu
- School of Mathematics and Statistics, Shandong University, Weihai, China.
| | - Juntao Liu
- School of Mathematics and Statistics, Shandong University, Weihai, China.
| |
Collapse
|
7
|
Lu D, Li J, Zheng C, Liu J, Zhang Q. HGTMDA: A Hypergraph Learning Approach with Improved GCN-Transformer for miRNA-Disease Association Prediction. Bioengineering (Basel) 2024; 11:680. [PMID: 39061762 PMCID: PMC11273495 DOI: 10.3390/bioengineering11070680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 06/14/2024] [Accepted: 07/02/2024] [Indexed: 07/28/2024] Open
Abstract
Accumulating scientific evidence highlights the pivotal role of miRNA-disease association research in elucidating disease pathogenesis and developing innovative diagnostics. Consequently, accurately identifying disease-associated miRNAs has emerged as a prominent research topic in bioinformatics. Advances in graph neural networks (GNNs) have catalyzed methodological breakthroughs in this field. However, existing methods are often plagued by data noise and struggle to effectively integrate local and global information, which hinders their predictive performance. To address this, we introduce HGTMDA, an innovative hypergraph learning framework that incorporates random walk with restart-based association masking and an enhanced GCN-Transformer model to infer miRNA-disease associations. HGTMDA starts by constructing multiple homogeneous similarity networks. A novel enhancement of our approach is the introduction of a restart-based random walk association masking strategy. By stochastically masking a subset of association data and integrating it with a GCN enhanced by an attention mechanism, this strategy enables better capture of key information, leading to improved information utilization and reduced impact of noisy data. Next, we build an miRNA-disease heterogeneous hypergraph and adopt an improved GCN-Transformer encoder to effectively solve the effective extraction of local and global information. Lastly, we utilize a combined Dice cross-entropy (DCE) loss function to guide the model training and optimize its performance. To evaluate the performance of HGTMDA, comprehensive comparisons were conducted with state-of-the-art methods. Additionally, in-depth case studies on lung cancer and colorectal cancer were performed. The results demonstrate HGTMDA's outstanding performance across various metrics and its exceptional effectiveness in real-world application scenarios, highlighting the advantages and value of this method.
Collapse
Affiliation(s)
- Daying Lu
- School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China
| | | | | | | | | |
Collapse
|
8
|
Sheng N, Xie X, Wang Y, Huang L, Zhang S, Gao L, Wang H. A Survey of Deep Learning for Detecting miRNA- Disease Associations: Databases, Computational Methods, Challenges, and Future Directions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:328-347. [PMID: 38194377 DOI: 10.1109/tcbb.2024.3351752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2024]
Abstract
MicroRNAs (miRNAs) are an important class of non-coding RNAs that play an essential role in the occurrence and development of various diseases. Identifying the potential miRNA-disease associations (MDAs) can be beneficial in understanding disease pathogenesis. Traditional laboratory experiments are expensive and time-consuming. Computational models have enabled systematic large-scale prediction of potential MDAs, greatly improving the research efficiency. With recent advances in deep learning, it has become an attractive and powerful technique for uncovering novel MDAs. Consequently, numerous MDA prediction methods based on deep learning have emerged. In this review, we first summarize publicly available databases related to miRNAs and diseases for MDA prediction. Next, we outline commonly used miRNA and disease similarity calculation and integration methods. Then, we comprehensively review the 48 existing deep learning-based MDA computation methods, categorizing them into classical deep learning and graph neural network-based techniques. Subsequently, we investigate the evaluation methods and metrics that are frequently used to assess MDA prediction performance. Finally, we discuss the performance trends of different computational methods, point out some problems in current research, and propose 9 potential future research directions. Data resources and recent advances in MDA prediction methods are summarized in the GitHub repository https://github.com/sheng-n/DL-miRNA-disease-association-methods.
Collapse
|
9
|
Daniel Thomas S, Vijayakumar K, John L, Krishnan D, Rehman N, Revikumar A, Kandel Codi JA, Prasad TSK, S S V, Raju R. Machine Learning Strategies in MicroRNA Research: Bridging Genome to Phenome. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2024; 28:213-233. [PMID: 38752932 DOI: 10.1089/omi.2024.0047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2024]
Abstract
MicroRNAs (miRNAs) have emerged as a prominent layer of regulation of gene expression. This article offers the salient and current aspects of machine learning (ML) tools and approaches from genome to phenome in miRNA research. First, we underline that the complexity in the analysis of miRNA function ranges from their modes of biogenesis to the target diversity in diverse biological conditions. Therefore, it is imperative to first ascertain the miRNA coding potential of genomes and understand the regulatory mechanisms of their expression. This knowledge enables the efficient classification of miRNA precursors and the identification of their mature forms and respective target genes. Second, and because one miRNA can target multiple mRNAs and vice versa, another challenge is the assessment of the miRNA-mRNA target interaction network. Furthermore, long-noncoding RNA (lncRNA)and circular RNAs (circRNAs) also contribute to this complexity. ML has been used to tackle these challenges at the high-dimensional data level. The present expert review covers more than 100 tools adopting various ML approaches pertaining to, for example, (1) miRNA promoter prediction, (2) precursor classification, (3) mature miRNA prediction, (4) miRNA target prediction, (5) miRNA- lncRNA and miRNA-circRNA interactions, (6) miRNA-mRNA expression profiling, (7) miRNA regulatory module detection, (8) miRNA-disease association, and (9) miRNA essentiality prediction. Taken together, we unpack, critically examine, and highlight the cutting-edge synergy of ML approaches and miRNA research so as to develop a dynamic and microlevel understanding of human health and diseases.
Collapse
Affiliation(s)
- Sonet Daniel Thomas
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Krithika Vijayakumar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Levin John
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Deepak Krishnan
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Niyas Rehman
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | - Amjesh Revikumar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Kerala Genome Data Centre, Kerala Development and Innovation Strategic Council, Thiruvananthapuram, Kerala, India
| | - Jalaluddin Akbar Kandel Codi
- Department of Surgical Oncology, Yenepoya Medical College, Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| | | | - Vinodchandra S S
- Department of Computer Science, University of Kerala, Thiruvananthapuram, Kerala, India
| | - Rajesh Raju
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
- Centre for Systems Biology and Molecular Medicine (CSBMM), Yenepoya (Deemed to Be University), Manglore, Karnataka, India
| |
Collapse
|
10
|
Jin Z, Wang M, Tang C, Zheng X, Zhang W, Sha X, An S. Predicting miRNA-disease association via graph attention learning and multiplex adaptive modality fusion. Comput Biol Med 2024; 169:107904. [PMID: 38181611 DOI: 10.1016/j.compbiomed.2023.107904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 12/12/2023] [Accepted: 12/23/2023] [Indexed: 01/07/2024]
Abstract
miRNAs are a class of small non-coding RNA molecules that play important roles in gene regulation. They are crucial for maintaining normal cellular functions, and dysregulation or dysfunction of miRNAs which are linked to the onset and advancement of multiple human diseases. Research on miRNAs has unveiled novel avenues in the realm of the diagnosis, treatment, and prevention of human diseases. However, clinical trials pose challenges and drawbacks, such as complexity and time-consuming processes, which create obstacles for many researchers. Graph Attention Network (GAT) has shown excellent performance in handling graph-structured data for tasks such as link prediction. Some studies have successfully applied GAT to miRNA-disease association prediction. However, there are several drawbacks to existing methods. Firstly, most of the previous models rely solely on concatenation operations to merge features of miRNAs and diseases, which results in the deprivation of significant modality-specific information and even the inclusion of redundant information. Secondly, as the number of layers in GAT increases, there is a possibility of excessive smoothing in the feature extraction process, which significantly affects the prediction accuracy. To address these issues and effectively complete miRNA disease prediction tasks, we propose an innovative model called Multiplex Adaptive Modality Fusion Graph Attention Network (MAMFGAT). MAMFGAT utilizes GAT as the main structure for feature aggregation and incorporates a multi-modal adaptive fusion module to extract features from three interconnected networks: the miRNA-disease association network, the miRNA similarity network, and the disease similarity network. It employs adaptive learning and cross-modality contrastive learning to fuse more effective miRNA and disease feature embeddings as well as incorporates multi-modal residual feature fusion to tackle the problem of excessive feature smoothing in GATs. Finally, we employ a Multi-Layer Perceptron (MLP) model that takes the embeddings of miRNA and disease features as input to anticipate the presence of potential miRNA-disease associations. Extensive experimental results provide evidence of the superior performance of MAMFGAT in comparison to other state-of-the-art methods. To validate the significance of various modalities and assess the efficacy of the designed modules, we performed an ablation analysis. Furthermore, MAMFGAT shows outstanding performance in three cancer case studies, indicating that it is a reliable method for studying the association between miRNA and diseases. The implementation of MAMFGAT can be accessed at the following GitHub repository: https://github.com/zixiaojin66/MAMFGAT-master.
Collapse
Affiliation(s)
- Zixiao Jin
- School of Computer, China University of Geosciences, Wuhan, 430074, China.
| | - Minhui Wang
- Department of Pharmacy, Lianshui People's Hospital of Kangda College Affiliated to Nanjing Medical University, Huai'an 223300, China.
| | - Chang Tang
- School of Computer, China University of Geosciences, Wuhan, 430074, China.
| | - Xiao Zheng
- School of Computer, National University of Defense Technology, Changsha, 410073, China.
| | - Wen Zhang
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China.
| | - Xiaofeng Sha
- Department of Oncology, Huai'an Hongze District People's Hospital, Huai'an, 223100, China.
| | - Shan An
- JD Health International Inc., China.
| |
Collapse
|
11
|
Han Y, Zhou Q, Liu L, Li J, Zhou Y. DNI-MDCAP: improvement of causal MiRNA-disease association prediction based on deep network imputation. BMC Bioinformatics 2024; 25:22. [PMID: 38216907 PMCID: PMC10785389 DOI: 10.1186/s12859-024-05644-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 01/08/2024] [Indexed: 01/14/2024] Open
Abstract
BACKGROUND MiRNAs are involved in the occurrence and development of many diseases. Extensive literature studies have demonstrated that miRNA-disease associations are stratified and encompass ~ 20% causal associations. Computational models that predict causal miRNA-disease associations provide effective guidance in identifying novel interpretations of disease mechanisms and potential therapeutic targets. Although several predictive models for miRNA-disease associations exist, it is still challenging to discriminate causal miRNA-disease associations from non-causal ones. Hence, there is a pressing need to develop an efficient prediction model for causal miRNA-disease association prediction. RESULTS We developed DNI-MDCAP, an improved computational model that incorporated additional miRNA similarity metrics, deep graph embedding learning-based network imputation and semi-supervised learning framework. Through extensive predictive performance evaluation, including tenfold cross-validation and independent test, DNI-MDCAP showed excellent performance in identifying causal miRNA-disease associations, achieving an area under the receiver operating characteristic curve (AUROC) of 0.896 and 0.889, respectively. Regarding the challenge of discriminating causal miRNA-disease associations from non-causal ones, DNI-MDCAP exhibited superior predictive performance compared to existing models MDCAP and LE-MDCAP, reaching an AUROC of 0.870. Wilcoxon test also indicated significantly higher prediction scores for causal associations than for non-causal ones. Finally, the potential causal miRNA-disease associations predicted by DNI-MDCAP, exemplified by diabetic nephropathies and hsa-miR-193a, have been validated by recently published literature, further supporting the reliability of the prediction model. CONCLUSIONS DNI-MDCAP is a dedicated tool to specifically distinguish causal miRNA-disease associations with substantially improved accuracy. DNI-MDCAP is freely accessible at http://www.rnanut.net/DNIMDCAP/ .
Collapse
Affiliation(s)
- Yu Han
- Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Qiong Zhou
- Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Leibo Liu
- Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University, Beijing, China
| | - Jianwei Li
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| | - Yuan Zhou
- Department of Biomedical Informatics, School of Basic Medical Sciences, Peking University, Beijing, China.
- State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, China.
| |
Collapse
|
12
|
Yang F, Zhao R, Suo J, Ding Y, Tan J, Zhu Q, Ma Y. Understanding quality differences between kiwifruit varieties during softening. Food Chem 2024; 430:136983. [PMID: 37527582 DOI: 10.1016/j.foodchem.2023.136983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 07/17/2023] [Accepted: 07/22/2023] [Indexed: 08/03/2023]
Abstract
Research into variations between kiwifruit varieties particularly their softening quality during storage is important in improving kiwifruit quality. The potential reasons for ripening quality differences between 'Cuixiang' (CX) and 'Hayward' (HWD) kiwifruit were analyzed by physiology and metabolomic data combined with the random forests learning algorithm. The results showed that the storability difference between the two varieties mainly resulted from differences in polygalacturonase (PG) and β-galactosidase activities. The 1 °C slowed the fruit softening process of both varieties by decreasing their PG activities. A total of 368 metabolites were identified and amino acid, carbohydrate, cofactors and vitamins, as well as nucleotide metabolism are key metabolic modules that affect the ripening differences of CX and HWD kiwifruit. A total of 30 metabolites showed remarkable ability in distinguish the ripening quality of CX and HWD kiwifruit, in which d-glucose, d-maltose, 2-hydroxybutyric acid, phenyllactate, and vitamin B2 were noteworthy for their potential application on the evaluation of kiwifruit taste and nutritional value. These findings provide positive insights into the underlying mechanism of ripening quality differences between CX and HWD kiwifruit and new ideas for identifying key metabolic markers in kiwifruit.
Collapse
Affiliation(s)
- Fan Yang
- College of Forestry, Northwest A&F University, Yangling, Shaanxi 712100, PR China
| | - Renkai Zhao
- College of Forestry, Northwest A&F University, Yangling, Shaanxi 712100, PR China
| | - Jiangtao Suo
- Shaanxi Bairui Kiwi Research Institute Co., Ltd., in China, Xi'an, Shaanxi 710000, PR China
| | - Yuduan Ding
- College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, PR China
| | - Jiawei Tan
- College of Forestry, Northwest A&F University, Yangling, Shaanxi 712100, PR China
| | - Qinggang Zhu
- College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, PR China.
| | - Yanping Ma
- College of Forestry, Northwest A&F University, Yangling, Shaanxi 712100, PR China.
| |
Collapse
|
13
|
Chen L, Zhang T, Ding H, Xie X, Zhu Y, Dai G, Gao Y, Zhang G, Xie K. Identification of metabolite biomarkers in Salmonella enteritidis-contaminated chickens using UHPLC-QTRAP-MS-based targeted metabolomics. Food Chem X 2023; 20:100966. [PMID: 38144757 PMCID: PMC10740086 DOI: 10.1016/j.fochx.2023.100966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 10/21/2023] [Accepted: 10/24/2023] [Indexed: 12/26/2023] Open
Abstract
This study aimed to characterize the metabolic profile of Salmonella enteritidis (S. enteritidis) in chicken matrix and to identify metabolic biomarkers of S. enteritidis in chicken. The UHPLC-QTRAP-MS high-throughput targeted metabolomics approach was employed to analyze the metabolic profiles of contaminated and control group chickens. A total of 348 metabolites were quantified, and the application of deep learning least absolute shrinkage and selection operator (LASSO) modelling analysis obtained eight potential metabolite biomarkers for S. enteritidis. Metabolic abundance change analysis revealed significantly enriched abundances of anthranilic acid, l-pyroglutamic acid, 5-hydroxylysine, n,n-dimethylarginine, 4-hydroxybenzoic acid, and menatetrenone in contaminated chicken samples. The receiver operating characteristic (ROC) curve analysis demonstrated the strong ability of these six metabolites as biomarkers to distinguish S. enteritidis contaminated and fresh chicken samples. The findings presented in this study offer a theoretical foundation for developing an innovative approach to identify and detect foodborne contamination caused by S. enteritidis.
Collapse
Affiliation(s)
- Lan Chen
- College of Veterinary Medicine, Yangzhou University, Yangzhou 225009, China
- Joint International Research Laboratory of Agriculture & Agri-Product Safety of MOE, Yangzhou University, Yangzhou 225009, China
- Poultry Institute, Chinese Academy of Agricultural Sciences, Yangzhou 225009, China
| | - Tao Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
- Joint International Research Laboratory of Agriculture & Agri-Product Safety of MOE, Yangzhou University, Yangzhou 225009, China
| | - Hao Ding
- College of Veterinary Medicine, Yangzhou University, Yangzhou 225009, China
- Joint International Research Laboratory of Agriculture & Agri-Product Safety of MOE, Yangzhou University, Yangzhou 225009, China
| | - Xing Xie
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology, Ministry of Agriculture, Nanjing 210000 China
| | - Yali Zhu
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
- Joint International Research Laboratory of Agriculture & Agri-Product Safety of MOE, Yangzhou University, Yangzhou 225009, China
| | - Guojun Dai
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
- Joint International Research Laboratory of Agriculture & Agri-Product Safety of MOE, Yangzhou University, Yangzhou 225009, China
| | - Yushi Gao
- College of Veterinary Medicine, Yangzhou University, Yangzhou 225009, China
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology, Ministry of Agriculture, Nanjing 210000 China
| | - Genxi Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
- Joint International Research Laboratory of Agriculture & Agri-Product Safety of MOE, Yangzhou University, Yangzhou 225009, China
| | - Kaizhou Xie
- College of Veterinary Medicine, Yangzhou University, Yangzhou 225009, China
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
- Joint International Research Laboratory of Agriculture & Agri-Product Safety of MOE, Yangzhou University, Yangzhou 225009, China
| |
Collapse
|
14
|
Sheng N, Wang Y, Huang L, Gao L, Cao Y, Xie X, Fu Y. Multi-task prediction-based graph contrastive learning for inferring the relationship among lncRNAs, miRNAs and diseases. Brief Bioinform 2023; 24:bbad276. [PMID: 37529914 DOI: 10.1093/bib/bbad276] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 07/09/2023] [Accepted: 07/11/2023] [Indexed: 08/03/2023] Open
Abstract
MOTIVATION Identifying the relationships among long non-coding RNAs (lncRNAs), microRNAs (miRNAs) and diseases is highly valuable for diagnosing, preventing, treating and prognosing diseases. The development of effective computational prediction methods can reduce experimental costs. While numerous methods have been proposed, they often to treat the prediction of lncRNA-disease associations (LDAs), miRNA-disease associations (MDAs) and lncRNA-miRNA interactions (LMIs) as separate task. Models capable of predicting all three relationships simultaneously remain relatively scarce. Our aim is to perform multi-task predictions, which not only construct a unified framework, but also facilitate mutual complementarity of information among lncRNAs, miRNAs and diseases. RESULTS In this work, we propose a novel unsupervised embedding method called graph contrastive learning for multi-task prediction (GCLMTP). Our approach aims to predict LDAs, MDAs and LMIs by simultaneously extracting embedding representations of lncRNAs, miRNAs and diseases. To achieve this, we first construct a triple-layer lncRNA-miRNA-disease heterogeneous graph (LMDHG) that integrates the complex relationships between these entities based on their similarities and correlations. Next, we employ an unsupervised embedding model based on graph contrastive learning to extract potential topological feature of lncRNAs, miRNAs and diseases from the LMDHG. The graph contrastive learning leverages graph convolutional network architectures to maximize the mutual information between patch representations and corresponding high-level summaries of the LMDHG. Subsequently, for the three prediction tasks, multiple classifiers are explored to predict LDA, MDA and LMI scores. Comprehensive experiments are conducted on two datasets (from older and newer versions of the database, respectively). The results show that GCLMTP outperforms other state-of-the-art methods for the disease-related lncRNA and miRNA prediction tasks. Additionally, case studies on two datasets further demonstrate the ability of GCLMTP to accurately discover new associations. To ensure reproducibility of this work, we have made the datasets and source code publicly available at https://github.com/sheng-n/GCLMTP.
Collapse
Affiliation(s)
- Nan Sheng
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 130012 Changchun, China
| | - Yan Wang
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 130012 Changchun, China
- School of Artificial Intelligence, Jilin University, 130012 Changchun, China
| | - Lan Huang
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 130012 Changchun, China
| | - Ling Gao
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 130012 Changchun, China
| | - Yangkun Cao
- School of Artificial Intelligence, Jilin University, 130012 Changchun, China
| | - Xuping Xie
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 130012 Changchun, China
| | - Yuan Fu
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, Ceredigion, UK
| |
Collapse
|
15
|
Bai T, Yan K, Liu B. DAmiRLocGNet: miRNA subcellular localization prediction by combining miRNA-disease associations and graph convolutional networks. Brief Bioinform 2023:bbad212. [PMID: 37332057 DOI: 10.1093/bib/bbad212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 05/17/2023] [Accepted: 05/18/2023] [Indexed: 06/20/2023] Open
Abstract
MicroRNAs (miRNAs) are human post-transcriptional regulators in humans, which are involved in regulating various physiological processes by regulating the gene expression. The subcellular localization of miRNAs plays a crucial role in the discovery of their biological functions. Although several computational methods based on miRNA functional similarity networks have been presented to identify the subcellular localization of miRNAs, it remains difficult for these approaches to effectively extract well-referenced miRNA functional representations due to insufficient miRNA-disease association representation and disease semantic representation. Currently, there has been a significant amount of research on miRNA-disease associations, making it possible to address the issue of insufficient miRNA functional representation. In this work, a novel model is established, named DAmiRLocGNet, based on graph convolutional network (GCN) and autoencoder (AE) for identifying the subcellular localizations of miRNA. The DAmiRLocGNet constructs the features based on miRNA sequence information, miRNA-disease association information and disease semantic information. GCN is utilized to gather the information of neighboring nodes and capture the implicit information of network structures from miRNA-disease association information and disease semantic information. AE is employed to capture sequence semantics from sequence similarity networks. The evaluation demonstrates that the performance of DAmiRLocGNet is superior to other competing computational approaches, benefiting from implicit features captured by using GCNs. The DAmiRLocGNet has the potential to be applied to the identification of subcellular localization of other non-coding RNAs. Moreover, it can facilitate further investigation into the functional mechanisms underlying miRNA localization. The source code and datasets are accessed at http://bliulab.net/DAmiRLocGNet.
Collapse
Affiliation(s)
- Tao Bai
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
- School of Mathematics & Computer Science, Yan'an University, Shaanxi 716000, China
| | - Ke Yan
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
- Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing 100081, China
| |
Collapse
|
16
|
Gu M, Li C, Chen L, Li S, Xiao N, Zhang D, Zheng X. Insight from untargeted metabolomics: Revealing the potential marker compounds changes in refrigerated pork based on random forests machine learning algorithm. Food Chem 2023; 424:136341. [PMID: 37216778 DOI: 10.1016/j.foodchem.2023.136341] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 04/16/2023] [Accepted: 05/08/2023] [Indexed: 05/24/2023]
Abstract
Data on changes in non-volatile components and metabolic pathways during pork storage were inadequately investigated. Herein, an untargeted metabolomics coupled with random forests machine learning algorithm was proposed to identify the potential marker compounds and their effects on non-volatile production during pork storage by ultra-high-performance liquid chromatography-mass spectrometry (UHPLC-MS/MS). A total of 873 differential metabolites were identified based on analysis of variance (ANOVA). Bioinformatics analysis shows that the key metabolic pathways for protein degradation and amino acid transport are amino acid metabolism and nucleotide metabolism. Finally, 40 potential marker compounds were screened using the random forest regression model, innovatively proposing the key role of pentose-related metabolism in pork spoilage. Multiple linear regression analysis revealed that d-xylose, xanthine, and pyruvaldehyde could be key marker compounds related to the freshness of refrigerated pork. Therefore, this study could provide new ideas for the identification of marker compounds in refrigerated pork.
Collapse
Affiliation(s)
- Minghui Gu
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Cheng Li
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Li Chen
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Shaobo Li
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Naiyu Xiao
- College of Light Industry and Food Science, Zhongkai University of Agriculture and Engineering, Guangzhou, Guangdong 510225, China
| | - Dequan Zhang
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China.
| | - Xiaochun Zheng
- Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Key Laboratory of Agro-products Quality and Safety Control in Storage and Transport Process, Ministry of Agriculture and Rural Affairs, Beijing 100193, China.
| |
Collapse
|
17
|
S S, E R V, Krishnakumar U. Improving miRNA Disease Association Prediction Accuracy Using Integrated Similarity Information and Deep Autoencoders. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1125-1136. [PMID: 35914051 DOI: 10.1109/tcbb.2022.3195514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
MicroRNAs (miRNAs) are short endogenous non-encoding RNA molecules (22nt) that have a vital role in many biological and molecular processes inside the human body. Abnormal and dysregulated expressions of miRNAs are correlated with many complex disorders. Time-consuming wet-lab biological experiments are costly and labour-intensive. So, the situation demands feasible and efficient computational approaches for predicting promising miRNAs associated with diseases. Here a two-stage feature pruning approach based on miRNA feature similarity fusion that uses deep attention autoencoder and recursive feature elimination with cross-validation (RFECV) is proposed for predicting unknown miRNA-disease associations. In the first stage, an attention autoencoder captures highly influential features from the fused feature vector. For further pruning of features, RFECV is applied. The resultant features were given to a Random Forest classifier for association prediction. The Highest AUC of 94.41% is attained when all miRNA similarity measures are merged with disease similarities. Case studies were done on two diseases-lymphoma and leukaemia, to examine the reliability of the approach. Comparative analysis shows that the proposed approach outperforms recent methodologies for predicting miRNA-disease associations.
Collapse
|
18
|
Zhang H, Fang J, Sun Y, Xie G, Lin Z, Gu G. Predicting miRNA-Disease Associations via Node-Level Attention Graph Auto-Encoder. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1308-1318. [PMID: 35503834 DOI: 10.1109/tcbb.2022.3170843] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Previous studies have confirmed microRNA (miRNA), small single-stranded non-coding RNA, participates in various biological processes and plays vital roles in many complex human diseases. Therefore, developing an efficient method to infer potential miRNA disease associations could greatly help understand operational mechanisms for diseases at the molecular level. However, during these early stages for miRNA disease prediction, traditional biological experiments are laborious and expensive. Therefore, this study proposes a novel method called AGAEMD (node-level Attention Graph Auto-Encoder to predict potential MiRNA Disease associations). We first create a heterogeneous matrix incorporating miRNA similarity, disease similarity, and known miRNA-disease associations. Then these matrixes are input into a node-level attention encoder-decoder network which utilizes low dimensional dense embeddings to represent nodes and calculate association scores. To verify the effectiveness of the proposed method, we conduct a series of experiments on two benchmark datasets (the Human MicroRNA Disease Database v2.0 and v3.2) and report the averages over 10 runs in comparison with several state-of-the-art methods. Experimental results have demonstrated the excellent performance of AGAEMD in comparison with other methods. Three important diseases (Colon Neoplasms, Lung Neoplasms, Lupus Vulgaris) were applied in case studies. The results comfirm the reliable predictive performance of AGAEMD.
Collapse
|
19
|
Luo Y, Peng L, Shan W, Sun M, Luo L, Liang W. Machine learning in the development of targeting microRNAs in human disease. Front Genet 2023; 13:1088189. [PMID: 36685965 PMCID: PMC9845262 DOI: 10.3389/fgene.2022.1088189] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 12/12/2022] [Indexed: 01/05/2023] Open
Abstract
A microRNA is a small, single-stranded, non-coding ribonucleic acid that plays a crucial role in RNA silencing and can regulate gene expression. With the in-depth study of miRNA in development and disease, miRNA has become an attractive target for novel therapeutic strategies. Exploring miRNA targeting therapy only through experiments is expensive and laborious, so it is essential to develop novel and efficient computational methods to narrow down the search. Recent advances in machine learning applied in biomedical informatics provide opportunities to explore miRNA-targeting drugs, thus promoting miRNA therapeutics. This review provides an overview of recent advancements in miRNA targeting therapeutic using machine learning. First, we mainly describe the basics of predicting miRNA targeting drugs, including pharmacogenomic data resources and data preprocessing. Then we present primary machine learning algorithms and elaborate their application in discovering relationships among miRNAs, drugs, and diseases. Along with the progress of miRNA targeting therapeutics, we finally analyze and discuss the current challenges and opportunities that machine learning confronts.
Collapse
Affiliation(s)
- Yuxun Luo
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, China,Hunan Key Laboratory for Service computing and Novel Software Technology, Xiangtan, China
| | - Li Peng
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, China,Hunan Key Laboratory for Service computing and Novel Software Technology, Xiangtan, China
| | - Wenyu Shan
- School of Computer Science, University of South China, Hengyang, China
| | - Mengyue Sun
- School of Polymer Science and Polymer Engineering, The University of Akron, Akron, OH, United States
| | - Lingyun Luo
- School of Computer Science, University of South China, Hengyang, China
| | - Wei Liang
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, China,Hunan Key Laboratory for Service computing and Novel Software Technology, Xiangtan, China,*Correspondence: Wei Liang,
| |
Collapse
|
20
|
Yu L, Ju B, Ren S. HLGNN-MDA: Heuristic Learning Based on Graph Neural Networks for miRNA-Disease Association Prediction. Int J Mol Sci 2022; 23:13155. [PMID: 36361945 PMCID: PMC9657597 DOI: 10.3390/ijms232113155] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 10/23/2022] [Accepted: 10/26/2022] [Indexed: 01/12/2024] Open
Abstract
Identifying disease-related miRNAs can improve the understanding of complex diseases. However, experimentally finding the association between miRNAs and diseases is expensive in terms of time and resources. The computational screening of reliable miRNA-disease associations has thus become a necessary tool to guide biological experiments. "Similar miRNAs will be associated with the same disease" is the assumption on which most current miRNA-disease association prediction methods rely; however, biased prior knowledge, and incomplete and inaccurate miRNA similarity data and disease similarity data limit the performance of the model. Here, we propose heuristic learning based on graph neural networks to predict microRNA-disease associations (HLGNN-MDA). We learn the local graph topology features of the predicted miRNA-disease node pairs using graph neural networks. In particular, our improvements to the graph convolution layer of the graph neural network enable it to learn information among homogeneous nodes and among heterogeneous nodes. We illustrate the performance of HLGNN-MDA by performing tenfold cross-validation against excellent baseline models. The results show that we have promising performance in multiple metrics. We also focus on the role of the improvements to the graph convolution layer in the model. The case studies are supported by evidence on breast cancer, hepatocellular carcinoma and renal cell carcinoma. Given the above, the experiments demonstrate that HLGNN-MDA can serve as a reliable method to identify novel miRNA-disease associations.
Collapse
Affiliation(s)
- Liang Yu
- School of Computer Science and Technology, Xidian University, Xi’an 710071, China
| | | | | |
Collapse
|
21
|
Zhang Y, Ye F, Gao X. MCA-Net: Multi-Feature Coding and Attention Convolutional Neural Network for Predicting lncRNA-Disease Association. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2907-2919. [PMID: 34283719 DOI: 10.1109/tcbb.2021.3098126] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
With the advent of the era of big data, it is troublesome to accurately predict the associations between lncRNAs and diseases based on traditional biological experiments due to its time-consuming and subjective. In this paper, we propose a novel deep learning method for predicting lncRNA-disease associations using multi-feature coding and attention convolutional neural network (MCA-Net). We first calculate six similarity features to extract different types of lncRNA and disease feature information. Second, a multi-feature coding method is proposed to construct the feature vectors of lncRNA-disease association samples by integrating the six similarity features. Furthermore, an attention convolutional neural network is developed to identify lncRNA-disease associations under 10-fold cross-validation. Finally, we evaluate the performance of MCA-Net from different perspectives including the effects of the model parameters, distinct deep learning models, and the necessity of attention mechanism. We also compare MCA-Net with several state-of-the-art methods on three publicly available datasets, i.e., LncRNADisease, Lnc2Cancer, and LncRNADisease2.0. The results show that our MCA-Net outperforms the state-of-the-art methods on all three dataset. Besides, case studies on breast cancer and lung cancer further verify that MCA-Net is effective and accurate for the lncRNA-disease association prediction.
Collapse
|
22
|
Yang Y, Shang J, Sun Y, Li F, Zhang Y, Kong XZ, Li S, Liu JX. TLNPMD: Prediction of miRNA-Disease Associations Based on miRNA-Drug-Disease Three-Layer Heterogeneous Network. Molecules 2022; 27:4371. [PMID: 35889243 PMCID: PMC9324587 DOI: 10.3390/molecules27144371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 07/06/2022] [Indexed: 12/10/2022] Open
Abstract
Many microRNAs (miRNAs) have been confirmed to be associated with the generation of human diseases. Capturing miRNA-disease associations (M-DAs) provides an effective way to understand the etiology of diseases. Many models for predicting M-DAs have been constructed; nevertheless, there are still several limitations, such as generally considering direct information between miRNAs and diseases, usually ignoring potential knowledge hidden in isolated miRNAs or diseases. To overcome these limitations, in this study a novel method for predicting M-DAs was developed named TLNPMD, highlights of which are the introduction of drug heuristic information and a bipartite network reconstruction strategy. Specifically, three bipartite networks, including drug-miRNA, drug-disease, and miRNA-disease, were reconstructed as weighted ones using such reconstruction strategy. Based on these weighted bipartite networks, as well as three corresponding similarity networks of drugs, miRNAs and diseases, the miRNA-drug-disease three-layer heterogeneous network was constructed. Then, this heterogeneous network was converted into three two-layer heterogeneous networks, for each of which the network path computational model was employed to predict association scores. Finally, both direct and indirect miRNA-disease paths were used to predict M-DAs. Comparative experiments of TLNPMD and other four models were performed and evaluated by five-fold and global leave-one-out cross validations, results of which show that TLNPMD has the highest AUC values among those of compared methods. In addition, case studies of two common diseases were carried out to validate the effectiveness of the TLNPMD. These experiments demonstrate that the TLNPMD may serve as a promising alternative to existing methods for predicting M-DAs.
Collapse
Affiliation(s)
- Yi Yang
- School of Computer Science, Qufu Normal University, Rizhao 276826, China; (Y.Y.); (Y.S.); (F.L.); (X.-Z.K.); (S.L.); (J.-X.L.)
| | - Junliang Shang
- School of Computer Science, Qufu Normal University, Rizhao 276826, China; (Y.Y.); (Y.S.); (F.L.); (X.-Z.K.); (S.L.); (J.-X.L.)
| | - Yan Sun
- School of Computer Science, Qufu Normal University, Rizhao 276826, China; (Y.Y.); (Y.S.); (F.L.); (X.-Z.K.); (S.L.); (J.-X.L.)
| | - Feng Li
- School of Computer Science, Qufu Normal University, Rizhao 276826, China; (Y.Y.); (Y.S.); (F.L.); (X.-Z.K.); (S.L.); (J.-X.L.)
| | - Yuanyuan Zhang
- School of Information and Control Engineering, Qingdao University of Technology, Qingdao 266520, China;
| | - Xiang-Zhen Kong
- School of Computer Science, Qufu Normal University, Rizhao 276826, China; (Y.Y.); (Y.S.); (F.L.); (X.-Z.K.); (S.L.); (J.-X.L.)
| | - Shengjun Li
- School of Computer Science, Qufu Normal University, Rizhao 276826, China; (Y.Y.); (Y.S.); (F.L.); (X.-Z.K.); (S.L.); (J.-X.L.)
| | - Jin-Xing Liu
- School of Computer Science, Qufu Normal University, Rizhao 276826, China; (Y.Y.); (Y.S.); (F.L.); (X.-Z.K.); (S.L.); (J.-X.L.)
| |
Collapse
|
23
|
Liu Y, Yu Y, Zhao S. Dual Attention Mechanisms and Feature Fusion Networks Based Method for Predicting LncRNA-Disease Associations. Interdiscip Sci 2022; 14:358-371. [PMID: 35067893 DOI: 10.1007/s12539-021-00492-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 11/02/2021] [Accepted: 11/07/2021] [Indexed: 11/30/2022]
Abstract
LncRNAs play a part in numerous momentous processes of biology such as disease diagnoses, preventions and treatments. The associations between various diseases and lncRNAs are one of the crucial approaches to learn the role and status of lncRNAs in human diseases. With the researches on lncRNA and diseases, multiple methods based on neural network have been employed to predict these associations. However, the deep and complicated characteristic representations of lncRNA-disease associations were failed to be extracted, and the discriminative contributions of the interactions, correlations, and similarities among miRNAs diseases, and lncRNAs for the correlation predictions were ignored. In this paper, based on the multibiology premise of lncRNAs, miRNAs, and diseases, a dual attention network was proposed to predict the model of lncRNA-disease associations for miRNAs, the disease characteristic matrix, and lncRNAs. Through two attention modules, we enable the model to learn the nonlinear, more complex and useful features of lncRNA, miRNA, and disease characteristic matrix. For the feature embedding matrix composed of lncRNA-disease, the connection between lncRNA-disease feature embedding matrix and lncRNA, miRNA, and disease characteristic matrix was enhanced through deconvolution and feature fusion layer. Compared with several latest methods, the method proposed in this paper can produce better performance. Researches on the cases of osteosarcoma, lung cancer, and gastric cancer have confirmed the effective recognition of potential lncRNA-disease associations.
Collapse
Affiliation(s)
- Yu Liu
- Dalian Key Lab of Digital Technology for National Culture, Dalian Minzu University, Dalian, 116600, China. .,Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Darul Ehsan, Malaysia.
| | - Yingying Yu
- Dalian Key Lab of Digital Technology for National Culture, Dalian Minzu University, Dalian, 116600, China
| | - Shimin Zhao
- Guangxi Vocational and Technical College, Nanning, 530000, Guangxi, China
| |
Collapse
|
24
|
Zhong J, Zhou W, Kang J, Fang Z, Xie M, Xiao Q, Peng W. DNRLCNN: A CNN Framework for Identifying MiRNA-Disease Associations Using Latent Feature Matrix Extraction with Positive Samples. Interdiscip Sci 2022; 14:607-622. [PMID: 35428965 DOI: 10.1007/s12539-022-00509-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 02/24/2022] [Accepted: 03/01/2022] [Indexed: 06/14/2023]
Abstract
Emerging evidence indicates that miRNAs have strong relationships with many human diseases. Investigating the associations will contribute to elucidating the activities of miRNAs and pathogenesis mechanisms, and providing new opportunities for disease diagnosis and drug discovery. Therefore, it is of significance to identify potential associations between miRNAs and diseases. The existing databases about the miRNA-disease associations (MDAs) only provide the known MDAs, which can be regarded as positive samples. However, the unknown MDAs are not sufficient to regard as reliable negative samples. To deal with this uncertainty, we proposed a convolutional neural network (CNN) framework, named DNRLCNN, based on a latent feature matrix extracted by only positive samples to predict MDAs. First, by only considering the positive samples into the calculation process, we captured the latent feature matrix for complex interactions between miRNAs and diseases in low-dimensional space. Then, we constructed a feature vector for each miRNA and disease pair based on the feature representation. Finally, we adopted a modified CNN for the feature vector to predict MDAs. As a result, our model achieves better performance than other state-of-the-art methods which based CNN in fivefold cross-validation on both miRNA-disease association prediction task (average AUC of 0.9030) and miRNA-phenotype association prediction task (average AUC of 0. 9442). In addition, we carried out case studies on two human diseases, and all the top-50 predicted miRNAs for lung neoplasms are confirmed by HMDD v3.2 and dbDEMC 2.0 databases, 98% of the top-50 predicted miRNAs for heart failure are confirmed. The experiment results show that our model has the capability of inferring potential disease-related miRNAs.
Collapse
Affiliation(s)
- Jiancheng Zhong
- College of Information Science and Engineering, Hunan Normal University, Changsha, 410083, China
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Wubin Zhou
- College of Information Science and Engineering, Hunan Normal University, Changsha, 410083, China
| | - Jiedong Kang
- College of Information Science and Engineering, Hunan Normal University, Changsha, 410083, China
| | - Zhuo Fang
- College of Information Science and Engineering, Hunan Normal University, Changsha, 410083, China
| | - Minzhu Xie
- College of Information Science and Engineering, Hunan Normal University, Changsha, 410083, China
| | - Qiu Xiao
- College of Information Science and Engineering, Hunan Normal University, Changsha, 410083, China.
| | - Wei Peng
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650500, China.
| |
Collapse
|
25
|
Yan C, Duan G, Li N, Zhang L, Wu FX, Wang J. PDMDA: predicting deep-level miRNA-disease associations with graph neural networks and sequence features. Bioinformatics 2022; 38:2226-2234. [PMID: 35150255 DOI: 10.1093/bioinformatics/btac077] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 01/18/2022] [Accepted: 02/05/2022] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Many studies have shown that microRNAs (miRNAs) play a key role in human diseases. Meanwhile, traditional experimental methods for miRNA-disease association identification are extremely costly, time-consuming and challenging. Therefore, many computational methods have been developed to predict potential associations between miRNAs and diseases. However, those methods mainly predict the existence of miRNA-disease associations, and they cannot predict the deep-level miRNA-disease association types. RESULTS In this study, we propose a new end-to-end deep learning method (called PDMDA) to predict deep-level miRNA-disease associations with graph neural networks (GNNs) and miRNA sequence features. Based on the sequence and structural features of miRNAs, PDMDA extracts the miRNA feature representations by a fully connected network (FCN). The disease feature representations are extracted from the disease-gene network and gene-gene interaction network by GNN model. Finally, a multilayer with three fully connected layers and a softmax layer is designed to predict the final miRNA-disease association scores based on the concatenated feature representations of miRNAs and diseases. Note that PDMDA does not take the miRNA-disease association matrix as input to compute the Gaussian interaction profile similarity. We conduct three experiments based on six association type samples (including circulations, epigenetics, target, genetics, known association of which their types are unknown and unknown association samples). We conduct fivefold cross-validation validation to assess the prediction performance of PDMDA. The area under the receiver operating characteristic curve scores is used as metric. The experiment results show that PDMDA can accurately predict the deep-level miRNA-disease associations. AVAILABILITY AND IMPLEMENTATION Data and source codes are available at https://github.com/27167199/PDMDA.
Collapse
Affiliation(s)
- Cheng Yan
- School of Information Science and Engineering, Hunan University of Chinese Medicine, Changsha 410208, China.,School of Computer Science and Engineering, Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha 410083, China
| | - Guihua Duan
- School of Computer Science and Engineering, Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha 410083, China
| | - Na Li
- School of Computer Science and Engineering, Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha 410083, China
| | - Lishen Zhang
- School of Computer Science and Engineering, Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha 410083, China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering and Department of Mechanical Engineering, University of Saskatchewan, Saskatoon SK S7N5A9, Canada
| | - Jianxin Wang
- School of Computer Science and Engineering, Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha 410083, China
| |
Collapse
|
26
|
Yu S, Wang H, Liu T, Liang C, Luo J. A knowledge-driven network for fine-grained relationship detection between miRNA and disease. Brief Bioinform 2022; 23:6551111. [PMID: 35323892 DOI: 10.1093/bib/bbac058] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 01/30/2022] [Accepted: 02/04/2022] [Indexed: 12/12/2022] Open
Abstract
Increasing biological evidence indicated that microRNAs (miRNAs) play a vital role in exploring the pathogenesis of various human diseases (especially in tumors). Mining disease-related miRNAs is of great significance for the clinical diagnosis and treatment of diseases. Compared with the traditional experimental methods with the significant limitations of high cost, long cycle and small scale, the methods based on computing have the advantages of being cost-effective. However, although the current methods based on computational biology can accurately predict the correlation between miRNAs and disease, they can not predict the detailed association information at a fine level. We propose a knowledge-driven approach to the fine-grained prediction of disease-related miRNAs (KDFGMDA). Different from the previous methods, this method can finely predict the clear associations between miRNA and disease, such as upregulation, downregulation or dysregulation. Specifically, KDFGMDA extracts triple information from massive experimental data and existing datasets to construct a knowledge graph and then trains a depth graph representation learning model based on knowledge graph to complete fine-grained prediction tasks. Experimental results show that KDFGMDA can predict the relationship between miRNA and disease accurately, which is of far-reaching significance for medical clinical research and early diagnosis, prevention and treatment of diseases. Additionally, the results of case studies on three types of cancers, Kaplan-Meier survival analysis and expression difference analysis further provide the effectiveness and feasibility of KDFGMDA to detect potential candidate miRNAs. Availability: Our work can be downloaded from https://github.com/ShengPengYu/KDFGMDA.
Collapse
Affiliation(s)
- Shengpeng Yu
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China
| | - Hong Wang
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China
| | - Tianyu Liu
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China
| | - Cheng Liang
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China
| | - Jiawei Luo
- School of Information Science and Engineering, Hunan University, Changsha, 410082, China
| |
Collapse
|
27
|
Yu L, Zheng Y, Ju B, Ao C, Gao L. Research progress of miRNA-disease association prediction and comparison of related algorithms. Brief Bioinform 2022; 23:6542222. [PMID: 35246678 DOI: 10.1093/bib/bbac066] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 01/30/2022] [Accepted: 02/08/2022] [Indexed: 11/13/2022] Open
Abstract
With an in-depth understanding of noncoding ribonucleic acid (RNA), many studies have shown that microRNA (miRNA) plays an important role in human diseases. Because traditional biological experiments are time-consuming and laborious, new calculation methods have recently been developed to predict associations between miRNA and diseases. In this review, we collected various miRNA-disease association prediction models proposed in recent years and used two common data sets to evaluate the performance of the prediction models. First, we systematically summarized the commonly used databases and similarity data for predicting miRNA-disease associations, and then divided the various calculation models into four categories for summary and detailed introduction. In this study, two independent datasets (D5430 and D6088) were compiled to systematically evaluate 11 publicly available prediction tools for miRNA-disease associations. The experimental results indicate that the methods based on information dissemination and the method based on scoring function require shorter running time. The method based on matrix transformation often requires a longer running time, but the overall prediction result is better than the previous two methods. We hope that the summary of work related to miRNA and disease will provide comprehensive knowledge for predicting the relationship between miRNA and disease and contribute to advanced computation tools in the future.
Collapse
Affiliation(s)
- Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Yujia Zheng
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Bingyi Ju
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Chunyan Ao
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an, China
| |
Collapse
|
28
|
Fu L, Cao Y, Wu J, Peng Q, Nie Q, Xie X. UFold: fast and accurate RNA secondary structure prediction with deep learning. Nucleic Acids Res 2022; 50:e14. [PMID: 34792173 PMCID: PMC8860580 DOI: 10.1093/nar/gkab1074] [Citation(s) in RCA: 82] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 09/15/2021] [Accepted: 10/19/2021] [Indexed: 11/13/2022] Open
Abstract
For many RNA molecules, the secondary structure is essential for the correct function of the RNA. Predicting RNA secondary structure from nucleotide sequences is a long-standing problem in genomics, but the prediction performance has reached a plateau over time. Traditional RNA secondary structure prediction algorithms are primarily based on thermodynamic models through free energy minimization, which imposes strong prior assumptions and is slow to run. Here, we propose a deep learning-based method, called UFold, for RNA secondary structure prediction, trained directly on annotated data and base-pairing rules. UFold proposes a novel image-like representation of RNA sequences, which can be efficiently processed by Fully Convolutional Networks (FCNs). We benchmark the performance of UFold on both within- and cross-family RNA datasets. It significantly outperforms previous methods on within-family datasets, while achieving a similar performance as the traditional methods when trained and tested on distinct RNA families. UFold is also able to predict pseudoknots accurately. Its prediction is fast with an inference time of about 160 ms per sequence up to 1500 bp in length. An online web server running UFold is available at https://ufold.ics.uci.edu. Code is available at https://github.com/uci-cbcl/UFold.
Collapse
Affiliation(s)
- Laiyi Fu
- Systems Engineering Institute, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, China
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| | - Yingxin Cao
- Department of Computer Science, University of California, Irvine, CA 92697, USA
- Center for Complex Biological Systems, University of California, Irvine, CA 92697, USA
- NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, CA 92697, USA
| | - Jie Wu
- Department of Biological Chemistry, University of California, Irvine, CA 92697, USA
| | - Qinke Peng
- Systems Engineering Institute, School of Electronic and Information Engineering, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, China
| | - Qing Nie
- Department of Mathematics, University of California, Irvine, CA 92697, USA
- Center for Complex Biological Systems, University of California, Irvine, CA 92697, USA
- NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, CA 92697, USA
| | - Xiaohui Xie
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| |
Collapse
|
29
|
Yu L, Zheng Y, Gao L. MiRNA-disease association prediction based on meta-paths. Brief Bioinform 2022; 23:6501422. [PMID: 35018405 DOI: 10.1093/bib/bbab571] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 12/02/2021] [Accepted: 12/11/2021] [Indexed: 01/09/2023] Open
Abstract
Since miRNAs can participate in the posttranscriptional regulation of gene expression, they may provide ideas for the development of new drugs or become new biomarkers for drug targets or disease diagnosis. In this work, we propose an miRNA-disease association prediction method based on meta-paths (MDPBMP). First, an miRNA-disease-gene heterogeneous information network was constructed, and seven symmetrical meta-paths were defined according to different semantics. After constructing the initial feature vector for the node, the vector information carried by all nodes on the meta-path instance is extracted and aggregated to update the feature vector of the starting node. Then, the vector information obtained by the nodes on different meta-paths is aggregated. Finally, miRNA and disease embedding feature vectors are used to calculate their associated scores. Compared with the other methods, MDPBMP obtained the highest AUC value of 0.9214. Among the top 50 predicted miRNAs for lung neoplasms, esophageal neoplasms, colon neoplasms and breast neoplasms, 49, 48, 49 and 50 have been verified. Furthermore, for breast neoplasms, we deleted all the known associations between breast neoplasms and miRNAs from the training set. These results also show that for new diseases without known related miRNA information, our model can predict their potential miRNAs. Code and data are available at https://github.com/LiangYu-Xidian/MDPBMP.
Collapse
Affiliation(s)
- Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an 710071, P.R. China
| | - Yujia Zheng
- School of Computer Science and Technology, Xidian University, Xi'an 710071, P.R. China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an 710071, P.R. China
| |
Collapse
|
30
|
MicroRNA-146-5p Promotes Pulmonary Artery Endothelial Cell Proliferation under Hypoxic Conditions through Regulating USP3. DISEASE MARKERS 2021; 2021:3668422. [PMID: 34917199 PMCID: PMC8670967 DOI: 10.1155/2021/3668422] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 11/19/2021] [Indexed: 12/12/2022]
Abstract
Objective MicroRNAs play a pivotal role in the progression of pulmonary hypertension (PAH). Although microRNA-146-5p is specifically expressed in many diseases, but in PAH, its role remains elusive. Patients and Methods. 30 patients with PAH and 20 healthy volunteers in our hospital were enrolled, and their serum samples were extracted for the detection of microRNA-146-5p and ubiquitin specific protease 3 (USP3) expression. In addition, the interaction between microRNA-146-5p and USP3 was examined by luciferase reporting assay. Furthermore, the potential mechanism was explored by cell counting kit-8 (CCK-8), 5-ethynyl-2′-deoxyuridine (EdU), and Western blotting experiments. Results It was found that microRNA-146-5p was higher in PAH patients than in healthy volunteers. Meanwhile, in hypoxia-induced human pulmonary artery endothelial cell lines (HPAECs), microRNA-146-5p expression was dramatically downregulated while USP3 protein expression was conversely upregulated. Under hypoxic conditions, microRNA-146-5p mimics was able to prompt the growth of HPAECs. In addition, after overexpression of microRNA-146-5p, luciferase reporting assay revealed a reduced luciferase activity of the reporter gene containing the USP3 3′-untranslated region, and a reduction of USP3 protein expression was also confirmed. However, USP3 overexpression partially attenuated the impact of upregulated microRNA-146-5p on the proliferation capacity of HPAECs. Conclusions MicroRNA-146-5p was able to enhance the proliferation ability of HPAEC cells under hypoxic conditions through targeting USP3, suggesting the microRNA-146-5p/USP3 axis may act as a target for PAH treatment.
Collapse
|
31
|
Zhou F, Yin MM, Jiao CN, Cui Z, Zhao JX, Liu JX. Bipartite graph-based collaborative matrix factorization method for predicting miRNA-disease associations. BMC Bioinformatics 2021; 22:573. [PMID: 34837953 PMCID: PMC8627000 DOI: 10.1186/s12859-021-04486-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 11/17/2021] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND With the rapid development of various advanced biotechnologies, researchers in related fields have realized that microRNAs (miRNAs) play critical roles in many serious human diseases. However, experimental identification of new miRNA-disease associations (MDAs) is expensive and time-consuming. Practitioners have shown growing interest in methods for predicting potential MDAs. In recent years, an increasing number of computational methods for predicting novel MDAs have been developed, making a huge contribution to the research of human diseases and saving considerable time. In this paper, we proposed an efficient computational method, named bipartite graph-based collaborative matrix factorization (BGCMF), which is highly advantageous for predicting novel MDAs. RESULTS By combining two improved recommendation methods, a new model for predicting MDAs is generated. Based on the idea that some new miRNAs and diseases do not have any associations, we adopt the bipartite graph based on the collaborative matrix factorization method to complete the prediction. The BGCMF achieves a desirable result, with AUC of up to 0.9514 ± (0.0007) in the five-fold cross-validation experiments. CONCLUSIONS Five-fold cross-validation is used to evaluate the capabilities of our method. Simulation experiments are implemented to predict new MDAs. More importantly, the AUC value of our method is higher than those of some state-of-the-art methods. Finally, many associations between new miRNAs and new diseases are successfully predicted by performing simulation experiments, indicating that BGCMF is a useful method to predict more potential miRNAs with roles in various diseases.
Collapse
Affiliation(s)
- Feng Zhou
- The School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Meng-Meng Yin
- The School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Cui-Na Jiao
- The School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Zhen Cui
- The School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Jing-Xiu Zhao
- The School of Computer Science, Qufu Normal University, Rizhao, 276826, China
| | - Jin-Xing Liu
- The School of Computer Science, Qufu Normal University, Rizhao, 276826, China.
| |
Collapse
|
32
|
Kaczmarek E, Pyman B, Nanayakkara J, Tuschl T, Tyryshkin K, Renwick N, Mousavi P. Discriminating Neoplastic from Nonneoplastic Tissues Using an miRNA-Based Deep Cancer Classifier. THE AMERICAN JOURNAL OF PATHOLOGY 2021; 192:344-352. [PMID: 34774515 DOI: 10.1016/j.ajpath.2021.10.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 10/07/2021] [Accepted: 10/13/2021] [Indexed: 10/19/2022]
Abstract
Next-generation sequencing has enabled the collection of large biological data sets, allowing novel molecular-based classification methods to be developed for increased understanding of disease. miRNAs are small regulatory RNA molecules that can be quantified using next-generation sequencing and are excellent classificatory markers. Herein, we adapt a deep cancer classifier (DCC) to differentiate neoplastic from nonneoplastic samples using comprehensive miRNA expression profiles from 1031 human breast and skin tissue samples. The classifier was fine-tuned and evaluated using 750 neoplastic and 281 nonneoplastic breast and skin tissue samples. Performance of the DCC was compared with two machine-learning classifiers: support vector machine and random forests. In addition, performance of feature extraction through the DCC was also compared with a developed feature selection algorithm, cancer specificity. The DCC had the highest performance of area under the receiver operating curve and high performance in both sensitivity and specificity, unlike machine-learning and feature selection models, which often performed well in one metric compared with the other. In particular, deep learning was shown to have noticeable advantages with highly heterogeneous data sets. In addition, our cancer specificity algorithm identified candidate biomarkers for differentiating neoplastic and nonneoplastic tissue samples (eg, miR-144 and miR-375 in breast cancer and miR-375 and miR-451 in skin cancer).
Collapse
Affiliation(s)
- Emily Kaczmarek
- Medical Informatics Laboratory, School of Computing, Queen's University, Kingston, Ontario, Canada.
| | - Blake Pyman
- Medical Informatics Laboratory, School of Computing, Queen's University, Kingston, Ontario, Canada
| | - Jina Nanayakkara
- Laboratory of Translational RNA Biology, Department of Pathology and Molecular Medicine, Queen's University, Kingston, Ontario, Canada
| | - Thomas Tuschl
- Laboratory of RNA Molecular Biology, Rockefeller University, New York, New York
| | - Kathrin Tyryshkin
- Laboratory of Translational RNA Biology, Department of Pathology and Molecular Medicine, Queen's University, Kingston, Ontario, Canada
| | - Neil Renwick
- Laboratory of Translational RNA Biology, Department of Pathology and Molecular Medicine, Queen's University, Kingston, Ontario, Canada.
| | - Parvin Mousavi
- Medical Informatics Laboratory, School of Computing, Queen's University, Kingston, Ontario, Canada
| |
Collapse
|
33
|
Wu Y, Zhu D, Wang X, Zhang S. An ensemble learning framework for potential miRNA-disease association prediction with positive-unlabeled data. Comput Biol Chem 2021; 95:107566. [PMID: 34534906 DOI: 10.1016/j.compbiolchem.2021.107566] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2021] [Revised: 08/13/2021] [Accepted: 08/18/2021] [Indexed: 11/17/2022]
Abstract
To explore the pathogenic mechanisms of MicroRNA (miRNA) on diverse diseases, many researchers have concentrated on discovering the potential associations between miRNA and disease using machine learning methods. However, the prediction accuracy of supervised machine learning methods is limited by lacking of experimentally-validated uncorrelated miRNA-disease pairs. Without these negative samples, training a highly accurate model is much more difficult. Different from traditional miRNA-disease prediction models using randomly selected unknown samples as negative training samples, we propose an ensemble learning framework to solve this positive-unlabeled (PU) learning problem. The framework incorporates two steps, i.e., a novel semi-supervised Kmeans (SS-Kmeans) to extract reliable negative samples from unknown miRNA-disease pairs and subagging method to generate diverse training sample sets to make full use of those reliable negative samples for ensemble learning. Combined with effective random vector functional link (RVFL) network as prediction model, the proposed framework showed superior prediction accuracy comparing with other popular approaches. A case study on lung and gastric neoplasms further confirms the framework's efficacy at identifying miRNA disease associations.
Collapse
Affiliation(s)
- Yao Wu
- School of Management and Economics, Beijing Institute of Technology, Beijing 100081, China
| | - Donghua Zhu
- School of Management and Economics, Beijing Institute of Technology, Beijing 100081, China
| | - Xuefeng Wang
- School of Management and Economics, Beijing Institute of Technology, Beijing 100081, China.
| | - Shuo Zhang
- School of Management and Economics, Beijing Institute of Technology, Beijing 100081, China
| |
Collapse
|
34
|
Ji C, Wang Y, Ni J, Zheng C, Su Y. Predicting miRNA-Disease Associations Based on Heterogeneous Graph Attention Networks. Front Genet 2021; 12:727744. [PMID: 34512733 PMCID: PMC8424198 DOI: 10.3389/fgene.2021.727744] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Accepted: 08/02/2021] [Indexed: 11/23/2022] Open
Abstract
In recent years, more and more evidence has shown that microRNAs (miRNAs) play an important role in the regulation of post-transcriptional gene expression, and are closely related to human diseases. Many studies have also revealed that miRNAs can be served as promising biomarkers for the potential diagnosis and treatment of human diseases. The interactions between miRNA and human disease have rarely been demonstrated, and the underlying mechanism of miRNA is not clear. Therefore, computational approaches has attracted the attention of researchers, which can not only save time and money, but also improve the efficiency and accuracy of biological experiments. In this work, we proposed a Heterogeneous Graph Attention Networks (GAT) based method for miRNA-disease associations prediction, named HGATMDA. We constructed a heterogeneous graph for miRNAs and diseases, introduced weighted DeepWalk and GAT methods to extract features of miRNAs and diseases from the graph. Moreover, a fully-connected neural networks is used to predict correlation scores between miRNA-disease pairs. Experimental results under five-fold cross validation (five-fold CV) showed that HGATMDA achieved better prediction performance than other state-of-the-art methods. In addition, we performed three case studies on breast neoplasms, lung neoplasms and kidney neoplasms. The results showed that for the three diseases mentioned above, 50 out of top 50 candidates were confirmed by the validation datasets. Therefore, HGATMDA is suitable as an effective tool to identity potential diseases-related miRNAs.
Collapse
Affiliation(s)
- Cunmei Ji
- School of Cyber Science and Engineering, Qufu Normal University, Qufu, China
| | - Yutian Wang
- School of Cyber Science and Engineering, Qufu Normal University, Qufu, China
| | - Jiancheng Ni
- School of Cyber Science and Engineering, Qufu Normal University, Qufu, China
| | - Chunhou Zheng
- School of Artificial Intelligence, Anhui University, Hefei, China
| | - Yansen Su
- School of Artificial Intelligence, Anhui University, Hefei, China
| |
Collapse
|
35
|
Tastsoglou S, Miliotis M, Kavakiotis I, Alexiou A, Gkotsi EC, Lambropoulou A, Lygnos V, Kotsira V, Maroulis V, Zisis D, Skoufos G, Hatzigeorgiou AG. PlasmiR: A Manual Collection of Circulating microRNAs of Prognostic and Diagnostic Value. Cancers (Basel) 2021; 13:cancers13153680. [PMID: 34359584 PMCID: PMC8345031 DOI: 10.3390/cancers13153680] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 07/11/2021] [Accepted: 07/19/2021] [Indexed: 11/16/2022] Open
Abstract
Only recently, microRNAs (miRNAs) were found to exist in traceable and distinctive amounts in the human circulatory system, bringing forth the intriguing possibility of using them as minimally invasive biomarkers. miRNAs are short non-coding RNAs that act as potent post-transcriptional regulators of gene expression. Extensive studies in cancer and other disease landscapes investigate the protective/pathogenic functions of dysregulated miRNAs, as well as their biomarker potential. A specialized resource amassing experimentally verified, circulating miRNA biomarkers does not exist. We queried the existing literature to identify articles assessing diagnostic/prognostic roles of miRNAs in blood, serum, or plasma samples. Articles were scrutinized in order to exclude instances lacking sufficient experimental documentation or employing no biomarker assessment methods. We incorporated information from more than 200 biomedical articles, annotating crucial meta-information including cohort sizes, inclusion-exclusion criteria, disease/healthy confirmation methods and quantification details. miRNAs and diseases were systematically characterized using reference resources. Our circulating miRNA biomarker collection is provided as an online database, plasmiR. It consists of 1021 entries regarding 251 miRNAs and 112 diseases. More than half of plasmiR's entries refer to cancerous and neoplastic conditions, 183 of them (32%) describing prognostic associations. plasmiR facilitates smart queries, emphasizing visualization and exploratory modes for all researchers.
Collapse
Affiliation(s)
- Spyros Tastsoglou
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
- DIANA-Lab, Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131 Lamia, Greece; (I.K.); (V.K.)
- Correspondence: (S.T.); (A.G.H.)
| | - Marios Miliotis
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
- DIANA-Lab, Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131 Lamia, Greece; (I.K.); (V.K.)
| | - Ioannis Kavakiotis
- DIANA-Lab, Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131 Lamia, Greece; (I.K.); (V.K.)
| | - Athanasios Alexiou
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
- DIANA-Lab, Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131 Lamia, Greece; (I.K.); (V.K.)
| | - Eleni C. Gkotsi
- Department of Informatics and Telecommunications, Postgraduate Program: ‘Information Technologies in Medicine and Biology’, University of Athens, 15784 Athens, Greece; (E.C.G.); (V.M.)
| | - Anastasia Lambropoulou
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
| | - Vasileios Lygnos
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
| | - Vasiliki Kotsira
- DIANA-Lab, Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131 Lamia, Greece; (I.K.); (V.K.)
| | - Vasileios Maroulis
- Department of Informatics and Telecommunications, Postgraduate Program: ‘Information Technologies in Medicine and Biology’, University of Athens, 15784 Athens, Greece; (E.C.G.); (V.M.)
| | - Dimitrios Zisis
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
| | - Giorgos Skoufos
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
- Department of Electrical and Computer Engineering, University of Thessaly, 38221 Volos, Greece
| | - Artemis G. Hatzigeorgiou
- Hellenic Pasteur Institute, 11521 Athens, Greece; (M.M.); (A.A.); (A.L.); (V.L.); (D.Z.); (G.S.)
- DIANA-Lab, Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131 Lamia, Greece; (I.K.); (V.K.)
- Correspondence: (S.T.); (A.G.H.)
| |
Collapse
|
36
|
Classification of Breast Cancer and Breast Neoplasm Scenarios Based on Machine Learning and Sequence Features from lncRNAs-miRNAs-Diseases Associations. Interdiscip Sci 2021; 13:572-581. [PMID: 34152557 DOI: 10.1007/s12539-021-00451-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 04/28/2021] [Accepted: 06/09/2021] [Indexed: 10/21/2022]
Abstract
The influence of non-coding RNAs, such as lncRNAs (long non-coding RNAs) and miRNAs (microRNAs), is undeniable in several diseases, for example, in the formation of neoplasms and cancer scenarios. However, there are challenges due to the scarcity of validated datasets and the imbalance in the data. We found that the research of associations between miRNAs-lncRNAs and diseases is limited or done separately. In addition, those investigations, which use Machine Learning models joined with genomic sequence features extracted from miRNAs and lncRNAs, are few compared with using some methods such as genomic expression or Deep Learning techniques. In this paper, we propose a structure of using supervised and unsupervised machine learning models with genomic sequence features, such as k-mers, sequence alignments, and energy folding values, to validate miRNAs and lncRNAs association with breast cancer and neoplasms scenarios. Using One-Class SVM for outlier detection and comparing two supervised models such as SVM and Random Forest, we manage to obtain accuracy results of 95.44% for the One-class model, with 88.79% and 99.65% for the SVM and Random Forest models, respectively. The results showed a promising path for the study of sequence features interactions joined with Machine Learning models comparable to those found in the existing literature.
Collapse
|
37
|
Chu Y, Wang X, Dai Q, Wang Y, Wang Q, Peng S, Wei X, Qiu J, Salahub DR, Xiong Y, Wei DQ. MDA-GCNFTG: identifying miRNA-disease associations based on graph convolutional networks via graph sampling through the feature and topology graph. Brief Bioinform 2021; 22:6261915. [PMID: 34009265 DOI: 10.1093/bib/bbab165] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 04/02/2021] [Accepted: 04/08/2021] [Indexed: 11/13/2022] Open
Abstract
Accurate identification of the miRNA-disease associations (MDAs) helps to understand the etiology and mechanisms of various diseases. However, the experimental methods are costly and time-consuming. Thus, it is urgent to develop computational methods towards the prediction of MDAs. Based on the graph theory, the MDA prediction is regarded as a node classification task in the present study. To solve this task, we propose a novel method MDA-GCNFTG, which predicts MDAs based on Graph Convolutional Networks (GCNs) via graph sampling through the Feature and Topology Graph to improve the training efficiency and accuracy. This method models both the potential connections of feature space and the structural relationships of MDA data. The nodes of the graphs are represented by the disease semantic similarity, miRNA functional similarity and Gaussian interaction profile kernel similarity. Moreover, we considered six tasks simultaneously on the MDA prediction problem at the first time, which ensure that under both balanced and unbalanced sample distribution, MDA-GCNFTG can predict not only new MDAs but also new diseases without known related miRNAs and new miRNAs without known related diseases. The results of 5-fold cross-validation show that the MDA-GCNFTG method has achieved satisfactory performance on all six tasks and is significantly superior to the classic machine learning methods and the state-of-the-art MDA prediction methods. Moreover, the effectiveness of GCNs via the graph sampling strategy and the feature and topology graph in MDA-GCNFTG has also been demonstrated. More importantly, case studies for two diseases and three miRNAs are conducted and achieved satisfactory performance.
Collapse
Affiliation(s)
- Yanyi Chu
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Xuhong Wang
- School of Electronic, Information and Electrical Engineering (SEIEE), Shanghai Jiao Tong University, China
| | - Qiuying Dai
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Yanjing Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Qiankun Wang
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, China
| | - Shaoliang Peng
- College of Computer Science and Electronic Engineering, Hunan University, China
| | | | | | - Dennis Russell Salahub
- Department of Chemistry, University of Calgary, Fellow Royal Society of Canada and Fellow of the American Association for the Advancement of Science, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint International Research Laboratory of Metabolic & Developmental Sciences and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, Shanghai-Islamabad-Belgrade Joint Innovation Center on Antibacterial Resistances, Joint International Research Laboratory of Metabolic & Developmental Sciences and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, P.R. China
| |
Collapse
|
38
|
Ji C, Gao Z, Ma X, Wu Q, Ni J, Zheng C. AEMDA: inferring miRNA-disease associations based on deep autoencoder. Bioinformatics 2021; 37:66-72. [PMID: 32726399 DOI: 10.1093/bioinformatics/btaa670] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Revised: 05/27/2020] [Accepted: 07/20/2020] [Indexed: 12/19/2022] Open
Abstract
MOTIVATION MicroRNAs (miRNAs) are a class of non-coding RNAs that play critical roles in various biological processes. Many studies have shown that miRNAs are closely related to the occurrence, development and diagnosis of human diseases. Traditional biological experiments are costly and time consuming. As a result, effective computational models have become increasingly popular for predicting associations between miRNAs and diseases, which could effectively boost human disease diagnosis and prevention. RESULTS We propose a novel computational framework, called AEMDA, to identify associations between miRNAs and diseases. AEMDA applies a learning-based method to extract dense and high-dimensional representations of diseases and miRNAs from integrated disease semantic similarity, miRNA functional similarity and heterogeneous related interaction data. In addition, AEMDA adopts a deep autoencoder that does not need negative samples to retrieve the underlying associations between miRNAs and diseases. Furthermore, the reconstruction error is used as a measurement to predict disease-associated miRNAs. Our experimental results indicate that AEMDA can effectively predict disease-related miRNAs and outperforms state-of-the-art methods. AVAILABILITY AND IMPLEMENTATION The source code and data are available at https://github.com/CunmeiJi/AEMDA. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Cunmei Ji
- School of Software, Qufu Normal University, Qufu 273165, China
| | - Zhen Gao
- School of Software, Qufu Normal University, Qufu 273165, China
| | - Xu Ma
- School of Software, Qufu Normal University, Qufu 273165, China
| | - Qingwen Wu
- School of Software, Qufu Normal University, Qufu 273165, China
| | - Jiancheng Ni
- School of Software, Qufu Normal University, Qufu 273165, China
| | - Chunhou Zheng
- School of Software, Qufu Normal University, Qufu 273165, China.,School of Computer Science and Technology, Anhui University, Hefei 230601, China
| |
Collapse
|
39
|
Liu D, Huang Y, Nie W, Zhang J, Deng L. SMALF: miRNA-disease associations prediction based on stacked autoencoder and XGBoost. BMC Bioinformatics 2021; 22:219. [PMID: 33910505 PMCID: PMC8082881 DOI: 10.1186/s12859-021-04135-2] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 04/14/2021] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Identifying miRNA and disease associations helps us understand disease mechanisms of action from the molecular level. However, it is usually blind, time-consuming, and small-scale based on biological experiments. Hence, developing computational methods to predict unknown miRNA and disease associations is becoming increasingly important. RESULTS In this work, we develop a computational framework called SMALF to predict unknown miRNA-disease associations. SMALF first utilizes a stacked autoencoder to learn miRNA latent feature and disease latent feature from the original miRNA-disease association matrix. Then, SMALF obtains the feature vector of representing miRNA-disease by integrating miRNA functional similarity, miRNA latent feature, disease semantic similarity, and disease latent feature. Finally, XGBoost is utilized to predict unknown miRNA-disease associations. We implement cross-validation experiments. Compared with other state-of-the-art methods, SAMLF achieved the best AUC value. We also construct three case studies, including hepatocellular carcinoma, colon cancer, and breast cancer. The results show that 10, 10, and 9 out of the top ten predicted miRNAs are verified in MNDR v3.0 or miRCancer, respectively. CONCLUSION The comprehensive experimental results demonstrate that SMALF is effective in identifying unknown miRNA-disease associations.
Collapse
Affiliation(s)
- Dayun Liu
- School of Computer Science and Engineering, Central South University, Hunan, 410083, China
| | - Yibiao Huang
- School of Computer Science and Engineering, Central South University, Hunan, 410083, China
| | - Wenjuan Nie
- School of Computer Science and Engineering, Central South University, Hunan, 410083, China
| | - Jiaxuan Zhang
- Department of Cognitive Science, University of California San Diego, La Jolla, 92093, USA
| | - Lei Deng
- School of Computer Science and Engineering, Central South University, Hunan, 410083, China.
| |
Collapse
|
40
|
Qiu Y, Ching WK, Zou Q. Prediction of RNA-binding protein and alternative splicing event associations during epithelial-mesenchymal transition based on inductive matrix completion. Brief Bioinform 2021; 22:6124915. [PMID: 33517359 DOI: 10.1093/bib/bbaa440] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 12/20/2020] [Accepted: 12/29/2020] [Indexed: 12/13/2022] Open
Abstract
MOTIVATION The developmental process of epithelial-mesenchymal transition (EMT) is abnormally activated during breast cancer metastasis. Transcriptional regulatory networks that control EMT have been well studied; however, alternative RNA splicing plays a vital regulatory role during this process and the regulating mechanism needs further exploration. Because of the huge cost and complexity of biological experiments, the underlying mechanisms of alternative splicing (AS) and associated RNA-binding proteins (RBPs) that regulate the EMT process remain largely unknown. Thus, there is an urgent need to develop computational methods for predicting potential RBP-AS event associations during EMT. RESULTS We developed a novel model for RBP-AS target prediction during EMT that is based on inductive matrix completion (RAIMC). Integrated RBP similarities were calculated based on RBP regulating similarity, and RBP Gaussian interaction profile (GIP) kernel similarity, while integrated AS event similarities were computed based on AS event module similarity and AS event GIP kernel similarity. Our primary objective was to complete missing or unknown RBP-AS event associations based on known associations and on integrated RBP and AS event similarities. In this paper, we identify significant RBPs for AS events during EMT and discuss potential regulating mechanisms. Our computational results confirm the effectiveness and superiority of our model over other state-of-the-art methods. Our RAIMC model achieved AUC values of 0.9587 and 0.9765 based on leave-one-out cross-validation (CV) and 5-fold CV, respectively, which are larger than the AUC values from the previous models. RAIMC is a general matrix completion framework that can be adopted to predict associations between other biological entities. We further validated the prediction performance of RAIMC on the genes CD44 and MAP3K7. RAIMC can identify the related regulating RBPs for isoforms of these two genes. AVAILABILITY AND IMPLEMENTATION The source code for RAIMC is available at https://github.com/yushanqiu/RAIMC. CONTACT zouquan@nclab.net online.
Collapse
Affiliation(s)
- Yushan Qiu
- College of Mathematics and Statistics, Shenzhen University, 518000, Guangdong, China
| | - Wai-Ki Ching
- Department of Mathematics, The University of Hong Kong, Pokfulam Road, Hong Kong
| | - Quan Zou
- College of Mathematics and Statistics, Shenzhen University, 518000, Guangdong, China.,Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
41
|
Ding Y, Jiang L, Tang J, Guo F. Identification of human microRNA-disease association via hypergraph embedded bipartite local model. Comput Biol Chem 2020; 89:107369. [DOI: 10.1016/j.compbiolchem.2020.107369] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 08/03/2020] [Accepted: 08/31/2020] [Indexed: 12/16/2022]
|
42
|
Deepthi K, Jereesh A. An ensemble approach for CircRNA-disease association prediction based on autoencoder and deep neural network. Gene 2020; 762:145040. [DOI: 10.1016/j.gene.2020.145040] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 06/28/2020] [Accepted: 08/04/2020] [Indexed: 01/26/2023]
|
43
|
Dong Y, Sun Y, Qin C, Zhu W. EPMDA: Edge Perturbation Based Method for miRNA-Disease Association Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:2170-2175. [PMID: 31514148 DOI: 10.1109/tcbb.2019.2940182] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In the recent few years, plenty of research has shown that microRNA (miRNA) is likely to be involved in the formation of many human diseases. So effectively predicting potential associations between miRNAs and diseases helps to understand the development and treatment of diseases. In this study, an edge perturbation based method is proposed for predicting potential miRNA-disease association (EPMDA). Different from the previous studies, we design an feature vector to describe each edge of a graph by structural Hamiltonian information. Moreover, the extracted features are used to train a multi-layer perception model to predict the candidate disease-miRNA associations. The experimental results on the HMDD dataset show that EPMDA achieves the AUC value of 0.9818 through 5-fold cross-validation, which improves the AUC values by approximately 3.5 percent compared to the latest method DeepMDA. For the leave-one-disease-out cross-validation, EPMDA achieves the AUC value of 0.9371, which improves the AUC values by approximately 7.4 percent compared to DeepMDA. In the case study, we verify the prediction performance of EPMDA on three human diseases. As a result, there are 42, 46, and 41 of the top 50 predicted miRNAs for these three diseases which are confirmed by the published experimental discoveries, respectively.
Collapse
|
44
|
Li J, Chen X, Huang Q, Wang Y, Xie Y, Dai Z, Zou X, Li Z. Seq-SymRF: a random forest model predicts potential miRNA-disease associations based on information of sequences and clinical symptoms. Sci Rep 2020; 10:17901. [PMID: 33087810 PMCID: PMC7578641 DOI: 10.1038/s41598-020-75005-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Accepted: 10/09/2020] [Indexed: 12/24/2022] Open
Abstract
Increasing evidence indicates that miRNAs play a vital role in biological processes and are closely related to various human diseases. Research on miRNA-disease associations is helpful not only for disease prevention, diagnosis and treatment, but also for new drug identification and lead compound discovery. A novel sequence- and symptom-based random forest algorithm model (Seq-SymRF) was developed to identify potential associations between miRNA and disease. Features derived from sequence information and clinical symptoms were utilized to characterize miRNA and disease, respectively. Moreover, the clustering method by calculating the Euclidean distance was adopted to construct reliable negative samples. Based on the fivefold cross-validation, Seq-SymRF achieved the accuracy of 98.00%, specificity of 99.43%, sensitivity of 96.58%, precision of 99.40% and Matthews correlation coefficient of 0.9604, respectively. The areas under the receiver operating characteristic curve and precision recall curve were 0.9967 and 0.9975, respectively. Additionally, case studies were implemented with leukemia, breast neoplasms and hsa-mir-21. Most of the top-25 predicted disease-related miRNAs (19/25 for leukemia; 20/25 for breast neoplasms) and 15 of top-25 predicted miRNA-related diseases were verified by literature and dbDEMC database. It is anticipated that Seq-SymRF could be regarded as a powerful high-throughput virtual screening tool for drug research and development. All source codes can be downloaded from https://github.com/LeeKamlong/Seq-SymRF.
Collapse
Affiliation(s)
- Jinlong Li
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, People's Republic of China
| | - Xingyu Chen
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, People's Republic of China
| | - Qixing Huang
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, People's Republic of China
| | - Yang Wang
- School of Chemistry, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China
| | - Yun Xie
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, People's Republic of China
| | - Zong Dai
- School of Chemistry, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China
| | - Xiaoyong Zou
- School of Chemistry, Sun Yat-Sen University, Guangzhou, 510275, People's Republic of China.
| | - Zhanchao Li
- School of Chemistry and Chemical Engineering, Guangdong Pharmaceutical University, Guangzhou, 510006, People's Republic of China. .,Key Laboratory of Digital Quality Evaluation of Chinese Materia Medica of State Administration of Traditional Chinese Medicine, Guangzhou, 510006, People's Republic of China.
| |
Collapse
|
45
|
Zhang T, Zhang S, Chen L, Ding H, Wu P, Zhang G, Xie K, Dai G, Wang J. UHPLC-MS/MS-Based Nontargeted Metabolomics Analysis Reveals Biomarkers Related to the Freshness of Chilled Chicken. Foods 2020; 9:foods9091326. [PMID: 32962264 PMCID: PMC7555583 DOI: 10.3390/foods9091326] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 09/11/2020] [Accepted: 09/13/2020] [Indexed: 12/17/2022] Open
Abstract
To identify metabolic biomarkers related to the freshness of chilled chicken, ultra-high-performance liquid chromatography-mass spectrometry (UHPLC-MS/MS) was used to obtain profiles of the metabolites present in chilled chicken stored for different lengths of time. Random forest regression analysis and stepwise multiple linear regression were used to identify key metabolic biomarkers related to the freshness of chilled chicken. A total of 265 differential metabolites were identified during storage of chilled chicken. Of these various metabolites, 37 were selected as potential biomarkers by random forest regression analysis. Receiver operating characteristic (ROC) curve analysis indicated that the biomarkers identified using random forest regression analysis showed a strong correlation with the freshness of chilled chicken. Subsequently, stepwise multiple linear regression analysis based on the biomarkers identified by using random forest regression analysis identified indole-3-carboxaldehyde, uridine monophosphate, s-phenylmercapturic acid, gluconic acid, tyramine, and Serylphenylalanine as key metabolic biomarkers. In conclusion, our study characterized the metabolic profiles of chilled chicken stored for different lengths of time and identified six key metabolic biomarkers related to the freshness of chilled chicken. These findings can contribute to a better understanding of the changes in the metabolic profiles of chilled chicken during storage and provide a basis for the further development of novel detection methods for the freshness of chilled chicken.
Collapse
Affiliation(s)
- Tao Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Shanshan Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Lan Chen
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Hao Ding
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Pengfei Wu
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Genxi Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Kaizhou Xie
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Guojun Dai
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
| | - Jinyu Wang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China; (T.Z.); (S.Z.); (L.C.); (H.D.); (P.W.); (G.Z.); (K.X.); (G.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Ministry of Education, Yangzhou University, Yangzhou 225009, China
- Correspondence: ; Tel.: +86-0514-87979075
| |
Collapse
|
46
|
Zhu R, Ji C, Wang Y, Cai Y, Wu H. Heterogeneous Graph Convolutional Networks and Matrix Completion for miRNA-Disease Association Prediction. Front Bioeng Biotechnol 2020; 8:901. [PMID: 32974293 PMCID: PMC7468400 DOI: 10.3389/fbioe.2020.00901] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Accepted: 05/13/2020] [Indexed: 01/21/2023] Open
Abstract
Due to the cost and complexity of biological experiments, many computational methods have been proposed to predict potential miRNA-disease associations by utilizing known miRNA-disease associations and other related information. However, there are some challenges for these computational methods. First, the relationships between miRNAs and diseases are complex. The computational network should consider the local and global influence of neighborhoods from the network. Furthermore, predicting disease-related miRNAs without any known associations is also very important. This study presents a new computational method that constructs a heterogeneous network composed of a miRNA similarity network, disease similarity network, and known miRNA-disease association network. The miRNA similarity considers the miRNAs and their possible families and clusters. The information of each node in heterogeneous network is obtained by aggregating neighborhood information with graph convolutional networks (GCNs), which can pass the information of a node to its intermediate and distant neighbors. Disease-related miRNAs with no known associations can be predicted with the reconstructed heterogeneous matrix. We apply 5-fold cross-validation, leave-one-disease-out cross-validation, and global and local leave-one-out cross-validation to evaluate our method. The corresponding areas under the curves (AUCs) are 0.9616, 0.9946, 0.9656, and 0.9532, confirming that our approach significantly outperforms the state-of-the-art methods. Case studies show that this approach can effectively predict new diseases without any known miRNAs.
Collapse
Affiliation(s)
- Rongxiang Zhu
- Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.,Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China
| | - Chaojie Ji
- Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Yingying Wang
- Department of Neurology and Stroke Center, The First Affiliated Hospital of Jinan University, Guangzhou, China.,Clinical Neuroscience Institute, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Yunpeng Cai
- Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Hongyan Wu
- Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| |
Collapse
|
47
|
Association extraction from biomedical literature based on representation and transfer learning. J Theor Biol 2020; 488:110112. [DOI: 10.1016/j.jtbi.2019.110112] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 12/08/2019] [Indexed: 12/17/2022]
|
48
|
Wang H, Wang J, Dong C, Lian Y, Liu D, Yan Z. A Novel Approach for Drug-Target Interactions Prediction Based on Multimodal Deep Autoencoder. Front Pharmacol 2020; 10:1592. [PMID: 32047432 PMCID: PMC6997437 DOI: 10.3389/fphar.2019.01592] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2019] [Accepted: 12/09/2019] [Indexed: 01/09/2023] Open
Abstract
Drug targets are biomacromolecules or biomolecular structures that bind to specific drugs and produce therapeutic effects. Therefore, the prediction of drug-target interactions (DTIs) is important for disease therapy. Incorporating multiple similarity measures for drugs and targets is of essence for improving the accuracy of prediction of DTIs. However, existing studies with multiple similarity measures ignored the global structure information of similarity measures, and required manual extraction features of drug-target pairs, ignoring the non-linear relationship among features. In this paper, we proposed a novel approach MDADTI for DTIs prediction based on MDA. MDADTI applied random walk with restart method and positive pointwise mutual information to calculate the topological similarity matrices of drugs and targets, capturing the global structure information of similarity measures. Then, MDADTI applied multimodal deep autoencoder to fuse multiple topological similarity matrices of drugs and targets, automatically learned the low-dimensional features of drugs and targets, and applied deep neural network to predict DTIs. The results of 5-repeats of 10-fold cross-validation under three different cross-validation settings indicated that MDADTI is superior to the other four baseline methods. In addition, we validated the predictions of the MDADTI in six drug-target interactions reference databases, and the results showed that MDADTI can effectively identify unknown DTIs.
Collapse
Affiliation(s)
- Huiqing Wang
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, China
| | - Jingjing Wang
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, China
| | - Chunlin Dong
- Dryland Agriculture Research Center, Shanxi Academy of Agricultural Sciences, Taiyuan, China
| | - Yuanyuan Lian
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, China
| | - Dan Liu
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, China
| | - Zhiliang Yan
- College of Information and Computer, Taiyuan University of Technology, Taiyuan, China
| |
Collapse
|
49
|
Osone T, Yoshida N. The Relationship Between the miRNA Sequence and Disease May be Revealed by Focusing on Hydrogen Bonding Sites in RNA-RNA Interactions. Cells 2019; 8:cells8121615. [PMID: 31835885 PMCID: PMC6952923 DOI: 10.3390/cells8121615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 11/26/2019] [Accepted: 12/09/2019] [Indexed: 11/18/2022] Open
Abstract
MicroRNAs are important genes in biological processes. Although the function of microRNAs has been elucidated, the relationship between the sequence and the disease is not sufficiently clear. It is important to clarify the relationship between the sequence and the disease because it is possible to clarify the meaning of the microRNA genetic code consisting of four nucleobases. Since seed theory is based on sequences, its development can be expected to reveal the meaning of microRNA sequences. However, this method has many false positives and false negatives. On the other hand, disease-related microRNA searches using network analysis are not based on sequences, so it is difficult to clarify the relationship between sequences and diseases. Therefore, RNA–RNA interactions which are caused by hydrogen bonding were focused on. As a result, it was clarified that sequences and diseases were highly correlated by calculating the electric field in microRNA which is considered as the torus. It was also suggested that four diseases with different major classifications can be distinguished. Conventionally, RNA was interpreted as a one-dimensional array of four nucleobases, but a new approach to RNA from this study can be expected to provide a new perspective on RNA-RNA interactions.
Collapse
Affiliation(s)
- Tatsunori Osone
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Yokohama 226-8502, Japan
- Correspondence: ; Tel.: +81-50-3568-0281
| | - Naohiro Yoshida
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Yokohama 226-8502, Japan
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo 152-8550, Japan;
| |
Collapse
|
50
|
Drug-Drug Interaction Predicting by Neural Network Using Integrated Similarity. Sci Rep 2019; 9:13645. [PMID: 31541145 PMCID: PMC6754439 DOI: 10.1038/s41598-019-50121-3] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 09/06/2019] [Indexed: 01/04/2023] Open
Abstract
Drug-Drug Interaction (DDI) prediction is one of the most critical issues in drug development and health. Proposing appropriate computational methods for predicting unknown DDI with high precision is challenging. We proposed "NDD: Neural network-based method for drug-drug interaction prediction" for predicting unknown DDIs using various information about drugs. Multiple drug similarities based on drug substructure, target, side effect, off-label side effect, pathway, transporter, and indication data are calculated. At first, NDD uses a heuristic similarity selection process and then integrates the selected similarities with a nonlinear similarity fusion method to achieve high-level features. Afterward, it uses a neural network for interaction prediction. The similarity selection and similarity integration parts of NDD have been proposed in previous studies of other problems. Our novelty is to combine these parts with new neural network architecture and apply these approaches in the context of DDI prediction. We compared NDD with six machine learning classifiers and six state-of-the-art graph-based methods on three benchmark datasets. NDD achieved superior performance in cross-validation with AUPR ranging from 0.830 to 0.947, AUC from 0.954 to 0.994 and F-measure from 0.772 to 0.902. Moreover, cumulative evidence in case studies on numerous drug pairs, further confirm the ability of NDD to predict unknown DDIs. The evaluations corroborate that NDD is an efficient method for predicting unknown DDIs. The data and implementation of NDD are available at https://github.com/nrohani/NDD.
Collapse
|