Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen X, Yan GY. Novel human lncRNA-disease association inference based on lncRNA expression profiles. ACTA ACUST UNITED AC 2013;29:2617-24. [PMID: 24002109 DOI: 10.1093/bioinformatics/btt426] [Citation(s) in RCA: 433] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

For:	Chen X, Yan GY. Novel human lncRNA-disease association inference based on lncRNA expression profiles. ACTA ACUST UNITED AC 2013;29:2617-24. [PMID: 24002109 DOI: 10.1093/bioinformatics/btt426] [Citation(s) in RCA: 433] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Number

Cited by Other Article(s)

Dong H, Huang D, Zhang J, Xu D, Jiao X, Wang W. Exploring the innate immune system of Urechis unicinctus: Insights from full-length transcriptome analysis. Gene 2024;928:148784. [PMID: 39047957 DOI: 10.1016/j.gene.2024.148784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Revised: 07/16/2024] [Accepted: 07/18/2024] [Indexed: 07/27/2024]

Abstract

The Echiura worm Urechis unicinctus refers to a common benthic invertebrate found in the intertidal zone of Huanghai as well as Bohai Bay. U. unicinctus is known to contain various physiologically active substances, making it highly valuable in terms of its edibility, medicinal properties, and economic potential. Nonetheless, the limited study on the immune system of U. unicinctus poses difficulties for its aquaculture and artificial reproduction. Marine invertebrates, including shellfish and U. unicinctus, are thought to primarily depend on their innate immune system for disease protection, owing to the severalinnate immune molecules they possess. Herein, we employed PacBio single-molecule real-time (SMRT) sequencing technology to perform the full-length transcriptome analysis of U. unicinctus individuals under five different conditions (room temperature (RT), low temperature (LT), high temperature (HT), without water (DRY), ultraviolet irradiation (UV)). Concequently, we identified 59,371 unigenes that had a 2,779 bp average length, 2,613 long non-coding RNAs (lncRNAs), 59,190 coding sequences (CDSs), 35,166 simple sequence repeats (SSRs), and 1,733 transcription factors (TFs), successfully annotating 90.58 % (53,778) of the unigenes. Subsequently, key factors associated with immune-related processes, such as non-self-recognition, cellular immune defenses, and humoral immune defenses, were searched. Our study also identified pattern recognition receptors (PRRs) that included 17 peptidoglycan recognition proteins (PGRPs), 13 Gram-negative binding proteins (GNBPs), 18 scavenger receptors (SRs), 74 toll-like receptors (TLRs), and 89 C-type lectins (CLTs). Altogether, the high-quality transcriptome obtained data will offer valuable insights for further investigations into U. unicinctus innate immune response, laying the foundation for subsequent molecular biology studies and aquaculture.

Collapse

Wen S, Liu Y, Yang G, Chen W, Wu H, Zhu X, Wang Y. A method for miRNA diffusion association prediction using machine learning decoding of multi-level heterogeneous graph Transformer encoded representations. Sci Rep 2024;14:20490. [PMID: 39227405 PMCID: PMC11371806 DOI: 10.1038/s41598-024-68897-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Accepted: 07/29/2024] [Indexed: 09/05/2024] Open

Abstract

MicroRNAs (miRNAs) are a key class of endogenous non-coding RNAs that play a pivotal role in regulating diseases. Accurately predicting the intricate relationships between miRNAs and diseases carries profound implications for disease diagnosis, treatment, and prevention. However, these prediction tasks are highly challenging due to the complexity of the underlying relationships. While numerous effective prediction models exist for validating these associations, they often encounter information distortion due to limitations in efficiently retaining information during the encoding-decoding process. Inspired by Multi-layer Heterogeneous Graph Transformer and Machine Learning XGboost classifier algorithm, this study introduces a novel computational approach based on multi-layer heterogeneous encoder-machine learning decoder structure for miRNA-disease association prediction (MHXGMDA). First, we employ the multi-view similarity matrices as the input coding for MHXGMDA. Subsequently, we utilize the multi-layer heterogeneous encoder to capture the embeddings of miRNAs and diseases, aiming to capture the maximum amount of relevant features. Finally, the information from all layers is concatenated to serve as input to the machine learning classifier, ensuring maximal preservation of encoding details. We conducted a comprehensive comparison of seven different classifier models and ultimately selected the XGBoost algorithm as the decoder. This algorithm leverages miRNA embedding features and disease embedding features to decode and predict the association scores between miRNAs and diseases. We applied MHXGMDA to predict human miRNA-disease associations on two benchmark datasets. Experimental findings demonstrate that our approach surpasses several leading methods in terms of both the area under the receiver operating characteristic curve and the area under the precision-recall curve.

Collapse

Xuan P, Wang W, Cui H, Wang S, Nakaguchi T, Zhang T. Mask-Guided Target Node Feature Learning and Dynamic Detailed Feature Enhancement for lncRNA-Disease Association Prediction. J Chem Inf Model 2024;64:6662-6675. [PMID: 39112431 DOI: 10.1021/acs.jcim.4c00652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]

Abstract

Identifying new relevant long noncoding RNAs (lncRNAs) for various human diseases can facilitate the exploration of the causes and progression of these diseases. Recently, several graph inference methods have been proposed to predict disease-related lncRNAs by exploiting the topological structure and node attributes within graphs. However, these methods did not prioritize the target lncRNA and disease nodes over auxiliary nodes like miRNA nodes, potentially limiting their ability to fully utilize the features of the target nodes. We propose a new method, mask-guided target node feature learning and dynamic detailed feature enhancement for lncRNA-disease association prediction (MDLD), to enhance node feature learning for improved lncRNA-disease association prediction. First, we designed a heterogeneous graph masked transformer autoencoder to guide feature learning, focusing more on the features of target lncRNA (disease) nodes. The target nodes were increasingly masked as training progressed, which helps develop a more robust prediction model. Second, we developed a graph convolutional network with dynamic residuals (GCNDR) to learn and integrate the heterogeneous topology and features of all lncRNA, disease, and miRNA nodes. GCNDR employs an interlayer residual strategy and a residual evolution strategy to mitigate oversmoothing caused by multilayer graph convolution. The interlayer residual strategy estimates the importance of node features learned in the previous GCN encoding layer for nodes in the current encoding layer. Additionally, since there are dependencies in the importance of features of individual lncRNA (disease, miRNA) nodes across multiple encoding layers, a gated recurrent unit-based strategy is proposed to encode these dependencies. Finally, we designed a perspective-level attention mechanism to obtain more informative features of lncRNA and disease node pairs from the perspectives of mask-enhanced and dynamic-enhanced node features. Cross-validation experimental results demonstrated that MDLD outperformed 10 other state-of-the-art prediction methods. Ablation experiments and case studies on candidate lncRNAs for three diseases further proved the technical contributions of MDLD and its capability to discover disease-related lncRNAs.

Collapse

Yao D, Zhang B, Zhan X, Zhang B, Li XK. Predicting lncRNA-Disease Associations Based on a Dual-Path Feature Extraction Network with Multiple Sources of Information Integration. ACS OMEGA 2024;9:35100-35112. [PMID: 39157140 PMCID: PMC11325412 DOI: 10.1021/acsomega.4c05365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 07/04/2024] [Accepted: 07/22/2024] [Indexed: 08/20/2024]

Abstract

Identifying the associations between long noncoding RNAs (lncRNAs) and disease is critical for disease prevention, diagnosis and treatment. However, conducting wet experiments to discover these associations is time-consuming and costly. Therefore, computational modeling for predicting lncRNA-disease associations (LDAs) has become an important alternative. To enhance the accuracy of LDAs prediction and alleviate the issue of node feature oversmoothing when exploring the potential features of nodes using graph neural networks, we introduce DPFELDA, a dual-path feature extraction network that leverages the integration of information from multiple sources to predict LDA. Initially, we establish a dual-view structure of lncRNAs and disease and a heterogeneous network of lncRNA-disease-microRNA (miRNA) interactions. Subsequently, features are extracted using a dual-path feature extraction network. In particular, we employ a combination of a graph convolutional network, a convolutional block attention module, and a node aggregation layer to perform multilayer topology feature extraction for the dual-view structure of lncRNAs and diseases. Additionally, we utilize a Transformer model to construct the node topology feature residual network for obtaining node-specific features in heterogeneous networks. Finally, XGBoost is employed for LDA prediction. The experimental results demonstrate that DPFELDA outperforms the benchmark model on various benchmark data sets. In the course of model exploration, it becomes evident that DPFELDA successfully alleviates the issue of node feature oversmoothing induced by graph-based learning. Ablation experiments confirm the effectiveness of the innovative module, and a case study substantiates the accuracy of DPFELDA model in predicting novel LDAs for characteristic diseases.

Collapse

Chen X, Yang L, Aslam MF, Tao J, Zhang X, Ren P, Wang Y, Chao P. Functional analysis, virtual screening, and molecular dynamics revealed potential novel drug targets and their inhibitors against cardiovascular disease in human. J Biomol Struct Dyn 2024;42:6982-6996. [PMID: 37608602 DOI: 10.1080/07391102.2023.2239926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 07/11/2023] [Indexed: 08/24/2023]

Abstract

Cardiovascular disease (CVD) is a group of diseases, affecting the human heart and accounting for 30% of deaths worldwide. Major CVDs include heart failure, hypertension, stroke, etc. Various therapeutics are available against CVD, still there is a dire need to find out potential protein drug targets to reduce economic burden and mortality rate. Goal of the current study was to utilize sequential computational techniques to find the best cardiovascular drug targets and their inhibitors. Common human cardiovascular targets of both databases (GeneCards and Uniprot) were subjected to bioinformatics analyses. Purpose was to validate putative therapeutic targets employing the structure-based bioinformatics methods to determine their physiochemical properties and biological processes. Three stable proteins, that have 0 transmembrane helices, and possess biological processes were screened as potential protein-based therapeutic targets: Hemoglobin subunit beta (HBB), Gamma-enolase (ENO2), and Cholesteryl ester transfer protein (CETP). Tertiary structures of target proteins were retrieved from PDB, and molecular docking technique was utilized to evaluate a library of 5000 phytochemicals against the interacting residues of the target protein as well as their respective standard drugs through MOE and Pyrx software. Top five phytochemicals (d-Sesamin, 1,3-benzodioxole, Sativanone, Thiamine, and Cajanol) were identified based on their RMSD and docking scores as compared to their standard drugs. The docking studies were also validated by MM-GBSA binding free energy and molecular dynamics simulations. According to the study's findings, these phytochemicals may eventually be used as drugs to treat CVD. Further in vitro testing is required to confirm their efficacy and drug potency.Communicated by Ramaswamy H. Sarma.

Collapse

Xie G, Li D, Lin Z, Gu G, Li W, Chen R, Liu Z. HPTRMF: Collaborative Matrix Factorization-Based Prediction Method for LncRNA-Disease Associations Using High-Order Perturbation and Flexible Trifactor Regularization. J Chem Inf Model 2024. [PMID: 39058598 DOI: 10.1021/acs.jcim.4c01070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2024]

Calazans MAA, Ferreira FABS, Santos FAN, Madeiro F, Lima JB. Machine Learning and Graph Signal Processing Applied to Healthcare: A Review. Bioengineering (Basel) 2024;11:671. [PMID: 39061753 PMCID: PMC11273494 DOI: 10.3390/bioengineering11070671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 06/20/2024] [Accepted: 06/26/2024] [Indexed: 07/28/2024] Open

Chini A, Guha P, Rishi A, Obaid M, Udden SN, Mandal SS. Discovery and functional characterization of LncRNAs associated with inflammation and macrophage activation. Methods 2024;227:1-16. [PMID: 38703879 DOI: 10.1016/j.ymeth.2024.05.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 04/24/2024] [Accepted: 05/01/2024] [Indexed: 05/06/2024] Open

Peng L, Ren M, Huang L, Chen M. GEnDDn: An lncRNA-Disease Association Identification Framework Based on Dual-Net Neural Architecture and Deep Neural Network. Interdiscip Sci 2024;16:418-438. [PMID: 38733474 DOI: 10.1007/s12539-024-00619-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 02/02/2024] [Accepted: 02/03/2024] [Indexed: 05/13/2024]

Nie Z, Gao M, Jin X, Rao Y, Zhang X. MFPINC: prediction of plant ncRNAs based on multi-source feature fusion. BMC Genomics 2024;25:531. [PMID: 38816689 PMCID: PMC11137975 DOI: 10.1186/s12864-024-10439-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 05/21/2024] [Indexed: 06/01/2024] Open

Bonomo M, Rombo SE. Neighborhood based computational approaches for the prediction of lncRNA-disease associations. BMC Bioinformatics 2024;25:187. [PMID: 38741200 DOI: 10.1186/s12859-024-05777-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 04/11/2024] [Indexed: 05/16/2024] Open

Abstract

MOTIVATION

Long non-coding RNAs (lncRNAs) are a class of molecules involved in important biological processes. Extensive efforts have been provided to get deeper understanding of disease mechanisms at the lncRNA level, guiding towards the detection of biomarkers for disease diagnosis, treatment, prognosis and prevention. Unfortunately, due to costs and time complexity, the number of possible disease-related lncRNAs verified by traditional biological experiments is very limited. Computational approaches for the prediction of disease-lncRNA associations allow to identify the most promising candidates to be verified in laboratory, reducing costs and time consuming.

RESULTS

We propose novel approaches for the prediction of lncRNA-disease associations, all sharing the idea of exploring associations among lncRNAs, other intermediate molecules (e.g., miRNAs) and diseases, suitably represented by tripartite graphs. Indeed, while only a few lncRNA-disease associations are still known, plenty of interactions between lncRNAs and other molecules, as well as associations of the latters with diseases, are available. A first approach presented here, NGH, relies on neighborhood analysis performed on a tripartite graph, built upon lncRNAs, miRNAs and diseases. A second approach (CF) relies on collaborative filtering; a third approach (NGH-CF) is obtained boosting NGH by collaborative filtering. The proposed approaches have been validated on both synthetic and real data, and compared against other methods from the literature. It results that neighborhood analysis allows to outperform competitors, and when it is combined with collaborative filtering the prediction accuracy further improves, scoring a value of AUC equal to 0966.

AVAILABILITY

Source code and sample datasets are available at: https://github.com/marybonomo/LDAsPredictionApproaches.git.

Collapse

Xuan P, Lu S, Cui H, Wang S, Nakaguchi T, Zhang T. Learning Association Characteristics by Dynamic Hypergraph and Gated Convolution Enhanced Pairwise Attributes for Prediction of Disease-Related lncRNAs. J Chem Inf Model 2024;64:3569-3578. [PMID: 38523267 DOI: 10.1021/acs.jcim.4c00245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2024]

Abstract

As the long non-coding RNAs (lncRNAs) play important roles during the incurrence and development of various human diseases, identifying disease-related lncRNAs can contribute to clarifying the pathogenesis of diseases. Most of the recent lncRNA-disease association prediction methods utilized the multi-source data about the lncRNAs and diseases. A single lncRNA may participate in multiple disease processes, and multiple lncRNAs usually are involved in the same disease process synergistically. However, the previous methods did not completely exploit the biological characteristics to construct the informative prediction models. We construct a prediction model based on adaptive hypergraph and gated convolution for lncRNA-disease association prediction (AGLDA), to embed and encode the biological characteristics about lncRNA-disease associations, the topological features from the entire heterogeneous graph perspective, and the gated enhanced pairwise features. First, the strategy for constructing hyperedges is designed to reflect the biological characteristic that multiple lncRNAs are involved in multiple disease processes. Furthermore, each hyperedge has its own biological perspective, and multiple hyperedges are beneficial for revealing the diverse relationships among multiple lncRNAs and diseases. Second, we encode the biological features of each lncRNA (disease) node using a strategy based on dynamic hypergraph convolutional networks. The strategy may adaptively learn the features of the hyperedges and formulate the dynamically evolved hypergraph topological structure. Third, a group convolutional network is established to integrate the entire heterogeneous topological structure and multiple types of node attributes within an lncRNA-disease-miRNA graph. Finally, a gated convolutional strategy is proposed to enhance the informative features of the lncRNA-disease node pairs. The comparison experiments indicate that AGLDA outperforms seven advanced prediction methods. The ablation studies confirm the effectiveness of major innovations, and the case studies validate AGLDA's ability in application for discovering potential disease-related lncRNA candidates.

Collapse

Wasson MCD, Venkatesh J, Cahill HF, McLean ME, Dean CA, Marcato P. LncRNAs exhibit subtype-specific expression, survival associations, and cancer-promoting effects in breast cancer. Gene 2024;901:148165. [PMID: 38219875 DOI: 10.1016/j.gene.2024.148165] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 12/25/2023] [Accepted: 01/11/2024] [Indexed: 01/16/2024]

Liu Y, Zhang R, Dong X, Yang H, Li J, Cao H, Tian J, Zhang Y. DAE-CFR: detecting microRNA-disease associations using deep autoencoder and combined feature representation. BMC Bioinformatics 2024;25:139. [PMID: 38553698 PMCID: PMC10981315 DOI: 10.1186/s12859-024-05757-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 03/20/2024] [Indexed: 04/01/2024] Open

Zhou L, Peng X, Zeng L, Peng L. Finding potential lncRNA-disease associations using a boosting-based ensemble learning model. Front Genet 2024;15:1356205. [PMID: 38495672 PMCID: PMC10940470 DOI: 10.3389/fgene.2024.1356205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 02/01/2024] [Indexed: 03/19/2024] Open

Yao HB, Hou ZJ, Zhang WG, Li H, Chen Y. Prediction of MicroRNA-Disease Potential Association Based on Sparse Learning and Multilayer Random Walks. J Comput Biol 2024;31:241-256. [PMID: 38377572 DOI: 10.1089/cmb.2023.0266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2024] Open

Peng L, Yang Y, Yang C, Li Z, Cheong N. HRGCNLDA: Forecasting of lncRNA-disease association based on hierarchical refinement graph convolutional neural network. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2024;21:4814-4834. [PMID: 38872515 DOI: 10.3934/mbe.2024212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]

Rinaldi S, Moroni E, Rozza R, Magistrato A. Frontiers and Challenges of Computing ncRNAs Biogenesis, Function and Modulation. J Chem Theory Comput 2024;20:993-1018. [PMID: 38287883 DOI: 10.1021/acs.jctc.3c01239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2024]

Ahvaz S, Amini M, Yari A, Baradaran B, Jebelli A, Mokhtarzadeh A. Downregulation of long noncoding RNA B4GALT1-AS1 is associated with breast cancer development. Sci Rep 2024;14:3114. [PMID: 38326326 PMCID: PMC10850139 DOI: 10.1038/s41598-023-51124-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 12/31/2023] [Indexed: 02/09/2024] Open

Chen Z, Zhang L, Li J, Fu M. MLFLHMDA: predicting human microbe-disease association based on multi-view latent feature learning. Front Microbiol 2024;15:1353278. [PMID: 38371933 PMCID: PMC10869561 DOI: 10.3389/fmicb.2024.1353278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2023] [Accepted: 01/17/2024] [Indexed: 02/20/2024] Open

Abstract

Introduction

A growing body of research indicates that microorganisms play a crucial role in human health. Imbalances in microbial communities are closely linked to human diseases, and identifying potential relationships between microbes and diseases can help elucidate the pathogenesis of diseases. However, traditional methods based on biological or clinical experiments are costly, so the use of computational models to predict potential microbe-disease associations is of great importance.

Methods

In this paper, we present a novel computational model called MLFLHMDA, which is based on a Multi-View Latent Feature Learning approach to predict Human potential Microbe-Disease Associations. Specifically, we compute Gaussian interaction profile kernel similarity between diseases and microbes based on the known microbe-disease associations from the Human Microbe-Disease Association Database and perform a preprocessing step on the resulting microbe-disease association matrix, namely, weighting K nearest known neighbors (WKNKN) to reduce the sparsity of the microbe-disease association matrix. To obtain unobserved associations in the microbe and disease views, we extract different latent features based on the geometrical structure of microbes and diseases, and project multi-modal latent features into a common subspace. Next, we introduce graph regularization to preserve the local manifold structure of Gaussian interaction profile kernel similarity and add L p , q -norms to the projection matrix to ensure the interpretability and sparsity of the model.

Results

The AUC values for global leave-one-out cross-validation and 5-fold cross validation implemented by MLFLHMDA are 0.9165 and 0.8942+/-0.0041, respectively, which perform better than other existing methods. In addition, case studies of different diseases have demonstrated the superiority of the predictive power of MLFLHMDA. The source code of our model and the data are available on https://github.com/LiangzheZhang/MLFLHMDA_master.

Collapse

Jiao CN, Zhou F, Liu BM, Zheng CH, Liu JX, Gao YL. Multi-Kernel Graph Attention Deep Autoencoder for MiRNA-Disease Association Prediction. IEEE J Biomed Health Inform 2024;28:1110-1121. [PMID: 38055359 DOI: 10.1109/jbhi.2023.3336247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]

Yao D, Deng Y, Zhan X, Zhan X. Predicting lncRNA-disease associations using multiple metapaths in hierarchical graph attention networks. BMC Bioinformatics 2024;25:46. [PMID: 38287236 PMCID: PMC11271052 DOI: 10.1186/s12859-024-05672-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 01/23/2024] [Indexed: 01/31/2024] Open

Abstract

BACKGROUND

Many biological studies have shown that lncRNAs regulate the expression of epigenetically related genes. The study of lncRNAs has helped to deepen our understanding of the pathogenesis of complex diseases at the molecular level. Due to the large number of lncRNAs and the complex and time-consuming nature of biological experiments, applying computer techniques to predict potential lncRNA-disease associations is very effective. To explore information between complex network structures, existing methods rely mainly on lncRNA and disease information. Metapaths have been applied to network models as an effective method for exploring information in heterogeneous graphs. However, existing methods are dominated by lncRNAs or disease nodes and tend to ignore the paths provided by intermediate nodes.

METHODS

We propose a deep learning model based on hierarchical graphical attention networks to predict unknown lncRNA-disease associations using multiple types of metapaths to extract features. We have named this model the MMHGAN. First, the model constructs a lncRNA-disease-miRNA heterogeneous graph based on known associations and two homogeneous graphs of lncRNAs and diseases. Second, for homogeneous graphs, the features of neighboring nodes are aggregated using a multihead attention mechanism. Third, for the heterogeneous graph, metapaths of different intermediate nodes are selected to construct subgraphs, and the importance of different types of metapaths is calculated and aggregated to obtain the final embedded features. Finally, the features are reconstructed using a fully connected layer to obtain the prediction results.

RESULTS

We used a fivefold cross-validation method and obtained an average AUC value of 96.07% and an average AUPR value of 93.23%. Additionally, ablation experiments demonstrated the role of homogeneous graphs and different intermediate node path weights. In addition, we studied lung cancer, esophageal carcinoma, and breast cancer. Among the 15 lncRNAs associated with these diseases, 15, 12, and 14 lncRNAs were validated by the lncRNA Disease Database and the Lnc2Cancer Database, respectively.

CONCLUSION

We compared the MMHGAN model with six existing models with better performance, and the case study demonstrated that the model was effective in predicting the correlation between potential lncRNAs and diseases.

Collapse

Zhang Y, Chu Y, Lin S, Xiong Y, Wei DQ. ReHoGCNES-MDA: prediction of miRNA-disease associations using homogenous graph convolutional networks based on regular graph with random edge sampler. Brief Bioinform 2024;25:bbae103. [PMID: 38517693 PMCID: PMC10959163 DOI: 10.1093/bib/bbae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 02/04/2024] [Accepted: 02/23/2024] [Indexed: 03/24/2024] Open

Zhang Y, Cai G, Li X, Chen M. GCN-Based Heterogeneous Complex Feature Learning to Enhance Predictability for LncRNA-Disease Associations. ACS OMEGA 2024;9:1472-1484. [PMID: 38222651 PMCID: PMC10785310 DOI: 10.1021/acsomega.3c07923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/20/2023] [Accepted: 11/28/2023] [Indexed: 01/16/2024]

Abstract

Using computational models to predict potential lncRNA-disease associations (LDAs) has emerged as an effective supplement to bioexperiments for exploring the pathogenesis of diseases. However, current computational models still face limitations in their ability to learn the complex features of bionetworks. In this study, HGCNLDA, a model which combines graph convolutional network (GCN)-based aggregation, heterogeneous information fusion, and a bilinear-decoder to infer LDAs was proposed. Recognizing the need to extract essential features during data processing, our HGCNLDA explored four key steps for uncovering interaction patterns within the bionetwork: (1) a novel type of tripartite heterogeneous network, known as the lncRNA-disease-miRNA network (LDMN), was constructed using computed similarities and known associations. (2) Homogeneous and heterogeneous features of nodes were extracted from domains within the LDMN by a GCN-based encoder. (3) Feature fusions, including bipolymerization operations and attention mechanism, were employed to capture a more accurate and comprehensive representation of nodes. (4) Bilinear-decoder was used to rebuild the edge type (or rating type) for a specific node pair, resulting in the predicted association score. Through a 5-fold cross-validation on two data sets, namely, data set1 and data set2, our HGCNLDA consistently demonstrated superior performance compared to five related models. It almost achieved the highest AUROC and AUPR values on both data sets, especially on data set2 where the results obtained were more challenging and objective. Case studies involving three real cancer scenarios further validated the practicality of HGCNLDA in identifying potential LDAs in real-world contexts. The source code and data for this study are available at https://github.com/zywait/HGCNLDA.

Collapse

Yao D, Zhang B, Li X, Zhan X, Zhan X, Zhang B. Applying negative sample denoising and multi-view feature for lncRNA-disease association prediction. Front Genet 2024;14:1332273. [PMID: 38264213 PMCID: PMC10803626 DOI: 10.3389/fgene.2023.1332273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 12/22/2023] [Indexed: 01/25/2024] Open

Yao D, Li B, Zhan X, Zhan X, Yu L. GCNFORMER: graph convolutional network and transformer for predicting lncRNA-disease associations. BMC Bioinformatics 2024;25:5. [PMID: 38166659 PMCID: PMC10763317 DOI: 10.1186/s12859-023-05625-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Accepted: 12/18/2023] [Indexed: 01/05/2024] Open

Cai J, Wang R, Chen Y, Zhang C, Fu L, Fan C. LncRNA FIRRE regulated endometrial cancer radiotherapy sensitivity via the miR-199b-5p/SIRT1/BECN1 axis-mediated autophagy. Genomics 2024;116:110750. [PMID: 38052260 DOI: 10.1016/j.ygeno.2023.110750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 11/13/2023] [Accepted: 11/27/2023] [Indexed: 12/07/2023]

Qu J, Ni J, Ni TG, Bian ZK, Liang JZ. Prediction of Human Microbe-Drug Association based on Layer Attention Graph Convolutional Network. Curr Med Chem 2024;31:5097-5109. [PMID: 39225188 DOI: 10.2174/0109298673249941231108091326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2023] [Revised: 08/20/2023] [Accepted: 10/19/2023] [Indexed: 09/04/2024]

Abstract

Human microbes are closely associated with a variety of complex diseases and have emerged as drug targets. Identification of microbe-related drugs is becoming a key issue in drug development and precision medicine. It can also provide guidance for solving the increasingly serious problem of drug resistance enhancement in viruses.

METHODS

In this paper, we have proposed a novel model of layer attention graph convolutional network for microbe-drug association prediction. First, multiple biological data have been integrated into a heterogeneous network. Then, the heterogeneous network has been incorporated into a graph convolutional network to determine the embedded microbe and drug. Finally, the microbe-drug association scores have been obtained by decoding the embedding of microbe and drug based on the layer attention mechanism.

RESULTS

To evaluate the performance of our proposed model, leave-one-out crossvalidation (LOOCV) and 5-fold cross-validation have been implemented on the two datasets of aBiofilm and MDAD. As a result, based on the aBiofilm dataset, our proposed model has attained areas under the curve (AUC) of 0.9178 and 0.9022 on global LOOCV and local LOOCV, respectively. Based on aBiofilm dataset, the proposed model has attained an AUC value of 0.9018 and 0.8902 on global LOOCV and local LOOCV, respectively. In addition, the average AUC and standard deviation of the proposed model for 5- fold cross-validation on the aBiofilm and MDAD datasets were 0.9141±6.8556e-04 and 0.8982±7.5868e-04, respectively. Also, two kinds of case studies have been further conducted to evaluate the proposed models.

CONCLUSION

Traditional methods for microbe-drug association prediction are timeconsuming and laborious. Therefore, the computational model proposed was used to predict new microbe-drug associations. Several evaluation results have shown the proposed model to achieve satisfactory results and that it can play a role in drug development and precision medicine.

Collapse

Zhu H, Hao H, Yu L. Identifying disease-related microbes based on multi-scale variational graph autoencoder embedding Wasserstein distance. BMC Biol 2023;21:294. [PMID: 38115088 PMCID: PMC10731776 DOI: 10.1186/s12915-023-01796-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 12/05/2023] [Indexed: 12/21/2023] Open

Yu J, Yang G, Li S, Li M, Ji C, Liu G, Wang Y, Chen N, Lei C, Dang R. Identification of Dezhou donkey muscle development-related genes and long non-coding RNA based on differential expression analysis. Anim Biotechnol 2023;34:2313-2323. [PMID: 35736796 DOI: 10.1080/10495398.2022.2088549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]

Wu X, Cao S, Zou Y, Wu F. Traditional Chinese Medicine studies for Alzheimer's disease via network pharmacology based on entropy and random walk. PLoS One 2023;18:e0294772. [PMID: 38019798 PMCID: PMC10686466 DOI: 10.1371/journal.pone.0294772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 11/08/2023] [Indexed: 12/01/2023] Open

Abstract

Alzheimer's disease (AD) is a common neurodegenerative disease having complex pathogenesis, approved drugs can only alleviate symptoms of AD for a period of time. Traditional Chinese medicine (TCM) contains multiple active ingredients that can act on multiple targets simultaneously. In this paper, a novel algorithm based on entropy and random walk with the restart of heterogeneous network (RWRHE) is proposed for predicting active ingredients for AD and screening out the effective TCMs for AD. First, Six TCM compounds containing 20 herbs from the AD drug reviews in the CNKI (China National Knowledge Internet) are collected, their active ingredients and targets are retrieved from different databases. Then, comprehensive similarity networks of active ingredients and targets are constructed based on different aspects and entropy weight, respectively. A comprehensive heterogeneous network is constructed by integrating the known active ingredient-target association information and two comprehensive similarity networks. Subsequently, bi-random walks are applied on the heterogeneous network to predict active ingredient-target associations. AD related targets are selected as the seed nodes, a random walk is carried out on the target similarity network to predict the AD-target associations, and the associations of AD-active ingredients are inferred and scored. The effective herbs and compounds for AD are screened out based on their active ingredients' scores. The results measured by machine learning and bioinformatics show that the RWRHE algorithm achieves better prediction accuracy, the top 15 active ingredients may act as multi-target agents in the prevention and treatment of AD, Danshen, Gouteng and Chaihu are recommended as effective TCMs for AD, Yiqitongyutang is recommended as effective compound for AD.

Collapse

Peng L, Huang L, Su Q, Tian G, Chen M, Han G. LDA-VGHB: identifying potential lncRNA-disease associations with singular value decomposition, variational graph auto-encoder and heterogeneous Newton boosting machine. Brief Bioinform 2023;25:bbad466. [PMID: 38127089 PMCID: PMC10734633 DOI: 10.1093/bib/bbad466] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Revised: 10/05/2023] [Accepted: 11/25/2023] [Indexed: 12/23/2023] Open

Wang S, Hui C, Zhang T, Wu P, Nakaguchi T, Xuan P. Graph Reasoning Method Based on Affinity Identification and Representation Decoupling for Predicting lncRNA-Disease Associations. J Chem Inf Model 2023;63:6947-6958. [PMID: 37906529 DOI: 10.1021/acs.jcim.3c01214] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]

Ning Z, Wu J, Ding Y, Wang Y, Peng Q, Fu L. BertNDA: A Model Based on Graph-Bert and Multi-Scale Information Fusion for ncRNA-Disease Association Prediction. IEEE J Biomed Health Inform 2023;27:5655-5664. [PMID: 37669210 DOI: 10.1109/jbhi.2023.3311808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2023]

Rahni Z, Hosseini SM, Shahrokh S, Saeedi Niasar M, Shoraka S, Mirjalali H, Nazemalhosseini-Mojarad E, Rostami-Nejad M, Malekpour H, Zali MR, Mohebbi SR. Long non-coding RNAs ANRIL, THRIL, and NEAT1 as potential circulating biomarkers of SARS-CoV-2 infection and disease severity. Virus Res 2023;336:199214. [PMID: 37657511 PMCID: PMC10502354 DOI: 10.1016/j.virusres.2023.199214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 08/23/2023] [Accepted: 08/29/2023] [Indexed: 09/03/2023]

Abstract

The current outbreak of coronavirus disease 2019 (COVID-19) is a global emergency, as its rapid spread and high mortality rate, which poses a significant threat to public health. Innate immunity plays a crucial role in the primary defense against infections, and recent studies have highlighted the pivotal regulatory function of long non-coding RNAs (lncRNAs) in innate immune responses. This study aims to assess the circulating levels of lncRNAs namely ANRIL, THRIL, NEAT1, and MALAT1 in the blood of moderate and severe SARS-CoV-2 infected patients, in comparison to healthy individuals. Additionally, it aims to explore the potential of these lncRNAs as biomarkers for determining the severity of the disease. The blood samples were collected from a total of 38 moderate and 25 severe COVID-19 patients, along with 30 healthy controls. The total RNA was extracted and qPCR was performed to evaluate the blood levels of the lncRNAs. The results indicate significantly higher expression levels of lncRNAs ANRIL and THRIL in severe patients when compared to moderate patients (P value = 0.0307, P value = 0.0059, respectively). Moreover, the expression levels of lncRNAs ANRIL and THRIL were significantly up-regulated in both moderate and severe patients in comparison to the control group (P value < 0.001, P value < 0.001, P value = 0.001, P value < 0.001, respectively). The expression levels of lncRNA NEAT1 were found to be significantly higher in both moderate and severe COVID-19 patients compared to the healthy group (P value < 0.001, P value < 0.001, respectively), and there was no significant difference in the expression levels of NEAT1 between moderate and severe patients (P value = 0.6979). The expression levels of MALAT1 in moderate and severe patients did not exhibit a significant difference compared to the control group (P value = 0.677, P value = 0.764, respectively). Furthermore, the discriminative power of ANRIL and THRIL was significantly higher in the severe patient group than the moderate group (Area under curve (AUC) = 0.6879; P-value = 0.0122, AUC = 0.6947; P-value = 0.0093, respectively). In conclusion, the expression levels of the lncRNAs ANRIL and THRIL are correlated with the severity of COVID-19 and can be regarded as circulating biomarkers for disease progression.

Collapse

Affiliation(s)

Zeynab Rahni Basic and Molecular Epidemiology of Gastrointestinal Disorders Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran; Department of Microbiology and Microbial Biotechnology, Faculty of Life Sciences and Biotechnology, Shahid Beheshti University, Tehran, Iran
Seyed Masoud Hosseini Department of Microbiology and Microbial Biotechnology, Faculty of Life Sciences and Biotechnology, Shahid Beheshti University, Tehran, Iran
Shabnam Shahrokh Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Mahsa Saeedi Niasar Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Shahrzad Shoraka Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Hamed Mirjalali Foodborne and Waterborne Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Ehsan Nazemalhosseini-Mojarad Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Mohammad Rostami-Nejad Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Habib Malekpour Research and Development Center, Imam Hossein Hospital, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Mohammad Reza Zali Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Seyed Reza Mohebbi Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran.

Collapse

Xie GB, Liu SG, Gu GS, Lin ZY, Yu JR, Chen RB, Xie WJ, Xu HJ. LUNCRW: Prediction of potential lncRNA-disease associations based on unbalanced neighborhood constraint random walk. Anal Biochem 2023;679:115297. [PMID: 37619903 DOI: 10.1016/j.ab.2023.115297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 08/14/2023] [Accepted: 08/18/2023] [Indexed: 08/26/2023]

Khan R, Riaz A, Abbasi SA, Sadaf T, Baig RM, Mansoor Q. Identification of transcriptional level variations in microRNA-221 and microRNA-222 as alternate players in the thyroid cancer tumor microenvironment. Sci Rep 2023;13:15800. [PMID: 37737255 PMCID: PMC10516937 DOI: 10.1038/s41598-023-42941-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 09/16/2023] [Indexed: 09/23/2023] Open

Sheng N, Wang Y, Huang L, Gao L, Cao Y, Xie X, Fu Y. Multi-task prediction-based graph contrastive learning for inferring the relationship among lncRNAs, miRNAs and diseases. Brief Bioinform 2023;24:bbad276. [PMID: 37529914 DOI: 10.1093/bib/bbad276] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 07/09/2023] [Accepted: 07/11/2023] [Indexed: 08/03/2023] Open

Abstract

MOTIVATION

Identifying the relationships among long non-coding RNAs (lncRNAs), microRNAs (miRNAs) and diseases is highly valuable for diagnosing, preventing, treating and prognosing diseases. The development of effective computational prediction methods can reduce experimental costs. While numerous methods have been proposed, they often to treat the prediction of lncRNA-disease associations (LDAs), miRNA-disease associations (MDAs) and lncRNA-miRNA interactions (LMIs) as separate task. Models capable of predicting all three relationships simultaneously remain relatively scarce. Our aim is to perform multi-task predictions, which not only construct a unified framework, but also facilitate mutual complementarity of information among lncRNAs, miRNAs and diseases.

RESULTS

In this work, we propose a novel unsupervised embedding method called graph contrastive learning for multi-task prediction (GCLMTP). Our approach aims to predict LDAs, MDAs and LMIs by simultaneously extracting embedding representations of lncRNAs, miRNAs and diseases. To achieve this, we first construct a triple-layer lncRNA-miRNA-disease heterogeneous graph (LMDHG) that integrates the complex relationships between these entities based on their similarities and correlations. Next, we employ an unsupervised embedding model based on graph contrastive learning to extract potential topological feature of lncRNAs, miRNAs and diseases from the LMDHG. The graph contrastive learning leverages graph convolutional network architectures to maximize the mutual information between patch representations and corresponding high-level summaries of the LMDHG. Subsequently, for the three prediction tasks, multiple classifiers are explored to predict LDA, MDA and LMI scores. Comprehensive experiments are conducted on two datasets (from older and newer versions of the database, respectively). The results show that GCLMTP outperforms other state-of-the-art methods for the disease-related lncRNA and miRNA prediction tasks. Additionally, case studies on two datasets further demonstrate the ability of GCLMTP to accurately discover new associations. To ensure reproducibility of this work, we have made the datasets and source code publicly available at https://github.com/sheng-n/GCLMTP.

Collapse

Xuan P, Bai H, Cui H, Zhang X, Nakaguchi T, Zhang T. Specific topology and topological connection sensitivity enhanced graph learning for lncRNA-disease association prediction. Comput Biol Med 2023;164:107265. [PMID: 37531860 DOI: 10.1016/j.compbiomed.2023.107265] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 06/26/2023] [Accepted: 07/16/2023] [Indexed: 08/04/2023]

Abstract

Predicting disease-related candidate long noncoding RNAs (lncRNAs) is beneficial for exploring disease pathogenesis due to the close relations between lncRNAs and the occurrence and development of human diseases. It is a long-term and challenging task to adequately extract specific and local topologies in individual lncRNA network and individual disease network, and integrate the information of the connection relationships. We propose a new graph learning-based prediction method to encode specific and local topologies from each individual network, neighbor topologies with different connection relationships, and pairwise attributes. We first construct a lncRNA network composed of all the lncRNA nodes and their similarities, and a single disease network that contains all the disease nodes and disease similarities. Then, a network-aware graph convolutional autoencoder is constructed to encode the specific and local topologies of each network. Secondly, a heterogeneous network is established to embed all lncRNA, disease, and miRNA nodes and their various connections. Afterwards, a connection-sensitive graph neural network is designed to deeply integrate the neighbor node attributes and connection characteristics in the heterogeneous network and learn neighbor topological representations. We also construct both connection-level and topology representation-level attention mechanisms to extract informative connections and topological representations. Finally, we build a multi-layer convolutional neural networks with weighted residuals to adaptively complement the detailed features to pairwise attribute encoding. Comprehensive experiments and comparison results demonstrated that NCPred outperforms seven state-of-the-art prediction methods. The ablation studies demonstrated the importance of local topology learning, neighbor topology learning, and pairwise attribute encoding. Case studies on prostate, lung, and breast cancers further revealed NCPred's capacity to screen potential candidate disease-related lncRNAs.

Collapse

Li Y, Zhang M, Shang J, Li F, Ren Q, Liu JX. iLncDA-RSN: identification of lncRNA-disease associations based on reliable similarity networks. Front Genet 2023;14:1249171. [PMID: 37614816 PMCID: PMC10442839 DOI: 10.3389/fgene.2023.1249171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 07/27/2023] [Indexed: 08/25/2023] Open

Biyu H, GuangWen T, Ming Z, Lixin G, Mengshan L. A lncRNA-disease association prediction model based on the two-step PU learning and fully connected neural networks. Heliyon 2023;9:e17726. [PMID: 37539215 PMCID: PMC10395133 DOI: 10.1016/j.heliyon.2023.e17726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 06/13/2023] [Accepted: 06/26/2023] [Indexed: 08/05/2023] Open

Abstract

Long non-coding RNAs (lncRNAs) have been shown to play a regulatory role in various processes of human diseases. However, lncRNA experiments are inefficient, time-consuming and highly subjective, so that the number of experimentally verified associations between lncRNA and diseases is limited. In the era of big data, numerous machine learning methods have been proposed to predict the potential association between lncRNA and diseases, but the characteristics of the associated data were seldom explored. In these methods, negative samples are randomly selected for model training and the model is prone to learn the potential positive association error, thus affecting the prediction accuracy. In this paper, we proposed a cyclic optimization model of predicting lncRNA-disease associations (COPTLDA in short). In COPTLDA, the two-step training strategy is adopted to search for the samples with the greater probability of being negative examples from unlabeled samples and the determined samples are treated as negative samples, which are combined together with known positive samples to train the model. The searching and training steps are repeated until the best model is obtained as the final prediction model. In order to evaluate the performance of the model, 30% of the known positive samples are used to calculate the model accuracy and 10% of positive samples are used to calculate the recall rate of the model. The sampling strategy used in this paper can improve the accuracy and the AUC value reaches 0.9348. The results of case studies showed that the model could predict the potential associations between lncRNA and malignant tumors such as colorectal cancer, gastric cancer, and breast cancer. The predicted top 20 associated lncRNAs included 10 colorectal cancer lncRNAs, 2 gastric cancer lncRNAs, and 8 breast cancer lncRNAs.

Collapse

Hu X, Yin Z, Zeng Z, Peng Y. Prediction of miRNA-Disease Associations by Cascade Forest Model Based on Stacked Autoencoder. Molecules 2023;28:5013. [PMID: 37446675 DOI: 10.3390/molecules28135013] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 06/23/2023] [Accepted: 06/24/2023] [Indexed: 07/15/2023] Open

Lu C, Xie M. LDAEXC: LncRNA-Disease Associations Prediction with Deep Autoencoder and XGBoost Classifier. Interdiscip Sci 2023:10.1007/s12539-023-00573-z. [PMID: 37308797 DOI: 10.1007/s12539-023-00573-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Revised: 05/14/2023] [Accepted: 05/15/2023] [Indexed: 06/14/2023]

Abstract

Numerous scientific evidences have revealed that long non-coding RNAs (lncRNAs) are involved in the progression of human complex diseases and biological life activities. Therefore, identifying novel and potential disease-related lncRNAs is helpful to diagnosis, prognosis and therapy of many human complex diseases. Since traditional laboratory experiments are cost and time-consuming, a great quantity of computer algorithms have been proposed for predicting the relationships between lncRNAs and diseases. However, there are still much room for the improvement. In this paper, we introduce an accurate framework named LDAEXC to infer LncRNA-Disease Associations with deep autoencoder and XGBoost Classifier. LDAEXC utilizes different similarity views of lncRNAs and human diseases to construct features for each data sources. Then, the reduced features are obtained by feeding the constructed feature vectors into a deep autoencoder, and at last an XGBoost classifier is leveraged to calculate the latent lncRNA-disease-associated scores using reduced features. The fivefold cross-validation experiments on four datasets showed that LDAEXC reached AUC scores of 0.9676 ± 0.0043, 0.9449 ± 0.022, 0.9375 ± 0.0331 and 0.9556 ± 0.0134, respectively, significantly higher than other advanced similar computer methods. Extensive experiment results and case studies of two complex diseases (colon and breast cancers) further indicated the practicability and excellent prediction performance of LDAEXC in inferring unknown lncRNA-disease associations. TLDAEXC utilizes disease semantic similarity, lncRNA expression similarity, and Gaussian interaction profile kernel similarity of lncRNAs and diseases for feature construction. The constructed features are fed to a deep autoencoder to extract reduced features, and an XGBoost classifier is used to predict the lncRNA-disease associations based on the reduced features. The fivefold and tenfold cross-validation experiments on a benchmark dataset showed that LDAEXC could achieve AUC scores of 0.9676 and 0.9682, respectively, significantly higher than other state-of-the-art similar methods.

Collapse

Zhong H, Luo J, Tang L, Liao S, Lu Z, Lin G, Murphy RW, Liu L. Association filtering and generative adversarial networks for predicting lncRNA-associated disease. BMC Bioinformatics 2023;24:234. [PMID: 37277721 DOI: 10.1186/s12859-023-05368-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 05/29/2023] [Indexed: 06/07/2023] Open

Abstract

BACKGROUND

Long non-coding RNA (lncRNA) closely associates with numerous biological processes, and with many diseases. Therefore, lncRNA-disease association prediction helps obtain relevant biological information and understand pathogenesis, and thus better diagnose preventable diseases.

RESULTS

Herein, we offer the LDAF_GAN method for predicting lncRNA-associated disease based on association filtering and generative adversarial networks. Experimentation used two types of data: lncRNA-disease associated data without lncRNA sequence features, and fused lncRNA sequence features. LDAF_GAN uses a generator and discriminator, and differs from the original GAN by the addition of a filtering operation and negative sampling. Filtering allows the generator output to filter out unassociated diseases before being fed into the discriminator. Thus, the results generated by the model focuses only on lncRNAs associated with disease. Negative sampling takes a portion of disease terms with 0 from the association matrix as negative samples, which are assumed to be unassociated with lncRNA. A regular term is added to the loss function to avoid producing a vector with all values of 1, which can fool the discriminator. Thus, the model requires that generated positive samples are close to 1, and negative samples are close to 0. The model achieved a superior fitting effect; LDAF_GAN had superior performance in predicting fivefold cross-validations on the two datasets with AUC values of 0.9265 and 0.9278, respectively. In the case study, LDAF_GAN predicted disease association for six lncRNAs-H19, MALAT1, XIST, ZFAS1, UCA1, and ZEB1-AS1-and with the top ten predictions of 100%, 80%, 90%, 90%, 100%, and 90%, respectively, which were reported by previous studies.

CONCLUSION

LDAF_GAN efficiently predicts the potential association of existing lncRNAs and the potential association of new lncRNAs with diseases. The results of fivefold cross-validation, tenfold cross-validation, and case studies suggest that the model has great predictive potential for lncRNA-disease association prediction.

Collapse

Kumar R, Yadav G, Kuddus M, Ashraf GM, Singh R. Unlocking the microbial studies through computational approaches: how far have we reached? ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:48929-48947. [PMID: 36920617 PMCID: PMC10016191 DOI: 10.1007/s11356-023-26220-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 02/24/2023] [Indexed: 04/16/2023]

Zhang GZ, Gao YL. BRWMC: Predicting lncRNA-disease associations based on bi-random walk and matrix completion on disease and lncRNA networks. Comput Biol Chem 2023;103:107833. [PMID: 36812824 DOI: 10.1016/j.compbiolchem.2023.107833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 12/29/2022] [Accepted: 02/15/2023] [Indexed: 02/19/2023]

Feng JL, Zheng WJ, Xu L, Zhou QY, Chen J. Identification of potential LncRNAs as papillary thyroid carcinoma biomarkers based on integrated bioinformatics analysis using TCGA and RNA sequencing data. Sci Rep 2023;13:4350. [PMID: 36928327 PMCID: PMC10020161 DOI: 10.1038/s41598-023-30086-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 02/15/2023] [Indexed: 03/18/2023] Open

Yalimaimaiti S, Liang X, Zhao H, Dou H, Liu W, Yang Y, Ning L. Establishment of a prognostic signature for lung adenocarcinoma using cuproptosis-related lncRNAs. BMC Bioinformatics 2023;24:81. [PMID: 36879187 PMCID: PMC9990240 DOI: 10.1186/s12859-023-05192-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 02/20/2023] [Indexed: 03/08/2023] Open

Abstract

OBJECTIVE

To establish a prognostic signature for lung adenocarcinoma (LUAD) based on cuproptosis-related long non-coding RNAs (lncRNAs), and to study the immune-related functions of LUAD.

METHODS

First, transcriptome data and clinical data related to LUAD were downloaded from the Cancer Genome Atlas (TCGA), and cuproptosis-related genes were analyzed to identify cuproptosis-related lncRNAs. Univariate COX analysis, least absolute shrinkage and selection operator (LASSO) analysis, and multivariate COX analysis were performed to analyze the cuproptosis-related lncRNAs, and a prognostic signature was established. Second, univariate COX analysis and multivariate COX analysis were performed for independent prognostic analyses. Receiver operating characteristic (ROC) curves, C index, survival curve, nomogram, and principal component analysis (PCA) were performed to evaluate the results of the independent prognostic analyses. Finally, gene enrichment analyses and immune-related function analyses were also carried out.

RESULTS

(1) A total of 1,297 cuproptosis-related lncRNAs were screened. (2) A LUAD prognostic signature containing 13 cuproptosis-related lncRNAs was constructed (NIFK-AS1, AC026355.2, SEPSECS-AS1, AL360270.1, AC010999.2, ABCA9-AS1, AC032011.1, AL162632.3, LINC02518, LINC0059, AL031600.2, AP000346.1, AC012409.4). (3) The area under the multi-indicator ROC curves at 1, 3, and 5 years were AUC1 = 0.742, AUC2 = 0.708, and AUC3 = 0.762, respectively. The risk score of the prognostic signature could be used as an independent prognostic factor that was independent of other clinical indicators. (4) The results of gene enrichment analyses showed that 13 biomarkers were primarily related to amoebiasis, the wnt signaling pathway, hematopoietic cell lineage. The ssGSEA volcano map showed significant differences between high- and low-risk groups in immune-related functions, such as human leukocyte antigen (HLA), Type_II_IFN_Reponse, MHC_class_I, and Parainflammation (P < 0.001).

CONCLUSIONS

Thirteen cuproptosis-related lncRNAs may be clinical molecular biomarkers for the prognosis of LUAD.

Collapse

Akbarzadeh S, Tayefeh-Gholami S, Najari P, Rajabi A, Ghasemzadeh T, Hosseinpour Feizi M, Safaralizadeh R. The expression profile of HAR1A and HAR1B in the peripheral blood cells of multiple sclerosis patients. Mol Biol Rep 2023;50:2391-2398. [PMID: 36583781 DOI: 10.1007/s11033-022-08182-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 12/06/2022] [Indexed: 12/31/2022]

Ha J, Park S. NCMD: Node2vec-Based Neural Collaborative Filtering for Predicting MiRNA-Disease Association. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:1257-1268. [PMID: 35849666 DOI: 10.1109/tcbb.2022.3191972] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]