101
|
Liu Y, Yang H, Zheng C, Wang K, Yan J, Cao H, Zhang Y. NCP-BiRW: A Hybrid Approach for Predicting Long Noncoding RNA-Disease Associations by Network Consistency Projection and Bi-Random Walk. Front Genet 2022; 13:862272. [PMID: 35495166 PMCID: PMC9043107 DOI: 10.3389/fgene.2022.862272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 03/21/2022] [Indexed: 12/06/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) play significant roles in the disease process. Understanding the pathological mechanisms of lncRNAs during the course of various diseases will help clinicians prevent and treat diseases. With the emergence of high-throughput techniques, many biological experiments have been developed to study lncRNA-disease associations. Because experimental methods are costly, slow, and laborious, a growing number of computational models have emerged. Here, we present a new approach using network consistency projection and bi-random walk (NCP-BiRW) to infer hidden lncRNA-disease associations. First, integrated similarity networks for lncRNAs and diseases were constructed by merging similarity information. Subsequently, network consistency projection was applied to calculate space projection scores for lncRNAs and diseases, which were then introduced into a bi-random walk method for association prediction. To test model performance, we employed 5- and 10-fold cross-validation, with the area under the receiver operating characteristic curve as the evaluation indicator. The computational results showed that our method outperformed the other five advanced algorithms. In addition, the novel method was applied to another dataset in the Mammalian ncRNA-Disease Repository (MNDR) database and showed excellent performance. Finally, case studies were carried out on atherosclerosis and leukemia to confirm the effectiveness of our method in practice. In conclusion, we could infer lncRNA-disease associations using the NCP-BiRW model, which may benefit biomedical studies in the future.
Collapse
Affiliation(s)
- Yanling Liu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
- Department of Mathematics, Changzhi Medical College, Changzhi, China
| | - Hong Yang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Chu Zheng
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Ke Wang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Jingjing Yan
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Hongyan Cao
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Yanbo Zhang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
- Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China
- School of Health and Service Management, Shanxi University of Chinese Medicine, Taiyuan, China
- *Correspondence:Yanbo Zhang,
| |
Collapse
|
102
|
Bonomo M, Giancarlo R, Greco D, Rombo SE. Topological ranks reveal functional knowledge encoded in biological networks: a comparative analysis. Brief Bioinform 2022; 23:6563936. [PMID: 35381599 DOI: 10.1093/bib/bbac101] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 01/31/2022] [Accepted: 02/28/2022] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION Biological networks topology yields important insights into biological function, occurrence of diseases and drug design. In the last few years, different types of topological measures have been introduced and applied to infer the biological relevance of network components/interactions, according to their position within the network structure. Although comparisons of such measures have been previously proposed, to what extent the topology per se may lead to the extraction of novel biological knowledge has never been critically examined nor formalized in the literature. RESULTS We present a comparative analysis of nine outstanding topological measures, based on compact views obtained from the rank they induce on a given input biological network. The goal is to understand their ability in correctly positioning nodes/edges in the rank, according to the functional knowledge implicitly encoded in biological networks. To this aim, both internal and external (gold standard) validation criteria are taken into account, and six networks involving three different organisms (yeast, worm and human) are included in the comparison. The results show that a distinct handful of best-performing measures can be identified for each of the considered organisms, independently from the reference gold standard. AVAILABILITY Input files and code for the computation of the considered topological measures and K-haus distance are available at https://gitlab.com/MaryBonomo/ranking. CONTACT simona.rombo@unipa.it. SUPPLEMENTARY INFORMATION Supplementary data are available at Briefings in Bioinformatics online.
Collapse
Affiliation(s)
- Mariella Bonomo
- Department of Engineering, University of Palermo, Palermo, 90121, Italy, Palermo
| | - Raffaele Giancarlo
- Department of Mathematics and Computer Science, University of Palermo, Palermo, 90121, Italy, Palermo
| | - Daniele Greco
- Department of Mathematics and Computer Science, University of Palermo, Palermo, 90121, Italy, Palermo
| | - Simona E Rombo
- Department of Mathematics and Computer Science, University of Palermo, Palermo, 90121, Italy, Palermo
| |
Collapse
|
103
|
Xuan P, Gong Z, Cui H, Li B, Zhang T. Fully connected autoencoder and convolutional neural network with attention-based method for inferring disease-related lncRNAs. Brief Bioinform 2022; 23:6561435. [PMID: 35362511 DOI: 10.1093/bib/bbac089] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 01/17/2022] [Accepted: 02/23/2022] [Indexed: 11/14/2022] Open
Abstract
Since abnormal expression of long noncoding RNAs (lncRNAs) is often closely related to various human diseases, identification of disease-associated lncRNAs is helpful for exploring the complex pathogenesis. Most of recent methods concentrate on exploiting multiple kinds of data related to lncRNAs and diseases for predicting candidate disease-related lncRNAs. These methods, however, failed to deeply integrate the topology information from the meta-paths that are composed of lncRNA, disease and microRNA (miRNA) nodes. We proposed a new method based on fully connected autoencoders and convolutional neural networks, called ACLDA, for inferring potential disease-related lncRNA candidates. A heterogeneous graph that consists of lncRNA, disease and miRNA nodes were firstly constructed to integrate similarities, associations and interactions among them. Fully connected autoencoder-based module was established to extract the low-dimensional features of lncRNA, disease and miRNA nodes in the heterogeneous graph. We designed the attention mechanisms at the node feature level and at the meta-path level to learn more informative features and meta-paths. A module based on convolutional neural networks was constructed to encode the local topologies of lncRNA and disease nodes from multiple meta-path perspectives. The comprehensive experimental results demonstrated ACLDA achieves superior performance than several state-of-the-art prediction methods. Case studies on breast, lung and colon cancers demonstrated that ACLDA is able to discover the potential disease-related lncRNAs.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Zhe Gong
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Hui Cui
- Department of Computer Science and Information Technology, La Trobe University, Melbourne 3083, Australia
| | - Bochong Li
- Center for Frontier Medical Engineering, Chiba University, Chiba 2638522, Japan
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin 150080, China
| |
Collapse
|
104
|
Liu W, Lin H, Huang L, Peng L, Tang T, Zhao Q, Yang L. Identification of miRNA-disease associations via deep forest ensemble learning based on autoencoder. Brief Bioinform 2022; 23:6553934. [PMID: 35325038 DOI: 10.1093/bib/bbac104] [Citation(s) in RCA: 51] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 02/18/2022] [Accepted: 03/01/2022] [Indexed: 12/31/2022] Open
Abstract
Increasing evidences show that the occurrence of human complex diseases is closely related to microRNA (miRNA) variation and imbalance. For this reason, predicting disease-related miRNAs is essential for the diagnosis and treatment of complex human diseases. Although some current computational methods can effectively predict potential disease-related miRNAs, the accuracy of prediction should be further improved. In our study, a new computational method via deep forest ensemble learning based on autoencoder (DFELMDA) is proposed to predict miRNA-disease associations. Specifically, a new feature representation strategy is proposed to obtain different types of feature representations (from miRNA and disease) for each miRNA-disease association. Then, two types of low-dimensional feature representations are extracted by two deep autoencoders for predicting miRNA-disease associations. Finally, two prediction scores of the miRNA-disease associations are obtained by the deep random forest and combined to determine the final results. DFELMDA is compared with several classical methods on the The Human microRNA Disease Database (HMDD) dataset. Results reveal that the performance of this method is superior. The area under receiver operating characteristic curve (AUC) values obtained by DFELMDA through 5-fold and 10-fold cross-validation are 0.9552 and 0.9560, respectively. In addition, case studies on colon, breast and lung tumors of different disease types further demonstrate the excellent ability of DFELMDA to predict disease-associated miRNA-disease. Performance analysis shows that DFELMDA can be used as an effective computational tool for predicting miRNA-disease associations.
Collapse
Affiliation(s)
- Wei Liu
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan, 411105, China.,School of Computer Science, Xiangtan University, Xiangtan, 411105, China
| | - Hui Lin
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan, 411105, China.,School of Computer Science, Xiangtan University, Xiangtan, 411105, China
| | - Li Huang
- Academy of Arts and Design, Tsinghua University, Beijing, 10084, China.,The Future Laboratory, Tsinghua University, Beijing, 10084, China
| | - Li Peng
- School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, 411201, China
| | - Ting Tang
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan, 411105, China.,School of Computer Science, Xiangtan University, Xiangtan, 411105, China
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China
| | - Li Yang
- Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan, 411105, China
| |
Collapse
|
105
|
Revealing key lncRNAs in cytogenetically normal acute myeloid leukemia by reconstruction of the lncRNA-miRNA-mRNA network. Sci Rep 2022; 12:4973. [PMID: 35322118 PMCID: PMC8942983 DOI: 10.1038/s41598-022-08930-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 03/14/2022] [Indexed: 11/09/2022] Open
Abstract
Cytogenetically normal acute myeloid leukemia (CN-AML) is a heterogeneous disease with different prognoses. Researches on prognostic biomarkers and therapy targets of CN-AML are still ongoing. Instead of protein-coding genes, more and more researches were focused on the non-coding RNAs especially long non-coding RNAs (lncRNAs) which may play an important role in the development of AML. Although a large number of lncRNAs have been found, our knowledge of their functions and pathological process is still in its infancy. The purpose of this research is to identify the key lncRNAs and explore their functions in CN-AML by reconstructing the lncRNA-miRNA-mRNA network based on the competitive endogenous RNA (ceRNA) theory. We reconstructed a global triple network based on the ceRNA theory using the data from National Center for Biotechnology Information Gene Expression Omnibus and published literature. According to the topological algorithm, we identified the key lncRNAs which had both the higher node degrees and the higher numbers of lncRNA-miRNA pairs and total pairs in the ceRNA network. Meanwhile, Gene Ontology (GO) and pathway analysis were performed using databases such as DAVID, KOBAS and Cytoscape plug-in ClueGO respectively. The lncRNA-miRNA-mRNA network was composed of 90 lncRNAs,33mRNAs,26 miRNAs and 259 edges in the lncRNA upregulated group, and 18 lncRNAs,11 mRNAs,6 miRNAs and 45 edges in the lncRNA downregulated group. The functional assay showed that 53 pathways and 108 GO terms were enriched. Three lncRNAs (XIST, TUG1, GABPB1-AS1) could possibly be selected as key lncRNAs which may play an important role in the development of CN-AML. Particularly, GABPB1-AS1 was highly expressed in CN-AML by both bioinformatic analysis and experimental verification in AML cell line (THP-1) with quantitative real-time polymerase chain reaction. In addition, GABPB1-AS1 was also negatively correlated with overall survival of AML patients. The lncRNA-miRNA-mRNA network revealed key lncRNAs and their functions in CN-AML. Particularly, lncRNA GABPB1-AS1 was firstly proposed in AML. We believe that GABPB1-AS1 is expected to become a candidate prognostic biomarker or a potential therapeutic target.
Collapse
|
106
|
Guo ZH, Chen ZH, You ZH, Wang YB, Yi HC, Wang MN. A learning-based method to predict LncRNA-disease associations by combining CNN and ELM. BMC Bioinformatics 2022; 22:622. [PMID: 35317723 PMCID: PMC8941737 DOI: 10.1186/s12859-022-04611-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 10/07/2021] [Indexed: 11/10/2022] Open
Abstract
Background lncRNAs play a critical role in numerous biological processes and life activities, especially diseases. Considering that traditional wet experiments for identifying uncovered lncRNA-disease associations is limited in terms of time consumption and labor cost. It is imperative to construct reliable and efficient computational models as addition for practice. Deep learning technologies have been proved to make impressive contributions in many areas, but the feasibility of it in bioinformatics has not been adequately verified. Results In this paper, a machine learning-based model called LDACE was proposed to predict potential lncRNA-disease associations by combining Extreme Learning Machine (ELM) and Convolutional Neural Network (CNN). Specifically, the representation vectors are constructed by integrating multiple types of biology information including functional similarity and semantic similarity. Then, CNN is applied to mine both local and global features. Finally, ELM is chosen to carry out the prediction task to detect the potential lncRNA-disease associations. The proposed method achieved remarkable Area Under Receiver Operating Characteristic Curve of 0.9086 in Leave-one-out cross-validation and 0.8994 in fivefold cross-validation, respectively. In addition, 2 kinds of case studies based on lung cancer and endometrial cancer indicate the robustness and efficiency of LDACE even in a real environment. Conclusions Substantial results demonstrated that the proposed model is expected to be an auxiliary tool to guide and assist biomedical research, and the close integration of deep learning and biology big data will provide life sciences with novel insights.
Collapse
Affiliation(s)
- Zhen-Hao Guo
- School of Electronics and Information Engineering, Tongji University, No. 4800 Cao'an Road, Shanghai, 201804, China
| | - Zhan-Heng Chen
- College of Computer Science and Engineering, Shenzhen University, Shenzhen, 518060, China.
| | - Zhu-Hong You
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129, China
| | - Yan-Bin Wang
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang, 277100, Shandong, China.
| | - Hai-Cheng Yi
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Mei-Neng Wang
- School of Mathematics and Computer Science, Yichun University, Yichun, 336000, Jiangxi, China
| |
Collapse
|
107
|
Yu L, Zheng Y, Ju B, Ao C, Gao L. Research progress of miRNA-disease association prediction and comparison of related algorithms. Brief Bioinform 2022; 23:6542222. [PMID: 35246678 DOI: 10.1093/bib/bbac066] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 01/30/2022] [Accepted: 02/08/2022] [Indexed: 11/13/2022] Open
Abstract
With an in-depth understanding of noncoding ribonucleic acid (RNA), many studies have shown that microRNA (miRNA) plays an important role in human diseases. Because traditional biological experiments are time-consuming and laborious, new calculation methods have recently been developed to predict associations between miRNA and diseases. In this review, we collected various miRNA-disease association prediction models proposed in recent years and used two common data sets to evaluate the performance of the prediction models. First, we systematically summarized the commonly used databases and similarity data for predicting miRNA-disease associations, and then divided the various calculation models into four categories for summary and detailed introduction. In this study, two independent datasets (D5430 and D6088) were compiled to systematically evaluate 11 publicly available prediction tools for miRNA-disease associations. The experimental results indicate that the methods based on information dissemination and the method based on scoring function require shorter running time. The method based on matrix transformation often requires a longer running time, but the overall prediction result is better than the previous two methods. We hope that the summary of work related to miRNA and disease will provide comprehensive knowledge for predicting the relationship between miRNA and disease and contribute to advanced computation tools in the future.
Collapse
Affiliation(s)
- Liang Yu
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Yujia Zheng
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Bingyi Ju
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Chunyan Ao
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi'an, China
| |
Collapse
|
108
|
Yu Y, Zhang Q, Sun K, Xiu Y, Wang X, Wang K, Yan L. Long non-coding RNA BBOX1 antisense RNA 1 increases the apoptosis of granulosa cells in premature ovarian failure by sponging miR-146b. Bioengineered 2022; 13:6092-6099. [PMID: 35188872 PMCID: PMC8973711 DOI: 10.1080/21655979.2022.2031675] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 01/17/2022] [Accepted: 01/17/2022] [Indexed: 11/02/2022] Open
Abstract
Long non-coding RNA (lncRNA) BBOX1 antisense RNA 1 (BBOX1-AS1) was reported to participate in ovarian cancer, while its role in other ovarian disorders is unclear. We speculated that BBOX1-AS1 could interact with microRNA(miR)-146b, which is involved in premature ovarian failure (POF). This study was therefore carried out to explore its role in POF. In this study, 60 patients with POF and 60 controls were enrolled. The expression of BBOX1-AS1 and miR-146b were analyzed by RT-qPCRs. The direct interaction between miR-146b and the wild type BBOX1-AS1 (BBOX1-AS1-WT) or mutant BBOX1-AS1 (BBOX1-AS1-mut) was explored with RNA-RNA pulldown assay. Subcellular location of BBOX1-AS1 in COV434 granulosa cells was detected by subcellular fractionation. The role of BBOX1-AS1 and miR-146b in the apoptosis of COV434 cells was evaluated by cell apoptosis assay. Overexpression assay was applied to explore the relationship between BBOX1-AS1 and miR-146b. We found that the expression levels of BBOX1-AS1 were increased, while the expression levels of miR-146b were decreased in POF patients. BBOX1-AS1-WT, but not BBOX1-AS1-mut, directly interacted with miR-146b. BBOX1-AS1 was detected in both nucleus and cytoplasm, while they did not affect the expression of each other. BBOX1-AS1 suppressed the role of miR-146b in cell apoptosis. Therefore, BBOX1-AS1 may increase the apoptosis of granulosa cells in POF by sponging miR-146b.
Collapse
Affiliation(s)
- Yuexin Yu
- Department of Reproductive Medicine, General Hospital of Northern Theater Command, Shenyang, Liaoning province, PR. China
| | - Qian Zhang
- Department of Reproductive Medicine, General Hospital of Northern Theater Command, Shenyang, Liaoning province, PR. China
| | - Kaixuan Sun
- Department of Reproductive Medicine, General Hospital of Northern Theater Command, Shenyang, Liaoning province, PR. China
| | - Yinling Xiu
- Department of Reproductive Medicine, General Hospital of Northern Theater Command, Shenyang, Liaoning province, PR. China
| | - Xiliang Wang
- Department of Reproductive Medicine, General Hospital of Northern Theater Command, Shenyang, Liaoning province, PR. China
| | - Kaiyue Wang
- Department of Reproductive Medicine, General Hospital of Northern Theater Command, Shenyang, Liaoning province, PR. China
| | - Li Yan
- Department of Reproductive Medicine, General Hospital of Northern Theater Command, Shenyang, Liaoning province, PR. China
| |
Collapse
|
109
|
Xie G, Jiang J, Sun Y. LDA-LNSUBRW: lncRNA-Disease Association Prediction Based on Linear Neighborhood Similarity and Unbalanced bi-Random Walk. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:989-997. [PMID: 32870798 DOI: 10.1109/tcbb.2020.3020595] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Increasing number of experiments show that lncRNAs are involved in many biological processes, and their mutations and disorders are associated with many diseases. However, verifying the relationships between lncRNAs and diseases is time consuming and laborio. Searching for effective computational methods will contribute to our understanding of the underlying mechanisms of disease and identifying biomarkers of diseases. Therefore, we proposed a method called lncRNA-disease association prediction based on linear neighborhood similarity and unbalanced bi-random walk (LDA-LNSUBRW). Given that the known lncRNA-disease associations are rare, a pretreatment step should be performed to obtain the interaction possibility of unknown cases, so as to help us predict the potential associations. In the framework of leave-one-out cross-validation (LOOCV)and fivefold cross-validation (5-fold CV), LDA-LNSUBRW achieved effective performance with AUC of 0.8874 and 0.8632 ± 0.0051, respectively. The experimental results in this paper show that the proposed method is superior to five other state-of-the-art methods. In addition, case studies of three diseases (lung cancer, breast cancer, and osteosarcoma)were carried out to illustrate that LDA-LNSUBRW could predict the relevant lncRNAs.
Collapse
|
110
|
Yang L, Li LP, Yi HC. DeepWalk based method to predict lncRNA-miRNA associations via lncRNA-miRNA-disease-protein-drug graph. BMC Bioinformatics 2022; 22:621. [PMID: 35216549 PMCID: PMC8875942 DOI: 10.1186/s12859-022-04579-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 01/18/2022] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Long non-coding RNAs (lncRNAs) play a crucial role in diverse biological processes and have been confirmed to be concerned with various diseases. Largely uncharacterized of the physiological role and functions of lncRNA remains. MicroRNAs (miRNAs), which are usually 20-24 nucleotides, have several critical regulatory parts in cells. LncRNA can be regarded as a sponge to adsorb miRNA and indirectly regulate transcription and translation. Thus, the identification of lncRNA-miRNA associations is essential and valuable. RESULTS In our work, we present DWLMI to infer the potential associations between lncRNAs and miRNAs by representing them as vectors via a lncRNA-miRNA-disease-protein-drug graph. Specifically, DeepWalk can be used to learn the behavior representation of vertices. The methods of fingerprint, k-mer and MeSH descriptors were mainly used to learn the attribute representation of vertices. By combining the above two kinds of information, unknown lncRNA-miRNA associations can be predicted by the random forest classifier. Under the five-fold cross-validation, the proposed DWLMI model obtained an average prediction accuracy of 95.22% with a sensitivity of 94.35% at the AUC of 98.56%. CONCLUSIONS The experimental results demonstrated that DWLMI can effectively predict the potential lncRNA-miRNA associated pairs, and the results can provide a new insight for related non-coding RNA researchers in the field of combing biology big data with deep learning.
Collapse
Affiliation(s)
- Long Yang
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Li-Ping Li
- College of Grassland and Environmental Science, Xinjiang Agricultural University, Urumqi, 830052, China.
| | - Hai-Cheng Yi
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| |
Collapse
|
111
|
Tyagi S, Chan EC, Barker D, McElduff P, Taylor KA, Riveros C, Singh E, Smith R. Transcriptomic analysis reveals myometrial topologically associated domains linked to onset of human term labor. Mol Hum Reprod 2022; 28:6527642. [PMID: 35150271 PMCID: PMC8903000 DOI: 10.1093/molehr/gaac003] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 01/10/2022] [Indexed: 11/13/2022] Open
Abstract
Changes in cell phenotype are thought to occur through the expression of groups of co-regulated genes within topologically associated domains (TADs). In this paper we allocate genes expressed within the myometrium of the human uterus during the onset of term labor into TADs. Transformation of the myometrial cells of the uterus into a contractile phenotype during term human labor is the result of a complex interaction of different epigenomic and genomic layers. Recent work suggests that the transcription factor RELA lies at the top of this regulatory network. Using deep RNA sequencing (RNAseq) analysis of myometrial samples (n = 16) obtained at term from women undergoing Caesarean section prior to or after the onset of labor we have identified evidence for how other gene expression regulatory elements interact with transcription factors in the labor phenotype transition. Gene set enrichment analysis of our RNAseq data identified three modules of enriched genes (M1, M2 and M3), which in gene ontology studies are linked to matrix degradation, smooth muscle and immune gene signatures, respectively. These genes were predominantly located within chromosomal TADs suggesting co-regulation of expression. Our transcriptomic analysis also identified significant differences in the expression of long non-coding RNAs (lncRNA), microRNAs (miRNA) and transcription factors that were predicted to target genes within the TADs. Additionally, network analysis revealed 15 new lncRNA (MCM3AP-AS1, TUG1, MIR29B2CHG, HCG18, LINC00963, KCNQ1OT1, NEAT1, HELLPAR, SNHG16, NUTM2B-AS1, MALAT1, PSMA3-AS1, GABPB1-AS1, NORAD, NKILA) and four miRNA (mir-145, mir-223, mir-let-7a, mir-132) as top gene hubs with three transcription factors (NFKB1, RELA, ESR1) as master regulators. Together, these factors are likely to be involved in co-regulatory networks driving a myometrial transformation to generate an estrogen sensitive phenotype. We conclude that lncRNA and miRNA targeting the estrogen receptor 1 and nuclear factor kappa B pathways play a key role in the initiation of human labor. For the first time we perform an integrative analysis to present a multi-level genomic signature made of mRNA, ncRNA and transcription factors in the myometrium for spontaneous term labor.
Collapse
Affiliation(s)
- Sonika Tyagi
- Central Clinical School, Monash University and the Alfred Hospital, Melbourne, VIC, Australia
| | - Eng-Cheng Chan
- Mothers and Babies Research Centre, HMRI University of Newcastle, NSW, Australia
| | | | | | - Kelly A Taylor
- Mothers and Babies Research Centre, HMRI University of Newcastle, NSW, Australia
| | | | - Esha Singh
- Department of Biotechnology and Biochemical Engineering, Indian Institute of Technology, New Delhi, India
| | - Roger Smith
- Mothers and Babies Research Centre, HMRI University of Newcastle, NSW, Australia.,University of Newcastle, Newcastle, NSW, Australia
| |
Collapse
|
112
|
Chen M, Deng Y, Li A, Tan Y. Inferring Latent Disease-lncRNA Associations by Label-Propagation Algorithm and Random Projection on a Heterogeneous Network. Front Genet 2022; 13:798632. [PMID: 35186029 PMCID: PMC8854791 DOI: 10.3389/fgene.2022.798632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/18/2022] [Indexed: 11/13/2022] Open
Abstract
Long noncoding RNA (lncRNA), a type of more than 200 nucleotides non-coding RNA, is related to various complex diseases. To precisely identify the potential lncRNA–disease association is important to understand the disease pathogenesis, to develop new drugs, and to design individualized diagnosis and treatment methods for different human diseases. Compared with the complexity and high cost of biological experiments, computational methods can quickly and effectively predict potential lncRNA–disease associations. Thus, it is a promising avenue to develop computational methods for lncRNA-disease prediction. However, owing to the low prediction accuracy ofstate of the art methods, it is vastly challenging to accurately and effectively identify lncRNA-disease at present. This article proposed an integrated method called LPARP, which is based on label-propagation algorithm and random projection to address the issue. Specifically, the label-propagation algorithm is initially used to obtain the estimated scores of lncRNA–disease associations, and then random projections are used to accurately predict disease-related lncRNAs.The empirical experiments showed that LAPRP achieved good prediction on three golddatasets, which is superior to existing state-of-the-art prediction methods. It can also be used to predict isolated diseases and new lncRNAs. Case studies of bladder cancer, esophageal squamous-cell carcinoma, and colorectal cancer further prove the reliability of the method. The proposed LPARP algorithm can predict the potential lncRNA–disease interactions stably and effectively with fewer data. LPARP can be used as an effective and reliable tool for biomedical research.
Collapse
|
113
|
Sheng N, Huang L, Wang Y, Zhao J, Xuan P, Gao L, Cao Y. Multi-channel graph attention autoencoders for disease-related lncRNAs prediction. Brief Bioinform 2022; 23:6519791. [PMID: 35108355 DOI: 10.1093/bib/bbab604] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Revised: 12/08/2021] [Accepted: 12/27/2021] [Indexed: 12/31/2022] Open
Abstract
MOTIVATION Predicting disease-related long non-coding RNAs (lncRNAs) can be used as the biomarkers for disease diagnosis and treatment. The development of effective computational prediction approaches to predict lncRNA-disease associations (LDAs) can provide insights into the pathogenesis of complex human diseases and reduce experimental costs. However, few of the existing methods use microRNA (miRNA) information and consider the complex relationship between inter-graph and intra-graph in complex-graph for assisting prediction. RESULTS In this paper, the relationships between the same types of nodes and different types of nodes in complex-graph are introduced. We propose a multi-channel graph attention autoencoder model to predict LDAs, called MGATE. First, an lncRNA-miRNA-disease complex-graph is established based on the similarity and correlation among lncRNA, miRNA and diseases to integrate the complex association among them. Secondly, in order to fully extract the comprehensive information of the nodes, we use graph autoencoder networks to learn multiple representations from complex-graph, inter-graph and intra-graph. Thirdly, a graph-level attention mechanism integration module is adopted to adaptively merge the three representations, and a combined training strategy is performed to optimize the whole model to ensure the complementary and consistency among the multi-graph embedding representations. Finally, multiple classifiers are explored, and Random Forest is used to predict the association score between lncRNA and disease. Experimental results on the public dataset show that the area under receiver operating characteristic curve and area under precision-recall curve of MGATE are 0.964 and 0.413, respectively. MGATE performance significantly outperformed seven state-of-the-art methods. Furthermore, the case studies of three cancers further demonstrate the ability of MGATE to identify potential disease-correlated candidate lncRNAs. The source code and supplementary data are available at https://github.com/sheng-n/MGATE. CONTACT huanglan@jlu.edu.cn, wy6868@jlu.edu.cn.
Collapse
Affiliation(s)
- Nan Sheng
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Lan Huang
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Yan Wang
- Key laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China.,School of Artificial Intelligence, Jilin University, Changchun 130012, China
| | - Jing Zhao
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus OH 43210, USA
| | - Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Ling Gao
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Yangkun Cao
- School of Artificial Intelligence, Jilin University, Changchun 130012, China
| |
Collapse
|
114
|
Heydarnezhad Asl M, Pasban Khelejani F, Bahojb Mahdavi SZ, Emrahi L, Jebelli A, Mokhtarzadeh A. The various regulatory functions of long noncoding RNAs in apoptosis, cell cycle, and cellular senescence. J Cell Biochem 2022; 123:995-1024. [PMID: 35106829 DOI: 10.1002/jcb.30221] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 12/28/2021] [Accepted: 01/11/2022] [Indexed: 12/12/2022]
Abstract
Long noncoding RNAs (lncRNAs) are a group of noncoding cellular RNAs involved in significant biological phenomena such as differentiation, cell development, genomic imprinting, adjusting the enzymatic activity, regulating chromosome conformation, apoptosis, cell cycle, and cellular senescence. The misregulation of lncRNAs interrupting normal biological processes has been implicated in tumor formation and metastasis, resulting in cancer. Apoptosis and cell cycle, two main biological phenomena, are highly conserved and intimately coupled mechanisms. Hence, some cell cycle regulators can influence both programmed cell death and cell division. Apoptosis eliminates defective and unwanted cells, and the cell cycle enables cells to replicate themselves. The improper regulation of apoptosis and cell cycle contributes to numerous disorders such as neurodegenerative and autoimmune diseases, viral infection, anemia, and mainly cancer. Cellular senescence is a tumor-suppressing response initiated by environmental and internal stress factors. This phenomenon has recently attained more attention due to its therapeutic implications in the field of senotherapy. In this review, the regulatory roles of lncRNAs on apoptosis, cell cycle, and senescence will be discussed. First, the role of lncRNAs in mitochondrial dynamics and apoptosis is addressed. Next, the interaction between lncRNAs and caspases, pro/antiapoptotic proteins, and also EGFR/PI3K/PTEN/AKT/mTORC1 signaling pathway will be investigated. Furthermore, the effect of lncRNAs in the cell cycle is surveyed through interaction with cyclins, cdks, p21, and wnt/β-catenin/c-myc pathway. Finally, the function of essential lncRNAs in cellular senescence is mentioned.
Collapse
Affiliation(s)
| | - Faezeh Pasban Khelejani
- Department of Cell and Molecular Biology, Faculty of Basic Sciences, University of Maragheh, Maragheh, Iran
| | | | - Leila Emrahi
- Department of Medical Genetics, Faculty of Medical Science, Tarbiat Modares University, Tehran, Iran
| | - Asiyeh Jebelli
- Department of Biological Science, Faculty of Basic Science, Higher Education Institute of Rab-Rashid, Tabriz, Iran.,Tuberculosis and Lung Disease Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Ahad Mokhtarzadeh
- Immunology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
115
|
Li J, Kong M, Wang D, Yang Z, Hao X. Prediction of lncRNA-Disease Associations via Closest Node Weight Graphs of the Spatial Neighborhood Based on the Edge Attention Graph Convolutional Network. Front Genet 2022; 12:808962. [PMID: 35058974 PMCID: PMC8763691 DOI: 10.3389/fgene.2021.808962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Accepted: 11/29/2021] [Indexed: 11/24/2022] Open
Abstract
Accumulated evidence of biological clinical trials has shown that long non-coding RNAs (lncRNAs) are closely related to the occurrence and development of various complex human diseases. Research works on lncRNA–disease relations will benefit to further understand the pathogenesis of human complex diseases at the molecular level, but only a small proportion of lncRNA–disease associations has been confirmed. Considering the high cost of biological experiments, exploring potential lncRNA–disease associations with computational approaches has become very urgent. In this study, a model based on closest node weight graph of the spatial neighborhood (CNWGSN) and edge attention graph convolutional network (EAGCN), LDA-EAGCN, was developed to uncover potential lncRNA–disease associations by integrating disease semantic similarity, lncRNA functional similarity, and known lncRNA–disease associations. Inspired by the great success of the EAGCN method on the chemical molecule property recognition problem, the prediction of lncRNA–disease associations could be regarded as a component recognition problem of lncRNA–disease characteristic graphs. The CNWGSN features of lncRNA–disease associations combined with known lncRNA–disease associations were introduced to train EAGCN, and correlation scores of input data were predicted with EAGCN for judging whether the input lncRNAs would be associated with the input diseases. LDA-EAGCN achieved a reliable AUC value of 0.9853 in the ten-fold cross-over experiments, which was the highest among five state-of-the-art models. Furthermore, case studies of renal cancer, laryngeal carcinoma, and liver cancer were implemented, and most of the top-ranking lncRNA–disease associations have been proven by recently published experimental literature works. It can be seen that LDA-EAGCN is an effective model for predicting potential lncRNA–disease associations. Its source code and experimental data are available at https://github.com/HGDKMF/LDA-EAGCN.
Collapse
Affiliation(s)
- Jianwei Li
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China.,Hebei Province Key Laboratory of Big Data Calculation, Hebei University of Technology, Tianjin, China
| | - Mengfan Kong
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| | - Duanyang Wang
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| | - Zhenwu Yang
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| | - Xiaoke Hao
- Institute of Computational Medicine, School of Artificial Intelligence, Hebei University of Technology, Tianjin, China
| |
Collapse
|
116
|
Li J, Yang Z, Wang D, Li Z. WAFNRLTG: A Novel Model for Predicting LncRNA Target Genes Based on Weighted Average Fusion Network Representation Learning Method. Front Cell Dev Biol 2022; 9:820342. [PMID: 35127729 PMCID: PMC8807548 DOI: 10.3389/fcell.2021.820342] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 12/14/2021] [Indexed: 11/29/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) do not encode proteins, yet they have been well established to be involved in complex regulatory functions, and lncRNA regulatory dysfunction can lead to a variety of human complex diseases. LncRNAs mostly exert their functions by regulating the expressions of target genes, and accurate prediction of potential lncRNA target genes would be helpful to further understanding the functional annotations of lncRNAs. Considering the limitations in traditional computational methods for predicting lncRNA target genes, a novel model which was named Weighted Average Fusion Network Representation learning for predicting LncRNA Target Genes (WAFNRLTG) was proposed. First, a novel heterogeneous network was constructed by integrating lncRNA sequence similarity network, mRNA sequence similarity network, lncRNA-mRNA interaction network, lncRNA-miRNA interaction network and mRNA-miRNA interaction network. Next, four popular network representation learning methods were utilized to gain the representation vectors of lncRNA and mRNA nodes. Then, the representations of lncRNAs and target genes in the heterogeneous network were obtained with the weighted average fusion network representation learning method. Finally, we merged the representations of lncRNAs and related target genes to form lncRNA-gene pairs, trained the XGBoost classifier and predicted potential lncRNA target genes. In five-cross validations on the training and independent datasets, the experimental results demonstrated that WAFNRLTG obtained better AUC scores (0.9410, 0.9350) and AUPR scores (0.9391, 0.9350). Moreover, case studies of three common lncRNAs were performed for predicting their potential lncRNA target genes and the results confirmed the effectiveness of WAFNRLTG. The source codes and all data of WAFNRLTG can be freely downloaded at https://github.com/HGDYZW/WAFNRLTG.
Collapse
Affiliation(s)
- Jianwei Li
- School of Artificial Intelligence, Institute of Computational Medicine, Hebei University of Technology, Tianjin, China
- Hebei Province Key Laboratory of Big Data Calculation, Hebei University of Technology, Tianjin, China
- *Correspondence: Jianwei Li,
| | - Zhenwu Yang
- School of Artificial Intelligence, Institute of Computational Medicine, Hebei University of Technology, Tianjin, China
| | - Duanyang Wang
- School of Artificial Intelligence, Institute of Computational Medicine, Hebei University of Technology, Tianjin, China
| | - Zhiguang Li
- School of Artificial Intelligence, Institute of Computational Medicine, Hebei University of Technology, Tianjin, China
| |
Collapse
|
117
|
Evaluation of the Prognostic Value of Long Noncoding RNAs in Lung Squamous Cell Carcinoma. JOURNAL OF ONCOLOGY 2022; 2022:9273628. [PMID: 35069738 PMCID: PMC8776467 DOI: 10.1155/2022/9273628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/16/2021] [Indexed: 12/24/2022]
Abstract
Lung squamous cell carcinoma (LUSC) is the most common type of lung cancer accounting for 40% to 51%. Long noncoding RNAs (lncRNAs) have been reported to play a significant role in the invasion, migration, and proliferation of lung cancer tissue cells. However, systematic identification of lncRNA signatures and evaluation of the prognostic value for LUSC are still an urgent problem. In this work, LUSC RNA-seq data were collected from TCGA database, and the limma R package was used to screen differentially expressed lncRNAs (DElncRNAs). In total, 216 DElncRNAs were identified between the LUSC and normal samples. lncRNAs associated with prognosis were calculated using univariate Cox regression analysis. The overall survival (OS) prognostic model containing 10 lncRNAs and the disease-free survival (DFS) prognostic model consisting of 11 lncRNAs were constructed using a machine learning-based algorithm, systematic LASSO-Cox regression analysis. We found that the survival rate of samples in the high-risk group was lower than that in the low-risk group. Results of ROC curves showed that both the OS and DFS risk score had better prognostic effects than the clinical characteristics, including age, stage, gender, and TNM. Two lncRNAs (LINC00519 and FAM83A-AS1) that were commonly identified as prognostic factors in both models could be further investigated for their clinical significance and therapeutic value. In conclusion, we constructed lncRNA prognostic models with considerable prognostic effect for both OS and DFS of LUSC.
Collapse
|
118
|
Xiong D, Wang C, Yang Z, Han F, Zhan H. Clinical Significance of Serum-Derived Exosomal LINC00917 in Patients With Non-Small Cell Lung Cancer. Front Genet 2022; 12:728763. [PMID: 35003204 PMCID: PMC8739916 DOI: 10.3389/fgene.2021.728763] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 11/01/2021] [Indexed: 11/13/2022] Open
Abstract
Background: In this study, we aimed to explore the diagnostic potential of serum-based exosomal long intergenic noncoding RNA 917 (LINC00917) in non-small cell lung cancer (NSCLC). Methods: Exosomes were extracted from NSCLC patients’ serum samples. Exosomal LINC00917 expression levels were compared, by qRT-PCR, between cancer patients and healthy controls, as well as sub-populations of cancer patients. The association between exosomal LINC00917 expression and NSCLC patients’ clinicopathologic factors were investigated, and receiver operating characteristic (ROC) curves were drawn. In addition, NSCLC patients’ overall survivals (OSs) was examined based on exosomal LINC00917 expression and further evaluated by the cox regression analysis. Results: Serum-derived exosomal LINC00917 was highly expressed in NSCLC patients, and further upregulated in stage III/IV cancer patients. Exosomal LINC00917 yielded modestly good under the curve (AUC) values. Upregulated exosomal LINC00917 expression was closely associated with cancer patients’ advanced stages and shorter OSs. Conclusion: Serum-derived exosomal LINC00917 may hold diagnostic potential for patients with non-small cell lung cancer.
Collapse
Affiliation(s)
- Dani Xiong
- Department of Respiratory Disease, Qingdao Huangdao District Central Hospital, Qingdao, China
| | - Chuanlin Wang
- Department of Respiratory Disease, Qingdao Huangdao District Central Hospital, Qingdao, China
| | - Zhaohui Yang
- Department of Respiratory Disease, Qingdao Huangdao District Central Hospital, Qingdao, China
| | - Fusen Han
- Department of Respiratory Disease, Qingdao Huangdao District Central Hospital, Qingdao, China
| | - Huaibing Zhan
- Department of Respiratory Disease, Qingdao Huangdao District Central Hospital, Qingdao, China
| |
Collapse
|
119
|
Wang F, Zhang F, Zheng F. lncRNA Kcnq1ot1 promotes bone formation by inhibiting miR‑98‑5p/Tbx5 axis in MC3T3‑E1 cells. Exp Ther Med 2022; 23:194. [PMID: 35126697 PMCID: PMC8794546 DOI: 10.3892/etm.2022.11117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 07/30/2021] [Indexed: 11/23/2022] Open
Abstract
Long non-coding (lnc)RNA KCNQ1 opposite strand/antisense transcript 1 (Kcnq1ot1) has been shown to regulate multiple biological processes. However, the functional role of Kcnq1ot1 in osteoporosis and the underlying mechanism are still unclear. The present study aimed to investigate the function of lncRNA Kcnq1ot1 in osteogenic differentiation. Alkaline phosphatase (ALP) activity was measured using an ALP assay kit. Western blotting was performed to assess the expression levels of osteogenic differentiation-associated proteins. Reverse transcription-quantitative PCR was performed to detect Kcnq1ot1, microRNA (miR)-98-5p and T-box transcription factor 5 (Tbx5) expression levels. The binding of Kcnq1ot1 with miR-98-5p and that of miR-98-5p with Tbx5 were predicted by starBase and TargetScan databases, respectively, and verified using dual luciferase reporter assays. The mineralization of MC3T3-E1 cells was observed using an Alizarin red S staining assay. The results revealed that expression of Kcnq1ot1 was increased and that of miR-98-5p was decreased during osteogenic differentiation. Additionally, Kcnq1ot1 was shown to target miR-98-5p and inhibit its expression. Inhibiting miR-98-5p reversed the inhibitory effect of Kcnq1ot1 knockdown on osteogenic differentiation and mineralization of MC3T3-E1 cells. Furthermore, Kcnq1ot1 regulated Tbx5 expression via miR-98-5p. Overexpressing miR-98-5p or downregulating Tbx5 expression reversed the promotive effect of Kcnq1ot1 overexpression on osteogenic differentiation and mineralization of MC3T3-E1 cells. In conclusion, these findings suggested that Kcnq1ot1 may promote bone formation by inhibiting miR-98-5p and upregulating Tbx5. Kcnq1ot1, miR-98-5p and Tbx5 may therefore serve as promising targets for the treatment of osteoporosis.
Collapse
Affiliation(s)
- Furong Wang
- Department of Orthopedics, Qinghai Provincial People's Hospital, Chengdong, Xining, Qinghai 810007, P.R. China
| | - Fucai Zhang
- Department of Orthopedics, Qinghai Provincial People's Hospital, Chengdong, Xining, Qinghai 810007, P.R. China
| | - Feng Zheng
- Department of Orthopedics, Qinghai Provincial People's Hospital, Chengdong, Xining, Qinghai 810007, P.R. China
| |
Collapse
|
120
|
Wang L, Zhong C. gGATLDA: lncRNA-disease association prediction based on graph-level graph attention network. BMC Bioinformatics 2022; 23:11. [PMID: 34983363 PMCID: PMC8729153 DOI: 10.1186/s12859-021-04548-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Accepted: 12/21/2021] [Indexed: 01/20/2023] Open
Abstract
Background Long non-coding RNAs (lncRNAs) are related to human diseases by regulating gene expression. Identifying lncRNA-disease associations (LDAs) will contribute to diagnose, treatment, and prognosis of diseases. However, the identification of LDAs by the biological experiments is time-consuming, costly and inefficient. Therefore, the development of efficient and high-accuracy computational methods for predicting LDAs is of great significance. Results In this paper, we propose a novel computational method (gGATLDA) to predict LDAs based on graph-level graph attention network. Firstly, we extract the enclosing subgraphs of each lncRNA-disease pair. Secondly, we construct the feature vectors by integrating lncRNA similarity and disease similarity as node attributes in subgraphs. Finally, we train a graph neural network (GNN) model by feeding the subgraphs and feature vectors to it, and use the trained GNN model to predict lncRNA-disease potential association scores. The experimental results show that our method can achieve higher area under the receiver operation characteristic curve (AUC), area under the precision recall curve (AUPR), accuracy and F1-Score than the state-of-the-art methods in five fold cross-validation. Case studies show that our method can effectively identify lncRNAs associated with breast cancer, gastric cancer, prostate cancer, and renal cancer. Conclusion The experimental results indicate that our method is a useful approach for predicting potential LDAs.
Collapse
Affiliation(s)
- Li Wang
- School of Computer Science and Engineering, South China University of Technology, Guangzhou, China.,School of Computer, Electronics and Information, Guangxi University, Nanning, China
| | - Cheng Zhong
- School of Computer, Electronics and Information, Guangxi University, Nanning, China. .,Key Laboratory of Parallel and Distributed Computing in Guangxi Colleges and Universities, Guangxi University, Nanning, China.
| |
Collapse
|
121
|
Wang L, Shang M, Dai Q, He PA. Prediction of lncRNA-disease association based on a Laplace normalized random walk with restart algorithm on heterogeneous networks. BMC Bioinformatics 2022; 23:5. [PMID: 34983367 PMCID: PMC8729064 DOI: 10.1186/s12859-021-04538-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/15/2021] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND More and more evidence showed that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human sophisticated diseases. Therefore, predicting human lncRNA-disease associations is a challenging and urgently task in bioinformatics to research of human sophisticated diseases. RESULTS In the work, a global network-based computational framework called as LRWRHLDA were proposed which is a universal network-based method. Firstly, four isomorphic networks include lncRNA similarity network, disease similarity network, gene similarity network and miRNA similarity network were constructed. And then, six heterogeneous networks include known lncRNA-disease, lncRNA-gene, lncRNA-miRNA, disease-gene, disease-miRNA, and gene-miRNA associations network were applied to design a multi-layer network. Finally, the Laplace normalized random walk with restart algorithm in this global network is suggested to predict the relationship between lncRNAs and diseases. CONCLUSIONS The ten-fold cross validation is used to evaluate the performance of LRWRHLDA. As a result, LRWRHLDA achieves an AUC of 0.98402, which is higher than other compared methods. Furthermore, LRWRHLDA can predict isolated disease-related lnRNA (isolated lnRNA related disease). The results for colorectal cancer, lung adenocarcinoma, stomach cancer and breast cancer have been verified by other researches. The case studies indicated that our method is effective.
Collapse
Affiliation(s)
- Liugen Wang
- School of Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Min Shang
- School of Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Qi Dai
- College of Life Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Ping-An He
- School of Science, Zhejiang Sci-Tech University, Hangzhou, 310018, China.
| |
Collapse
|
122
|
Predicting miRNA-Disease Association Based on Neural Inductive Matrix Completion with Graph Autoencoders and Self-Attention Mechanism. Biomolecules 2022; 12:biom12010064. [PMID: 35053212 PMCID: PMC8774034 DOI: 10.3390/biom12010064] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 12/29/2021] [Accepted: 12/31/2021] [Indexed: 02/06/2023] Open
Abstract
Many studies have clarified that microRNAs (miRNAs) are associated with many human diseases. Therefore, it is essential to predict potential miRNA-disease associations for disease pathogenesis and treatment. Numerous machine learning and deep learning approaches have been adopted to this problem. In this paper, we propose a Neural Inductive Matrix completion-based method with Graph Autoencoders (GAE) and Self-Attention mechanism for miRNA-disease associations prediction (NIMGSA). Some of the previous works based on matrix completion ignore the importance of label propagation procedure for inferring miRNA-disease associations, while others cannot integrate matrix completion and label propagation effectively. Varying from previous studies, NIMGSA unifies inductive matrix completion and label propagation via neural network architecture, through the collaborative training of two graph autoencoders. This neural inductive matrix completion-based method is also an implementation of self-attention mechanism for miRNA-disease associations prediction. This end-to-end framework can strengthen the robustness and preciseness of both matrix completion and label propagation. Cross validations indicate that NIMGSA outperforms current miRNA-disease prediction methods. Case studies demonstrate that NIMGSA is competent in detecting potential miRNA-disease associations.
Collapse
|
123
|
Diagnostic value of long noncoding RNA LINC01485 in patients with colorectal cancer. Clin Biochem 2022; 102:34-43. [DOI: 10.1016/j.clinbiochem.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Revised: 01/05/2022] [Accepted: 01/07/2022] [Indexed: 12/14/2022]
|
124
|
|
125
|
Gong Y, Zhu W, Sun M, Shi L. Bioinformatics Analysis of Long Non-coding RNA and Related Diseases: An Overview. Front Genet 2021; 12:813873. [PMID: 34956340 PMCID: PMC8692768 DOI: 10.3389/fgene.2021.813873] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 11/26/2021] [Indexed: 12/30/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) are usually located in the nucleus and cytoplasm of cells. The transcripts of lncRNAs are >200 nucleotides in length and do not encode proteins. Compared with small RNAs, lncRNAs have longer sequences, more complex spatial structures, and more diverse and complex mechanisms involved in the regulation of gene expression. LncRNAs are widely involved in the biological processes of cells, and in the occurrence and development of many human diseases. Many studies have shown that lncRNAs can induce the occurrence of diseases, and some lncRNAs undergo specific changes in tumor cells. Research into the roles of lncRNAs has covered the diagnosis of, for example, cardiovascular, cerebrovascular, and central nervous system diseases. The bioinformatics of lncRNAs has gradually become a research hotspot and has led to the discovery of a large number of lncRNAs and associated biological functions, and lncRNA databases and recognition models have been developed. In this review, the research progress of lncRNAs is discussed, and lncRNA-related databases and the mechanisms and modes of action of lncRNAs are described. In addition, disease-related lncRNA methods and the relationships between lncRNAs and human lung adenocarcinoma, rectal cancer, colon cancer, heart disease, and diabetes are discussed. Finally, the significance and existing problems of lncRNA research are considered.
Collapse
Affiliation(s)
- Yuxin Gong
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China.,Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.,Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China.,Key Laboratory of Data Science and Smart Education, Hainan Normal University, Ministry of Education, Haikou, China
| | - Wen Zhu
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Meili Sun
- Beidahuang Industry Group General Hospital, Harbin, China
| | - Lei Shi
- Department of Spine Surgery, Changzheng Hospital, Naval Medical University, Shanghai, China
| |
Collapse
|
126
|
Duan T, Kuang Z, Wang J, Ma Z. GBDTLRL2D Predicts LncRNA-Disease Associations Using MetaGraph2Vec and K-Means Based on Heterogeneous Network. Front Cell Dev Biol 2021; 9:753027. [PMID: 34977011 PMCID: PMC8718797 DOI: 10.3389/fcell.2021.753027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 11/22/2021] [Indexed: 12/16/2022] Open
Abstract
In recent years, the long noncoding RNA (lncRNA) has been shown to be involved in many disease processes. The prediction of the lncRNA-disease association is helpful to clarify the mechanism of disease occurrence and bring some new methods of disease prevention and treatment. The current methods for predicting the potential lncRNA-disease association seldom consider the heterogeneous networks with complex node paths, and these methods have the problem of unbalanced positive and negative samples. To solve this problem, a method based on the Gradient Boosting Decision Tree (GBDT) and logistic regression (LR) to predict the lncRNA-disease association (GBDTLRL2D) is proposed in this paper. MetaGraph2Vec is used for feature learning, and negative sample sets are selected by using K-means clustering. The innovation of the GBDTLRL2D is that the clustering algorithm is used to select a representative negative sample set, and the use of MetaGraph2Vec can better retain the semantic and structural features in heterogeneous networks. The average area under the receiver operating characteristic curve (AUC) values of GBDTLRL2D obtained on the three datasets are 0.98, 0.98, and 0.96 in 10-fold cross-validation.
Collapse
Affiliation(s)
| | - Zhufang Kuang
- School of Computer and Information Engineering, Central South University of Forestry and Technology, Changsha, China
| | | | | |
Collapse
|
127
|
Yao Y, Shu F, Wang F, Wang X, Guo Z, Wang H, Li L, Lv H. Long noncoding RNA LINC01189 is associated with HCV-hepatocellular carcinoma and regulates cancer cell proliferation and chemoresistance through hsa-miR-155-5p. Ann Hepatol 2021; 22:100269. [PMID: 33059056 DOI: 10.1016/j.aohep.2020.09.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 09/22/2020] [Accepted: 09/22/2020] [Indexed: 02/07/2023]
Abstract
INTRODUCTION AND OBJECTIVES Emerging evidence has demonstrated that long noncoding RNAs (lncRNAs) may be closely associated with Hepatitis C virus (HCV) infection and the development of hepatocellular carcinoma (HCC). In this study, we investigated the expression and functions of a lncRNA, LINC01189, in HCV-associated HCC. PATIENTS OR MATERIALS AND METHODS LINC01189 expression was measured in HCC tumors, HCV-infected HCC tumors and HCV-infected HCC cells. LINC01189 was overexpressed in HCV-infected HepG2 cells to measure its function on HCV-correlated cancer proliferation. In HCC cell lines of Huh7 and Hep3B, LINC01189 was upregulated to investigate its effects on cancer cell proliferation and 5-FU chemoresistance. The competing endogenous RNA (ceRNA) target of LINC01189, human microRNA-155-5p (hsa-miR-155-5p) was probed by dual-luciferase assay and qRT-PCR. Hsa-miR-155-5p was upregulated in LINC01189-overexpessed Huh7 and Hep3B cells to investigate their epigenetic correlation on HCC development regulation. RESULTS LINC01189 is downregulated in HCV-infected HCC tumors and cell lines. LINC01189 overexpression inhibited HCC cancer cell proliferation and 5-FU chemoresistance. Hsa-miR-155-5p was confirmed to be a ceRNA target of LINC01189 in HCC. Upregulating hsa-miR-155-5p reversed the LINC01189-mediated inhibition on HCC proliferation and 5-FU chemoresistance. CONCLUSIONS LINC01189 downregulation is associated with HCV infection in HCC, and it has tumor-suppressing effects on HCC development through hsa-miR-155-5p.
Collapse
Affiliation(s)
- Ying Yao
- Clinical laboratory, Honghui Hospital, Xi'an Jiaotong University, Xi'an, 710054 Shaanxi, China
| | - Fang Shu
- Clinical laboratory, Third Hospital of Xi'an, 710000, Shaanxi, China
| | - Fang Wang
- Anaesthesiology department, Honghui Hospital, Xi'an Jiaotong University, Xi'an, 710054, Shaanxi, China
| | - Xiaoqiang Wang
- Department of Cancer Biology, City of Hope National Medical Center, Duarte, CA, 91010, USA
| | - Zhengshe Guo
- Anaesthesiology department, Honghui Hospital, Xi'an Jiaotong University, Xi'an, 710054, Shaanxi, China
| | - Haili Wang
- Anaesthesiology department, Honghui Hospital, Xi'an Jiaotong University, Xi'an, 710054, Shaanxi, China
| | - Lu Li
- Anaesthesiology department, Honghui Hospital, Xi'an Jiaotong University, Xi'an, 710054, Shaanxi, China
| | - Haigang Lv
- Anaesthesiology department, Honghui Hospital, Xi'an Jiaotong University, Xi'an, 710054, Shaanxi, China.
| |
Collapse
|
128
|
GCAEMDA: Predicting miRNA-disease associations via graph convolutional autoencoder. PLoS Comput Biol 2021; 17:e1009655. [PMID: 34890410 PMCID: PMC8694430 DOI: 10.1371/journal.pcbi.1009655] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 12/22/2021] [Accepted: 11/17/2021] [Indexed: 01/02/2023] Open
Abstract
microRNAs (miRNAs) are small non-coding RNAs related to a number of complicated biological processes. A growing body of studies have suggested that miRNAs are closely associated with many human diseases. It is meaningful to consider disease-related miRNAs as potential biomarkers, which could greatly contribute to understanding the mechanisms of complex diseases and benefit the prevention, detection, diagnosis and treatment of extraordinary diseases. In this study, we presented a novel model named Graph Convolutional Autoencoder for miRNA-Disease Association Prediction (GCAEMDA). In the proposed model, we utilized miRNA-miRNA similarities, disease-disease similarities and verified miRNA-disease associations to construct a heterogeneous network, which is applied to learn the embeddings of miRNAs and diseases. In addition, we separately constructed miRNA-based and disease-based sub-networks. Combining the embeddings of miRNAs and diseases, graph convolutional autoencoder (GCAE) was utilized to calculate association scores of miRNA-disease on two sub-networks, respectively. Furthermore, we obtained final prediction scores between miRNAs and diseases by adopting an average ensemble way to integrate the prediction scores from two types of subnetworks. To indicate the accuracy of GCAEMDA, we applied different cross validation methods to evaluate our model whose performances were better than the state-of-the-art models. Case studies on a common human diseases were also implemented to prove the effectiveness of GCAEMDA. The results demonstrated that GCAEMDA was beneficial to infer potential associations of miRNA-disease. Numerous studies have demonstrated that miRNAs are closely related to several common human diseases, so observing unverified associations between miRNAs and diseases is conducive to the diagnose and treatment of complex diseases. Considerable models proposed to infer potential miRNA-disease associations have made the prediction more effective and productive. We constructed GCAEMDA model to acquire more accuracy prediction result by integrating graph convolutional network and autoencoder to make prediction based on multi-source miRNA and disease information. The five-fold cross validation and global leave-one-out cross validation were implemented to evaluate the performance of our model. Consequently, GCAEMDA reached AUCs of 0.9415 and 0.9505 respectively that were distinctly higher than AUCs of other comparative models. Furthermore, we carried out case studies on lung neoplasms and breast neoplasms to demonstrate the practical application of the model, 47 and 47 of top-50 candidate miRNAs were confirmed by experimental reports. In summary, GCAEMDA could be considered as an effective and accuracy model to reveal relationship between miRNAs and diseases.
Collapse
|
129
|
Peng L, Yuan R, Shen L, Gao P, Zhou L. LPI-EnEDT: an ensemble framework with extra tree and decision tree classifiers for imbalanced lncRNA-protein interaction data classification. BioData Min 2021; 14:50. [PMID: 34861891 PMCID: PMC8642957 DOI: 10.1186/s13040-021-00277-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 08/22/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Long noncoding RNAs (lncRNAs) have dense linkages with various biological processes. Identifying interacting lncRNA-protein pairs contributes to understand the functions and mechanisms of lncRNAs. Wet experiments are costly and time-consuming. Most computational methods failed to observe the imbalanced characterize of lncRNA-protein interaction (LPI) data. More importantly, they were measured based on a unique dataset, which produced the prediction bias. RESULTS In this study, we develop an Ensemble framework (LPI-EnEDT) with Extra tree and Decision Tree classifiers to implement imbalanced LPI data classification. First, five LPI datasets are arranged. Second, lncRNAs and proteins are separately characterized based on Pyfeat and BioTriangle and concatenated as a vector to represent each lncRNA-protein pair. Finally, an ensemble framework with Extra tree and decision tree classifiers is developed to classify unlabeled lncRNA-protein pairs. The comparative experiments demonstrate that LPI-EnEDT outperforms four classical LPI prediction methods (LPI-BLS, LPI-CatBoost, LPI-SKF, and PLIPCOM) under cross validations on lncRNAs, proteins, and LPIs. The average AUC values on the five datasets are 0.8480, 0,7078, and 0.9066 under the three cross validations, respectively. The average AUPRs are 0.8175, 0.7265, and 0.8882, respectively. Case analyses suggest that there are underlying associations between HOTTIP and Q9Y6M1, NRON and Q15717. CONCLUSIONS Fusing diverse biological features of lncRNAs and proteins and exploiting an ensemble learning model with Extra tree and decision tree classifiers, this work focus on imbalanced LPI data classification as well as interaction information inference for a new lncRNA (or protein).
Collapse
Affiliation(s)
- Lihong Peng
- School of Computer Science, Hunan University of Technology, No.88, Taishan West Road, Tianyuan District, Zhuzhou, China.,College of Life Sciences and Chemistry, Hunan University of Technology, No.88, Taishan West Road, Tianyuan District, Zhuzhou, China
| | - Ruya Yuan
- School of Computer Science, Hunan University of Technology, No.88, Taishan West Road, Tianyuan District, Zhuzhou, China
| | - Ling Shen
- School of Computer Science, Hunan University of Technology, No.88, Taishan West Road, Tianyuan District, Zhuzhou, China
| | - Pengfei Gao
- College of Life Sciences and Chemistry, Hunan University of Technology, No.88, Taishan West Road, Tianyuan District, Zhuzhou, China
| | - Liqian Zhou
- School of Computer Science, Hunan University of Technology, No.88, Taishan West Road, Tianyuan District, Zhuzhou, China.
| |
Collapse
|
130
|
Chen X, Jiang Z. ISFMDA: Learning Interactions of Selected Features-Based Method for Predicting Potential MicroRNA-Disease Associations. J Comput Biol 2021; 28:1219-1227. [PMID: 34847740 DOI: 10.1089/cmb.2021.0149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Prediction of potential microRNA-disease associations is one of the important tasks in computational biology fields. Mining more sophisticated features can improve the performance of the prediction methods. This article proposes a novel algorithm (ISFMDA) that can effectively learn low- or high-order interactions of recursive feature elimination selected features by an extreme gradient boosting, a factorization machine, and a deep neural network. As a result, ISFMDA can obtain an area under receiver operating characteristic curve (AUROC) of 0.9342 ± 0.0007 in fivefold cross-validation tests with 51.25% of original features, which verifies the effectiveness of the methods.
Collapse
Affiliation(s)
- Xuejun Chen
- School of Computer Science and Technology, East China Normal University, Shanghai, China
| | - Zhenran Jiang
- School of Computer Science and Technology, East China Normal University, Shanghai, China
| |
Collapse
|
131
|
Zhou D, He S, Zhang D, Lv Z, Yu J, Li Q, Li M, Guo W, Qi F. LINC00857 promotes colorectal cancer progression by sponging miR-150-5p and upregulating HMGB3 (high mobility group box 3) expression. Bioengineered 2021; 12:12107-12122. [PMID: 34753396 PMCID: PMC8810051 DOI: 10.1080/21655979.2021.2003941] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 11/02/2021] [Accepted: 11/03/2021] [Indexed: 01/06/2023] Open
Abstract
Colorectal cancer (CRC) is the third most commonly diagnosed malignant tumor worldwide. LINC00857 has been reported as a dysregulated long non-coding RNAs (lncRNAs) involved in the genesis and development of different cancers. In CRC, accumulating evidence indicates that high mobility group box 3 (HMGB3) is over-expressed and contributes to CRC development. However, the mechanism underlying HMGB3 upregulation in CRC remains unclear. The present work aims to investigate the role of LINC00857 and its functional interaction with HMGB3 in regulating CRC progression. Differential expression of LINC00857 between CRC tissues and normal tissues was identified in TCGA (The Cancer Genome Atlas) database. In vitro functional assays were performed to explore the biological functions of LINC00857 in CRC cells. In vivo xenograft model was employed to investigate the role of LINC00857 in CRC tumorigenesis. We found that LINC00857 was significant upregulated in CRC tissues and cell lines. LINC00857 knockdown significantly inhibited the proliferation, migration and invasion of CRC cells, and also induced apoptosis. Moreover, LINC00857 knockdown suppressed CRC tumorigenesis in vivo. We further demonstrated that the effects of LINC00857 in CRC cells were mediated through miR-150-5p/HMGB3 axis. LINC00857 negatively regulates the activity of miR-150-5p, which releases its inhibition on HMGB3 expression. Our data indicate that LINC00857/miR-150-5p/HMGB3 axis plays a fundamental role in regulating the malignant phenotype and tumorigenesis of CRC. Targeting this axis may serve as novel therapeutic strategies for CRC treatment.
Collapse
Affiliation(s)
- Dongbing Zhou
- Department of General Surgery, Tianjin Medical University General Hospital, Tianjin, China
- Department of Gastrointestinal Surgery, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Sijia He
- Department of Medical Imaging, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Daquan Zhang
- Department of Gastrointestinal Surgery, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Zhenbing Lv
- Department of Gastrointestinal Surgery, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Jing Yu
- Department of Gastrointestinal Surgery, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Quanlin Li
- Department of Gastrointestinal Surgery, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Min Li
- Department of Gastrointestinal Surgery, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Wei Guo
- Department of Gastrointestinal Surgery, Nanchong Central Hospital, the Second Clinical Institute of North Sichuan Medical College, Nanchong, Sichuan, China
| | - Feng Qi
- Department of General Surgery, Tianjin Medical University General Hospital, Tianjin, China
| |
Collapse
|
132
|
Xuan P, Zhan L, Cui H, Zhang T, Nakaguchi T, Zhang W. Graph Triple-Attention Network for Disease-related LncRNA Prediction. IEEE J Biomed Health Inform 2021; 26:2839-2849. [PMID: 34813484 DOI: 10.1109/jbhi.2021.3130110] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Abnormal expressions of long non-coding RNAs (lncRNAs) are associated with various human diseases. Identifying disease-related lncRNAs can help clarify complex disease pathogeneses. The latest methods for lncRNA-disease association prediction rely on diverse data about lncRNAs and diseases. These methods, however, cannot adequately integrate the neighbour topological information of lncRNA and disease nodes. Moreover, more intrinsic features of lncRNA-disease node pairs can be explored to better predict the latent associations between lncRNAs and diseases. We developed a novel method, named GTAN, to predict the association propensities between lncRNAs and diseases. GTAN integrates various information about lncRNAs and diseases, including similarities, associations and interactions among lncRNAs, diseases and miRNAs, and exploits neighbour topology and attribute representations of a pair of lncRNA-disease nodes. We adopted in GTAN a graph neural network architecture with three attention mechanisms and multi-layer convolutional neural networks. First, a neighbour-level self-attention mechanism is constructed to learn the importance of each neighbour for an interested lncRNA or disease node. Second, topology-level attention is proposed to enhance contextual dependencies among multiple local topology representations of the lncRNA or disease node. An attention-enhanced graph neural network framework is then established to learn a topology representation of top-ranked neighbours for a pair of lncRNA-disease nodes. GTAN also has attribute-level attention to distinguish various contributions of attributes of the lncRNA-disease pair. Finally, attribute representation is learned by multi-layer CNN to integrate detailed features and representative features of the pair. Extensive experimental results demonstrated that GTAN outperformed state-of-the-art methods. The improved recall rates also showed GTANs capacity for retrieving more actual lncRNA-disease associations in the top-ranked candidates. The ablation studies confirmed the important contributions of three attention mechanisms. Case studies on lung cancer, prostate cancer and colon cancer further showed GTANs ability in discovering potential lncRNA candidates related to diseases.
Collapse
|
133
|
Yang C, Wu J, Lu X, Xiong S, Xu X. Identification of novel biomarkers for intracerebral hemorrhage via long noncoding RNA-associated competing endogenous RNA network. Mol Omics 2021; 18:71-82. [PMID: 34807207 DOI: 10.1039/d1mo00298h] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Intracerebral hemorrhage (ICH) is a leading cause of death and disability worldwide. This study aimed to examine the involvement of long non-coding RNAs (lncRNAs), a group of non-coding transcripts, in ICH as potential biomarkers. An expression profile of patients with ICH using four contralateral grey matter controls (GM) and four contralateral white matter controls (WM) was downloaded from the Gene Expression Omnibus (GEO) database. Co-expressed lncRNAs and mRNAs were selected to create competing endogenous RNA (ceRNA) networks. Key lncRNAs were identified in ceRNA networks, which were validated through Real-time qPCR (RT-qPCR) with peripheral blood samples from patients with ICH. A total of 49 differentially expressed lncRNAs were discovered in different brain regions. The ceRNA network in GM included 9 lncRNAs, 40 mRNAs, and 20 microRNAs (miRNAs), while the one in WM covered 6 lncRNAs, 25 mRNAs, and 14 miRNAs. Six hub lncRNAs were observed and RT-qPCR results showed that LY86-AS1, DLX6-AS1, RRN3P2, and CRNDE were down-regulated, while HCP5 and MIAT were up-regulated in patients with ICH. Receiver Operating Characteristic (ROC) assessments demonstrated the diagnostic value of these lncRNAs. Our findings highlight the potential roles of lncRNA in ICH pathogenesis. Moreover, the hub lncRNAs discovered here might become novel biomarkers and promising targets for ICH drug development.
Collapse
Affiliation(s)
- Chunyu Yang
- Department of Neurology, the First Hospital of China Medical University, No 155, Nanjing Street, Heping District, Shenyang, Liaoning, 110001, China. .,Department of Pharmacy, The Fourth Hospital of China Medical University, Shenyang, China
| | - Jiao Wu
- Department of Neurology, The People's Hospital of Liaoning Province, Shenyang, China
| | - Xi Lu
- Department of Public Health, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| | - Shuang Xiong
- Liaoning Academy of Analytic Science, Construction Engineering Center of Important Technology Innovation and Research and Development Base in Liaoning Province, Shenyang, China
| | - Xiaoxue Xu
- Department of Neurology, the First Hospital of China Medical University, No 155, Nanjing Street, Heping District, Shenyang, Liaoning, 110001, China.
| |
Collapse
|
134
|
Graph convolutional network approach to discovering disease-related circRNA-miRNA-mRNA axes. Methods 2021; 198:45-55. [PMID: 34758394 DOI: 10.1016/j.ymeth.2021.10.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Revised: 10/07/2021] [Accepted: 10/19/2021] [Indexed: 02/05/2023] Open
Abstract
Non-coding RNAs are gaining prominence in biology and medicine, as they play major roles in cellular homeostasis among which the circRNA-miRNA-mRNA axes are involved in a series of disease-related pathways, such as apoptosis, cell invasion and metastasis. Recently, many computational methods have been developed for the prediction of the relationship between ncRNAs and diseases, which can alleviate the time-consuming and labor-intensive exploration involved with biological experiments. However, these methods handle ncRNAs separately, ignoring the impact of the interactions among ncRNAs on the diseases. In this paper we present a novel approach to discovering disease-related circRNA-miRNA-mRNA axes from the disease-RNA information network. Our method, using graph convolutional network, learns the characteristic representation of each biological entity by propagating and aggregating local neighbor information based on the global structure of the network. The approach is evaluated using the real-world datasets and the results show that it outperforms other state-of-the-art baselines on most of the metrics.
Collapse
|
135
|
Bamunu Mudiyanselage T, Lei X, Senanayake N, Zhang Y, Pan Y. Predicting CircRNA disease associations using novel node classification and link prediction models on Graph Convolutional Networks. Methods 2021; 198:32-44. [PMID: 34748953 DOI: 10.1016/j.ymeth.2021.10.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 09/21/2021] [Accepted: 10/22/2021] [Indexed: 12/17/2022] Open
Abstract
Accumulated studies have discovered that circular RNAs (CircRNAs) are closely related to many complex human diseases. Due to this close relationship, CircRNAs can be used as good biomarkers for disease diagnosis and therapeutic targets for treatments. However, the number of experimentally verified circRNA-disease associations are still fewer and also conducting wet-lab experiments are constrained by the small scale and cost of time and labour. Therefore, effective computational methods are required to predict associations between circRNAs and diseases which will be promising candidates for small scale biological and clinical experiments. In this paper, we propose novel computational models based on Graph Convolution Networks (GCN) for the potential circRNA-disease association prediction. Currently most of the existing prediction methods use shallow learning algorithms. Instead, the proposed models combine the strengths of deep learning and graphs for the computation. First, they integrate multi-source similarity information into the association network. Next, models predict potential associations using graph convolution which explore this important relational knowledge of that network structure. Two circRNA-disease association prediction models, GCN based Node Classification (GCN-NC) and GCN based Link Prediction (GCN-LP) are introduced in this work and they demonstrate promising results in various experiments and outperforms other existing methods. Further, a case study proves that some of the predicted results of the novel computational models were confirmed by published literature and all top results could be verified using gene-gene interaction networks.
Collapse
Affiliation(s)
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an 710119, China.
| | - Nipuna Senanayake
- Department of Computer Science, Georgia State University, Atlanta, USA.
| | - Yanqing Zhang
- Department of Computer Science, Georgia State University, Atlanta, USA.
| | - Yi Pan
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
| |
Collapse
|
136
|
Wang CC, Han CD, Zhao Q, Chen X. Circular RNAs and complex diseases: from experimental results to computational models. Brief Bioinform 2021; 22:bbab286. [PMID: 34329377 PMCID: PMC8575014 DOI: 10.1093/bib/bbab286] [Citation(s) in RCA: 104] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 06/23/2021] [Accepted: 07/03/2021] [Indexed: 12/13/2022] Open
Abstract
Circular RNAs (circRNAs) are a class of single-stranded, covalently closed RNA molecules with a variety of biological functions. Studies have shown that circRNAs are involved in a variety of biological processes and play an important role in the development of various complex diseases, so the identification of circRNA-disease associations would contribute to the diagnosis and treatment of diseases. In this review, we summarize the discovery, classifications and functions of circRNAs and introduce four important diseases associated with circRNAs. Then, we list some significant and publicly accessible databases containing comprehensive annotation resources of circRNAs and experimentally validated circRNA-disease associations. Next, we introduce some state-of-the-art computational models for predicting novel circRNA-disease associations and divide them into two categories, namely network algorithm-based and machine learning-based models. Subsequently, several evaluation methods of prediction performance of these computational models are summarized. Finally, we analyze the advantages and disadvantages of different types of computational models and provide some suggestions to promote the development of circRNA-disease association identification from the perspective of the construction of new computational models and the accumulation of circRNA-related data.
Collapse
Affiliation(s)
- Chun-Chun Wang
- School of Information and Control Engineering, China University of Mining and Technology
| | - Chen-Di Han
- School of Information and Control Engineering, China University of Mining and Technology
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning
| | - Xing Chen
- China University of Mining and Technology
| |
Collapse
|
137
|
Yang X, Kuang L, Chen Z, Wang L. Multi-Similarities Bilinear Matrix Factorization-Based Method for Predicting Human Microbe-Disease Associations. Front Genet 2021; 12:754425. [PMID: 34721543 PMCID: PMC8551558 DOI: 10.3389/fgene.2021.754425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 09/21/2021] [Indexed: 11/13/2022] Open
Abstract
Accumulating studies have shown that microbes are closely related to human diseases. In this paper, a novel method called MSBMFHMDA was designed to predict potential microbe-disease associations by adopting multi-similarities bilinear matrix factorization. In MSBMFHMDA, a microbe multiple similarities matrix was constructed first based on the Gaussian interaction profile kernel similarity and cosine similarity for microbes. Then, we use the Gaussian interaction profile kernel similarity, cosine similarity, and symptom similarity for diseases to compose the disease multiple similarities matrix. Finally, we integrate these two similarity matrices and the microbe-disease association matrix into our model to predict potential associations. The results indicate that our method can achieve reliable AUCs of 0.9186 and 0.9043 ± 0.0048 in the framework of leave-one-out cross validation (LOOCV) and fivefold cross validation, respectively. What is more, experimental results indicated that there are 10, 10, and 8 out of the top 10 related microbes for asthma, inflammatory bowel disease, and type 2 diabetes mellitus, respectively, which were confirmed by experiments and literatures. Therefore, our model has favorable performance in predicting potential microbe-disease associations.
Collapse
Affiliation(s)
- Xiaoyu Yang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China.,College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Linai Kuang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China
| | - Zhiping Chen
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Lei Wang
- Key Laboratory of Hunan Province for Internet of Things and Information Security, Xiangtan University, Xiangtan, China.,College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| |
Collapse
|
138
|
SVDNVLDA: predicting lncRNA-disease associations by Singular Value Decomposition and node2vec. BMC Bioinformatics 2021; 22:538. [PMID: 34727886 PMCID: PMC8561941 DOI: 10.1186/s12859-021-04457-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 10/18/2021] [Indexed: 11/10/2022] Open
Abstract
Background Numerous studies on discovering the roles of long non-coding RNAs (lncRNAs) in the occurrence, development and prognosis progresses of various human diseases have drawn substantial attentions. Since only a tiny portion of lncRNA-disease associations have been properly annotated, an increasing number of computational methods have been proposed for predicting potential lncRNA-disease associations. However, traditional predicting models lack the ability to precisely extract features of biomolecules, it is urgent to find a model which can identify potential lncRNA-disease associations with both efficiency and accuracy. Results In this study, we proposed a novel model, SVDNVLDA, which gained the linear and non-linear features of lncRNAs and diseases with Singular Value Decomposition (SVD) and node2vec methods respectively. The integrated features were constructed from connecting the linear and non-linear features of each entity, which could effectively enhance the semantics contained in ultimate representations. And an XGBoost classifier was employed for identifying potential lncRNA-disease associations eventually. Conclusions We propose a novel model to predict lncRNA-disease associations. This model is expected to identify potential relationships between lncRNAs and diseases and further explore the disease mechanisms at the lncRNA molecular level. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04457-1.
Collapse
|
139
|
Zhao X, Yang Y, Yin M. MHRWR: Prediction of lncRNA-Disease Associations Based on Multiple Heterogeneous Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2577-2585. [PMID: 32086216 DOI: 10.1109/tcbb.2020.2974732] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In the last few years, accumulating evidences had demonstrated that long non-coding RNAs (lncRNAs) participated in the regulation of target gene expression and played an important role in biological processes and human disease development. Thus, prediction of the associations between lncRNAs and disease had become a hot research in the fields of human sophisticated diseases. Most of these methods considered the information of two networks (lncRNA, disease) while neglected other networks. In this study, we designed a multi-layer network by integrating the similarity networks of lncRNAs, diseases and genes, and the known association networks of lncRNA-disease, lncRNAs-gene, and disease-gene, and then we developed a model called MHRWR for predicting the lncRNA-disease potential associations based on random walk with restart. The performance of MHRWR was evaluated by experimentally verified lncRNA-disease associations based on leave-one-out cross validation. MHRWR obtained a reliable AUC value of 0.91344, which significantly outperformed some previous methods. To further validate the reproducibility of performance, we used the model of MHRWR to verify related lncRNAs of colon cancer, colorectal cancer and lung adenocarcinoma in the case studies. The codes of MHRWR is available on: https://github.com/yangyq505/MHRWR.
Collapse
|
140
|
Zeng M, Lu C, Fei Z, Wu FX, Li Y, Wang J, Li M. DMFLDA: A Deep Learning Framework for Predicting lncRNA-Disease Associations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2353-2363. [PMID: 32248123 DOI: 10.1109/tcbb.2020.2983958] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
A growing amount of evidence suggests that long non-coding RNAs (lncRNAs) play important roles in the regulation of biological processes in many human diseases. However, the number of experimentally verified lncRNA-disease associations is very limited. Thus, various computational approaches are proposed to predict lncRNA-disease associations. Current matrix factorization-based methods cannot capture the complex non-linear relationship between lncRNAs and diseases, and traditional machine learning-based methods are not sufficiently powerful to learn the representation of lncRNAs and diseases. Considering these limitations in existing computational methods, we propose a deep matrix factorization model to predict lncRNA-disease associations (DMFLDA in short). DMFLDA uses a cascade of non-linear hidden layers to learn latent representation to represent lncRNAs and diseases. By using non-linear hidden layers, DMFLDA captures the more complex non-linear relationship between lncRNAs and diseases than traditional matrix factorization-based methods. In addition, DMFLDA learns features directly from the lncRNA-disease interaction matrix and thus can obtain more accurate representation learning for lncRNAs and diseases than traditional machine learning methods. The low dimensional representations of the lncRNAs and diseases are fused to estimate the new interaction value. To evaluate the performance of DMFLDA, we perform leave-one-out cross-validation and 5-fold cross-validation on known experimentally verified lncRNA-disease associations. The experimental results show that DMFLDA performs better than the existing methods. The case studies show that many predicted interactions of colorectal cancer, prostate cancer, and renal cancer have been verified by recent biomedical literature. The source code and datasets can be obtained from https://github.com/CSUBioGroup/DMFLDA.
Collapse
|
141
|
Li H, Wang Y, Zhang Z, Tan Y, Chen Z, Wang X, Pei T, Wang L. Identifying Microbe-Disease Association Based on a Novel Back-Propagation Neural Network Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2502-2513. [PMID: 32305935 DOI: 10.1109/tcbb.2020.2986459] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Over the years, numerous evidences have demonstrated that microbes living in the human body are closely related to human life activities and human diseases. However, traditional biological experiments are time-consuming and expensive, so it has become a research topic in bioinformatics to predict potential microbe-disease associations by adopting computational methods. In this study, a novel calculative method called BPNNHMDA is proposed to identify potential microbe-disease associations. In BPNNHMDA, a novel neural network model is first designed to infer potential microbe-disease associations, its input signal is a matrix of known microbe-disease associations, and its output signal is matrix of potential microbe-disease associations probabilities. And moreover, in the novel neural network model, a new activation function is designed to activate the hidden layer and the output layer based on the hyperbolic tangent function, and its initial connection weights are optimized by adopting Gaussian Interaction Profile kernel (GIP) similarity for microbes, which can improve the training speed of BPNNHMDA efficiently. Finally, in order to verify the performance of our prediction model, different frameworks such as the Leave-One-Out Cross Validation (LOOCV) and k-Fold Cross Validation ( k-Fold CV) are implemented on BPNNHMDA respectively. Simulation results illustrate that BPNNHMDA can achieve reliable AUCs of 0.9242, 0.9127 ± 0.0009 and 0.8955 ± 0.0018 in LOOCV, 5-Fold CV and 2-Fold CV separately, which are superior to previous state-of-the-art methods. Furthermore, case studies of inflammatory bowel disease (IBD), asthma and obesity demonstrate that BPNNHMDA has excellent prediction ability in practical applications as well.
Collapse
|
142
|
Waters E, Pucci P, Hirst M, Chapman S, Wang Y, Crea F, Heath CJ. HAR1: an insight into lncRNA genetic evolution. Epigenomics 2021; 13:1831-1843. [PMID: 34676772 DOI: 10.2217/epi-2021-0069] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) have a wide range of functions in health and disease, but many remain uncharacterized because of their complex expression patterns and structures. The genetic loci encoding lncRNAs can be subject to accelerated evolutionary changes within the human lineage. HAR1 is a region that has a significantly altered sequence compared to other primates and is a component of two overlapping lncRNA loci, HAR1A and HAR1B. Although the functions of these lncRNAs are unknown, they have been associated with neurological disorders and cancer. Here, we explore the current state of understanding of evolution in human lncRNA genes, using the HAR1 locus as the case study.
Collapse
Affiliation(s)
- Ella Waters
- School of Life, Health & Chemical Sciences, The Open University, Milton Keynes, MK7 6AA, UK
| | - Perla Pucci
- School of Life, Health & Chemical Sciences, The Open University, Milton Keynes, MK7 6AA, UK.,Division of Cellular & Molecular Pathology, Department of Pathology, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Mark Hirst
- School of Life, Health & Chemical Sciences, The Open University, Milton Keynes, MK7 6AA, UK
| | - Simon Chapman
- School of Life, Health & Chemical Sciences, The Open University, Milton Keynes, MK7 6AA, UK
| | - Yuzhuo Wang
- Department of Urologic Sciences, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Francesco Crea
- School of Life, Health & Chemical Sciences, The Open University, Milton Keynes, MK7 6AA, UK
| | - Christopher J Heath
- School of Life, Health & Chemical Sciences, The Open University, Milton Keynes, MK7 6AA, UK
| |
Collapse
|
143
|
He S, Arikin A, Chen J, Huang T, Wu Z, Wang L, Yang F, Li Y, Yang Y, Wang R, Lian M, Zhong Q, Fang J. Transcriptome Analysis Identified 2 New lncRNAs Associated with the Metastasis of Papillary Thyroid Carcinoma. ORL J Otorhinolaryngol Relat Spec 2021; 84:247-254. [PMID: 34818244 DOI: 10.1159/000518085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 06/20/2021] [Indexed: 11/19/2022]
Abstract
INTRODUCTION Papillary thyroid microcarcinoma (PTMC) is a specific subgroup of papillary thyroid carcinoma and defined with the dimension ≤1 cm by the WHO. Although it shows a relatively high 10-year livability, the metastasis of PTMC into other tissues and organs seriously affects the daily life of patients with relatively high mortality. Therefore, the genetic basis for the metastasis of PTMC needs to be explored for effective therapeutic targets. Here, we conducted a series of comparative analysis of the transcriptional expression profile between PTMC patients with and without lymph node metastasis. METHODS Gene expression profile and gene function were analyzed using RNA extracted from pathological tissues of 12 patients with PTMC, and the core biomarkers closely related to its metastasis were identified. RESULTS Our results showed that 7,507 genes and 42 RNAs showed remarkably different expression patterns. More sophisticated analysis showed that the high expression of 2 lncRNAs (T077499 and T004533) resulted in the metastasis of PTMC, which suggests that the expression pattern of the 2 lncRNAs may act as a potential biomarker for pathogenesis and prognosis of PTMC metastasis. CONCLUSION Our findings preliminarily reveal the molecular mechanisms for PTMC metastasis, which will provide vital reference for subsequent studies about the genetic basis and molecular targeted therapy for PTMC metastasis.
Collapse
Affiliation(s)
- Shizhi He
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Abdeyrim Arikin
- Department of Otorhinolaryngology Head and Neck Surgery, The People's Hospital of Xinjiang Uygur Autonomous Region, Ürümqi, China
| | - Jiaming Chen
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Tianqiao Huang
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Zhen Wu
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Lingwa Wang
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Fan Yang
- Department of Otolaryngology Head Neck Surgery, Beijing Anzhen Hospital, Capital Medical University, Beijing, China
| | - Yunxia Li
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Yifan Yang
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Ru Wang
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Meng Lian
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Qi Zhong
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Jugao Fang
- Department of Otolaryngology Head Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
144
|
Yang Z, Hu S, He Y, Ji L. LINC00665 rescues bupivacaine induced neurotoxicity in human neural cell of SH-SY5Y through has-miR-34a-5p. Brain Res Bull 2021; 177:210-216. [PMID: 34626694 DOI: 10.1016/j.brainresbull.2021.10.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Revised: 09/30/2021] [Accepted: 10/04/2021] [Indexed: 10/20/2022]
Abstract
BACKGROUND Excessive application of local anesthetics, bupivacaine (BUP) may induce neurotoxicity and lead to neurologic dysfunctions in human brains. Yet, the exact molecular mechanisms underlying BUP-induced neurotoxicity was not fully understood. In this study, we utilized an in vitro SH-SY5Y cell culture model to explore the functional mechanism of long intergenic non-protein coding RNA 665 (LINC00665) in regulating BUP-induced neurotoxicity. METHODS SH-SY5Y cells were induced with BUP in vitro, and their viability and apoptosis were monitored. BUP-induced LINC00665 expression was also monitored, by qRT-PCR. LINC00665 was then overexpressed in SH-SY5Y cells, and its effects on BUP-induced neurotoxicity were investigated. The downstream target transcript of LINC00665, human mature microRNA-34a-5p (hsa-miR-34a-5p) was investigated in BUP-induced SH-SY5Y cells. Co-regulation of LINC00665 / hsa-miR-132-3p epigenetic axis was further examined on BUP-induced apoptosis in SH-SY5Y cells. RESULTS BUP reduced cell viability, induced apoptosis and downregulated LINC00665 in SH-SY5Y cells. LINC00665 overexpression rescued BUP-induced neurotoxicity in SH-SY5Y cells. Hsa-miR-34a-5p expression was directly correlated with BUP treatment and LINC00665 overexpression in SH-SY5Y cells. Upregulating hsa-miR-34a-5p reversed the rescuing effects of LINC00665 on BUP-induced SH-SY5Y apoptosis. CONCLUSIONS BUP-induced neurotoxicity in human neural cells may be regulated by the epigenetic axis of LINC00665 / hsa-miR-34a-5p.
Collapse
Affiliation(s)
- Zhoujing Yang
- Anesthesiology & Perioperative Medicine Centre, Xi'an People's Hospital, Xi'an 710004, Shaanxi Province, China
| | - Sheng Hu
- Anesthesiology & Perioperative Medicine Centre, Xi'an People's Hospital, Xi'an 710004, Shaanxi Province, China
| | - Yinbin He
- Anesthesiology & Perioperative Medicine Centre, Xi'an People's Hospital, Xi'an 710004, Shaanxi Province, China
| | - Ling Ji
- Anesthesiology & Perioperative Medicine Centre, Xi'an People's Hospital, Xi'an 710004, Shaanxi Province, China.
| |
Collapse
|
145
|
Wang B, Zhang C, Du XX, Zhang JF. lncRNA-disease association prediction based on latent factor model and projection. Sci Rep 2021; 11:19965. [PMID: 34620945 PMCID: PMC8497550 DOI: 10.1038/s41598-021-99493-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 09/27/2021] [Indexed: 02/08/2023] Open
Abstract
Computer aided research of lncRNA-disease association is an important way to study the development of lncRNA-disease. The correlation analysis of existing data, the establishment of prediction model, prediction of unknown lncRNA-disease association, can make the biological experiment targeted, improve the accuracy of biological experiment. In this paper, a lncRNA-disease association prediction model based on latent factor model and projection is proposed (LFMP). This method uses lncRNA-miRNA association data and miRNA-disease association data to predict the unknown lncRNA-disease association, so this method does not need lncRNA-disease association data. The simulation results show that under the LOOCV framework, the AUC of LFMP can reach 0.8964. Better than the latest results. Through the case study of lung and colorectal tumors, LFMP can effectively infer the undetected lncRNA-disease association.
Collapse
Affiliation(s)
- Bo Wang
- grid.412616.60000 0001 0002 2355College of Computer and Control Engineering, Qiqihar University, Qiqihar, 161006 People’s Republic of China
| | - Chao Zhang
- grid.412616.60000 0001 0002 2355College of Computer and Control Engineering, Qiqihar University, Qiqihar, 161006 People’s Republic of China
| | - Xiao-xin Du
- grid.412616.60000 0001 0002 2355College of Computer and Control Engineering, Qiqihar University, Qiqihar, 161006 People’s Republic of China
| | - Jian-fei Zhang
- grid.412616.60000 0001 0002 2355College of Computer and Control Engineering, Qiqihar University, Qiqihar, 161006 People’s Republic of China
| |
Collapse
|
146
|
Zhou L, Wang Z, Tian X, Peng L. LPI-deepGBDT: a multiple-layer deep framework based on gradient boosting decision trees for lncRNA-protein interaction identification. BMC Bioinformatics 2021; 22:479. [PMID: 34607567 PMCID: PMC8489074 DOI: 10.1186/s12859-021-04399-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 07/14/2021] [Indexed: 12/31/2022] Open
Abstract
Background Long noncoding RNAs (lncRNAs) play important roles in various biological and pathological processes. Discovery of lncRNA–protein interactions (LPIs) contributes to understand the biological functions and mechanisms of lncRNAs. Although wet experiments find a few interactions between lncRNAs and proteins, experimental techniques are costly and time-consuming. Therefore, computational methods are increasingly exploited to uncover the possible associations. However, existing computational methods have several limitations. First, majority of them were measured based on one simple dataset, which may result in the prediction bias. Second, few of them are applied to identify relevant data for new lncRNAs (or proteins). Finally, they failed to utilize diverse biological information of lncRNAs and proteins. Results Under the feed-forward deep architecture based on gradient boosting decision trees (LPI-deepGBDT), this work focuses on classify unobserved LPIs. First, three human LPI datasets and two plant LPI datasets are arranged. Second, the biological features of lncRNAs and proteins are extracted by Pyfeat and BioProt, respectively. Thirdly, the features are dimensionally reduced and concatenated as a vector to represent an lncRNA–protein pair. Finally, a deep architecture composed of forward mappings and inverse mappings is developed to predict underlying linkages between lncRNAs and proteins. LPI-deepGBDT is compared with five classical LPI prediction models (LPI-BLS, LPI-CatBoost, PLIPCOM, LPI-SKF, and LPI-HNM) under three cross validations on lncRNAs, proteins, lncRNA–protein pairs, respectively. It obtains the best average AUC and AUPR values under the majority of situations, significantly outperforming other five LPI identification methods. That is, AUCs computed by LPI-deepGBDT are 0.8321, 0.6815, and 0.9073, respectively and AUPRs are 0.8095, 0.6771, and 0.8849, respectively. The results demonstrate the powerful classification ability of LPI-deepGBDT. Case study analyses show that there may be interactions between GAS5 and Q15717, RAB30-AS1 and O00425, and LINC-01572 and P35637. Conclusions Integrating ensemble learning and hierarchical distributed representations and building a multiple-layered deep architecture, this work improves LPI prediction performance as well as effectively probes interaction data for new lncRNAs/proteins.
Collapse
Affiliation(s)
- Liqian Zhou
- School of Computer Science, Hunan University of Technology, No. 88, Taishan West Road, Tianyuan District, Zhuzhou, China
| | - Zhao Wang
- School of Computer Science, Hunan University of Technology, No. 88, Taishan West Road, Tianyuan District, Zhuzhou, China
| | - Xiongfei Tian
- School of Computer Science, Hunan University of Technology, No. 88, Taishan West Road, Tianyuan District, Zhuzhou, China
| | - Lihong Peng
- School of Computer Science, Hunan University of Technology, No. 88, Taishan West Road, Tianyuan District, Zhuzhou, China. .,College of Life Sciences and Chemistry, Hunan University of Technology, No. 88, Taishan West Road, Tianyuan District, Zhuzhou, China.
| |
Collapse
|
147
|
Zhao X, Zhao X, Yin M. Heterogeneous graph attention network based on meta-paths for lncRNA-disease association prediction. Brief Bioinform 2021; 23:6377515. [PMID: 34585231 DOI: 10.1093/bib/bbab407] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 08/26/2021] [Indexed: 12/15/2022] Open
Abstract
MOTIVATION Discovering long noncoding RNA (lncRNA)-disease associations is a fundamental and critical part in understanding disease etiology and pathogenesis. However, only a few lncRNA-disease associations have been identified because of the time-consuming and expensive biological experiments. As a result, an efficient computational method is of great importance and urgently needed for identifying potential lncRNA-disease associations. With the ability of exploiting node features and relationships in network, graph-based learning models have been commonly utilized by these biomolecular association predictions. However, the capability of these methods in comprehensively fusing node features, heterogeneous topological structures and semantic information is distant from optimal or even satisfactory. Moreover, there are still limitations in modeling complex associations between lncRNAs and diseases. RESULTS In this paper, we develop a novel heterogeneous graph attention network framework based on meta-paths for predicting lncRNA-disease associations, denoted as HGATLDA. At first, we conduct a heterogeneous network by incorporating lncRNA and disease feature structural graphs, and lncRNA-disease topological structural graph. Then, for the heterogeneous graph, we conduct multiple metapath-based subgraphs and then utilize graph attention network to learn node embeddings from neighbors of these homogeneous and heterogeneous subgraphs. Next, we implement attention mechanism to adaptively assign weights to multiple metapath-based subgraphs and get more semantic information. In addition, we combine neural inductive matrix completion to reconstruct lncRNA-disease associations, which is applied for capturing complicated associations between lncRNAs and diseases. Moreover, we incorporate cost-sensitive neural network into the loss function to tackle the commonly imbalance problem in lncRNA-disease association prediction. Finally, extensive experimental results demonstrate the effectiveness of our proposed framework.
Collapse
Affiliation(s)
- Xiaosa Zhao
- School of Information Science and Technology, Northeast Normal University, Changchun 130117, China
| | - Xiaowei Zhao
- School of Information Science and Technology, Northeast Normal University, Changchun 130117, China
| | - Minghao Yin
- School of Information Science and Technology, Northeast Normal University, Changchun 130117, China
| |
Collapse
|
148
|
Li X, Han X, Sun C, Li G, Wang K, Li X, Qiao R. Analysis of mRNA and Long Non-Coding RNA Expression Profiles in Developing Yorkshire Pig Spleens. Animals (Basel) 2021; 11:ani11102768. [PMID: 34679790 PMCID: PMC8532824 DOI: 10.3390/ani11102768] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 09/16/2021] [Accepted: 09/18/2021] [Indexed: 12/19/2022] Open
Abstract
Simple Summary Epidemic disease is a prominent problem in intensive pig production. The spleen is a blood bank and the largest immune organ, and most of the diseases in pig farms will be reflected as spleen abnormality. The results showed how the mRNA–lncRNA expression profiles in Yorkshire spleens varied with age (seven, 90, and 180 days after birth). Our study shows that 90 days after birth the gene-expression profile of pig spleen no longer changes significantly. The results are helpful for a better understanding of the transcriptome and functional genomics of spleen tissue in farm animals and could provide reference for precision pig disease research and prevention and control in pig farms. Abstract Epidemic diseases cause great economic loss in pig farms each year; some of these diseases are characterized mainly in the spleen, but mRNA and lncRNA (long non-coding RNA) expression networks in developing Yorkshire pig spleens remain obscure. Here, we profiled the systematic characters of mRNA and lncRNA repertoires in three groups of spleens from nine Yorkshire pigs, each three aged at seven days, 90 days, and 180 days. By using a precise mRNA and lncRNA identification pipeline, we identified 19,647 genes and 219 known and 3219 putative lncRNA transcripts; 1729 genes and 64 lncRNAs therein were found to express differentially. The gene expression characteristics of genes and lncRNAs were found to be basically fixed before 90 days after birth. Three large gene expression modules were detected. The enrichment analyses of differentially expressed genes and the potential target genes of differentially expressed lncRNAs both displayed the crucial roles of up-regulation in immune activation and hematopoiesis, and down-regulation in cell replication and division in 90 days and 180 days compared to seven days. ENSSSCT00000001325 was the only lncRNA transcript that existed in the three groups. CDK1, PCNA, and PLK were detected to be node genes that varied with age. This study contributes to a further understanding of mRNA and lncRNA expression in different developmental pig spleens.
Collapse
|
149
|
Ding P, Ouyang W, Luo J, Kwoh CK. Heterogeneous information network and its application to human health and disease. Brief Bioinform 2021; 21:1327-1346. [PMID: 31566212 DOI: 10.1093/bib/bbz091] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 06/29/2019] [Accepted: 06/30/2019] [Indexed: 12/11/2022] Open
Abstract
The molecular components with the functional interdependencies in human cell form complicated biological network. Diseases are mostly caused by the perturbations of the composite of the interaction multi-biomolecules, rather than an abnormality of a single biomolecule. Furthermore, new biological functions and processes could be revealed by discovering novel biological entity relationships. Hence, more and more biologists focus on studying the complex biological system instead of the individual biological components. The emergence of heterogeneous information network (HIN) offers a promising way to systematically explore complicated and heterogeneous relationships between various molecules for apparently distinct phenotypes. In this review, we first present the basic definition of HIN and the biological system considered as a complex HIN. Then, we discuss the topological properties of HIN and how these can be applied to detect network motif and functional module. Afterwards, methodologies of discovering relationships between disease and biomolecule are presented. Useful insights on how HIN aids in drug development and explores human interactome are provided. Finally, we analyze the challenges and opportunities for uncovering combinatorial patterns among pharmacogenomics and cell-type detection based on single-cell genomic data.
Collapse
Affiliation(s)
- Pingjian Ding
- School of Computer Science, University of South China, Hengyang, China
| | - Wenjue Ouyang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Chee-Keong Kwoh
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
150
|
Yao Y, Ji B, Lv Y, Li L, Xiang J, Liao B, Gao W. Predicting LncRNA-Disease Association by a Random Walk With Restart on Multiplex and Heterogeneous Networks. Front Genet 2021; 12:712170. [PMID: 34490041 PMCID: PMC8417042 DOI: 10.3389/fgene.2021.712170] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 07/23/2021] [Indexed: 02/05/2023] Open
Abstract
Studies have found that long non-coding RNAs (lncRNAs) play important roles in many human biological processes, and it is critical to explore potential lncRNA–disease associations, especially cancer-associated lncRNAs. However, traditional biological experiments are costly and time-consuming, so it is of great significance to develop effective computational models. We developed a random walk algorithm with restart on multiplex and heterogeneous networks of lncRNAs and diseases to predict lncRNA–disease associations (MHRWRLDA). First, multiple disease similarity networks are constructed by using different approaches to calculate similarity scores between diseases, and multiple lncRNA similarity networks are also constructed by using different approaches to calculate similarity scores between lncRNAs. Then, a multiplex and heterogeneous network was constructed by integrating multiple disease similarity networks and multiple lncRNA similarity networks with the lncRNA–disease associations, and a random walk with restart on the multiplex and heterogeneous network was performed to predict lncRNA–disease associations. The results of Leave-One-Out cross-validation (LOOCV) showed that the value of Area under the curve (AUC) was 0.68736, which was improved compared with the classical algorithm in recent years. Finally, we confirmed a few novel predicted lncRNAs associated with specific diseases like colon cancer by literature mining. In summary, MHRWRLDA contributes to predict lncRNA–disease associations.
Collapse
Affiliation(s)
- Yuhua Yao
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China.,Key Laboratory of Data Science and Intelligence Education, Ministry of Education, Hainan Normal University, Haikou, China.,Key Laboratory of Computational Science and Application of Hainan Province, Hainan Normal University, Haikou, China
| | - Binbin Ji
- Geneis Beijing Co., Ltd., Beijing, China
| | - Yaping Lv
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Ling Li
- Basic Courses Department, Zhejiang Shuren University, Hangzhou, China
| | - Ju Xiang
- School of Computer Science and Engineering, Central South University, Changsha, China.,Department of Basic Medical Sciences, Changsha Medical University, Changsha, China.,Department of Computer Science, Changsha Medical University, Changsha, China
| | - Bo Liao
- School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Wei Gao
- Departments of Internal Medicine-Oncology, Fujian Cancer Hospital & Fujian Medical University Cancer Hospital, Fuzhou, China
| |
Collapse
|