1
|
Wang Z, Chen Y, Ma H, Gao H, Zhu Y, Wang H, Zhang N. Accurate identification of snoRNA targets using variational graph autoencoder to advance the redevelopment of traditional medicines. Front Pharmacol 2025; 15:1529128. [PMID: 39834830 PMCID: PMC11743687 DOI: 10.3389/fphar.2024.1529128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2024] [Accepted: 12/11/2024] [Indexed: 01/22/2025] Open
Abstract
Existing studies indicate that dysregulation or abnormal expression of small nucleolar RNA (snoRNA) is closely associated with various diseases, including lung cancer. Furthermore, these diseases often involve multiple targets, making the redevelopment of traditional medicines highly promising. Accurate prediction of potential snoRNA therapeutic targets is essential for early disease intervention and the redevelopment of traditional medicines. Additionally, researchers have developed artificial intelligence (AI)-based methods to screen and predict potential snoRNA therapeutic targets, thereby advancing traditional drug redevelopment. However, existing methods face challenges such as imbalanced datasets and the dominance of high-degree nodes in graph neural networks (GNNs), which compromise the accuracy of node representations. To address these challenges, we propose an AI model based on variational graph autoencoders (VGAEs) that integrates decoupling and Kolmogorov-Arnold Network (KAN) technologies. The model reconstructs snoRNA-disease graphs by learning snoRNA and disease representations, accurately identifying potential snoRNA therapeutic targets. By decoupling similarity from node degree, the model mitigates the dominance of high-degree nodes, enhances prediction accuracy in scenarios like lung cancer, and leverages KAN technology to improve adaptability and flexibility to new data. Case studies revealed that snoRNA SNORA21 and SNORD33 are abnormally expressed in lung cancer patients and are strong candidates for potential therapeutic targets. These findings validate the proposed model's effectiveness in identifying therapeutic targets for diseases like lung cancer, supporting early screening and treatment, and advancing the redevelopment of traditional medicines. Data and experimental findings are archived in: https://github.com/shmildsj/data.
Collapse
Affiliation(s)
- Zhina Wang
- Department of Pulmonary and Critical Care Medicine II, Emergency General Hospital, Beijing, China
- Department of Oncology, Emergency General Hospital, Beijing, China
| | - Yangyuan Chen
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, China
| | - Hongming Ma
- Department of Pulmonary and Critical Care Medicine II, Emergency General Hospital, Beijing, China
- Department of Oncology, Emergency General Hospital, Beijing, China
| | - Hong Gao
- Department of Pulmonary and Critical Care Medicine II, Emergency General Hospital, Beijing, China
- Department of Oncology, Emergency General Hospital, Beijing, China
| | - Yangbin Zhu
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, China
| | - Hongwu Wang
- Respiratory Disease Center, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Nan Zhang
- Department of Pulmonary and Critical Care Medicine II, Emergency General Hospital, Beijing, China
- Department of Oncology, Emergency General Hospital, Beijing, China
| |
Collapse
|
2
|
Thakur A, Kumar M. Computational Resources for lncRNA Functions and Targetome. Methods Mol Biol 2025; 2883:299-323. [PMID: 39702714 DOI: 10.1007/978-1-0716-4290-0_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2024]
Abstract
Long non-coding RNAs (lncRNAs) are a type of non-coding RNA molecules exceeding 200 nucleotides in length and that do not encode proteins. The dysregulated expression of lncRNAs has been identified in various diseases, holding therapeutic significance. Over the past decade, numerous computational resources have been published in the field of lncRNA. In this chapter, we have provided a comprehensive review of the databases as well as predictive tools, that is, lncRNA databases, machine learning based algorithms, and tools predicting lncRNAs utilizing different techniques. The chapter will focus on the importance of lncRNA resources developed for different organisms specifically for humans, mouse, plants, and other model organisms. We have enlisted important databases, primarily focusing on comprehensive information related to lncRNA registries, associations with diseases, differential expression, lncRNA transcriptome, target regulations, and all-in-one resources. Further, we have also included the updated version of lncRNA resources. Additionally, computational identification of lncRNAs using algorithms like Deep learning, Support Vector Machine (SVM), and Random Forest (RF) was also discussed. In conclusion, this comprehensive overview concludes by summarizing vital in silico resources, empowering biologists to choose the most suitable tools for their lncRNA research endeavors. This chapter serves as a valuable guide, emphasizing the significance of computational approaches in understanding lncRNAs and their implications in various biological contexts.
Collapse
Affiliation(s)
- Anamika Thakur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh, India.
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India.
| |
Collapse
|
3
|
Pooresmaeil F, Azadi S, Hasannejad-Asl B, Takamoli S, Bolhassani A. Pivotal Role of miRNA-lncRNA Interactions in Human Diseases. Mol Biotechnol 2024:10.1007/s12033-024-01343-y. [PMID: 39673006 DOI: 10.1007/s12033-024-01343-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Accepted: 11/25/2024] [Indexed: 12/15/2024]
Abstract
New technologies have shown that most of the genome comprises transcripts that cannot code for proteins and are referred to as non-coding RNAs (ncRNAs). Some ncRNAs, like long non-coding RNAs (lncRNAs) and microRNAs (miRNAs), are of substantial interest because of their critical function in controlling genes and numerous biological activities. The expression levels and function of miRNAs and lncRNAs are rigorously monitored throughout developmental processes and the maintenance of physiological homeostasis. Due to their critical roles, any dysregulation or changes in their expression can significantly influence the pathogenesis of various human diseases. The interactions between miRNAs and lncRNAs have been found to influence gene expression in various ways. These interactions significantly influence the understanding of disease etiology, cellular processes, and potential therapeutic targets. Different experimental and in silico methods can be used to investigate miRNA-lncRNA interactions. By aiding the elucidation of miRNA-lncRNA interactions and deepening the understanding of post-transcriptional gene regulation, researchers can open a new window for designing hypotheses, conducting experiments, and discovering methods for diagnosing and treating complex human diseases. This review briefly summarizes miRNA and lncRNA functions, discusses their interaction mechanisms, and examines the experimental and computational methods used to study these interactions. Additionally, we highlight significant studies on lncRNA and miRNA interactions in various diseases from 2000 to 2024, using the academic research databases such as PubMed, Google Scholar, ScienceDirect, and Scopus.
Collapse
Affiliation(s)
- Farkhondeh Pooresmaeil
- Department of Medical Biotechnology, School of Allied Medicine, Iran University of Medical Science, Tehran, Iran
- Department of Hepatitis & AIDS, Pasteur Institute of Iran, Tehran, Iran
| | - Sareh Azadi
- Department of Medical Biotechnology, School of Allied Medicine, Iran University of Medical Science, Tehran, Iran
| | - Behnam Hasannejad-Asl
- Department of Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti, University of Medical Sciences, Tehran, Iran
| | - Shahla Takamoli
- Department of Biology, Faculty of Science, University of Guilan, Rasht, Iran
| | - Azam Bolhassani
- Department of Hepatitis & AIDS, Pasteur Institute of Iran, Tehran, Iran.
| |
Collapse
|
4
|
Zheng H, Xu L, Xie H, Xie J, Ma Y, Hu Y, Wu L, Chen J, Wang M, Yi Y, Huang Y, Wang D. RIscoper 2.0: A deep learning tool to extract RNA biomedical relation sentences from literature. Comput Struct Biotechnol J 2024; 23:1469-1476. [PMID: 38623560 PMCID: PMC11016866 DOI: 10.1016/j.csbj.2024.03.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/15/2024] [Accepted: 03/21/2024] [Indexed: 04/17/2024] Open
Abstract
RNA plays an extensive role in a multi-dimensional regulatory system, and its biomedical relationships are scattered across numerous biological studies. However, text mining works dedicated to the extraction of RNA biomedical relations remain limited. In this study, we established a comprehensive and reliable corpus of RNA biomedical relations, recruiting over 30,000 sentences manually curated from more than 15,000 biomedical literature. We also updated RIscoper 2.0, a BERT-based deep learning tool to extract RNA biomedical relation sentences from literature. Benefiting from approximately 100,000 annotated named entities, we integrated the text classification and named entity recognition tasks in this tool. Additionally, RIscoper 2.0 outperformed the original tool in both tasks and can discover new RNA biomedical relations. Additionally, we provided a user-friendly online search tool that enables rapid scanning of RNA biomedical relationships using local and online resources. Both the online tools and data resources of RIscoper 2.0 are available at http://www.rnainter.org/riscoper.
Collapse
Affiliation(s)
- Hailong Zheng
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Linfu Xu
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Hailong Xie
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Jiajing Xie
- National Institute for Data Science in Health and Medicine, Xiamen University, 361102 Xiamen, China
| | - Yapeng Ma
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Yongfei Hu
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Le Wu
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Jia Chen
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Meiyi Wang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Ying Yi
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Yan Huang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
| | - Dong Wang
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, 510515 Guangzhou, China
- Guangdong Province Key Laboratory of Molecular Tumor Pathology, 510515, Guangzhou, China
| |
Collapse
|
5
|
Hu W, Hu X, Zhu Y, Li M, Meng H, Zhao H. Oncogenic role of SKA2 and its ceRNA network in hepatocellular carcinoma based on a comprehensive analysis. Transl Cancer Res 2024; 13:5190-5201. [PMID: 39525023 PMCID: PMC11543042 DOI: 10.21037/tcr-24-833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 09/06/2024] [Indexed: 11/16/2024]
Abstract
Background Genetic alterations have important roles in cancer development and progression. SKA2 (spindle and kinetochore associated complex subunit 2) is a mitotic component that plays a critical role in maintaining the silence of the metaphase plate and spindle checkpoint. However, the exact role of SKA2 in hepatocellular carcinoma (HCC) remains unclear. The current study aimed to comprehensively identify the function of SKA2 in HCC. Methods We utilized various databases and bioinformatics tools, such as The Cancer Genome Atlas (TCGA), survminer package, Tumor Immune Estimation Resource (TIMER), cBioPortal website, clusterProfiler package, gene set enrichment analysis (GSEA), miRWalk, TargetScanHuman8.0, miRDB, DIANA and Cytoscape to identify the role of SKA2 in HCC. Results Our results showed that patients with HCC exhibited a high SKA2 expression. Further, the SKA2 high expression group had a worse overall survival (OS). And SKA2 was associated with tumor stage and the immune system. In addition, 188 co-expression genes of SKA2 participated in some processes including cell cycle, DNA replication and so on. The tumor had a lower hsa-miR-19b-1-5p and hsa-miR-378a-5p expression, and these two microRNAs (miRNAs) were also correlated with OS. SNHG14, SNHG15, and SPCA6P-AS were significantly negatively correlated with hsa-378a-5p, and these three long non-coding RNAs (lncRNAs) showed a positive correlation with SKA2 (P<0.05). SKA2 is a member of competing endogenous RNA (ceRNA). Moreover, it is related to SPACA6P-AS/hsa-miR-378a-5p/SKA2, SNHG14/hsa-miR-378a-5p/SKA2, and SNHG15/hsa-miR-378a-5p/SKA2, which play significant roles in tumor progression. Conclusions SKA2 is associated with OS, tumor stage, and immune infiltrating cells in HCC. Thus, we propose that SKA2 functions as a ceRNA and influences tumorigenesis. These findings lay the foundation for future research in the field of HCC.
Collapse
Affiliation(s)
- Wanxue Hu
- Department of Clinical Laboratory, Hua County People’s Hospital, Anyang, China
| | - Xiaoyi Hu
- Department of Nursing, Henan Health Cadre College, Zhengzhou, China
| | - Yongchao Zhu
- Department of Clinical Laboratory, Hua County People’s Hospital, Anyang, China
| | - Min Li
- Department of Clinical Laboratory, Hua County People’s Hospital, Anyang, China
| | - Hongyu Meng
- Department of Clinical Laboratory, Hua County People’s Hospital, Anyang, China
| | - Hongbo Zhao
- Department of Laboratory Animal Science, Kunming Medical University, Kunming, China
| |
Collapse
|
6
|
Sun J, Chen Y, Bi R, Yuan Y, Yu H. Bioinformatic approaches of liquid-liquid phase separation in human disease. Chin Med J (Engl) 2024; 137:1912-1925. [PMID: 39033393 PMCID: PMC11332758 DOI: 10.1097/cm9.0000000000003249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Indexed: 07/23/2024] Open
Abstract
ABSTRACT Biomolecular aggregation within cellular environments via liquid-liquid phase separation (LLPS) spontaneously forms droplet-like structures, which play pivotal roles in diverse biological processes. These structures are closely associated with a range of diseases, including neurodegenerative disorders, cancer and infectious diseases, highlighting the significance of understanding LLPS mechanisms for elucidating disease pathogenesis, and exploring potential therapeutic interventions. In this review, we delineate recent advancements in LLPS research, emphasizing its pathological relevance, therapeutic considerations, and the pivotal role of bioinformatic tools and databases in facilitating LLPS investigations. Additionally, we undertook a comprehensive analysis of bioinformatic resources dedicated to LLPS research in order to elucidate their functionality and applicability. By providing comprehensive insights into current LLPS-related bioinformatics resources, this review highlights its implications for human health and disease.
Collapse
Affiliation(s)
- Jun Sun
- Department of Thoracic Surgery and West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
- Med-X Center for Informatics, Sichuan University, Chengdu, Sichuan 610041, China
| | - Yilong Chen
- Department of Thoracic Surgery and West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
- Med-X Center for Informatics, Sichuan University, Chengdu, Sichuan 610041, China
| | - Ruiye Bi
- Department of Orthognathic and TMJ Surgery, State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan 610041, China
| | - Yong Yuan
- Department of Thoracic Surgery and West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
- Med-X Center for Informatics, Sichuan University, Chengdu, Sichuan 610041, China
| | - Haopeng Yu
- Department of Thoracic Surgery and West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
- Med-X Center for Informatics, Sichuan University, Chengdu, Sichuan 610041, China
| |
Collapse
|
7
|
Adjeroh DA, Zhou X, Paschoal AR, Dimitrova N, Derevyanchuk EG, Shkurat TP, Loeb JA, Martinez I, Lipovich L. Challenges in LncRNA Biology: Views and Opinions. Noncoding RNA 2024; 10:43. [PMID: 39195572 DOI: 10.3390/ncrna10040043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2024] [Revised: 06/26/2024] [Accepted: 07/04/2024] [Indexed: 08/29/2024] Open
Abstract
This is a mini-review capturing the views and opinions of selected participants at the 2021 IEEE BIBM 3rd Annual LncRNA Workshop, held in Dubai, UAE. The views and opinions are expressed on five broad themes related to problems in lncRNA, namely, challenges in the computational analysis of lncRNAs, lncRNAs and cancer, lncRNAs in sports, lncRNAs and COVID-19, and lncRNAs in human brain activity.
Collapse
Affiliation(s)
- Donald A Adjeroh
- Lane Department of Computer Science and Electrical Engineering, West Virginia University (WVU), Morgantown, WV 26506, USA
| | - Xiaobo Zhou
- Department of Bioinformatics and Systems Medicine, University of Texas Health Science Center, Houston, TX 77030, USA
| | - Alexandre Rossi Paschoal
- Department of Computer Science, Bioinformatics and Pattern Recognition Group, Federal University of Technology-Paraná-UTFPR, Curitiba 86300-000, Brazil
- Rosalind Franklin Institute, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Nadya Dimitrova
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06520, USA
| | | | - Tatiana P Shkurat
- Department of Genetics, Southern Federal University, Rostov-on-Don 344090, Russia
| | - Jeffrey A Loeb
- Department of Neurology and Rehabilitation, The Center for Clinical and Translational Science, The University of Illinois NeuroRepository, University of Illinois, Chicago, IL 60607, USA
| | - Ivan Martinez
- Department of Microbiology, Immunology & Cell Biology, WVU Cancer Institute, West Virginia University (WVU) School of Medicine, Morgantown, WV 26505, USA
| | - Leonard Lipovich
- Shenzhen Huayuan Biological Science Research Institute, Shenzhen Huayuan Biotechnology Co., Ltd., Shenzhen 518000, China
- Center for Molecular Medicine and Genetics, School of Medicine, Wayne State University, Detroit, MI 48201, USA
- College of Science, Mathematics and Technology, Wenzhou-Kean University, Wenzhou 325060, China
| |
Collapse
|
8
|
Zhang Y, Yang Y, Ren L, Ning L, Zou Q, Luo N, Zhang Y, Liu R. RDscan: Extracting RNA-disease relationship from the literature based on pre-training model. Methods 2024; 228:48-54. [PMID: 38789016 DOI: 10.1016/j.ymeth.2024.05.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Revised: 05/02/2024] [Accepted: 05/16/2024] [Indexed: 05/26/2024] Open
Abstract
With the rapid advancements in molecular biology and genomics, a multitude of connections between RNA and diseases has been unveiled, making the efficient and accurate extraction of RNA-disease (RD) relationships from extensive biomedical literature crucial for advancing research in this field. This study introduces RDscan, a novel text mining method developed based on the pre-training and fine-tuning strategy, aimed at automatically extracting RD-related information from a vast corpus of literature using pre-trained biomedical large language models (LLM). Initially, we constructed a dedicated RD corpus by manually curating from literature, comprising 2,082 positive and 2,000 negative sentences, alongside an independent test dataset (comprising 500 positive and 500 negative sentences) for training and evaluating RDscan. Subsequently, by fine-tuning the Bioformer and BioBERT pre-trained models, RDscan demonstrated exceptional performance in text classification and named entity recognition (NER) tasks. In 5-fold cross-validation, RDscan significantly outperformed traditional machine learning methods (Support Vector Machine, Logistic Regression and Random Forest). In addition, we have developed an accessible webserver that assists users in extracting RD relationships from text. In summary, RDscan represents the first text mining tool specifically designed for RD relationship extraction, and is poised to emerge as an invaluable tool for researchers dedicated to exploring the intricate interactions between RNA and diseases. Webserver of RDscan is free available at https://cellknowledge.com.cn/RDscan/.
Collapse
Affiliation(s)
- Yang Zhang
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 611731, China; School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China.
| | - Yu Yang
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Liping Ren
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Lin Ning
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Nanchao Luo
- School of Computer Science and Technology, Aba Teachers College, WenChuan, Sichuan, 623002, China
| | - Yinghui Zhang
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu 611844, China.
| | - Ruijun Liu
- School of Software, Beihang University, Beijing 100191, China.
| |
Collapse
|
9
|
Diao B, Luo J, Guo Y. A comprehensive survey on deep learning-based identification and predicting the interaction mechanism of long non-coding RNAs. Brief Funct Genomics 2024; 23:314-324. [PMID: 38576205 DOI: 10.1093/bfgp/elae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 02/25/2024] [Accepted: 03/14/2024] [Indexed: 04/06/2024] Open
Abstract
Long noncoding RNAs (lncRNAs) have been discovered to be extensively involved in eukaryotic epigenetic, transcriptional, and post-transcriptional regulatory processes with the advancements in sequencing technology and genomics research. Therefore, they play crucial roles in the body's normal physiology and various disease outcomes. Presently, numerous unknown lncRNA sequencing data require exploration. Establishing deep learning-based prediction models for lncRNAs provides valuable insights for researchers, substantially reducing time and costs associated with trial and error and facilitating the disease-relevant lncRNA identification for prognosis analysis and targeted drug development as the era of artificial intelligence progresses. However, most lncRNA-related researchers lack awareness of the latest advancements in deep learning models and model selection and application in functional research on lncRNAs. Thus, we elucidate the concept of deep learning models, explore several prevalent deep learning algorithms and their data preferences, conduct a comprehensive review of recent literature studies with exemplary predictive performance over the past 5 years in conjunction with diverse prediction functions, critically analyze and discuss the merits and limitations of current deep learning models and solutions, while also proposing prospects based on cutting-edge advancements in lncRNA research.
Collapse
Affiliation(s)
- Biyu Diao
- Department of Breast Surgery, The First Affiliated Hospital of Ningbo University, No. 59, Liuting Street, Haishu District, Ningbo 315000, China
| | - Jin Luo
- Department of Breast Surgery, The First Affiliated Hospital of Ningbo University, No. 59, Liuting Street, Haishu District, Ningbo 315000, China
| | - Yu Guo
- Department of Breast Surgery, The First Affiliated Hospital of Ningbo University, No. 59, Liuting Street, Haishu District, Ningbo 315000, China
| |
Collapse
|
10
|
Chaudhary U, Banerjee S. Decoding the Non-coding: Tools and Databases Unveiling the Hidden World of "Junk" RNAs for Innovative Therapeutic Exploration. ACS Pharmacol Transl Sci 2024; 7:1901-1915. [PMID: 39022352 PMCID: PMC11249652 DOI: 10.1021/acsptsci.3c00388] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 05/15/2024] [Accepted: 05/27/2024] [Indexed: 07/20/2024]
Abstract
Non-coding RNAs are pivotal regulators of gene and protein expression, exerting crucial influences on diverse biological processes. Their dysregulation is frequently implicated in the onset and progression of diseases, notably cancer. A profound comprehension of the intricate mechanisms governing ncRNAs is imperative for devising innovative therapeutic interventions against these debilitating conditions. Significantly, nearly 80% of our genome comprises ncRNAs, underscoring their centrality in cellular processes. The elucidation of ncRNA functions is pivotal for grasping the complexities of gene regulation and its implications for human health. Modern genome sequencing techniques yield vast datasets, stored in specialized databases. To harness this wealth of information and to understand the crosstalk of non-coding RNAs, knowledge of available databases is required, and many new sophisticated computational tools have emerged. These tools play a pivotal role in the identification, prediction, and annotation of ncRNAs, thereby facilitating their experimental validation. This Review succinctly outlines the current understanding of ncRNAs, emphasizing their involvement in disease development. It also highlights the databases and tools instrumental in classifying, annotating, and evaluating ncRNAs. By extracting meaningful biological insights from seemingly "junk" data, these tools empower scientists to unravel the intricate roles of ncRNAs in shaping human health.
Collapse
Affiliation(s)
- Uma Chaudhary
- Department of Biotechnology,
School of Biosciences and Technology, Vellore
Institute of Technology (VIT), Vellore, Tamil Nadu 632014, India
| | - Satarupa Banerjee
- Department of Biotechnology,
School of Biosciences and Technology, Vellore
Institute of Technology (VIT), Vellore, Tamil Nadu 632014, India
| |
Collapse
|
11
|
Zhang Y, Yang Y, Ren L, Zhan M, Sun T, Zou Q, Zhang Y. Predicting intercellular communication based on metabolite-related ligand-receptor interactions with MRCLinkdb. BMC Biol 2024; 22:152. [PMID: 38978014 PMCID: PMC11232326 DOI: 10.1186/s12915-024-01950-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Accepted: 07/03/2024] [Indexed: 07/10/2024] Open
Abstract
BACKGROUND Metabolite-associated cell communications play critical roles in maintaining human biological function. However, most existing tools and resources focus only on ligand-receptor interaction pairs where both partners are proteinaceous, neglecting other non-protein molecules. To address this gap, we introduce the MRCLinkdb database and algorithm, which aggregates and organizes data related to non-protein L-R interactions in cell-cell communication, providing a valuable resource for predicting intercellular communication based on metabolite-related ligand-receptor interactions. RESULTS Here, we manually curated the metabolite-ligand-receptor (ML-R) interactions from the literature and known databases, ultimately collecting over 790 human and 670 mouse ML-R interactions. Additionally, we compiled information on over 1900 enzymes and 260 transporter entries associated with these metabolites. We developed Metabolite-Receptor based Cell Link Database (MRCLinkdb) to store these ML-R interactions data. Meanwhile, the platform also offers extensive information for presenting ML-R interactions, including fundamental metabolite information and the overall expression landscape of metabolite-associated gene sets (such as receptor, enzymes, and transporter proteins) based on single-cell transcriptomics sequencing (covering 35 human and 26 mouse tissues, 52 human and 44 mouse cell types) and bulk RNA-seq/microarray data (encompassing 62 human and 39 mouse tissues). Furthermore, MRCLinkdb introduces a web server dedicated to the analysis of intercellular communication based on ML-R interactions. MRCLinkdb is freely available at https://www.cellknowledge.com.cn/mrclinkdb/ . CONCLUSIONS In addition to supplementing ligand-receptor databases, MRCLinkdb may provide new perspectives for decoding the intercellular communication and advancing related prediction tools based on ML-R interactions.
Collapse
Affiliation(s)
- Yuncong Zhang
- Guangdong Provincial Key Laboratory of Tumor Interventional Diagnosis and Treatment, Zhuhai Institute of Translational Medicine, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Jinan University, Zhuhai, Guangdong, China
| | - Yu Yang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Liping Ren
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Meixiao Zhan
- Guangdong Provincial Key Laboratory of Tumor Interventional Diagnosis and Treatment, Zhuhai Institute of Translational Medicine, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Jinan University, Zhuhai, Guangdong, China
| | - Taoping Sun
- Guangdong Provincial Key Laboratory of Tumor Interventional Diagnosis and Treatment, Zhuhai Institute of Translational Medicine, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Jinan University, Zhuhai, Guangdong, China.
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.
| | - Yang Zhang
- Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China.
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.
| |
Collapse
|
12
|
Liu X, Yao X, Chen L. Expanding roles of circRNAs in cardiovascular diseases. Noncoding RNA Res 2024; 9:429-436. [PMID: 38511061 PMCID: PMC10950605 DOI: 10.1016/j.ncrna.2024.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 02/01/2024] [Accepted: 02/04/2024] [Indexed: 03/22/2024] Open
Abstract
CircRNAs are a class of single-stranded RNAs characterized by covalently looped structures. Emerging advances have promoted our understanding of circRNA biogenesis, nuclear export, biological functions, and functional mechanisms. Roles of circRNAs in diverse diseases have been increasingly recognized in the past decade, with novel approaches in bioinformatics analysis and new strategies in modulating circRNA levels, which have made circRNAs the hot spot for therapeutic applications. Moreover, due to the intrinsic features of circRNAs such as high stability, conservation, and tissue-/stage-specific expression, circRNAs are believed to be promising prognostic and diagnostic markers for diseases. Aiming cardiovascular disease (CVD), one of the leading causes of mortality worldwide, we briefly summarize the current understanding of circRNAs, provide the recent progress in circRNA functions and functional mechanisms in CVD, and discuss the future perspectives both in circRNA research and therapeutics based on existing knowledge.
Collapse
Affiliation(s)
- Xu Liu
- Department of Cardiology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230001, China
| | - Xuelin Yao
- Department of Endocrinology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230001, China
- Department of Endocrinology, The First Affiliated Hospital of Anhui Medical University, Hefei, 230022, China
| | - Liang Chen
- Department of Cardiology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230001, China
| |
Collapse
|
13
|
Mahajan A, Gunewardena S, Morris A, Clauss M, Dhillon NK. Analysis of MicroRNA Cargo in Circulating Extracellular Vesicles from HIV-Infected Individuals with Pulmonary Hypertension. Cells 2024; 13:886. [PMID: 38891019 PMCID: PMC11172129 DOI: 10.3390/cells13110886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 04/26/2024] [Accepted: 05/05/2024] [Indexed: 06/20/2024] Open
Abstract
The risk of developing pulmonary hypertension (PH) in people living with HIV is at least 300-fold higher than in the general population, and illicit drug use further potentiates the development of HIV-associated PH. The relevance of extracellular vesicles (EVs) containing both coding as well as non-coding RNAs in PH secondary to HIV infection and drug abuse is yet to be explored. We here compared the miRNA cargo of plasma-derived EVs from HIV-infected stimulant users with (HIV + Stimulants + PH) and without PH (HIV + Stimulants) using small RNA sequencing. The data were compared with 12 PH datasets available in the GEO database to identify potential candidate gene targets for differentially altered miRNAs using the following functional analysis tools: ingenuity pathway analysis (IPA), over-representation analysis (ORA), and gene set enrichment analysis (GSEA). MiRNAs involved in promoting cell proliferation and inhibition of intrinsic apoptotic signaling pathways were among the top upregulated miRNAs identified in EVs from the HIV + Stimulants + PH group compared to the HIV + Stimulants group. Alternatively, the downregulated miRNAs in the HIV + Stimulants + PH group suggested an association with the negative regulation of smooth muscle cell proliferation, IL-2 mediated signaling, and transmembrane receptor protein tyrosine kinase signaling pathways. The validation of significantly differentially expressed miRNAs in an independent set of HIV-infected (cocaine users and nondrug users) with and without PH confirmed the upregulation of miR-32-5p, 92-b-3p, and 301a-3p positively regulating cellular proliferation and downregulation of miR-5571, -4670 negatively regulating smooth muscle proliferation in EVs from HIV-PH patients. This increase in miR-301a-3p and decrease in miR-4670 were negatively correlated with the CD4 count and FEV1/FVC ratio, and positively correlated with viral load. Collectively, this data suggest the association of alterations in the miRNA cargo of circulating EVs with HIV-PH.
Collapse
Affiliation(s)
- Aatish Mahajan
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Kansas Medical Center, Mail Stop 3007, 3901 Rainbow Blvd, Kansas City, KS 66160, USA
| | - Sumedha Gunewardena
- Department of Molecular & Integrative Physiology, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Alison Morris
- Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA;
| | - Matthias Clauss
- Pulmonary, Critical Care, Sleep and Occupational Medicine, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Navneet K. Dhillon
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Kansas Medical Center, Mail Stop 3007, 3901 Rainbow Blvd, Kansas City, KS 66160, USA
| |
Collapse
|
14
|
Wei H, Gao L, Wu S, Jiang Y, Liu B. DiSMVC: a multi-view graph collaborative learning framework for measuring disease similarity. Bioinformatics 2024; 40:btae306. [PMID: 38715444 PMCID: PMC11256965 DOI: 10.1093/bioinformatics/btae306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 04/19/2024] [Accepted: 05/05/2024] [Indexed: 05/30/2024] Open
Abstract
MOTIVATION Exploring potential associations between diseases can help in understanding pathological mechanisms of diseases and facilitating the discovery of candidate biomarkers and drug targets, thereby promoting disease diagnosis and treatment. Some computational methods have been proposed for measuring disease similarity. However, these methods describe diseases without considering their latent multi-molecule regulation and valuable supervision signal, resulting in limited biological interpretability and efficiency to capture association patterns. RESULTS In this study, we propose a new computational method named DiSMVC. Different from existing predictors, DiSMVC designs a supervised graph collaborative framework to measure disease similarity. Multiple bio-entity associations related to genes and miRNAs are integrated via cross-view graph contrastive learning to extract informative disease representation, and then association pattern joint learning is implemented to compute disease similarity by incorporating phenotype-annotated disease associations. The experimental results show that DiSMVC can draw discriminative characteristics for disease pairs, and outperform other state-of-the-art methods. As a result, DiSMVC is a promising method for predicting disease associations with molecular interpretability. AVAILABILITY AND IMPLEMENTATION Datasets and source codes are available at https://github.com/Biohang/DiSMVC.
Collapse
Affiliation(s)
- Hang Wei
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi 710126, China
| | - Lin Gao
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi 710126, China
| | - Shuai Wu
- School of Computer Science and Technology, Xidian University, Xi’an, Shaanxi 710126, China
| | - Yina Jiang
- Department of Basic Medicine, Shaanxi University of Chinese Medicine, Xianyang, Shaanxi 712046, China
| | - Bin Liu
- Faculty of Engineering, Shenzhen MSU-BIT University, Shenzhen, Guangdong 518172, China
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, 100081, China
| |
Collapse
|
15
|
Ma Y, Shi Y, Chen X, Zhang B, Wu H, Gao J. NFMCLDA: Predicting miRNA-based lncRNA-disease associations by network fusion and matrix completion. Comput Biol Med 2024; 174:108403. [PMID: 38582002 DOI: 10.1016/j.compbiomed.2024.108403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 03/28/2024] [Accepted: 04/01/2024] [Indexed: 04/08/2024]
Abstract
In recent years, emerging evidence has revealed a strong association between dysregulations of long non-coding RNAs (lncRNAs) and sophisticated human diseases. Biological experiments are adequate to identify such associations, but they are costly and time-consuming. Therefore, developing high-quality computational methods is a challenging and urgent task in the field of bioinformatics. This paper proposes a new lncRNA-disease association inference approach NFMCLDA (Network Fusion and Matrix Completion lncRNA-Disease Association), which can effectively integrate multi-source association data. In this approach, miRNA information is used as the transition path, and an unbalanced random walk method on three-layer heterogeneous network is adopted in the preprocessing. Therefore, more effective information between networks can be mined and the sparsity problem of the association matrix can be solved. Finally, the matrix completion method accurately predicts associations. The results show that NFMCLDA can provide more accurate lncRNA-disease associations than state-of-the-art methods. The areas under the receiver operating characteristic curves are 0.9648 and 0.9713, respectively, through the cross-validation of 5-fold and 10-fold. Data from published case studies on four diseases - lung cancer, osteosarcoma, cervical cancer, and colon cancer - have confirmed the reliable predictive potential of NFMCLDA model.
Collapse
Affiliation(s)
- Yibing Ma
- School of Science, Jiangnan University, Wuxi, Jiangsu, 214122, China
| | - Yongle Shi
- School of Science, Jiangnan University, Wuxi, Jiangsu, 214122, China
| | - Xiang Chen
- School of Science, Jiangnan University, Wuxi, Jiangsu, 214122, China
| | - Bai Zhang
- School of Science, Jiangnan University, Wuxi, Jiangsu, 214122, China
| | - Hanwen Wu
- School of Science, Jiangnan University, Wuxi, Jiangsu, 214122, China
| | - Jie Gao
- School of Science, Jiangnan University, Wuxi, Jiangsu, 214122, China.
| |
Collapse
|
16
|
Abedpoor N, Taghian F, Jalali Dehkordi K, Safavi K. Sparassis latifolia and exercise training as complementary medicine mitigated the 5-fluorouracil potent side effects in mice with colorectal cancer: bioinformatics approaches, novel monitoring pathological metrics, screening signatures, and innovative management tactic. Cancer Cell Int 2024; 24:141. [PMID: 38637796 PMCID: PMC11027426 DOI: 10.1186/s12935-024-03328-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Accepted: 04/12/2024] [Indexed: 04/20/2024] Open
Abstract
BACKGROUND Prompt identification and assessment of the disease are essential for reducing the death rate associated with colorectal cancer (COL). Identifying specific causal or sensitive components, such as coding RNA (cRNA) and non-coding RNAs (ncRNAs), may greatly aid in the early detection of colorectal cancer. METHODS For this purpose, we gave natural chemicals obtained from Sparassis latifolia (SLPs) either alone or in conjunction with chemotherapy (5-Fluorouracil to a mouse colorectal tumor model induced by AOM-DSS. The transcription profile of non-coding RNAs (ncRNAs) and their target hub genes was evaluated using qPCR Real-Time, and ELISA techniques. RESULTS MSX2, MMP7, ITIH4, and COL1A2 were identified as factors in inflammation and oxidative stress, leading to the development of COL. The hub genes listed, upstream regulatory factors such as lncRNA PVT1, NEAT1, KCNQ1OT1, SNHG16, and miR-132-3p have been discovered as biomarkers for prognosis and diagnosis of COL. The SLPs and exercise, effectively decreased the size and quantity of tumors. CONCLUSIONS This effect may be attributed to the modulation of gene expression levels, including MSX2, MMP7, ITIH4, COL1A2, PVT1, NEAT1, KCNQ1OT1, SNHG16, and miR-132-3p. Ultimately, SLPs and exercise have the capacity to be regarded as complementing and enhancing chemotherapy treatments, owing to their efficacious components.
Collapse
Affiliation(s)
- Navid Abedpoor
- Department of Sports Physiology, Faculty of Sports Sciences, School of Sports Sciences, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran
| | - Farzaneh Taghian
- Department of Sports Physiology, Faculty of Sports Sciences, School of Sports Sciences, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
| | - Khosro Jalali Dehkordi
- Department of Sports Physiology, Faculty of Sports Sciences, School of Sports Sciences, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran
| | - Kamran Safavi
- Department of Plant Biotechnology, Medicinal Plants Research Centre, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran
| |
Collapse
|
17
|
He J, Li M, Qiu J, Pu X, Guo Y. HOPEXGB: A Consensual Model for Predicting miRNA/lncRNA-Disease Associations Using a Heterogeneous Disease-miRNA-lncRNA Information Network. J Chem Inf Model 2024; 64:2863-2877. [PMID: 37604142 DOI: 10.1021/acs.jcim.3c00856] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/23/2023]
Abstract
Predicting disease-related microRNAs (miRNAs) and long noncoding RNAs (lncRNAs) is crucial to find new biomarkers for the prevention, diagnosis, and treatment of complex human diseases. Computational predictions for miRNA/lncRNA-disease associations are of great practical significance, since traditional experimental detection is expensive and time-consuming. In this paper, we proposed a consensual machine-learning technique-based prediction approach to identify disease-related miRNAs and lncRNAs by high-order proximity preserved embedding (HOPE) and eXtreme Gradient Boosting (XGB), named HOPEXGB. By connecting lncRNA, miRNA, and disease nodes based on their correlations and relationships, we first created a heterogeneous disease-miRNA-lncRNA (DML) information network to achieve an effective fusion of information on similarities, correlations, and interactions among miRNAs, lncRNAs, and diseases. In addition, a more rational negative data set was generated based on the similarities of unknown associations with the known ones, so as to effectively reduce the false negative rate in the data set for model construction. By 10-fold cross-validation, HOPE shows better performance than other graph embedding methods. The final consensual HOPEXGB model yields robust performance with a mean prediction accuracy of 0.9569 and also demonstrates high sensitivity and specificity advantages compared to lncRNA/miRNA-specific predictions. Moreover, it is superior to other existing methods and gives promising performance on the external testing data, indicating that integrating the information on lncRNA-miRNA interactions and the similarities of lncRNAs/miRNAs is beneficial for improving the prediction performance of the model. Finally, case studies on lung, stomach, and breast cancers indicate that HOPEXGB could be a powerful tool for preclinical biomarker detection and bioexperiment preliminary screening for the diagnosis and prognosis of cancers. HOPEXGB is publicly available at https://github.com/airpamper/HOPEXGB.
Collapse
Affiliation(s)
- Jian He
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Menglong Li
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Jiangguo Qiu
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Xuemei Pu
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Yanzhi Guo
- College of Chemistry, Sichuan University, Chengdu 610064, China
| |
Collapse
|
18
|
Zhong B, Dai Y, Chen L, Xu X, Lan Y, Deng L, Ren L, Luo N, Ning L. ncRS: A resource of non-coding RNAs in sepsis. Comput Biol Med 2024; 172:108256. [PMID: 38489989 DOI: 10.1016/j.compbiomed.2024.108256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 02/10/2024] [Accepted: 03/06/2024] [Indexed: 03/17/2024]
Abstract
Sepsis, a life-threatening condition triggered by the body's response to infection, presents a significant global healthcare challenge characterized by disarrayed host responses, widespread inflammation, organ impairment, and heightened mortality rates. This study introduces the ncRS database (http://www.ncrdb.cn), a meticulously curated repository housing 1144 experimentally validated non-coding RNAs (ncRNAs) intricately linked with sepsis. ncRS offers comprehensive RNA data, exhaustive experimental insights, and integrated annotations from diverse databases. This resource empowers researchers and clinicians to decipher ncRNAs' roles in sepsis pathogenesis, potentially identifying vital biomarkers for early diagnosis and prognosis, thus facilitating personalized treatments.
Collapse
Affiliation(s)
- Baocai Zhong
- School of Computer and Software, Chengdu Neusoft University, Chengdu, China
| | - Yongfang Dai
- School of Computer and Software, Chengdu Neusoft University, Chengdu, China
| | - Li Chen
- School of Computer and Software, Chengdu Neusoft University, Chengdu, China.
| | - Xinying Xu
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Yuxi Lan
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Leyao Deng
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Liping Ren
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Nanchao Luo
- School of Computer Science and Technology, A Ba Teachers University, Wenchuan, China.
| | - Lin Ning
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China; Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, Zhejiang, China.
| |
Collapse
|
19
|
Morishita EC, Nakamura S. Recent applications of artificial intelligence in RNA-targeted small molecule drug discovery. Expert Opin Drug Discov 2024; 19:415-431. [PMID: 38321848 DOI: 10.1080/17460441.2024.2313455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 01/30/2024] [Indexed: 02/08/2024]
Abstract
INTRODUCTION Targeting RNAs with small molecules offers an alternative to the conventional protein-targeted drug discovery and can potentially address unmet and emerging medical needs. The recent rise of interest in the strategy has already resulted in large amounts of data on disease associated RNAs, as well as on small molecules that bind to such RNAs. Artificial intelligence (AI) approaches, including machine learning and deep learning, present an opportunity to speed up the discovery of RNA-targeted small molecules by improving decision-making efficiency and quality. AREAS COVERED The topics described in this review include the recent applications of AI in the identification of RNA targets, RNA structure determination, screening of chemical compound libraries, and hit-to-lead optimization. The impact and limitations of the recent AI applications are discussed, along with an outlook on the possible applications of next-generation AI tools for the discovery of novel RNA-targeted small molecule drugs. EXPERT OPINION Key areas for improvement include developing AI tools for understanding RNA dynamics and RNA - small molecule interactions. High-quality and comprehensive data still need to be generated especially on the biological activity of small molecules that target RNAs.
Collapse
|
20
|
Li G, Bai P, Liang C, Luo J. Node-adaptive graph Transformer with structural encoding for accurate and robust lncRNA-disease association prediction. BMC Genomics 2024; 25:73. [PMID: 38233788 PMCID: PMC10795365 DOI: 10.1186/s12864-024-09998-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 01/09/2024] [Indexed: 01/19/2024] Open
Abstract
BACKGROUND Long noncoding RNAs (lncRNAs) are integral to a plethora of critical cellular biological processes, including the regulation of gene expression, cell differentiation, and the development of tumors and cancers. Predicting the relationships between lncRNAs and diseases can contribute to a better understanding of the pathogenic mechanisms of disease and provide strong support for the development of advanced treatment methods. RESULTS Therefore, we present an innovative Node-Adaptive Graph Transformer model for predicting unknown LncRNA-Disease Associations, named NAGTLDA. First, we utilize the node-adaptive feature smoothing (NAFS) method to learn the local feature information of nodes and encode the structural information of the fusion similarity network of diseases and lncRNAs using Structural Deep Network Embedding (SDNE). Next, the Transformer module is used to capture potential association information between the network nodes. Finally, we employ a Transformer module with two multi-headed attention layers for learning global-level embedding fusion. Network structure coding is added as the structural inductive bias of the network to compensate for the missing message-passing mechanism in Transformer. NAGTLDA achieved an average AUC of 0.9531 and AUPR of 0.9537 significantly higher than state-of-the-art methods in 5-fold cross validation. We perform case studies on 4 diseases; 55 out of 60 associations between lncRNAs and diseases have been validated in the literatures. The results demonstrate the enormous potential of the graph Transformer structure to incorporate graph structural information for uncovering lncRNA-disease unknown correlations. CONCLUSIONS Our proposed NAGTLDA model can serve as a highly efficient computational method for predicting biological information associations.
Collapse
Affiliation(s)
- Guanghui Li
- School of Information Engineering, East China Jiaotong University, Nanchang, China.
| | - Peihao Bai
- School of Information Engineering, East China Jiaotong University, Nanchang, China
| | - Cheng Liang
- School of Information Science and Engineering, Shandong Normal University, Jinan, China
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China.
| |
Collapse
|
21
|
Zhang G, Song C, Fan S, Yin M, Wang X, Zhang Y, Huang X, Li Y, Shang D, Li C, Wang Q. LncSEA 2.0: an updated platform for long non-coding RNA related sets and enrichment analysis. Nucleic Acids Res 2024; 52:D919-D928. [PMID: 37986229 PMCID: PMC10767924 DOI: 10.1093/nar/gkad1008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/12/2023] [Accepted: 10/19/2023] [Indexed: 11/22/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) possess a wide range of biological functions, and research has demonstrated their significance in regulating major biological processes such as development, differentiation, and immune response. The accelerating accumulation of lncRNA research has greatly expanded our understanding of lncRNA functions. Here, we introduce LncSEA 2.0 (http://bio.liclab.net/LncSEA/index.php), aiming to provide a more comprehensive set of functional lncRNAs and enhanced enrichment analysis capabilities. Compared with LncSEA 1.0, we have made the following improvements: (i) We updated the lncRNA sets for 11 categories and extremely expanded the lncRNA scopes for each set. (ii) We newly introduced 15 functional lncRNA categories from multiple resources. This update not only included a significant amount of downstream regulatory data for lncRNAs, but also covered numerous epigenetic regulatory data sets, including lncRNA-related transcription co-factor binding, chromatin regulator binding, and chromatin interaction data. (iii) We incorporated two new lncRNA set enrichment analysis functions based on GSEA and GSVA. (iv) We adopted the snakemake analysis pipeline to track data processing and analysis. In summary, LncSEA 2.0 offers a more comprehensive collection of lncRNA sets and a greater variety of enrichment analysis modules, assisting researchers in a more comprehensive study of the functional mechanisms of lncRNAs.
Collapse
Affiliation(s)
- Guorui Zhang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Chao Song
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Shifan Fan
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Mingxue Yin
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Xinyue Wang
- School of Medical Informatics, Daqing Campus, Harbin Medical University. Daqing, 163319, China
| | - Yuexin Zhang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Xuemei Huang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Ye Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
| | - Desi Shang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Department of Cardiology, Hengyang Medical School, University of South China, Hengyang, China
| | - Chunquan Li
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Department of Cardiology, Hengyang Medical School, University of South China, Hengyang, China
- MOE Key Lab of Rare Pediatric Diseases, University of South China, Hengyang, Hunan 421001, China
| | - Qiuyu Wang
- The First Affiliated Hospital & National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan 421001, China
- The First Affiliated Hospital, Department of Cardiology, Hengyang Medical School, University of South China, Hengyang, China
| |
Collapse
|
22
|
Momanyi BM, Zhou YW, Grace-Mercure BK, Temesgen SA, Basharat A, Ning L, Tang L, Gao H, Lin H, Tang H. SAGESDA: Multi-GraphSAGE networks for predicting SnoRNA-disease associations. Curr Res Struct Biol 2023; 7:100122. [PMID: 38188542 PMCID: PMC10771890 DOI: 10.1016/j.crstbi.2023.100122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 11/30/2023] [Accepted: 12/24/2023] [Indexed: 01/09/2024] Open
Abstract
Over the years, extensive research has highlighted the functional roles of small nucleolar RNAs in various biological processes associated with the development of complex human diseases. Therefore, understanding the existing relationships between different snoRNAs and diseases is crucial for advancing disease diagnosis and treatment. However, classical biological experiments for identifying snoRNA-disease associations are expensive and time-consuming. Therefore, there is an urgent need for cost-effective computational techniques that can enhance the efficiency and accuracy of prediction. While several computational models have already been proposed, many suffer from limitations and suboptimal performance. In this study, we introduced a novel Graph Neural Network-based (GNN) classification model, called SAGESDA, which is implemented through the GraphSAGE architecture with attention for the prediction of snoRNA-disease associations. The classifier leverages local neighbouring nodes in a heterogeneous network to generate new node embeddings through message passing. The mini-batch gradient descent technique was applied to divide the graph into smaller sub-graphs, which enhances the model's accuracy, speed and scalability. With these advancements, SAGESDA attained an area under the receiver operating characteristic (ROC) curve (AUC) of 0.92 using the standard dot product classifier, surpassing previous related studies. This notable performance demonstrates that SAGESDA is a promising model for predicting unknown snoRNA-disease associations with high accuracy. The SAGESDA implementation details can be obtained from https://github.com/momanyibiffon/SAGESDA.git.
Collapse
Affiliation(s)
- Biffon Manyura Momanyi
- School of Computer Science and Engineering, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Yu-Wei Zhou
- School of Health Care Technology, Chengdu Neusoft University, Chengdu, China
| | - Bakanina Kissanga Grace-Mercure
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Sebu Aboma Temesgen
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Ahmad Basharat
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Lin Ning
- School of Health Care Technology, Chengdu Neusoft University, Chengdu, China
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Lixia Tang
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Hui Gao
- School of Computer Science and Engineering, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hao Lin
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Hua Tang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou, 646000, China
- Basic Medicine Research Innovation Center for Cardiometabolic Diseases, Ministry of Education, Luzhou, 646000, China
- Central Nervous System Drug Key Laboratory of Sichuan Province, Luzhou, 646000, China
| |
Collapse
|
23
|
Kaviani E, Hajibabaie F, Abedpoor N, Safavi K, Ahmadi Z, Karimy A. System biology analysis to develop diagnostic biomarkers, monitoring pathological indexes, and novel therapeutic approaches for immune targeting based on maggot bioactive compounds and polyphenolic cocktails in mice with gastric cancer. ENVIRONMENTAL RESEARCH 2023; 238:117168. [PMID: 37742751 DOI: 10.1016/j.envres.2023.117168] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 08/26/2023] [Accepted: 09/15/2023] [Indexed: 09/26/2023]
Abstract
Early diagnosis and prognosis are prerequisites for mitigating mortality in gastric cancer (GaCa). Identifying some causative or sensitive elements (coding RNA (cRNA)-non-cRNAs (ncRNAs)) can be very helpful in the early diagnosis of GaCa. Notably, despite significant development in the GaCa treatment, the outcome of patients does not remain satisfactory due to limitations such as multi-drug resistance and tumor relapse. Therefore, more attention has been drawn to complementary therapies and the use of supplements. In this regard, Polyphenol natural compounds (PNC) and maggot larvae (MaLa) alone or in combination were administered along with chemotherapy (paclitaxel) to N-methyl-N-nitrosourea (MNU)- induced murine tumor model. In addition, in order to identify potential diagnostic or prognostic biomarkers, transcriptomics analysis was performed through a bioinformatics approach. Then transcription profile of ncRNAs with their target hub genes was assessed through qPCR Real-Time, Western blot, and ELISA. According to the bioinformatics results, 17 hub genes (e.g., IL-6, CXCL8, MKI67, IL-2, IL-4, IL-10, IL-1β, SPP1, LOX, COL1A1, and IFN-γ) were explored that contribute towards inflammation and oxidative stress and ultimately GaCa development. Upstream of the mentioned hub genes, regulatory factors (lncRNA XIST and NEAT1) were also identified and introduced as prognosis and diagnosis biomarkers for GaCa. Our results showed that PNC alone and in combination with MaLa was able to reduce the size and number of tumors, which is related to the reduction of genes expression levels (including IL-6, CXCL8, MKI67, IL-2, IL-4, IL-10, IL-1β, SPP1, LOX, COL1A1, IFN-γ, NEAT1, and XIST). In conclusion, PNC and MaLa have the potential to be considered as complementary and improving chemotherapy due to their effective compounds. Also, the introduced hub gene and lncRNA in addition to diagnostic and prognostic biomarkers can be used as druggable proteins for novel therapeutic targeting of GaCa.
Collapse
Affiliation(s)
- Elina Kaviani
- Isfahan Endocrine and Metabolism Research Center, Isfahan University of Medical Sciences, Isfahan, Iran.
| | - Fatemeh Hajibabaie
- Department of Physiology, Medicinal Plants Research Center, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran; Department of Biology, Faculty of Basic Sciences, Shahrekord Branch, Islamic Azad University, Shahrekord, Iran.
| | - Navid Abedpoor
- Department of Physiology, Medicinal Plants Research Center, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran; Department of Sports Physiology, Faculty of Sports Sciences, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
| | - Kamran Safavi
- Department of Plant Biotechnology, Medicinal Plants Research Centre, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
| | - Zahra Ahmadi
- Department of Physiology, Medicinal Plants Research Center, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran; Department of Sports Physiology, Faculty of Sports Sciences, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
| | - Azadeh Karimy
- Department of Plant Biotechnology, Medicinal Plants Research Centre, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
| |
Collapse
|
24
|
Tao S, Hou Y, Diao L, Hu Y, Xu W, Xie S, Xiao Z. Long noncoding RNA study: Genome-wide approaches. Genes Dis 2023; 10:2491-2510. [PMID: 37554208 PMCID: PMC10404890 DOI: 10.1016/j.gendis.2022.10.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 10/09/2022] [Accepted: 10/23/2022] [Indexed: 11/30/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) have been confirmed to play a crucial role in various biological processes across several species. Though many efforts have been devoted to the expansion of the lncRNAs landscape, much about lncRNAs is still unknown due to their great complexity. The development of high-throughput technologies and the constantly improved bioinformatic methods have resulted in a rapid expansion of lncRNA research and relevant databases. In this review, we introduced genome-wide research of lncRNAs in three parts: (i) novel lncRNA identification by high-throughput sequencing and computational pipelines; (ii) functional characterization of lncRNAs by expression atlas profiling, genome-scale screening, and the research of cancer-related lncRNAs; (iii) mechanism research by large-scale experimental technologies and computational analysis. Besides, primary experimental methods and bioinformatic pipelines related to these three parts are summarized. This review aimed to provide a comprehensive and systemic overview of lncRNA genome-wide research strategies and indicate a genome-wide lncRNA research system.
Collapse
Affiliation(s)
- Shuang Tao
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Yarui Hou
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Liting Diao
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Yanxia Hu
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Wanyi Xu
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Shujuan Xie
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
- Institute of Vaccine, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Zhendong Xiao
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| |
Collapse
|
25
|
Hu X, Liu D, Zhang J, Fan Y, Ouyang T, Luo Y, Zhang Y, Deng L. A comprehensive review and evaluation of graph neural networks for non-coding RNA and complex disease associations. Brief Bioinform 2023; 24:bbad410. [PMID: 37985451 DOI: 10.1093/bib/bbad410] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 10/07/2023] [Accepted: 10/25/2023] [Indexed: 11/22/2023] Open
Abstract
Non-coding RNAs (ncRNAs) play a critical role in the occurrence and development of numerous human diseases. Consequently, studying the associations between ncRNAs and diseases has garnered significant attention from researchers in recent years. Various computational methods have been proposed to explore ncRNA-disease relationships, with Graph Neural Network (GNN) emerging as a state-of-the-art approach for ncRNA-disease association prediction. In this survey, we present a comprehensive review of GNN-based models for ncRNA-disease associations. Firstly, we provide a detailed introduction to ncRNAs and GNNs. Next, we delve into the motivations behind adopting GNNs for predicting ncRNA-disease associations, focusing on data structure, high-order connectivity in graphs and sparse supervision signals. Subsequently, we analyze the challenges associated with using GNNs in predicting ncRNA-disease associations, covering graph construction, feature propagation and aggregation, and model optimization. We then present a detailed summary and performance evaluation of existing GNN-based models in the context of ncRNA-disease associations. Lastly, we explore potential future research directions in this rapidly evolving field. This survey serves as a valuable resource for researchers interested in leveraging GNNs to uncover the complex relationships between ncRNAs and diseases.
Collapse
Affiliation(s)
- Xiaowen Hu
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Dayun Liu
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Jiaxuan Zhang
- Department of Electrical and Computer Engineering, University of California, San Diego,92093 CA, USA
| | - Yanhao Fan
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Tianxiang Ouyang
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Yue Luo
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| | - Yuanpeng Zhang
- school of software, Xinjiang University, 830046 Urumqi, China
| | - Lei Deng
- School of Computer Science and Engineering, Central South University,410075 Changsha, China
| |
Collapse
|
26
|
Szakats S, McAtamney A, Cross H, Wilson MJ. Sex-biased gene and microRNA expression in the developing mouse brain is associated with neurodevelopmental functions and neurological phenotypes. Biol Sex Differ 2023; 14:57. [PMID: 37679839 PMCID: PMC10486049 DOI: 10.1186/s13293-023-00538-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 08/18/2023] [Indexed: 09/09/2023] Open
Abstract
BACKGROUND Sex differences pose a challenge and an opportunity in biomedical research. Understanding how sex chromosomes and hormones affect disease-causing mechanisms will shed light on the mechanisms underlying predominantly idiopathic sex-biased neurodevelopmental disorders such as ADHD, schizophrenia, and autism. Gene expression is a crucial conduit for the influence of sex on developmental processes; therefore, this study focused on sex differences in gene expression and the regulation of gene expression. The increasing interest in microRNAs (miRNAs), small, non-coding RNAs, for their contribution to normal and pathological neurodevelopment prompted us to test how miRNA expression differs between the sexes in the developing brain. METHODS High-throughput sequencing approaches were used to identify transcripts, including miRNAs, that showed significantly different expression between male and female brains on day 15.5 of development (E15.5). RESULTS Robust sex differences were identified for some genes and miRNAs, confirming the influence of biological sex on RNA. Many miRNAs that exhibit the greatest differences between males and females have established roles in neurodevelopment, implying that sex-biased expression may drive sex differences in developmental processes. In addition to highlighting sex differences for individual miRNAs, gene ontology analysis suggested several broad categories in which sex-biased RNAs might act to establish sex differences in the embryonic mouse brain. Finally, mining publicly available SNP data indicated that some sex-biased miRNAs reside near the genomic regions associated with neurodevelopmental disorders. CONCLUSIONS Together, these findings reinforce the importance of cataloguing sex differences in molecular biology research and highlight genes, miRNAs, and pathways of interest that may be important for sexual differentiation in the mouse and possibly the human brain.
Collapse
Affiliation(s)
- Susanna Szakats
- Developmental Genomics Laboratory, Department of Anatomy, School of Biomedical Sciences, University of Otago, P.O. Box 56, Dunedin, 9054, New Zealand
| | - Alice McAtamney
- Developmental Genomics Laboratory, Department of Anatomy, School of Biomedical Sciences, University of Otago, P.O. Box 56, Dunedin, 9054, New Zealand
| | - Hugh Cross
- Developmental Genomics Laboratory, Department of Anatomy, School of Biomedical Sciences, University of Otago, P.O. Box 56, Dunedin, 9054, New Zealand
| | - Megan J Wilson
- Developmental Genomics Laboratory, Department of Anatomy, School of Biomedical Sciences, University of Otago, P.O. Box 56, Dunedin, 9054, New Zealand.
| |
Collapse
|
27
|
Qiao LJ, Gao Z, Ji CM, Liu ZH, Zheng CH, Wang YT. Potential circRNA-Disease Association Prediction Using DeepWalk and Nonnegative Matrix Factorization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:3154-3162. [PMID: 37018084 DOI: 10.1109/tcbb.2023.3264466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Circular RNAs (circRNAs) are a category of noncoding RNAs that exist in great numbers in eukaryotes. They have recently been discovered to be crucial in the growth of tumors. Therefore, it is important to explore the association of circRNAs with disease. This paper proposes a new method based on DeepWalk and nonnegative matrix factorization (DWNMF) to predict circRNA-disease association. Based on the known circRNA-disease association, we calculate the topological similarity of circRNA and disease via the DeepWalk-based method to learn the node features on the association network. Next, the functional similarity of the circRNAs and the semantic similarity of the diseases are fused with their respective topological similarities at different scales. Then, we use the improved weighted K-nearest neighbor (IWKNN) method to preprocess the circRNA-disease association network and correct nonnegative associations by setting different parameters K1 and K2 in the circRNA and disease matrices. Finally, the L2,1-norm, dual-graph regularization term and Frobenius norm regularization term are introduced into the nonnegative matrix factorization model to predict the circRNA-disease correlation. We perform cross-validation on circR2Disease, circRNADisease, and MNDR. The numerical results show that DWNMF is an efficient tool for forecasting potential circRNA-disease relationships, outperforming other state-of-the-art approaches in terms of predictive performance.
Collapse
|
28
|
Zhang W, Liu B. iSnoDi-MDRF: Identifying snoRNA-Disease Associations Based on Multiple Biological Data by Ranking Framework. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:3013-3019. [PMID: 37030816 DOI: 10.1109/tcbb.2023.3258448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Accumulating evidence indicates that the dysregulation of small nucleolar RNAs (snoRNAs) is relevant with diseases. Identifying snoRNA-disease associations by computational methods is desired for biologists, which can save considerable costs and time compared biological experiments. However, it still faces some challenges as followings: (i) Many snoRNAs are detected in recent years, but only a few snoRNAs have been proved to be associated with diseases; (ii) Computational predictors trained with only a few known snoRNA-disease associations fail to accurately identify the snoRNA-disease associations. In this study, we propose a ranking framework, called iSnoDi-MDRF, to identify potential snoRNA-disease associations based on multiple biological data, which has the following highlights: (i) iSnoDi-MDRF integrates ranking framework, which is not only able to identify potential associations between known snoRNAs and diseases, but also can identify diseases associated with new snoRNAs. (ii) Known gene-disease associations are employed to help train a mature model for predicting snoRNA-disease association. Experimental results illustrate that iSnoDi-MDRF is very suitable for identifying potential snoRNA-disease associations. The web server of iSnoDi-MDRF predictor is freely available at http://bliulab.net/iSnoDi-MDRF/.
Collapse
|
29
|
Rasmussen A, Okholm T, Knudsen M, Vang S, Dyrskjøt L, Hansen T, Pedersen J. Circular stable intronic RNAs possess distinct biological features and are deregulated in bladder cancer. NAR Cancer 2023; 5:zcad041. [PMID: 37554968 PMCID: PMC10405568 DOI: 10.1093/narcan/zcad041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 08/02/2023] [Indexed: 08/10/2023] Open
Abstract
Until recently, intronic lariats were regarded as short-lasting splicing byproducts with no apparent function; however, increasing evidence of stable derivatives suggests regulatory roles. Yet little is known about their characteristics, functions, distribution, and expression in healthy and tumor tissue. Here, we profiled and characterized circular stable intronic sequence RNAs (sisRNAs) using total RNA-Seq data from bladder cancer (BC; n = 457, UROMOL cohort), healthy tissue (n = 46), and fractionated cell lines (n = 5). We found that the recently-discovered full-length intronic circles and the stable lariats formed distinct subclasses, with a surprisingly high intronic circle fraction in BC (∼45%) compared to healthy tissues (0-20%). The stable lariats and their host introns were characterized by small transcript sizes, highly conserved BP regions, enriched BP motifs, and localization in multiple cell fractions. Additionally, circular sisRNAs showed tissue-specific expression patterns. We found nine circular sisRNAs as differentially expressed across early-stage BC patients with different prognoses, and sisHNRNPK expression correlated with progression-free survival. In conclusion, we identify distinguishing biological features of circular sisRNAs and point to specific candidates (incl. sisHNRNPK, sisWDR13 and sisMBNL1) that were highly expressed, had evolutionary conserved sequences, or had clinical correlations, which may facilitate future studies and further insights into their functional roles.
Collapse
Affiliation(s)
- Asta M Rasmussen
- Department of Clinical Medicine, Aarhus University, Aarhus 8000, Denmark
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark
- Bioinformatics Research Center (BiRC), Aarhus University, Aarhus 8000, Denmark
| | - Trine Line H Okholm
- Departments of Otolaryngology-Head and Neck Surgery and Microbiology & Immunology, University of California, San Francisco, CA, USA
| | - Michael Knudsen
- Department of Clinical Medicine, Aarhus University, Aarhus 8000, Denmark
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark
| | - Søren Vang
- Department of Clinical Medicine, Aarhus University, Aarhus 8000, Denmark
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark
| | - Lars Dyrskjøt
- Department of Clinical Medicine, Aarhus University, Aarhus 8000, Denmark
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark
| | - Thomas B Hansen
- Department of Molecular Biology and Genetics (MBG), Aarhus University, Aarhus 8000, Denmark
| | - Jakob S Pedersen
- Department of Clinical Medicine, Aarhus University, Aarhus 8000, Denmark
- Department of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus N 8200, Denmark
- Bioinformatics Research Center (BiRC), Aarhus University, Aarhus 8000, Denmark
| |
Collapse
|
30
|
Shakhpazyan NK, Mikhaleva LM, Bedzhanyan AL, Sadykhov NK, Midiber KY, Konyukova AK, Kontorschikov AS, Maslenkina KS, Orekhov AN. Long Non-Coding RNAs in Colorectal Cancer: Navigating the Intersections of Immunity, Intercellular Communication, and Therapeutic Potential. Biomedicines 2023; 11:2411. [PMID: 37760852 PMCID: PMC10525929 DOI: 10.3390/biomedicines11092411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 08/18/2023] [Accepted: 08/23/2023] [Indexed: 09/29/2023] Open
Abstract
This comprehensive review elucidates the intricate roles of long non-coding RNAs (lncRNAs) within the colorectal cancer (CRC) microenvironment, intersecting the domains of immunity, intercellular communication, and therapeutic potential. lncRNAs, which are significantly involved in the pathogenesis of CRC, immune evasion, and the treatment response to CRC, have crucial implications in inflammation and serve as promising candidates for novel therapeutic strategies and biomarkers. This review scrutinizes the interaction of lncRNAs with the Consensus Molecular Subtypes (CMSs) of CRC, their complex interplay with the tumor stroma affecting immunity and inflammation, and their conveyance via extracellular vesicles, particularly exosomes. Furthermore, we delve into the intricate relationship between lncRNAs and other non-coding RNAs, including microRNAs and circular RNAs, in mediating cell-to-cell communication within the CRC microenvironment. Lastly, we propose potential strategies to manipulate lncRNAs to enhance anti-tumor immunity, thereby underlining the significance of lncRNAs in devising innovative therapeutic interventions in CRC.
Collapse
Affiliation(s)
- Nikolay K. Shakhpazyan
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
| | - Liudmila M. Mikhaleva
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
| | - Arcady L. Bedzhanyan
- Department of Abdominal Surgery and Oncology II (Coloproctology and Uro-Gynecology), Petrovsky National Research Center of Surgery, 119435 Moscow, Russia;
| | - Nikolay K. Sadykhov
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
| | - Konstantin Y. Midiber
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
| | - Alexandra K. Konyukova
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
| | - Andrey S. Kontorschikov
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
| | - Ksenia S. Maslenkina
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
| | - Alexander N. Orekhov
- Avtsyn Research Institute of Human Morphology, Petrovsky National Research Center of Surgery, 119435 Moscow, Russia; (L.M.M.); (N.K.S.); (K.Y.M.); (A.K.K.); (A.S.K.); (K.S.M.); (A.N.O.)
- Laboratory of Angiopathology, Institute of General Pathology and Pathophysiology, 125315 Moscow, Russia
- Institute for Atherosclerosis Research, 121096 Moscow, Russia
| |
Collapse
|
31
|
Zhu W, Yuan SS, Li J, Huang CB, Lin H, Liao B. A First Computational Frame for Recognizing Heparin-Binding Protein. Diagnostics (Basel) 2023; 13:2465. [PMID: 37510209 PMCID: PMC10377868 DOI: 10.3390/diagnostics13142465] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2023] [Revised: 07/13/2023] [Accepted: 07/21/2023] [Indexed: 07/30/2023] Open
Abstract
Heparin-binding protein (HBP) is a cationic antibacterial protein derived from multinuclear neutrophils and an important biomarker of infectious diseases. The correct identification of HBP is of great significance to the study of infectious diseases. This work provides the first HBP recognition framework based on machine learning to accurately identify HBP. By using four sequence descriptors, HBP and non-HBP samples were represented by discrete numbers. By inputting these features into a support vector machine (SVM) and random forest (RF) algorithm and comparing the prediction performances of these methods on training data and independent test data, it is found that the SVM-based classifier has the greatest potential to identify HBP. The model could produce an auROC of 0.981 ± 0.028 on training data using 10-fold cross-validation and an overall accuracy of 95.0% on independent test data. As the first model for HBP recognition, it will provide some help for infectious diseases and stimulate further research in related fields.
Collapse
Affiliation(s)
- Wen Zhu
- Key Laboratory of Computational Science and Application of Hainan Province, Haikou 571158, China
- Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou 571158, China
- School of Mathematics and Statistics, Hainan Normal University, Haikou 571158, China
| | - Shi-Shi Yuan
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Jian Li
- School of Basic Medical Sciences, Chengdu University, Chengdu 610106, China
| | - Cheng-Bing Huang
- School of Computer Science and Technology, ABa Teachers University, Chengdu 623002, China
| | - Hao Lin
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Bo Liao
- Key Laboratory of Computational Science and Application of Hainan Province, Haikou 571158, China
- Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou 571158, China
- School of Mathematics and Statistics, Hainan Normal University, Haikou 571158, China
| |
Collapse
|
32
|
Wekesa JS, Kimwele M. A review of multi-omics data integration through deep learning approaches for disease diagnosis, prognosis, and treatment. Front Genet 2023; 14:1199087. [PMID: 37547471 PMCID: PMC10398577 DOI: 10.3389/fgene.2023.1199087] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Accepted: 07/11/2023] [Indexed: 08/08/2023] Open
Abstract
Accurate diagnosis is the key to providing prompt and explicit treatment and disease management. The recognized biological method for the molecular diagnosis of infectious pathogens is polymerase chain reaction (PCR). Recently, deep learning approaches are playing a vital role in accurately identifying disease-related genes for diagnosis, prognosis, and treatment. The models reduce the time and cost used by wet-lab experimental procedures. Consequently, sophisticated computational approaches have been developed to facilitate the detection of cancer, a leading cause of death globally, and other complex diseases. In this review, we systematically evaluate the recent trends in multi-omics data analysis based on deep learning techniques and their application in disease prediction. We highlight the current challenges in the field and discuss how advances in deep learning methods and their optimization for application is vital in overcoming them. Ultimately, this review promotes the development of novel deep-learning methodologies for data integration, which is essential for disease detection and treatment.
Collapse
|
33
|
Szulc NA, Mackiewicz Z, Bujnicki JM, Stefaniak F. Structural interaction fingerprints and machine learning for predicting and explaining binding of small molecule ligands to RNA. Brief Bioinform 2023; 24:bbad187. [PMID: 37204195 DOI: 10.1093/bib/bbad187] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 04/07/2023] [Accepted: 04/25/2023] [Indexed: 05/20/2023] Open
Abstract
Ribonucleic acids (RNAs) play crucial roles in living organisms and some of them, such as bacterial ribosomes and precursor messenger RNA, are targets of small molecule drugs, whereas others, e.g. bacterial riboswitches or viral RNA motifs are considered as potential therapeutic targets. Thus, the continuous discovery of new functional RNA increases the demand for developing compounds targeting them and for methods for analyzing RNA-small molecule interactions. We recently developed fingeRNAt-a software for detecting non-covalent bonds formed within complexes of nucleic acids with different types of ligands. The program detects several non-covalent interactions and encodes them as structural interaction fingerprint (SIFt). Here, we present the application of SIFts accompanied by machine learning methods for binding prediction of small molecules to RNA. We show that SIFt-based models outperform the classic, general-purpose scoring functions in virtual screening. We also employed Explainable Artificial Intelligence (XAI)-the SHapley Additive exPlanations, Local Interpretable Model-agnostic Explanations and other methods to help understand the decision-making process behind the predictive models. We conducted a case study in which we applied XAI on a predictive model of ligand binding to human immunodeficiency virus type 1 trans-activation response element RNA to distinguish between residues and interaction types important for binding. We also used XAI to indicate whether an interaction has a positive or negative effect on binding prediction and to quantify its impact. Our results obtained using all XAI methods were consistent with the literature data, demonstrating the utility and importance of XAI in medicinal chemistry and bioinformatics.
Collapse
Affiliation(s)
- Natalia A Szulc
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 4 Ks. Trojdena Str, 02-109 Warsaw, Poland
- Laboratory of Protein Metabolism, International Institute of Molecular and Cell Biology in Warsaw, 4 Ks. Trojdena Str, 02-109 Warsaw, Poland
| | - Zuzanna Mackiewicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 4 Ks. Trojdena Str, 02-109 Warsaw, Poland
- Laboratory of RNA Biology - ERA Chairs Group, International Institute of Molecular and Cell Biology in Warsaw, 4 Ks. Trojdena Str, 02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 4 Ks. Trojdena Str, 02-109 Warsaw, Poland
| | - Filip Stefaniak
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, 4 Ks. Trojdena Str, 02-109 Warsaw, Poland
| |
Collapse
|
34
|
Rego N, Libisch MG, Rovira C, Tosar JP, Robello C. Comparative microRNA profiling of Trypanosoma cruzi infected human cells. Front Cell Infect Microbiol 2023; 13:1187375. [PMID: 37424776 PMCID: PMC10322668 DOI: 10.3389/fcimb.2023.1187375] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 06/01/2023] [Indexed: 07/11/2023] Open
Abstract
Introduction Trypanosoma cruzi, the causative agent of Chagas disease, can infect almost any nucleated cell in the mammalian host. Although previous studies have described the transcriptomic changes that occur in host cells during parasite infection, the understanding of the role of post-transcriptional regulation in this process is limited. MicroRNAs, a class of short non-coding RNAs, are key players in regulating gene expression at the post-transcriptional level, and their involvement in the host-T. cruzi interplay is a growing area of research. However, to our knowledge, there are no comparative studies on the microRNA changes that occur in different cell types in response to T. cruzi infection. Methods and results Here we investigated microRNA changes in epithelial cells, cardiomyocytes and macrophages infected with T. cruzi for 24 hours, using small RNA sequencing followed by careful bioinformatics analysis. We show that, although microRNAs are highly cell type-specific, a signature of three microRNAs -miR-146a, miR-708 and miR-1246, emerges as consistently responsive to T. cruzi infection across representative human cell types. T. cruzi lacks canonical microRNA-induced silencing mechanisms and we confirm that it does not produce any small RNA that mimics known host microRNAs. We found that macrophages show a broad response to parasite infection, while microRNA changes in epithelial and cardiomyocytes are modest. Complementary data indicated that cardiomyocyte response may be greater at early time points of infection. Conclusions Our findings emphasize the significance of considering microRNA changes at the cellular level and complement previous studies conducted at higher organizational levels, such as heart samples. While miR-146a has been previously implicated in T. cruzi infection, similarly to its involvement in many other immunological responses, miR-1246 and miR-708 are demonstrated here for the first time. Given their expression in multiple cell types, we anticipate our work as a starting point for future investigations into their role in the post-transcriptional regulation of T. cruzi infected cells and their potential as biomarkers for Chagas disease.
Collapse
Affiliation(s)
- Natalia Rego
- Unidad de Bioinformática, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - María Gabriela Libisch
- Laboratorio de Interacciones Hospedero Patógeno/UBM, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Carlos Rovira
- Department of Clinical Sciences Lund, Division of Oncology, Lund University, Lund, Sweden
| | - Juan Pablo Tosar
- Laboratorio de Genómica Funcional, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Unidad de Bioquímica Analítica, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Carlos Robello
- Laboratorio de Interacciones Hospedero Patógeno/UBM, Institut Pasteur de Montevideo, Montevideo, Uruguay
- Departamento de Bioquímica, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay
| |
Collapse
|
35
|
Hou J, Wei H, Liu B. iPiDA-SWGCN: Identification of piRNA-disease associations based on Supplementarily Weighted Graph Convolutional Network. PLoS Comput Biol 2023; 19:e1011242. [PMID: 37339125 DOI: 10.1371/journal.pcbi.1011242] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 06/05/2023] [Indexed: 06/22/2023] Open
Abstract
Accurately identifying potential piRNA-disease associations is of great importance in uncovering the pathogenesis of diseases. Recently, several machine-learning-based methods have been proposed for piRNA-disease association detection. However, they are suffering from the high sparsity of piRNA-disease association network and the Boolean representation of piRNA-disease associations ignoring the confidence coefficients. In this study, we propose a supplementarily weighted strategy to solve these disadvantages. Combined with Graph Convolutional Networks (GCNs), a novel predictor called iPiDA-SWGCN is proposed for piRNA-disease association prediction. There are three main contributions of iPiDA-SWGCN: (i) Potential piRNA-disease associations are preliminarily supplemented in the sparse piRNA-disease network by integrating various basic predictors to enrich network structure information. (ii) The original Boolean piRNA-disease associations are assigned with different relevance confidence to learn node representations from neighbour nodes in varying degrees. (iii) The experimental results show that iPiDA-SWGCN achieves the best performance compared with the other state-of-the-art methods, and can predict new piRNA-disease associations.
Collapse
Affiliation(s)
- Jialu Hou
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
| | - Hang Wei
- School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi, China
| | - Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
- Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing, China
| |
Collapse
|
36
|
Fan YX, Andoh V, Chen L. Multi-omics study and ncRNA regulation of anti-BmNPV in silkworms, Bombyx mori: an update. Front Microbiol 2023; 14:1123448. [PMID: 37275131 PMCID: PMC10232802 DOI: 10.3389/fmicb.2023.1123448] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 05/02/2023] [Indexed: 06/07/2023] Open
Abstract
Bombyx mori silkworm is an important economic insect which has a significant contribution to the improvement of the economy. Bombyx mori nucleopolyhedrovirus (BmNPV) is a vitally significant purulent virus that impedes the sustainable and stable development of the silkworm industry, resulting in substantial economic losses. In recent years, with the development of biotechnology, transcriptomics, proteomics, metabolomics, and the related techniques have been used to select BmNPV-resistant genes, proteins, and metabolites. The regulatory networks between viruses and hosts have been gradually clarified with the discovery of ncRNAs, such as miRNA, lncRNA, and circRNA in cells. Thus, this paper aims to highlight the results of current multi-omics and ncRNA studies on BmNPV resistance in the silkworm, providing some references for resistant strategies in the silkworm to BmNPV.
Collapse
|
37
|
Aparicio-Puerta E, Hirsch P, Schmartz GP, Kern F, Fehlmann T, Keller A. miEAA 2023: updates, new functional microRNA sets and improved enrichment visualizations. Nucleic Acids Res 2023:7161530. [PMID: 37177999 DOI: 10.1093/nar/gkad392] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 04/21/2023] [Accepted: 05/05/2023] [Indexed: 05/15/2023] Open
Abstract
MicroRNAs (miRNAs) are small non-coding RNAs that play a critical role in regulating diverse biological processes. Extracting functional insights from a list of miRNAs is challenging, as each miRNA can potentially interact with hundreds of genes. To address this challenge, we developed miEAA, a flexible and comprehensive miRNA enrichment analysis tool based on direct and indirect miRNA annotation. The latest release of miEAA includes a data warehouse of 19 miRNA repositories, covering 10 different organisms and 139 399 functional categories. We have added information on the cellular context of miRNAs, isomiRs, and high-confidence miRNAs to improve the accuracy of the results. We have also improved the representation of aggregated results, including interactive Upset plots to aid users in understanding the interaction among enriched terms or categories. Finally, we demonstrate the functionality of miEAA in the context of ageing and highlight the importance of carefully considering the miRNA input list. MiEAA is free to use and publicly available at https://www.ccb.uni-saarland.de/mieaa/.
Collapse
Affiliation(s)
| | - Pascal Hirsch
- Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
| | - Georges P Schmartz
- Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
| | - Fabian Kern
- Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarland University Campus, 66123 Saarbrücken, Germany
| | - Tobias Fehlmann
- Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
| | - Andreas Keller
- Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarland University Campus, 66123 Saarbrücken, Germany
| |
Collapse
|
38
|
Zhang L, Lu D, Bi X, Zhao K, Yu G, Quan N. Predicting disease genes based on multi-head attention fusion. BMC Bioinformatics 2023; 24:162. [PMID: 37085750 PMCID: PMC10122338 DOI: 10.1186/s12859-023-05285-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 04/12/2023] [Indexed: 04/23/2023] Open
Abstract
BACKGROUND The identification of disease-related genes is of great significance for the diagnosis and treatment of human disease. Most studies have focused on developing efficient and accurate computational methods to predict disease-causing genes. Due to the sparsity and complexity of biomedical data, it is still a challenge to develop an effective multi-feature fusion model to identify disease genes. RESULTS This paper proposes an approach to predict the pathogenic gene based on multi-head attention fusion (MHAGP). Firstly, the heterogeneous biological information networks of disease genes are constructed by integrating multiple biomedical knowledge databases. Secondly, two graph representation learning algorithms are used to capture the feature vectors of gene-disease pairs from the network, and the features are fused by introducing multi-head attention. Finally, multi-layer perceptron model is used to predict the gene-disease association. CONCLUSIONS The MHAGP model outperforms all of other methods in comparative experiments. Case studies also show that MHAGP is able to predict genes potentially associated with diseases. In the future, more biological entity association data, such as gene-drug, disease phenotype-gene ontology and so on, can be added to expand the information in heterogeneous biological networks and achieve more accurate predictions. In addition, MHAGP with strong expansibility can be used for potential tasks such as gene-drug association and drug-disease association prediction.
Collapse
Affiliation(s)
- Linlin Zhang
- College of Software Engineering, Xinjiang University, Urumqi, China.
| | - Dianrong Lu
- College of information Science and Engineering, Xinjiang University, Urumqi, China
| | - Xuehua Bi
- Medical Engineering and Technology College, Xinjiang Medical University, Urumqi, China
| | - Kai Zhao
- College of information Science and Engineering, Xinjiang University, Urumqi, China
| | - Guanglei Yu
- Medical Engineering and Technology College, Xinjiang Medical University, Urumqi, China
| | - Na Quan
- College of information Science and Engineering, Xinjiang University, Urumqi, China
| |
Collapse
|
39
|
Zulfiqar H, Ahmed Z, Kissanga Grace-Mercure B, Hassan F, Zhang ZY, Liu F. Computational prediction of promotors in Agrobacterium tumefaciens strain C58 by using the machine learning technique. Front Microbiol 2023; 14:1170785. [PMID: 37125199 PMCID: PMC10133480 DOI: 10.3389/fmicb.2023.1170785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 03/17/2023] [Indexed: 05/02/2023] Open
Abstract
Promotors are those genomic regions on the upstream of genes, which are bound by RNA polymerase for starting gene transcription. Because it is the most critical element of gene expression, the recognition of promoters is crucial to understand the regulation of gene expression. This study aimed to develop a machine learning-based model to predict promotors in Agrobacterium tumefaciens (A. tumefaciens) strain C58. In the model, promotor sequences were encoded by three different kinds of feature descriptors, namely, accumulated nucleotide frequency, k-mer nucleotide composition, and binary encodings. The obtained features were optimized by using correlation and the mRMR-based algorithm. These optimized features were inputted into a random forest (RF) classifier to discriminate promotor sequences from non-promotor sequences in A. tumefaciens strain C58. The examination of 10-fold cross-validation showed that the proposed model could yield an overall accuracy of 0.837. This model will provide help for the study of promoters in A. tumefaciens C58 strain.
Collapse
Affiliation(s)
- Hasan Zulfiqar
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, China
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zahoor Ahmed
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, China
| | - Bakanina Kissanga Grace-Mercure
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Farwa Hassan
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhao-Yue Zhang
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Fen Liu
- Department of Radiation Oncology, Peking University Cancer Hospital (Inner Mongolia Campus), Affiliated Cancer Hospital of Inner Mongolia Medical University, Inner Mongolia Cancer Hospital, Hohhot, China
| |
Collapse
|
40
|
Mohammed MA, Abdulkareem KH, Dinar AM, Zapirain BG. Rise of Deep Learning Clinical Applications and Challenges in Omics Data: A Systematic Review. Diagnostics (Basel) 2023; 13:diagnostics13040664. [PMID: 36832152 PMCID: PMC9955380 DOI: 10.3390/diagnostics13040664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Revised: 02/05/2023] [Accepted: 02/07/2023] [Indexed: 02/12/2023] Open
Abstract
This research aims to review and evaluate the most relevant scientific studies about deep learning (DL) models in the omics field. It also aims to realize the potential of DL techniques in omics data analysis fully by demonstrating this potential and identifying the key challenges that must be addressed. Numerous elements are essential for comprehending numerous studies by surveying the existing literature. For example, the clinical applications and datasets from the literature are essential elements. The published literature highlights the difficulties encountered by other researchers. In addition to looking for other studies, such as guidelines, comparative studies, and review papers, a systematic approach is used to search all relevant publications on omics and DL using different keyword variants. From 2018 to 2022, the search procedure was conducted on four Internet search engines: IEEE Xplore, Web of Science, ScienceDirect, and PubMed. These indexes were chosen because they offer enough coverage and linkages to numerous papers in the biological field. A total of 65 articles were added to the final list. The inclusion and exclusion criteria were specified. Of the 65 publications, 42 are clinical applications of DL in omics data. Furthermore, 16 out of 65 articles comprised the review publications based on single- and multi-omics data from the proposed taxonomy. Finally, only a small number of articles (7/65) were included in papers focusing on comparative analysis and guidelines. The use of DL in studying omics data presented several obstacles related to DL itself, preprocessing procedures, datasets, model validation, and testbed applications. Numerous relevant investigations were performed to address these issues. Unlike other review papers, our study distinctly reflects different observations on omics with DL model areas. We believe that the result of this study can be a useful guideline for practitioners who look for a comprehensive view of the role of DL in omics data analysis.
Collapse
Affiliation(s)
- Mazin Abed Mohammed
- College of Computer Science and Information Technology, University of Anbar, Anbar 31001, Iraq
- eVIDA Lab, University of Deusto, 48007 Bilbao, Spain
- Correspondence: (M.A.M.); (B.G.Z.)
| | - Karrar Hameed Abdulkareem
- College of Agriculture, Al-Muthanna University, Samawah 66001, Iraq
- College of Engineering, University of Warith Al-Anbiyaa, Karbala 56001, Iraq
| | - Ahmed M. Dinar
- Computer Engineering Department, University of Technology- Iraq, Baghdad 19006, Iraq
| | | |
Collapse
|
41
|
Sheng N, Huang L, Lu Y, Wang H, Yang L, Gao L, Xie X, Fu Y, Wang Y. Data resources and computational methods for lncRNA-disease association prediction. Comput Biol Med 2023; 153:106527. [PMID: 36610216 DOI: 10.1016/j.compbiomed.2022.106527] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 12/08/2022] [Accepted: 12/31/2022] [Indexed: 01/03/2023]
Abstract
Increasing interest has been attracted in deciphering the potential disease pathogenesis through lncRNA-disease association (LDA) prediction, regarding to the diverse functional roles of lncRNAs in genome regulation. Whilst, computational models and algorithms benefit systematic biology research, even facilitate the classical biological experimental procedures. In this review, we introduce representative diseases associated with lncRNAs, such as cancers, cardiovascular diseases, and neurological diseases. Current publicly available resources related to lncRNAs and diseases have also been included. Furthermore, all of the 64 computational methods for LDA prediction have been divided into 5 groups, including machine learning-based methods, network propagation-based methods, matrix factorization- and completion-based methods, deep learning-based methods, and graph neural network-based methods. The common evaluation methods and metrics in LDA prediction have also been discussed. Finally, the challenges and future trends in LDA prediction have been discussed. Recent advances in LDA prediction approaches have been summarized in the GitHub repository at https://github.com/sheng-n/lncRNA-disease-methods.
Collapse
Affiliation(s)
- Nan Sheng
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, China
| | - Lan Huang
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, China.
| | - Yuting Lu
- School of Artificial Intelligence, Jilin University, Changchun, China
| | - Hao Wang
- Department of Hepatopancreatobiliary Surgery, Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Lili Yang
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, China; Department of Obstetrics, The First Hospital of Jilin University, Changchun, China
| | - Ling Gao
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, China
| | - Xuping Xie
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, China
| | - Yuan Fu
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, Ceredigion, United Kingdom
| | - Yan Wang
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, China; School of Artificial Intelligence, Jilin University, Changchun, China.
| |
Collapse
|
42
|
Lu C, Zhang L, Zeng M, Lan W, Duan G, Wang J. Inferring disease-associated circRNAs by multi-source aggregation based on heterogeneous graph neural network. Brief Bioinform 2023; 24:6960978. [PMID: 36572658 DOI: 10.1093/bib/bbac549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 11/03/2022] [Accepted: 11/11/2022] [Indexed: 12/28/2022] Open
Abstract
Emerging evidence has proved that circular RNAs (circRNAs) are implicated in pathogenic processes. They are regarded as promising biomarkers for diagnosis due to covalently closed loop structures. As opposed to traditional experiments, computational approaches can identify circRNA-disease associations at a lower cost. Aggregating multi-source pathogenesis data helps to alleviate data sparsity and infer potential associations at the system level. The majority of computational approaches construct a homologous network using multi-source data, but they lose the heterogeneity of the data. Effective methods that use the features of multi-source data are considered as a matter of urgency. In this paper, we propose a model (CDHGNN) based on edge-weighted graph attention and heterogeneous graph neural networks for potential circRNA-disease association prediction. The circRNA network, micro RNA network, disease network and heterogeneous network are constructed based on multi-source data. To reflect association probabilities between nodes, an edge-weighted graph attention network model is designed for node features. To assign attention weights to different types of edges and learn contextual meta-path, CDHGNN infers potential circRNA-disease association based on heterogeneous neural networks. CDHGNN outperforms state-of-the-art algorithms in terms of accuracy. Edge-weighted graph attention networks and heterogeneous graph networks have both improved performance significantly. Furthermore, case studies suggest that CDHGNN is capable of identifying specific molecular associations and investigating biomolecular regulatory relationships in pathogenesis. The code of CDHGNN is freely available at https://github.com/BioinformaticsCSU/CDHGNN.
Collapse
Affiliation(s)
- Chengqian Lu
- School of Computer Science and Engineering, Central South University, Changsha, 410083, Hunan, China.,Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, Hunan, China.,School of Computer Science, Xiangtan University, Xiangtan, 411105, Hunan, China
| | - Lishen Zhang
- School of Computer Science and Engineering, Central South University, Changsha, 410083, Hunan, China.,Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, Hunan, China
| | - Min Zeng
- School of Computer Science and Engineering, Central South University, Changsha, 410083, Hunan, China.,Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, Hunan, China
| | - Wei Lan
- School of Computer, Electronic and Information, Guangxi University, Nanning, 530004, Guangxi, China
| | - Guihua Duan
- School of Computer Science and Engineering, Central South University, Changsha, 410083, Hunan, China.,Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, Hunan, China
| | - Jianxin Wang
- School of Computer Science and Engineering, Central South University, Changsha, 410083, Hunan, China.,Hunan Provincial Key Lab on Bioinformatics, Central South University, Changsha, 410083, Hunan, China
| |
Collapse
|
43
|
Zhang Z, Xu J, Wu Y, Liu N, Wang Y, Liang Y. CapsNet-LDA: predicting lncRNA-disease associations using attention mechanism and capsule network based on multi-view data. Brief Bioinform 2023; 24:6889447. [PMID: 36511221 DOI: 10.1093/bib/bbac531] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Revised: 10/25/2022] [Accepted: 11/07/2022] [Indexed: 12/15/2022] Open
Abstract
Cumulative studies have shown that many long non-coding RNAs (lncRNAs) are crucial in a number of diseases. Predicting potential lncRNA-disease associations (LDAs) can facilitate disease prevention, diagnosis and treatment. Therefore, it is vital to develop practical computational methods for LDA prediction. In this study, we propose a novel predictor named capsule network (CapsNet)-LDA for LDA prediction. CapsNet-LDA first uses a stacked autoencoder for acquiring the informative low-dimensional representations of the lncRNA-disease pairs under multiple views, then the attention mechanism is leveraged to implement an adaptive allocation of importance weights to them, and they are subsequently processed using a CapsNet-based architecture for predicting LDAs. Different from the conventional convolutional neural networks (CNNs) that have some restrictions with the usage of scalar neurons and pooling operations. the CapsNets use vector neurons instead of scalar neurons that have better robustness for the complex combination of features and they use dynamic routing processes for updating parameters. CapsNet-LDA is superior to other five state-of-the-art models on four benchmark datasets, four perturbed datasets and an independent test set in the comparison experiments, demonstrating that CapsNet-LDA has excellent performance and robustness against perturbation, as well as good generalization ability. The ablation studies verify the effectiveness of some modules of CapsNet-LDA. Moreover, the ability of multi-view data to improve performance is proven. Case studies further indicate that CapsNet-LDA can accurately predict novel LDAs for specific diseases.
Collapse
Affiliation(s)
- Zequn Zhang
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Junlin Xu
- College of Information Science and Engineering, Hunan University, Changsha 410082, Hunan, China
| | - Yanan Wu
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Niannian Liu
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Yinglong Wang
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Ying Liang
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| |
Collapse
|
44
|
Zhang P, Sun W, Wei D, Li G, Xu J, You Z, Zhao B, Li L. PDA-PRGCN: identification of Piwi-interacting RNA-disease associations through subgraph projection and residual scaling-based feature augmentation. BMC Bioinformatics 2023; 24:18. [PMID: 36650439 PMCID: PMC9843905 DOI: 10.1186/s12859-022-05073-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 05/10/2022] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Emerging evidences show that Piwi-interacting RNAs (piRNAs) play a pivotal role in numerous complex human diseases. Identifying potential piRNA-disease associations (PDAs) is crucial for understanding disease pathogenesis at molecular level. Compared to the biological wet experiments, the computational methods provide a cost-effective strategy. However, few computational methods have been developed so far. RESULTS Here, we proposed an end-to-end model, referred to as PDA-PRGCN (PDA prediction using subgraph Projection and Residual scaling-based feature augmentation through Graph Convolutional Network). Specifically, starting with the known piRNA-disease associations represented as a graph, we applied subgraph projection to construct piRNA-piRNA and disease-disease subgraphs for the first time, followed by a residual scaling-based feature augmentation algorithm for node initial representation. Then, we adopted graph convolutional network (GCN) to learn and identify potential PDAs as a link prediction task on the constructed heterogeneous graph. Comprehensive experiments, including the performance comparison of individual components in PDA-PRGCN, indicated the significant improvement of integrating subgraph projection, node feature augmentation and dual-loss mechanism into GCN for PDA prediction. Compared with state-of-the-art approaches, PDA-PRGCN gave more accurate and robust predictions. Finally, the case studies further corroborated that PDA-PRGCN can reliably detect PDAs. CONCLUSION PDA-PRGCN provides a powerful method for PDA prediction, which can also serve as a screening tool for studies of complex diseases.
Collapse
Affiliation(s)
- Ping Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Weicheng Sun
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Dengguo Wei
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
- Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Shenzhen, 518000, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Guodong Li
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Jinsheng Xu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Zhuhong You
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710129, China
| | - Bowei Zhao
- The Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China
| | - Li Li
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China.
- Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, People's Republic of China.
| |
Collapse
|
45
|
Hajibabaie F, Abedpoor N, Taghian F, Safavi K. A Cocktail of Polyherbal Bioactive Compounds and Regular Mobility Training as Senolytic Approaches in Age-dependent Alzheimer's: the In Silico Analysis, Lifestyle Intervention in Old Age. J Mol Neurosci 2023; 73:171-184. [PMID: 36631703 DOI: 10.1007/s12031-022-02086-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 11/15/2022] [Indexed: 01/13/2023]
Abstract
Alzheimer's is a principal concern globally. Machine learning is a valuable tool to determine protective and diagnostic approaches for the elderly. We analyzed microarray datasets of Alzheimer's cases based on artificial intelligence by R statistical software. This study provided a screened pool of ncRNAs and coding RNAs related to Alzheimer's development. We designed hub genes as cut points in networks and predicted potential microRNAs and LncRNA to regulate protein networks in aging and Alzheimer's through in silico algorithms. Notably, we collected effective traditional herbal medicines. A list of bioactive compounds prepared including capsaicin, piperine, crocetin, safranal, saffron oil, coumarin, thujone, rosmarinic acid, sabinene, thymoquinone, ascorbic acid, vitamin E, cyanidin, rhaponticin, isovitexin, coumarin, nobiletin, evodiamine, gingerol, curcumin, quercetin, fisetin, and allicin as an effective fusion that potentially modulates hub proteins and molecular signaling pathways based on pharmacophore model screening and chemoinformatics survey. We identified profiles of 21 mRNAs, 272 microRNAs, and eight LncRNA in Alzheimer's based on prediction algorithms. We suggested a fusion of senolytic herbal ligands as an alternative therapy and preventive formulation in dementia. Also, we provided ncRNAs expression status as novel monitoring strategies in Alzheimer's and new cut-point proteins as novel therapeutic approaches. Synchronizing fusion drugs and lifestyle could reverse Alzheimer's hallmarks to amelioration via an offset of the signaling pathways, leading to increased life quality in the elderly.
Collapse
Affiliation(s)
- Fatemeh Hajibabaie
- Department of Biology, Shahrekord Branch, Islamic Azad University, Shahrekord, Iran
- Department of Physiology, Medicinal Plants Research Center, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran
| | - Navid Abedpoor
- Department of Physiology, Medicinal Plants Research Center, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
- Department of Sports Physiology, Faculty of Sports Sciences, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
| | - Farzaneh Taghian
- Department of Sports Physiology, Faculty of Sports Sciences, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran.
| | - Kamran Safavi
- Department of Plant Biotechnology, Medicinal Plants Research Centre, Isfahan (Khorasgan) Branch, Islamic Azad University, Isfahan, Iran
| |
Collapse
|
46
|
Qu J, Cheng XL. Editorial: Computational approaches for non-coding RNA prediction studies. Front Mol Biosci 2022; 9:1091918. [PMID: 36582204 PMCID: PMC9793085 DOI: 10.3389/fmolb.2022.1091918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 11/29/2022] [Indexed: 12/14/2022] Open
|
47
|
Garg P, Jamal F, Srivastava P. Deciphering the role of precursor miR-12136 and miR-8485 in the progression of intellectual disability (ID). IBRO Neurosci Rep 2022; 13:393-401. [PMID: 36345471 PMCID: PMC9636553 DOI: 10.1016/j.ibneur.2022.10.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 10/15/2022] [Indexed: 11/06/2022] Open
Abstract
The short, non-coding RNAs known as miRNA modulate the expression of human protein-coding genes. About 90 % of genes in humans are controlled by the expression of miRNA. The dysfunction of these miRNA target genes leads to many human diseases, including neurodevelopmental disorders as well. Intellectual disability (ID) is a neurodevelopmental disorder that is characterized by adaptive behavior and intellectual functioning which includes logical reasoning, ability in learning, practical intelligence, and verbal skills. Identification of miRNA involved in ID and their associated target genes can help in the identification of diagnostic biomarkers related to ID at a very early age. The present study is an attempt to identify miRNA and their associated target genes that play an important role in the development of intellectual disability patients through the meta-analysis of available transcriptome data. A total of 6 transcriptomic studies were retrieved from NCBI and were subjected to quality check and trimming before alignment. The normalization and identification of differentially expressed miRNA were carried out using the EdgeR package of R studio. Further, the gene targets of downregulated miRNA were identified using miRDB. The system biology approaches were also applied to the study to identify the hub target genes and the diseases associated with main miRNAs.
Collapse
Affiliation(s)
- Prekshi Garg
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Lucknow Campus, 226028, India
| | - Farrukh Jamal
- Department of Biochemistry, Dr. Rammanohar Lohia Avadh University, Ayodhya 224001, U.P., India
| | - Prachi Srivastava
- Amity Institute of Biotechnology, Amity University Uttar Pradesh, Lucknow Campus, 226028, India
| |
Collapse
|
48
|
Zhang W, Liu B. iSnoDi-LSGT: identifying snoRNA-disease associations based on local similarity constraints and global topological constraints. RNA (NEW YORK, N.Y.) 2022; 28:1558-1567. [PMID: 36192132 PMCID: PMC9670808 DOI: 10.1261/rna.079325.122] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 09/26/2022] [Indexed: 06/16/2023]
Abstract
Growing evidence proves that small nucleolar RNAs (snoRNAs) have important functions in various biological processes, the malfunction of which leads to the emergence and development of complex diseases. However, identifying snoRNA-disease associations is an ongoing challenging task due to the considerable time- and money-consuming biological experiments. Therefore, it is urgent to design efficient and economical methods for the identification of snoRNA-disease associations. In this regard, we propose a computational method named iSnoDi-LSGT, which utilizes snoRNA sequence similarity and disease similarity as local similarity constraints. The iSnoDi-LSGT predictor further employs network embedding technology to extract topological features of snoRNAs and diseases, based on which snoRNA topological similarity and disease topological similarity are calculated as global topological constraints. To the best of our knowledge, the iSnoDi-LSGT is the first computational method for snoRNA-disease association identification. The experimental results indicate that the iSnoDi-LSGT predictor can effectively predict unknown snoRNA-disease associations. The web server of the iSnoDi-LSGT predictor is freely available at http://bliulab.net/iSnoDi-LSGT.
Collapse
Affiliation(s)
- Wenxiang Zhang
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
- Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing 100081, China
| |
Collapse
|
49
|
Chen Y, Wang J, Wang C, Liu M, Zou Q. Deep learning models for disease-associated circRNA prediction: a review. Brief Bioinform 2022; 23:6696465. [PMID: 36130259 DOI: 10.1093/bib/bbac364] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 07/30/2022] [Accepted: 08/03/2022] [Indexed: 12/14/2022] Open
Abstract
Emerging evidence indicates that circular RNAs (circRNAs) can provide new insights and potential therapeutic targets for disease diagnosis and treatment. However, traditional biological experiments are expensive and time-consuming. Recently, deep learning with a more powerful ability for representation learning enables it to be a promising technology for predicting disease-associated circRNAs. In this review, we mainly introduce the most popular databases related to circRNA, and summarize three types of deep learning-based circRNA-disease associations prediction methods: feature-generation-based, type-discrimination and hybrid-based methods. We further evaluate seven representative models on benchmark with ground truth for both balance and imbalance classification tasks. In addition, we discuss the advantages and limitations of each type of method and highlight suggested applications for future research.
Collapse
Affiliation(s)
- Yaojia Chen
- College of Electronics and Information Engineering Guangdong Ocean University, Zhanjiang, China and the Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Jiacheng Wang
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Chuyu Wang
- Faculty of Computing, Harbin Institute of Technology, Harbin, China
| | - Mingxin Liu
- College of Electronics and Information Engineering, Guangdong Ocean University, Zhanjiang, China
| | - Quan Zou
- University of Electronic Science and Technology of China, China
| |
Collapse
|
50
|
Yang Y, Wang D, Miao YR, Wu X, Luo H, Cao W, Yang W, Yang J, Guo AY, Gong J. lncRNASNP v3: an updated database for functional variants in long non-coding RNAs. Nucleic Acids Res 2022; 51:D192-D198. [PMID: 36350671 PMCID: PMC9825536 DOI: 10.1093/nar/gkac981] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/12/2022] [Accepted: 11/03/2022] [Indexed: 11/11/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) act as versatile regulators of many biological processes and play vital roles in various diseases. lncRNASNP is dedicated to providing a comprehensive repository of single nucleotide polymorphisms (SNPs) and somatic mutations in lncRNAs and their impacts on lncRNA structure and function. Since the last release in 2018, there has been a huge increase in the number of variants and lncRNAs. Thus, we updated the lncRNASNP to version 3 by expanding the species to eight eukaryotic species (human, chimpanzee, pig, mouse, rat, chicken, zebrafish, and fruitfly), updating the data and adding several new features. SNPs in lncRNASNP have increased from 11 181 387 to 67 513 785. The human mutations have increased from 1 174 768 to 2 387 685, including 1 031 639 TCGA mutations and 1 356 046 CosmicNCVs. Compared with the last release, updated and new features in lncRNASNP v3 include (i) SNPs in lncRNAs and their impacts on lncRNAs for eight species, (ii) SNP effects on miRNA-lncRNA interactions for eight species, (iii) lncRNA expression profiles for six species, (iv) disease & GWAS-associated lncRNAs and variants, (v) experimental & predicted lncRNAs and drug target associations and (vi) SNP effects on lncRNA expression (eQTL) across tumor & normal tissues. The lncRNASNP v3 is freely available at http://gong_lab.hzau.edu.cn/lncRNASNP3/.
Collapse
Affiliation(s)
| | | | - Ya-Ru Miao
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Xiaohong Wu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Haohui Luo
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Wen Cao
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Wenqian Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Jianye Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - An-Yuan Guo
- Correspondence may also be addressed to An-Yuan Guo. Tel: +86 27 8779 3177; Fax: +86 27 8779 3177;
| | - Jing Gong
- To whom correspondence should be addressed. Tel: +86 27 8728 5085; Fax: +86 27 8728 5085;
| |
Collapse
|