1
|
Zhang C, Zhu X, Peterson N, Wang J, Wan S. A Comprehensive Review on RNA Subcellular Localization Prediction. ARXIV 2025:arXiv:2504.17162v1. [PMID: 40313658 PMCID: PMC12045386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
The subcellular localization of RNAs, including long non-coding RNAs (lncRNAs), messenger RNAs (mRNAs), microRNAs (miRNAs) and other smaller RNAs, plays a critical role in determining their biological functions. For instance, lncRNAs are predominantly associated with chromatin and act as regulators of gene transcription and chromatin structure, while mRNAs are distributed across the nucleus and cytoplasm, facilitating the transport of genetic information for protein synthesis. Understanding RNA localization sheds light on processes like gene expression regulation with spatial and temporal precision. However, traditional wet lab methods for determining RNA localization, such as in situ hybridization, are often time-consuming, resource-demanding, and costly. To overcome these challenges, computational methods leveraging artificial intelligence (AI) and machine learning (ML) have emerged as powerful alternatives, enabling large-scale prediction of RNA subcellular localization. This paper provides a comprehensive review of the latest advancements in AI-based approaches for RNA subcellular localization prediction, covering various RNA types and focusing on sequence-based, image-based, and hybrid methodologies that combine both data types. We highlight the potential of these methods to accelerate RNA research, uncover molecular pathways, and guide targeted disease treatments. Furthermore, we critically discuss the challenges in AI/ML approaches for RNA subcellular localization, such as data scarcity and lack of benchmarks, and opportunities to address them. This review aims to serve as a valuable resource for researchers seeking to develop innovative solutions in the field of RNA subcellular localization and beyond.
Collapse
Affiliation(s)
- Cece Zhang
- Department of Cell & Systems Biology, University of Toronto, ON, Canada
| | - Xuehuan Zhu
- School of Engineering, University of California, Los Angeles, CA, United States
| | - Nick Peterson
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Jieqiong Wang
- Department of Neurological Sciences, University of Nebraska Medical Center, Omaha, NE, United States
| | - Shibiao Wan
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| |
Collapse
|
2
|
Bereczki Z, Benczik B, Balogh OM, Marton S, Puhl E, Pétervári M, Váczy-Földi M, Papp ZT, Makkos A, Glass K, Locquet F, Euler G, Schulz R, Ferdinandy P, Ágg B. Mitigating off-target effects of small RNAs: conventional approaches, network theory and artificial intelligence. Br J Pharmacol 2025; 182:340-379. [PMID: 39293936 DOI: 10.1111/bph.17302] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 05/07/2024] [Accepted: 06/17/2024] [Indexed: 09/20/2024] Open
Abstract
Three types of highly promising small RNA therapeutics, namely, small interfering RNAs (siRNAs), microRNAs (miRNAs) and the RNA subtype of antisense oligonucleotides (ASOs), offer advantages over small-molecule drugs. These small RNAs can target any gene product, opening up new avenues of effective and safe therapeutic approaches for a wide range of diseases. In preclinical research, synthetic small RNAs play an essential role in the investigation of physiological and pathological pathways as silencers of specific genes, facilitating discovery and validation of drug targets in different conditions. Off-target effects of small RNAs, however, could make it difficult to interpret experimental results in the preclinical phase and may contribute to adverse events of small RNA therapeutics. Out of the two major types of off-target effects we focused on the hybridization-dependent, especially on the miRNA-like off-target effects. Our main aim was to discuss several approaches, including sequence design, chemical modifications and target prediction, to reduce hybridization-dependent off-target effects that should be considered even at the early development phase of small RNA therapy. Because there is no standard way of predicting hybridization-dependent off-target effects, this review provides an overview of all major state-of-the-art computational methods and proposes new approaches, such as the possible inclusion of network theory and artificial intelligence (AI) in the prediction workflows. Case studies and a concise survey of experimental methods for validating in silico predictions are also presented. These methods could contribute to interpret experimental results, to minimize off-target effects and hopefully to avoid off-target-related adverse events of small RNA therapeutics. LINKED ARTICLES: This article is part of a themed issue Non-coding RNA Therapeutics. To view the other articles in this section visit http://onlinelibrary.wiley.com/doi/10.1111/bph.v182.2/issuetoc.
Collapse
Affiliation(s)
- Zoltán Bereczki
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - Bettina Benczik
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| | - Olivér M Balogh
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - Szandra Marton
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - Eszter Puhl
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - Mátyás Pétervári
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Sanovigado Kft, Budapest, Hungary
| | - Máté Váczy-Földi
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - Zsolt Tamás Papp
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - András Makkos
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| | - Kimberly Glass
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Fabian Locquet
- Physiologisches Institut, Justus-Liebig-Universität Gießen, Giessen, Germany
| | - Gerhild Euler
- Physiologisches Institut, Justus-Liebig-Universität Gießen, Giessen, Germany
| | - Rainer Schulz
- Physiologisches Institut, Justus-Liebig-Universität Gießen, Giessen, Germany
| | - Péter Ferdinandy
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| | - Bence Ágg
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| |
Collapse
|
3
|
Shelash SI, Shabeeb IA, Ahmad I, Saleem HM, Bansal P, Kumar A, Deorari M, Kareem AH, Al-Ani AM, Abosaoda MK. lncRNAs'p potential roles in the pathogenesis of cancer via interacting with signaling pathways; special focus on lncRNA-mediated signaling dysregulation in lung cancer. Med Oncol 2024; 41:310. [PMID: 39516331 DOI: 10.1007/s12032-024-02536-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2024] [Accepted: 10/07/2024] [Indexed: 11/16/2024]
Abstract
Lung cancer ranks among the most lethal types of cancer globally, with a high occurrence and fatality rate. The spread of cancer to other parts of the body, known as metastasis, is the primary cause of treatment failure and death in lung cancer cases. Current approaches for treating advanced lung cancer typically involve a combination of chemotherapy and targeted therapy. However, the majority of patients ultimately develop resistance to these treatments, leading to a worsened prognosis. In recent years, cancer biology research has predominantly focused on the role of protein-encoding genes in cancer development. Long non-coding RNAs (lncRNAs) are transcripts over 200 nucleotides in length that do not encode proteins but are crucial RNA molecules involved in numerous biological functions. While many functions of lncRNAs remain unknown, some have been linked to human diseases, including cancer. Studies have demonstrated that lncRNAs interact with other large molecules in the cell, such as proteins, DNA, and RNA, influencing various critical aspects of cancer. LncRNAs play a significant role in regulating gene expression and have a crucial function in the transcriptional regulation of cancer cells. They mediate various biological and clinical processes such as invasion, metastasis, apoptosis, and cell proliferation. Dysregulation of lncRNAs has been found to impact the process of carcinogenesis through advanced technologies like RNA sequencing and microarrays. Collectively, these long non-coding RNAs hold promise as potential biomarkers and therapeutic targets for human cancers. In this segment, we provide a comprehensive summary of the literature on the characteristics and formation of lncRNAs, along with an overview of their current known roles in lung cancer.
Collapse
Affiliation(s)
- Sulieman Ibrahim Shelash
- Electronic Marketing and Social Media, Economic and Administrative Sciences Zarqa University, Zarqa, Jordan
- Research Follower, INTI International University, Negeri Sembilan, 71800, Nilai, Malaysia
| | | | - Irfan Ahmad
- Department of Clinical Laboratory Sciences, College of Applied Medical Science, King Khalid University, Abha, Saudi Arabia
| | - Hiba Muwafaq Saleem
- Department of Biology, College of Science, University Of Anbar, Ramadi, Iraq.
| | - Pooja Bansal
- Department of Biotechnology and Genetics, Jain (Deemed-to-Be) University, Bengaluru, 560069, Karnataka, India
- Department of Allied Healthcare and Sciences, Vivekananda Global University, Jaipur, Rajasthan, 303012, India
| | - Abhinav Kumar
- Department of Nuclear and Renewable Energy, Ural Federal University Named After the First President of Russia Boris Yeltsin, Ekaterinburg, 620002, Russia
- Department of Technical Sciences, Western Caspian University, Baku, Azerbaijan
- Department of Mechanical Engineering, Karpagam Academy of Higher Education, Coimbatore, 641021, India
| | - Mahamedha Deorari
- Uttaranchal Institute of Pharmaceutical Sciences, Uttaranchal University, Dehradun, India
| | | | | | - Munther Kadhim Abosaoda
- College of Pharmacy, The Islamic University, Najaf, Iraq
- College of Pharmacy, The Islamic University of Al Diwaniyah, Al Diwaniyah, Iraq
- College of Pharmacy, The Islamic University of Babylon, Al Diwaniyah, Iraq
| |
Collapse
|
4
|
Rajabi A, Saber A, Kluiver J, van den Berg A, Hosseinpourfeizi MA, Safaralizadeh R. NEAT1 and CHROMR lncRNAs: a promising diagnostic tool for diffuse large B-cell lymphoma especially in elderly patients. Biomark Med 2024; 18:685-693. [PMID: 39263799 PMCID: PMC11404575 DOI: 10.1080/17520363.2024.2389036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 07/29/2024] [Indexed: 09/13/2024] Open
Abstract
Background: Long non-coding (lnc) RNAs have crucial regulatory roles in molecular pathways, and their dysregulation is associated with the pathogenesis of malignancies such as Diffuse large B-cell lymphoma (DLBCL). Therefore, we aimed to study the NEAT1 and CHROMR expression in DLBCL and explore their association with clinicopathological characteristics.Methods & materials: DLBCL and non-tumor lymph node specimens were obtained to assess the expression levels.Results: NEAT1 and CHROMR expressions were significantly increased in DLBCL, and were linked with the age of DLBCL patients (aged >60). NEAT1 and CHROMR overexpression may serve as moderate-to-good diagnostic biomarkers, with NEAT1 and CHROMR exhibiting area under the curve values of 0.781 and 0.831, respectively.
Collapse
MESH Headings
- Humans
- RNA, Long Noncoding/genetics
- RNA, Long Noncoding/metabolism
- Lymphoma, Large B-Cell, Diffuse/genetics
- Lymphoma, Large B-Cell, Diffuse/diagnosis
- Lymphoma, Large B-Cell, Diffuse/metabolism
- Lymphoma, Large B-Cell, Diffuse/pathology
- Male
- Aged
- Female
- Middle Aged
- Biomarkers, Tumor/genetics
- Biomarkers, Tumor/metabolism
- Adult
- Gene Expression Regulation, Neoplastic
- Aged, 80 and over
- Prognosis
- ROC Curve
Collapse
Affiliation(s)
- Ali Rajabi
- Department of Animal Biology, Faculty of Natural Sciences, University of Tabriz, Tabriz, 5166/15731, Iran
| | - Ali Saber
- Dr. Saber Medical Genetics Laboratory, Almas Complex, Namaz Blvd., Golsar, Rasht, Gilan, 4165685538, Iran
| | - Joost Kluiver
- Department of Pathology and Medical Biology, University of Groningen, University Medical Center Groningen, Groningen, 9700RB, The Netherlands
| | - Anke van den Berg
- Department of Pathology and Medical Biology, University of Groningen, University Medical Center Groningen, Groningen, 9700RB, The Netherlands
| | | | - Reza Safaralizadeh
- Department of Animal Biology, Faculty of Natural Sciences, University of Tabriz, Tabriz, 5166/15731, Iran
| |
Collapse
|
5
|
Saadh MJ, Rasulova I, Almoyad MAA, Kiasari BA, Ali RT, Rasheed T, Faisal A, Hussain F, Jawad MJ, Hani T, Sârbu I, Lakshmaiya N, Ciongradi CI. Recent progress and the emerging role of lncRNAs in cancer drug resistance; focusing on signaling pathways. Pathol Res Pract 2024; 253:154999. [PMID: 38118218 DOI: 10.1016/j.prp.2023.154999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 11/23/2023] [Accepted: 11/27/2023] [Indexed: 12/22/2023]
Abstract
It is becoming more and more apparent that many of the genetic alterations associated with cancer are located in areas that do not encode proteins. lncRNAs are a class of RNAs that do not code for proteins but play a crucial role in maintaining cell function and regulating various cellular processes. By doing this, they have recently introduced what may be a brand-new and essential layer of biological control. These have more than 200 nucleotides and are linked to several diseases; as a result, they have become potential tools for therapeutic intervention. Emerging technologies suggest the presence of mutations on genomic loci that give rise to lncRNAs rather than proteins in a disease as complex as cancer. These lncRNAs play essential parts in gene regulation, which impacts several cellular homeostasis processes, including proliferation, survival, migration, and genomic stability. The leading cause of death in the world today is cancer. Delays in diagnosis and a lack of standard and efficient treatments are the leading causes of the high death rate. Clinically, surgery is frequently used successfully to remove cancers that have not spread, but it is less successful in treating metastatic cancer, which has a drastically lower chance of survival. Chemotherapeutic drugs are a typical therapy to treat the cancer that has spread to other organs. Drug resistance to chemotherapy, however, presents a significant challenge to achieving positive outcomes and is frequently the cause of treatment failure. A substantial barrier to progress in medical oncology is cancer drug resistance. Resistance can develop clinically either before or after cancer treatment. According to this study, lncRNAs influence drug resistance through several different methods. LncRNAs often impact drug resistance by controlling the expression of a few intermediary regulatory variables rather than by directly affecting drug resistance. Additionally, lncRNAs have a variety of roles in cancer medication resistance. Most lncRNAs induce drug resistance when overexpressed; however, other lncRNAs have inhibitory effects. This study provides an overview of the current understanding of lncRNAs, relevance to cancer, and potential therapeutic applications.
Collapse
Affiliation(s)
- Mohamed J Saadh
- Faculty of Pharmacy, Middle East University, Amman 11831, Jordan
| | - Irodakhon Rasulova
- School of Humanities, Natural & Social Sciences, New Uzbekistan University, 54 Mustaqillik Ave., Tashkent 100007, Uzbekistan; Department of Public Health, Samarkand State Medical University, Amir Temur Street 18, Samarkand, Uzbekistan
| | - Muhammad Ali Abdullah Almoyad
- Department of Basic Medical Sciences, College of Applied Medical Sciences, King Khalid University, P.O. Box 4536, 47 Abha Mushait, 61412, Saudi Arabia
| | - Bahman Abedi Kiasari
- Microbiology & Immunology Group, Faculty of Veterinary Medicine, University of Tehran, Tehran, Iran
| | - Ronak Taher Ali
- College of Medical Technology, Al-Kitab University, Kirkuk, Iraq
| | - Tariq Rasheed
- College of Science and Humanities, Al-Kharj, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia
| | - Ahmed Faisal
- Department of Pharmacy, Al-Noor University College, Nineveh, Iraq
| | - Farah Hussain
- Medical Technical College, Al-Farahidi University, Iraq
| | | | - Thamer Hani
- Dentistry Department, Al-Turath University College, Baghdad, Iraq
| | - Ioan Sârbu
- 2nd Department of Surgery-Pediatric Surgery and Orthopedics, "Grigore T. Popa" University of Medicine and Pharmacy, 700115 Iași, Romania.
| | - Natrayan Lakshmaiya
- Department of Mechanical Engineering, Saveetha School of Engineering, SIMATS, Chennai, Tamil Nadu, India
| | - Carmen Iulia Ciongradi
- 2nd Department of Surgery-Pediatric Surgery and Orthopedics, "Grigore T. Popa" University of Medicine and Pharmacy, 700115 Iași, Romania.
| |
Collapse
|
6
|
Klapproth C, Zötzsche S, Kühnl F, Fallmann J, Stadler P, Findeiß S. Tailored machine learning models for functional RNA detection in genome-wide screens. NAR Genom Bioinform 2023; 5:lqad072. [PMID: 37608800 PMCID: PMC10440787 DOI: 10.1093/nargab/lqad072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 06/28/2023] [Accepted: 07/30/2023] [Indexed: 08/24/2023] Open
Abstract
The in silico prediction of non-coding and protein-coding genetic loci has received considerable attention in comparative genomics aiming in particular at the identification of properties of nucleotide sequences that are informative of their biological role in the cell. We present here a software framework for the alignment-based training, evaluation and application of machine learning models with user-defined parameters. Instead of focusing on the one-size-fits-all approach of pervasive in silico annotation pipelines, we offer a framework for the structured generation and evaluation of models based on arbitrary features and input data, focusing on stable and explainable results. Furthermore, we showcase the usage of our software package in a full-genome screen of Drosophila melanogaster and evaluate our results against the well-known but much less flexible program RNAz.
Collapse
Affiliation(s)
- Christopher Klapproth
- Leipzig University, Department of Computer Science and Interdisciplinary Center of Bioinformatics, Bioinformatics Group, Härtelstrasse 16-18, D-04107 Leipzig, Germany
- ScaDS.AI Leipzig (Center for Scalable Data Analytics and Artificial Intelligence), Humboldtstraße 25, D-04105 Leipzig, Germany
| | - Siegfried Zötzsche
- Leipzig University, Department of Computer Science and Interdisciplinary Center of Bioinformatics, Bioinformatics Group, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Felix Kühnl
- Leipzig University, Department of Computer Science and Interdisciplinary Center of Bioinformatics, Bioinformatics Group, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Jörg Fallmann
- Leipzig University, Department of Computer Science and Interdisciplinary Center of Bioinformatics, Bioinformatics Group, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| | - Peter F Stadler
- Leipzig University, Department of Computer Science and Interdisciplinary Center of Bioinformatics, Bioinformatics Group, Härtelstrasse 16-18, D-04107 Leipzig, Germany
- Max Planck Institute for Mathematics in the Science, Inselstraße 22, D-04103 Leipzig, Germany
- University of Vienna, Institute for Theoretical Chemistry, Währingerstraße 17, A-1090 Vienna, Austria
- Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe NM 97501, USA
- Universidad Nacional de Colombia, Facultad de Ciencias, Bogotá, D.C., Colombia
| | - Sven Findeiß
- Leipzig University, Department of Computer Science and Interdisciplinary Center of Bioinformatics, Bioinformatics Group, Härtelstrasse 16-18, D-04107 Leipzig, Germany
| |
Collapse
|
7
|
Zheng K, Wu X, Xue X, Li W, Wang Z, Chen J, Zhang Y, Qiao F, Zhao H, Zhang F, Han S. Transcriptome Screening of Long Noncoding RNAs and Their Target Protein-Coding Genes Unmasks a Dynamic Portrait of Seed Coat Coloration Associated with Anthocyanins in Tibetan Hulless Barley. Int J Mol Sci 2023; 24:10587. [PMID: 37445765 PMCID: PMC10341697 DOI: 10.3390/ijms241310587] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 06/21/2023] [Accepted: 06/22/2023] [Indexed: 07/15/2023] Open
Abstract
Many plants have the capability to accumulate anthocyanins for coloration, and anthocyanins are advantageous to human health. In the case of hulless barley (Hordeum vulgare L. var. nudum), investigation into the mechanism of anthocyanin formation is limited to the level of protein-coding genes (PCGs). Here, we conducted a comprehensive bioinformatics analysis to identify a total of 9414 long noncoding RNAs (lncRNAs) in the seed coats of purple and white hulless barley along a developmental gradient. Transcriptome-wide profiles of lncRNAs documented several properties, including GC content fluctuation, uneven length, a diverse range of exon numbers, and a wide variety of transcript classifications. We found that certain lncRNAs in hulless barley possess detectable sequence conservation with Hordeum vulgare and other monocots. Furthermore, both differentially expressed lncRNAs (DElncRNAs) and PCGs (DEPCGs) were concentrated in the later seed development stages. On the one hand, DElncRNAs could potentially cis-regulate DEPCGs associated with multiple metabolic pathways, including flavonoid and anthocyanin biosynthesis in the late milk and soft dough stages. On the other hand, there was an opportunity for trans-regulated lncRNAs in the color-forming module to affect seed coat color by upregulating PCGs in the anthocyanin pathway. In addition, the interweaving of hulless barley lncRNAs and diverse TFs may function in seed coat coloration. Notably, we depicted a dynamic portrait of the anthocyanin synthesis pathway containing hulless barley lncRNAs. Therefore, this work provides valuable gene resources and more insights into the molecular mechanisms underlying anthocyanin accumulation in hulless barley from the perspective of lncRNAs, which facilitate the development of molecular design breeding in crops.
Collapse
Affiliation(s)
- Kaifeng Zheng
- Beijing Key Laboratory of Gene Resources and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing 100875, China; (K.Z.); (X.X.); (W.L.); (H.Z.)
| | - Xiaozhuo Wu
- College of Life Sciences, Qinghai Normal University, Xining 810008, China; (X.W.); (Z.W.); (J.C.); (Y.Z.); (F.Q.)
| | - Xiuhua Xue
- Beijing Key Laboratory of Gene Resources and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing 100875, China; (K.Z.); (X.X.); (W.L.); (H.Z.)
| | - Wanjie Li
- Beijing Key Laboratory of Gene Resources and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing 100875, China; (K.Z.); (X.X.); (W.L.); (H.Z.)
| | - Zitao Wang
- College of Life Sciences, Qinghai Normal University, Xining 810008, China; (X.W.); (Z.W.); (J.C.); (Y.Z.); (F.Q.)
| | - Jinyuan Chen
- College of Life Sciences, Qinghai Normal University, Xining 810008, China; (X.W.); (Z.W.); (J.C.); (Y.Z.); (F.Q.)
| | - Yanfen Zhang
- College of Life Sciences, Qinghai Normal University, Xining 810008, China; (X.W.); (Z.W.); (J.C.); (Y.Z.); (F.Q.)
| | - Feng Qiao
- College of Life Sciences, Qinghai Normal University, Xining 810008, China; (X.W.); (Z.W.); (J.C.); (Y.Z.); (F.Q.)
| | - Heping Zhao
- Beijing Key Laboratory of Gene Resources and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing 100875, China; (K.Z.); (X.X.); (W.L.); (H.Z.)
| | - Fanfan Zhang
- Beijing Key Laboratory of Gene Resources and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing 100875, China; (K.Z.); (X.X.); (W.L.); (H.Z.)
| | - Shengcheng Han
- Beijing Key Laboratory of Gene Resources and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing 100875, China; (K.Z.); (X.X.); (W.L.); (H.Z.)
- Academy of Plateau Science and Sustainability of the People’s Government of Qinghai Province & Beijing Normal University, Qinghai Normal University, Xining 810008, China
| |
Collapse
|
8
|
Distefano R, Ilieva M, Madsen JH, Ishii H, Aikawa M, Rennie S, Uchida S. T2DB: A Web Database for Long Non-Coding RNA Genes in Type II Diabetes. Noncoding RNA 2023; 9:30. [PMID: 37218990 PMCID: PMC10204529 DOI: 10.3390/ncrna9030030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 05/01/2023] [Accepted: 05/04/2023] [Indexed: 05/24/2023] Open
Abstract
Type II diabetes (T2D) is a growing health problem worldwide due to increased levels of obesity and can lead to other life-threatening diseases, such as cardiovascular and kidney diseases. As the number of individuals diagnosed with T2D rises, there is an urgent need to understand the pathogenesis of the disease in order to prevent further harm to the body caused by elevated blood glucose levels. Recent advances in long non-coding RNA (lncRNA) research may provide insights into the pathogenesis of T2D. Although lncRNAs can be readily detected in RNA sequencing (RNA-seq) data, most published datasets of T2D patients compared to healthy donors focus only on protein-coding genes, leaving lncRNAs to be undiscovered and understudied. To address this knowledge gap, we performed a secondary analysis of published RNA-seq data of T2D patients and of patients with related health complications to systematically analyze the expression changes of lncRNA genes in relation to the protein-coding genes. Since immune cells play important roles in T2D, we conducted loss-of-function experiments to provide functional data on the T2D-related lncRNA USP30-AS1, using an in vitro model of pro-inflammatory macrophage activation. To facilitate lncRNA research in T2D, we developed a web application, T2DB, to provide a one-stop-shop for expression profiling of protein-coding and lncRNA genes in T2D patients compared to healthy donors or subjects without T2D.
Collapse
Affiliation(s)
- Rebecca Distefano
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, DK-2200 Copenhagen, Denmark;
| | - Mirolyuba Ilieva
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, DK-2450 Copenhagen, Denmark; (M.I.); (J.H.M.)
| | - Jens Hedelund Madsen
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, DK-2450 Copenhagen, Denmark; (M.I.); (J.H.M.)
| | - Hideshi Ishii
- Center of Medical Innovation and Translational Research, Department of Medical Data Science, Graduate School of Medicine, Osaka University, Suita 565-0871, Japan;
| | - Masanori Aikawa
- Center for Interdisciplinary Cardiovascular Sciences, Cardiovascular Division, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA;
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
- Center for Excellence in Vascular Biology, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Sarah Rennie
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, DK-2200 Copenhagen, Denmark;
| | - Shizuka Uchida
- Center for RNA Medicine, Department of Clinical Medicine, Aalborg University, DK-2450 Copenhagen, Denmark; (M.I.); (J.H.M.)
| |
Collapse
|
9
|
Pradhan UK, Meher PK, Naha S, Rao AR, Gupta A. ASLncR: a novel computational tool for prediction of abiotic stress-responsive long non-coding RNAs in plants. Funct Integr Genomics 2023; 23:113. [PMID: 37000299 DOI: 10.1007/s10142-023-01040-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 03/23/2023] [Accepted: 03/24/2023] [Indexed: 04/01/2023]
Abstract
Abiotic stresses are detrimental to plant growth and development and have a major negative impact on crop yields. A growing body of evidence indicates that a large number of long non-coding RNAs (lncRNAs) are key to many abiotic stress responses. Thus, identifying abiotic stress-responsive lncRNAs is essential in crop breeding programs in order to develop crop cultivars resistant to abiotic stresses. In this study, we have developed the first machine learning-based computational model for predicting abiotic stress-responsive lncRNAs. The lncRNA sequences which were responsive and non-responsive to abiotic stresses served as the two classes of the dataset for binary classification using the machine learning algorithms. The training dataset was created using 263 stress-responsive and 263 non-stress-responsive sequences, whereas the independent test set consists of 101 sequences from both classes. As the machine learning model can adopt only the numeric data, the Kmer features ranging from sizes 1 to 6 were utilized to represent lncRNAs in numeric form. To select important features, four different feature selection strategies were utilized. Among the seven learning algorithms, the support vector machine (SVM) achieved the highest cross-validation accuracy with the selected feature sets. The observed 5-fold cross-validation accuracy, AU-ROC, and AU-PRC were found to be 68.84, 72.78, and 75.86%, respectively. Furthermore, the robustness of the developed model (SVM with the selected feature) was evaluated using an independent test dataset, where the overall accuracy, AU-ROC, and AU-PRC were found to be 76.23, 87.71, and 88.49%, respectively. The developed computational approach was also implemented in an online prediction tool ASLncR accessible at https://iasri-sg.icar.gov.in/aslncr/ . The proposed computational model and the developed prediction tool are believed to supplement the existing effort for the identification of abiotic stress-responsive lncRNAs in plants.
Collapse
Affiliation(s)
- Upendra Kumar Pradhan
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi, 110012, India
| | - Prabina Kumar Meher
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi, 110012, India.
| | - Sanchita Naha
- Division of Computer Applications, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi, 110012, India
| | | | - Ajit Gupta
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi, 110012, India
| |
Collapse
|
10
|
Meng F, Ji Y, Chen X, Wang Y, Hua M. An integrative analysis of an lncRNA-mRNA competing endogenous RNA network to identify functional lncRNAs in uterine leiomyomas with RNA sequencing. Front Genet 2023; 13:1053845. [PMID: 36685910 PMCID: PMC9845257 DOI: 10.3389/fgene.2022.1053845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 11/18/2022] [Indexed: 01/06/2023] Open
Abstract
Objective: To explore the functions of mRNAs and lncRNAs in the occurrence of uterine leiomyomas (ULs) and further clarify the pathogenesis of UL by detecting the differential expression of mRNAs and lncRNAs in 10 cases of UL tissues and surrounding normal myometrial tissues by high-throughput RNA sequencing. Methods: The tissue samples of 10 patients who underwent hysterectomy for UL in Lianyungang Maternal and Child Health Hospital from January 2016 to December 2021 were collected. The differentially expressed mRNAs (DEmRNAs) and lncRNAs (DElncRNAs) were identified and further analyzed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis. The protein-protein interaction network (PPI) was constructed in Cytoscape software. Functional annotation of the nearby target cis-DEmRNAs of DElncRNAs was performed with the Database for Annotation, Visualization, and Integrated Discovery (DAVID) (https://david.ncifcrf.gov/). Meanwhile, the co-expression network of DElncRNA-DEmRNA was constructed in Cytoscape software. Results: A total of 553 DElncRNAs (283 upregulated DElncRNAs and 270 downregulated DElncRNAs) and 3,293 DEmRNAs (1,632 upregulated DEmRNAs and 1,661 downregulated DEmRNAs) were obtained. GO pathway enrichment analysis revealed that several important pathways were significantly enriched in UL such as blood vessel development, regulation of ion transport, and external encapsulating structure organization. In addition, cytokine-cytokine receptor interaction, neuroactive ligand-receptor interaction, and complement and coagulation cascades were significantly enriched in KEGG pathway enrichment analysis. A total of 409 DElncRNAs-nearby-targeted DEmRNA pairs were detected, which included 118 DElncRNAs and 136 DEmRNAs. Finally, we found that the top two DElncRNAs with the most nearby DEmRNAs were BISPR and AC012531.1. Conclusion: These results suggested that 3,293 DEmRNAs and 553 DElncRNAs were differentially expressed in UL tissue and normal myometrium tissue, which might be candidate-identified therapeutic and prognostic targets for UL and be considered as offering several possible mechanisms and pathogenesis of UL in the future.
Collapse
|
11
|
Wang K, Luo Q, Zhang Y, Xie X, Cheng W, Yao Q, Chen Y, Ren H, Li J, Pan Z. LINC01296 promotes proliferation of cutaneous malignant melanoma by regulating miR-324-3p/MAPK1 axis. Aging (Albany NY) 2022; 15:2877-2890. [PMID: 36462499 PMCID: PMC10188354 DOI: 10.18632/aging.204413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 11/21/2022] [Indexed: 12/03/2022]
Abstract
OBJECTIVE To investigate the functions and potential molecular mechanism of LINC01296 regarding the progression of cutaneous malignant melanoma (CMM) by the regulation of miR-324-3p/MAPK1 axis. METHODS The candidate differential lncRNAs of CMM were selected from GEPIA database, and quantitative real-time PCR (qRT-PCR) was utilized to assess the expression level of LINC01296 in human CMM tissues and cell lines. Cell proliferation assay, Colony formation assay, Ethynyl-2'-deoxyuridine (EDU) assay in vitro and tumorigenicity assays in nude mice in vivo were performed to examine the functions of LINC01296. Bioinformatics analysis, luciferase reporter assay and rescue experiments were also gained an insight into the underlying mechanisms of LINC01296 in CMM cell lines by miR-324-3p/MAPK1 axis. RESULTS In this study, the up-regulation of LINC01296 was found in CMM tissues and cell lines. Functionally, the over-expression of LINC01296 promoted the proliferation in CMM cell lines. In addition, immunochemistry analysis confirmed that the levels of MAPK1 and Ki-67 in sh-LINC01296-xenografted tumors was weaker than that in sh-NC-xenografted tumors. Then, bioinformatics analysis confirmed that LINC01296 interacted with miR-324-3p. Further investigations showed that MAPK1, which collected from the potential related genes of LINC01296, was the conjugated mRNA of miR-324-3p by luciferase reporter assay. Finally, the rescue experiments suggested the positive regulatory association among LINC01296 and MAPK1, which showed that MAPK1 could reverse the promoting-effect of LINC01296 in CMM cells in vitro. CONCLUSIONS Therefore, our findings provided insight into the mechanisms of LINC01296 via miR-324-3p/MAPK1 axis in CMM, and revealed an alternative target for the diagnosis and treatment of CMM.
Collapse
Affiliation(s)
- Kang Wang
- Department of General Surgery, Gaoyou People’s Hospital, Yangzhou 225600, China
| | - Qing Luo
- Department of Dermatology, The First People’s Hospital of Lianyungang, Lianyungang Clinical Medical College of Nanjing Medical University, Lianyungang 222002, China
| | - Yingfeng Zhang
- Department of General Surgery, Gaoyou People’s Hospital, Yangzhou 225600, China
| | - Xin Xie
- Department of General Surgery, Gaoyou People’s Hospital, Yangzhou 225600, China
| | - Wenhao Cheng
- Department of Dermatology, The First People’s Hospital of Lianyungang, Lianyungang Clinical Medical College of Nanjing Medical University, Lianyungang 222002, China
| | - Qiunan Yao
- Department of Dermatology, The First People’s Hospital of Lianyungang, Lianyungang Clinical Medical College of Nanjing Medical University, Lianyungang 222002, China
| | - Yingying Chen
- Department of General Medicine, The First People’s Hospital of Lianyungang, Lianyungang Clinical Medical College of Nanjing Medical University, Lianyungang 222002, China
| | - Hong Ren
- Department of Dermatology, The First People’s Hospital of Lianyungang, Lianyungang Clinical Medical College of Nanjing Medical University, Lianyungang 222002, China
| | - Jiuping Li
- Department of General Surgery, Gaoyou People’s Hospital, Yangzhou 225600, China
| | - Zuanqin Pan
- Department of General Surgery, Gaoyou People’s Hospital, Yangzhou 225600, China
| |
Collapse
|
12
|
Singh D, Roy J. A large-scale benchmark study of tools for the classification of protein-coding and non-coding RNAs. Nucleic Acids Res 2022; 50:12094-12111. [PMID: 36420898 PMCID: PMC9757047 DOI: 10.1093/nar/gkac1092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 10/22/2022] [Accepted: 10/28/2022] [Indexed: 11/27/2022] Open
Abstract
Identification of protein-coding and non-coding transcripts is paramount for understanding their biological roles. Computational approaches have been addressing this task for over a decade; however, generalized and high-performance models are still unreliable. This benchmark study assessed the performance of 24 tools producing >55 models on the datasets covering a wide range of species. We have collected 135 small and large transcriptomic datasets from existing studies for comparison and identified the potential bottlenecks hampering the performance of current tools. The key insights of this study include lack of standardized training sets, reliance on homogeneous training data, gradual changes in annotated data, lack of augmentation with homology searches, the presence of false positives and negatives in datasets and the lower performance of end-to-end deep learning models. We also derived a new dataset, RNAChallenge, from the benchmark considering hard instances that may include potential false alarms. The best and least well performing models under- and overfit the dataset, respectively, thereby serving a dual purpose. For computational approaches, it will be valuable to develop accurate and unbiased models. The identification of false alarms will be of interest for genome annotators, and experimental study of hard RNAs will help to untangle the complexity of the RNA world.
Collapse
Affiliation(s)
- Dalwinder Singh
- To whom correspondence should be addressed. Tel: +91 172 5221206;
| | - Joy Roy
- Correspondence may also be addressed to Joy Roy.
| |
Collapse
|
13
|
Glycation-Associated Diabetic Nephropathy and the Role of Long Noncoding RNAs. Biomedicines 2022; 10:biomedicines10102623. [PMID: 36289886 PMCID: PMC9599575 DOI: 10.3390/biomedicines10102623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 10/12/2022] [Accepted: 10/14/2022] [Indexed: 11/16/2022] Open
Abstract
The glycation of various biomolecules is the root cause of many pathological conditions associated with diabetic nephropathy and end-stage kidney disease. Glycation imbalances metabolism and increases renal cell injury. Numerous therapeutic measures have narrowed down the adverse effects of endogenous glycation, but efficient and potent measures are miles away. Recent advances in the identification and characterization of noncoding RNAs, especially the long noncoding RNAs (lncRNAs), have opened a mammon of new biology to explore the mitigations for glycation-associated diabetic nephropathy. Furthermore, tissue-specific distribution and condition-specific expression make lncRNA a promising key for second-generation therapeutic interventions. Though the techniques to identify and exemplify noncoding RNAs are rapidly evolving, the lncRNA study encounters multiple methodological constraints. This review will discuss lncRNAs and their possible involvement in glycation and advanced glycation end products (AGEs) signaling pathways. We further highlight the possible approaches for lncRNA-based therapeutics and their working mechanism for perturbing glycation and conclude our review with lncRNAs biology-related future opportunities.
Collapse
|
14
|
Bonidia RP, Avila Santos AP, de Almeida BLS, Stadler PF, Nunes da Rocha U, Sanches DS, de Carvalho ACPLF. Information Theory for Biological Sequence Classification: A Novel Feature Extraction Technique Based on Tsallis Entropy. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1398. [PMID: 37420418 DOI: 10.3390/e24101398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 09/16/2022] [Accepted: 09/24/2022] [Indexed: 07/09/2023]
Abstract
In recent years, there has been an exponential growth in sequencing projects due to accelerated technological advances, leading to a significant increase in the amount of data and resulting in new challenges for biological sequence analysis. Consequently, the use of techniques capable of analyzing large amounts of data has been explored, such as machine learning (ML) algorithms. ML algorithms are being used to analyze and classify biological sequences, despite the intrinsic difficulty in extracting and finding representative biological sequence methods suitable for them. Thereby, extracting numerical features to represent sequences makes it statistically feasible to use universal concepts from Information Theory, such as Tsallis and Shannon entropy. In this study, we propose a novel Tsallis entropy-based feature extractor to provide useful information to classify biological sequences. To assess its relevance, we prepared five case studies: (1) an analysis of the entropic index q; (2) performance testing of the best entropic indices on new datasets; (3) a comparison made with Shannon entropy and (4) generalized entropies; (5) an investigation of the Tsallis entropy in the context of dimensionality reduction. As a result, our proposal proved to be effective, being superior to Shannon entropy and robust in terms of generalization, and also potentially representative for collecting information in fewer dimensions compared with methods such as Singular Value Decomposition and Uniform Manifold Approximation and Projection.
Collapse
Affiliation(s)
- Robson P Bonidia
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil
| | - Anderson P Avila Santos
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research-UFZ GmbH, 04318 Leipzig, Germany
| | - Breno L S de Almeida
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil
| | - Peter F Stadler
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, 04107 Leipzig, Germany
| | - Ulisses Nunes da Rocha
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research-UFZ GmbH, 04318 Leipzig, Germany
| | - Danilo S Sanches
- Department of Computer Science, Federal University of Technology-Paraná-UTFPR, Cornélio Procópio 86300-000, Brazil
| | - André C P L F de Carvalho
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil
| |
Collapse
|