1
|
Wang X, Yang L, Wang R. DRpred: A Novel Deep Learning-Based Predictor for Multi-Label mRNA Subcellular Localization Prediction by Incorporating Bayesian Inferred Prior Label Relationships. Biomolecules 2024; 14:1067. [PMID: 39334834 PMCID: PMC11430783 DOI: 10.3390/biom14091067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2024] [Revised: 08/23/2024] [Accepted: 08/26/2024] [Indexed: 09/30/2024] Open
Abstract
The subcellular localization of messenger RNA (mRNA) not only helps us to understand the localization regulation of gene expression but also helps to understand the relationship between RNA localization pattern and human disease mechanism, which has profound biological and medical significance. Several predictors have been proposed for predicting the subcellular localization of mRNA. However, there is still considerable room for improvement in their predictive performance, especially regarding multi-label prediction. This study proposes a novel multi-label predictor, DRpred, for mRNA subcellular localization prediction. This predictor first utilizes Bayesian networks to capture the dependencies among labels. Subsequently, it combines these dependencies with features extracted from mRNA sequences using Word2vec, forming the input for the predictor. Finally, it employs a neural network combining BiLSTM and an attention mechanism to capture the internal relationships of the input features for mRNA subcellular localization. The experimental validation on an independent test set demonstrated that DRpred obtained a competitive predictive performance in multi-label prediction and outperformed state-of-the-art predictors in predicting single subcellular localizations, obtaining accuracies of 82.14%, 93.02%, 80.37%, 94.00%, 90.58%, 84.53%, 82.01%, 79.71%, and 85.67% for the chromatin, cytoplasm, cytosol, exosome, membrane, nucleolus, nucleoplasm, nucleus, and ribosome, respectively. It is anticipated to offer profound insights for biological and medical research.
Collapse
Affiliation(s)
- Xiao Wang
- School of Computer Science and Technology, Zhengzhou University of Light Industry, Zhengzhou 450000, China
- Henan Provincial Key Laboratory of Data Intelligence for Food Safety, Zhengzhou University of Light Industry, Zhengzhou 450000, China
| | - Lixiang Yang
- School of Computer Science and Technology, Zhengzhou University of Light Industry, Zhengzhou 450000, China
| | - Rong Wang
- School of Electronic Information, Zhengzhou University of Light Industry, Zhengzhou 450000, China
| |
Collapse
|
2
|
Chung TH, Zhuravskaya A, Makeyev EV. Regulation potential of transcribed simple repeated sequences in developing neurons. Hum Genet 2024; 143:875-895. [PMID: 38153590 PMCID: PMC11294396 DOI: 10.1007/s00439-023-02626-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 11/28/2023] [Indexed: 12/29/2023]
Abstract
Simple repeated sequences (SRSs), defined as tandem iterations of microsatellite- to satellite-sized DNA units, occupy a substantial part of the human genome. Some of these elements are known to be transcribed in the context of repeat expansion disorders. Mounting evidence suggests that the transcription of SRSs may also contribute to normal cellular functions. Here, we used genome-wide bioinformatics approaches to systematically examine SRS transcriptional activity in cells undergoing neuronal differentiation. We identified thousands of long noncoding RNAs containing >200-nucleotide-long SRSs (SRS-lncRNAs), with hundreds of these transcripts significantly upregulated in the neural lineage. We show that SRS-lncRNAs often originate from telomere-proximal regions and that they have a strong potential to form multivalent contacts with a wide range of RNA-binding proteins. Our analyses also uncovered a cluster of neurally upregulated SRS-lncRNAs encoded in a centromere-proximal part of chromosome 9, which underwent an evolutionarily recent segmental duplication. Using a newly established in vitro system for rapid neuronal differentiation of induced pluripotent stem cells, we demonstrate that at least some of the bioinformatically predicted SRS-lncRNAs, including those encoded in the segmentally duplicated part of chromosome 9, indeed increase their expression in developing neurons to readily detectable levels. These and other lines of evidence suggest that many SRSs may be expressed in a cell type and developmental stage-specific manner, providing a valuable resource for further studies focused on the functional consequences of SRS-lncRNAs in the normal development of the human brain, as well as in the context of neurodevelopmental disorders.
Collapse
Affiliation(s)
- Tek Hong Chung
- Centre for Developmental Neurobiology, New Hunt's House, King's College London, London, SE1 1UL, UK
| | - Anna Zhuravskaya
- Centre for Developmental Neurobiology, New Hunt's House, King's College London, London, SE1 1UL, UK
| | - Eugene V Makeyev
- Centre for Developmental Neurobiology, New Hunt's House, King's College London, London, SE1 1UL, UK.
| |
Collapse
|
3
|
Yang S, Kim SH, Yang E, Kang M, Joo JY. Molecular insights into regulatory RNAs in the cellular machinery. Exp Mol Med 2024; 56:1235-1249. [PMID: 38871819 PMCID: PMC11263585 DOI: 10.1038/s12276-024-01239-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 02/27/2024] [Accepted: 03/05/2024] [Indexed: 06/15/2024] Open
Abstract
It is apparent that various functional units within the cellular machinery are derived from RNAs. The evolution of sequencing techniques has resulted in significant insights into approaches for transcriptome studies. Organisms utilize RNA to govern cellular systems, and a heterogeneous class of RNAs is involved in regulatory functions. In particular, regulatory RNAs are increasingly recognized to participate in intricately functioning machinery across almost all levels of biological systems. These systems include those mediating chromatin arrangement, transcription, suborganelle stabilization, and posttranscriptional modifications. Any class of RNA exhibiting regulatory activity can be termed a class of regulatory RNA and is typically represented by noncoding RNAs, which constitute a substantial portion of the genome. These RNAs function based on the principle of structural changes through cis and/or trans regulation to facilitate mutual RNA‒RNA, RNA‒DNA, and RNA‒protein interactions. It has not been clearly elucidated whether regulatory RNAs identified through deep sequencing actually function in the anticipated mechanisms. This review addresses the dominant properties of regulatory RNAs at various layers of the cellular machinery and covers regulatory activities, structural dynamics, modifications, associated molecules, and further challenges related to therapeutics and deep learning.
Collapse
Affiliation(s)
- Sumin Yang
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea
| | - Sung-Hyun Kim
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea
| | - Eunjeong Yang
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea
| | - Mingon Kang
- Department of Computer Science, University of Nevada, Las Vegas, NV, 89154, USA
| | - Jae-Yeol Joo
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea.
| |
Collapse
|
4
|
Yin Y, Shen X. Noncoding RNA-chromatin association: Functions and mechanisms. FUNDAMENTAL RESEARCH 2023; 3:665-675. [PMID: 38933302 PMCID: PMC11197541 DOI: 10.1016/j.fmre.2023.03.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 03/12/2023] [Accepted: 03/23/2023] [Indexed: 06/28/2024] Open
Abstract
Pervasive transcription of the mammalian genome produces hundreds of thousands of noncoding RNAs (ncRNAs). Numerous studies have suggested that some of these ncRNAs regulate multiple cellular processes and play important roles in physiological and pathological processes. Notably, a large subset of ncRNAs is enriched on chromatin and participates in regulating gene expression and the dynamics of chromatin structure and status. In this review, we summarize recent advances in the functional study of chromatin-associated ncRNAs and mechanistic insights into how these ncRNAs associate with chromatin. We also discuss the potential future challenges which still need to be overcome in this field.
Collapse
Affiliation(s)
- Yafei Yin
- Department of Cell Biology and Department of Cardiology of the Second Affiliated Hospital, Zhejiang University School of Medicine, Yuhangtang Road, Hangzhou, Zhejiang 310058, China
| | - Xiaohua Shen
- Tsinghua-Peking Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| |
Collapse
|
5
|
Efovi D, Xiao Q. Noncoding RNAs in Vascular Cell Biology and Restenosis. BIOLOGY 2022; 12:24. [PMID: 36671717 PMCID: PMC9855655 DOI: 10.3390/biology12010024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/17/2022] [Accepted: 12/20/2022] [Indexed: 12/25/2022]
Abstract
In-stent restenosis (ISR), characterised by ≥50% re-narrowing of the target vessel, is a common complication following stent implantation and remains a significant challenge to the long-term success of angioplasty procedures. Considering the global burden of cardiovascular diseases, improving angioplasty patient outcomes remains a key priority. Noncoding RNAs (ncRNAs) including microRNA (miRNA), long noncoding RNA (lncRNA) and circular RNA (circRNA) have been extensively implicated in vascular cell biology and ISR through multiple, both distinct and overlapping, mechanisms. Vascular smooth muscle cells, endothelial cells and macrophages constitute the main cell types involved in the multifactorial pathophysiology of ISR. The identification of critical regulators exemplified by ncRNAs in all these cell types and processes makes them an exciting therapeutic target in the field of restenosis. In this review, we will comprehensively explore the potential functions and underlying molecular mechanisms of ncRNAs in vascular cell biology in the context of restenosis, with an in-depth focus on vascular cell dysfunction during restenosis development and progression. We will also discuss the diagnostic biomarker and therapeutic target potential of ncRNAs in ISR. Finally, we will discuss the current shortcomings, challenges, and perspectives toward the clinical application of ncRNAs.
Collapse
Affiliation(s)
- Denis Efovi
- William Harvey Research Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK
| | - Qingzhong Xiao
- William Harvey Research Institute, Faculty of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK
- Key Laboratory of Cardiovascular Diseases, School of Basic Medical Sciences, Guangzhou Institute of Cardiovascular Disease, The Second Affiliated Hospital, Guangzhou Medical University, Guangzhou 511436, China
| |
Collapse
|
6
|
Castro-Oropeza R, Piña-Sánchez P. Epigenetic and Transcriptomic Regulation Landscape in HPV+ Cancers: Biological and Clinical Implications. Front Genet 2022; 13:886613. [PMID: 35774512 PMCID: PMC9237502 DOI: 10.3389/fgene.2022.886613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 05/24/2022] [Indexed: 11/13/2022] Open
Abstract
Human Papillomavirus (HPV) is an oncogenic virus that causes the highest number of viral-associated cancer cases and deaths worldwide, with more than 690,000 new cases per year and 342,000 deaths only for cervical cancer (CC). Although the incidence and mortality rates for CC are declining in countries where screening and vaccination programs have been implemented, other types of cancer in which HPV is involved, such as oropharyngeal cancer, are increasing, particularly in men. Mutational and transcriptional profiles of various HPV-associated neoplasms have been described, and accumulated evidence has shown the oncogenic capacity of E6, E7, and E5 genes of high-risk HPV. Interestingly, transcriptomic analysis has revealed that although a vast majority of the human genome is transcribed into RNAs, only 2% of transcripts are translated into proteins. The remaining transcripts lacking protein-coding potential are called non-coding RNAs. In addition to the transfer and ribosomal RNAs, there are regulatory non-coding RNAs classified according to size and structure in long non-coding RNAs (lncRNAs), circular RNAs (circRNAs), and small RNAs; such as microRNAs (miRNAs), piwi-associated RNAs (piRNAs), small nucleolar RNAs (snoRNAs) and endogenous short-interfering RNAs. Recent evidence has shown that lncRNAs, miRNAs, and circRNAs are aberrantly expressed under pathological conditions such as cancer. In addition, those transcripts are dysregulated in HPV-related neoplasms, and their expression correlates with tumor progression, metastasis, poor prognosis, and recurrence. Nuclear lncRNAs are epigenetic regulators involved in controlling gene expression at the transcriptional level through chromatin modification and remodeling. Moreover, disruption of the expression profiles of those lncRNAs affects multiple biological processes such as cell proliferation, apoptosis, and migration. This review highlights the epigenetic alterations induced by HPV, from infection to neoplastic transformation. We condense the epigenetic role of non-coding RNA alterations and their potential as biomarkers in transformation's early stages and clinical applications. We also summarize the molecular mechanisms of action of nuclear lncRNAs to understand better their role in the epigenetic control of gene expression and how they can drive the malignant phenotype of HPV-related neoplasia. Finally, we review several chemical and epigenetic therapy options to prevent and treat HPV-associated neoplasms.
Collapse
Affiliation(s)
| | - Patricia Piña-Sánchez
- Molecular Oncology Laboratory, Oncology Research Unit, Oncology Hospital, IMSS National Medical Center, Mexico City, Mexico
| |
Collapse
|
7
|
Matthews BJ, Melia T, Waxman DJ. Harnessing natural variation to identify cis regulators of sex-biased gene expression in a multi-strain mouse liver model. PLoS Genet 2021; 17:e1009588. [PMID: 34752452 PMCID: PMC8664386 DOI: 10.1371/journal.pgen.1009588] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 12/10/2021] [Accepted: 10/27/2021] [Indexed: 12/13/2022] Open
Abstract
Sex differences in gene expression are widespread in the liver, where many autosomal factors act in tandem with growth hormone signaling to regulate individual variability of sex differences in liver metabolism and disease. Here, we compare hepatic transcriptomic and epigenetic profiles of mouse strains C57BL/6J and CAST/EiJ, representing two subspecies separated by 0.5-1 million years of evolution, to elucidate the actions of genetic factors regulating liver sex differences. We identify 144 protein coding genes and 78 lncRNAs showing strain-conserved sex bias; many have gene ontologies relevant to liver function, are more highly liver-specific and show greater sex bias, and are more proximally regulated than genes whose sex bias is strain-dependent. The strain-conserved genes include key growth hormone-dependent transcriptional regulators of liver sex bias; however, three other transcription factors, Trim24, Tox, and Zfp809, lose their sex-biased expression in CAST/EiJ mouse liver. To elucidate the observed strain specificities in expression, we characterized the strain-dependence of sex-biased chromatin opening and enhancer marks at cis regulatory elements (CREs) within expression quantitative trait loci (eQTL) regulating liver sex-biased genes. Strikingly, 208 of 286 eQTLs with strain-specific, sex-differential effects on expression were associated with a complete gain, loss, or reversal of the sex differences in expression between strains. Moreover, 166 of the 286 eQTLs were linked to the strain-dependent gain or loss of localized sex-biased CREs. Remarkably, a subset of these CREs apparently lacked strain-specific genetic variants yet showed coordinated, strain-dependent sex-biased epigenetic regulation. Thus, we directly link hundreds of strain-specific genetic variants to the high variability in CRE activity and expression of sex-biased genes and uncover underlying genetically-determined epigenetic states controlling liver sex bias in genetically diverse mouse populations.
Collapse
Affiliation(s)
- Bryan J. Matthews
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
| | - Tisha Melia
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
| | - David J. Waxman
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
| |
Collapse
|