1
|
Asim MN, Ibrahim MA, Asif T, Dengel A. RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models. Heliyon 2025; 11:e41488. [PMID: 39897847 PMCID: PMC11783440 DOI: 10.1016/j.heliyon.2024.e41488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Revised: 12/23/2024] [Accepted: 12/24/2024] [Indexed: 02/04/2025] Open
Abstract
Deciphering information of RNA sequences reveals their diverse roles in living organisms, including gene regulation and protein synthesis. Aberrations in RNA sequence such as dysregulation and mutations can drive a diverse spectrum of diseases including cancers, genetic disorders, and neurodegenerative conditions. Furthermore, researchers are harnessing RNA's therapeutic potential for transforming traditional treatment paradigms into personalized therapies through the development of RNA-based drugs and gene therapies. To gain insights of biological functions and to detect diseases at early stages and develop potent therapeutics, researchers are performing diverse types RNA sequence analysis tasks. RNA sequence analysis through conventional wet-lab methods is expensive, time-consuming and error prone. To enable large-scale RNA sequence analysis, empowerment of wet-lab experimental methods with Artificial Intelligence (AI) applications necessitates scientists to have a comprehensive knowledge of both DNA and AI fields. While molecular biologists encounter challenges in understanding AI methods, computer scientists often lack basic foundations of RNA sequence analysis tasks. Considering the absence of a comprehensive literature that bridges this research gap and promotes the development of AI-driven RNA sequence analysis applications, the contributions of this manuscript are manifold: It equips AI researchers with biological foundations of 47 distinct RNA sequence analysis tasks. It sets a stage for development of benchmark datasets related to 47 distinct RNA sequence analysis tasks by facilitating cruxes of 64 different biological databases. It presents word embeddings and language models applications across 47 distinct RNA sequence analysis tasks. It streamlines the development of new predictors by providing a comprehensive survey of 58 word embeddings and 70 language models based predictive pipelines performance values as well as top performing traditional sequence encoding based predictors and their performances across 47 RNA sequence analysis tasks.
Collapse
Affiliation(s)
- Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
| | - Muhammad Ali Ibrahim
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Tayyaba Asif
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Andreas Dengel
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| |
Collapse
|
2
|
Fang G, Shen Y, Gao X, Yang L, Zhu A, Liao D. Consistent RNA expression and RNA modification patterns in cardiotoxicity induced by Matrine and Evodiamine. Front Pharmacol 2025; 15:1485007. [PMID: 39850555 PMCID: PMC11755041 DOI: 10.3389/fphar.2024.1485007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Accepted: 12/16/2024] [Indexed: 01/25/2025] Open
Abstract
Recent research has demonstrated the efficacy of traditional Chinese medicine (TCM) and its active compounds in combating cancer, leading to an increasing utilization of TCM as adjunctive therapy in clinical oncology. However, the optimal dosage of TCM remains unclear, and excessive use may result in cardiotoxicity, which poses a significant health concern for patients undergoing systemic treatment. Therefore, elucidating the underlying mechanisms of cytotoxicity induced by TCM can provide valuable insights for clinical management. In this study, we employed a comprehensive bioinformatics analysis to present sequencing data obtained from AC16 myocardial cells treated with two bioactive derived from botanical drugs: Matrine and Evodiamine. We aim to investigate the dysregulated signaling pathways associated with cardiotoxicity induced by these compounds. Based on our sequencing results, we observed consistent patterns of gene expression and epitranscriptome regulation (m6A and A-to-I modifications) across various drugs-treated AC16 cells when analyzed using KEGG pathway enrichment and gene ontology analyses. Furthermore, m6A writers VIRMA and A-to-I writers ADARB1 is consistent target of Evodiamine and Matrine. In general, our findings suggest that different Chinese botanical drugs induced cardiotoxicity may share common therapeutic strategies.
Collapse
Affiliation(s)
- Guanhua Fang
- Department of Cardiovascular Surgery, Fujian Medical University Union Hospital, Fuzhou, Fujian, China
- Heart Center of Fujian Medical University, Fuzhou, Fujian, China
| | - Yanming Shen
- Department of Cardiovascular Surgery, Fujian Medical University Union Hospital, Fuzhou, Fujian, China
| | - Xinyue Gao
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - Lele Yang
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - An Zhu
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - Dongshan Liao
- Department of Cardiovascular Surgery, Fujian Medical University Union Hospital, Fuzhou, Fujian, China
- Heart Center of Fujian Medical University, Fuzhou, Fujian, China
| |
Collapse
|
3
|
Wang H, Wang Y, Zhou J, Song B, Tu G, Nguyen A, Su J, Coenen F, Wei Z, Rigden DJ, Meng J. Statistical modeling of single-cell epitranscriptomics enabled trajectory and regulatory inference of RNA methylation. CELL GENOMICS 2025; 5:100702. [PMID: 39642887 PMCID: PMC11770222 DOI: 10.1016/j.xgen.2024.100702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 10/07/2024] [Accepted: 11/06/2024] [Indexed: 12/09/2024]
Abstract
As a fundamental mechanism for gene expression regulation, post-transcriptional RNA methylation plays versatile roles in various biological processes and disease mechanisms. Recent advances in single-cell technology have enabled simultaneous profiling of transcriptome-wide RNA methylation in thousands of cells, holding the promise to provide deeper insights into the dynamics, functions, and regulation of RNA methylation. However, it remains a major challenge to determine how to best analyze single-cell epitranscriptomics data. In this study, we developed SigRM, a computational framework for effectively mining single-cell epitranscriptomics datasets with a large cell number, such as those produced by the scDART-seq technique from the SMART-seq2 platform. SigRM not only outperforms state-of-the-art models in RNA methylation site detection on both simulated and real datasets but also provides rigorous quantification metrics of RNA methylation levels. This facilitates various downstream analyses, including trajectory inference and regulatory network reconstruction concerning the dynamics of RNA methylation.
Collapse
Affiliation(s)
- Haozhe Wang
- Department of Biosciences and Bioinformatics, Center for Intelligent RNA Therapeutics, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, School of Science, XJTLU Entrepreneur College, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China; Department of Computer Science, University of Liverpool, L7 8TX Liverpool, UK
| | - Yue Wang
- School of Pharmacy, Nanjing University of Chinese Medicine, Nanjing, Jiangsu 210023, China.
| | - Jingxian Zhou
- School of AI and Advanced Computing, XJTLU Entrepreneur College, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China; Department of Computer Science, University of Liverpool, L7 8TX Liverpool, UK; Sino-French Hoffmann Institute, School of Basic Medical Sciences, Guangzhou Medical University, Guangzhou, Guangdong 511436, China
| | - Bowen Song
- Department of Public Health, School of Medicine, Nanjing University of Chinese Medicine, Nanjing, Jiangsu 210023, China; Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, Key Laboratory of Bioorganic Chemistry and Molecular Engineering of Ministry of Education, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Gang Tu
- Department of Biosciences and Bioinformatics, Center for Intelligent RNA Therapeutics, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, School of Science, XJTLU Entrepreneur College, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Anh Nguyen
- Department of Computer Science, University of Liverpool, L7 8TX Liverpool, UK
| | - Jionglong Su
- School of AI and Advanced Computing, XJTLU Entrepreneur College, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Frans Coenen
- Department of Computer Science, University of Liverpool, L7 8TX Liverpool, UK
| | - Zhi Wei
- Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Jia Meng
- Department of Biosciences and Bioinformatics, Center for Intelligent RNA Therapeutics, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, School of Science, XJTLU Entrepreneur College, Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China; Institute of Biomedical Research, Regulatory Mechanism and Targeted Therapy for Liver Cancer Shiyan Key Laboratory, Hubei Provincial Clinical Research Center for Precise Diagnosis and Treatment of Liver Cancer, Taihe Hospital, Hubei University of Medicine, Shiyan, Hubei 442000, China; Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK.
| |
Collapse
|
4
|
Boileau E, Wilhelmi H, Busch A, Cappannini A, Hildebrand A, Bujnicki J, Dieterich C. Sci-ModoM: a quantitative database of transcriptome-wide high-throughput RNA modification sites. Nucleic Acids Res 2025; 53:D310-D317. [PMID: 39498498 PMCID: PMC11701610 DOI: 10.1093/nar/gkae972] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2024] [Revised: 09/28/2024] [Accepted: 10/13/2024] [Indexed: 01/18/2025] Open
Abstract
We present Sci-ModoM, the first next-generation RNome database offering a holistic view of the epitranscriptomic landscape. Sci-ModoM has a simple yet powerful interface, underpinned by FAIR data principles, a standardized nomenclature, and interoperable formats, fostering the use of common standards within the epitranscriptomics community. Sci-ModoM provides quantitative measurements per site and dataset, enabling users to assess confidence levels based on score, coverage, and stoichiometry. Data in Sci-ModoM is directly traceable to its sources. Users can Search and Compare over six million modifications across 156 datasets, Browse or download datasets, and retrieve metadata. A comparison tool offers a novel and unique opportunity to compare modifications site-wise across datasets, with the ability to securely upload and compare user data against latest published research. Sci-ModoM empowers researchers, including non-experts, to access a broad spectrum of recent quantitative RNA modification data, thereby enhancing the utility and impact of latest discoveries, and opening new avenues in biological and medical research.
Collapse
Affiliation(s)
- Etienne Boileau
- Klaus Tschira Institute for Integrative Computational Cardiology, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, Im Neuenheimer Feld 410, 69120 Heidelberg, Germany
- German Center for Cardiovascular Research - Partner site Heidelberg/Mannheim, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany
| | - Harald Wilhelmi
- Klaus Tschira Institute for Integrative Computational Cardiology, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, Im Neuenheimer Feld 410, 69120 Heidelberg, Germany
| | - Anne Busch
- Institute of Computer Science, Johannes Gutenberg-University Mainz, Staudingerweg 9, 55128 Mainz, Germany
| | - Andrea Cappannini
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Andreas Hildebrand
- Institute of Computer Science, Johannes Gutenberg-University Mainz, Staudingerweg 9, 55128 Mainz, Germany
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Christoph Dieterich
- Klaus Tschira Institute for Integrative Computational Cardiology, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, Im Neuenheimer Feld 410, 69120 Heidelberg, Germany
- German Center for Cardiovascular Research - Partner site Heidelberg/Mannheim, Im Neuenheimer Feld 669, 69120 Heidelberg, Germany
| |
Collapse
|
5
|
Fan R, Cui C, Kang B, Chang Z, Wang G, Cui Q. A combined deep learning framework for mammalian m6A site prediction. CELL GENOMICS 2024; 4:100697. [PMID: 39571573 DOI: 10.1016/j.xgen.2024.100697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2024] [Revised: 09/17/2024] [Accepted: 10/28/2024] [Indexed: 12/14/2024]
Abstract
N6-methyladenosine (m6A) is the most prevalent chemical modification in eukaryotic mRNAs and plays key roles in diverse cellular processes. Precise localization of m6A sites is thus critical for characterizing the functional roles of m6A in various conditions and dissecting the mechanisms governing its deposition. Here, we design a combined framework of Transformer architecture and recurrent neural network, deepSRAMP, to identify m6A sites using sequence-based and genome-derived features. As a result, deepSRAMP achieves a notably enhanced performance compared to its predecessor, SRAMP, the most-used predictor in this field. Moreover, based on multiple benchmark datasets, deepSRAMP greatly outperforms other state-of-the-art m6A predictors, including WHISTLE and DeepPromise, with an average 16.1% and 18.3% increase in AUROC and a 43.9% and 46.4% increase in AUPRC. Finally, deepSRAMP can be successfully exploited on mammalian m6A epitranscriptome mapping under diverse cellular conditions and can potentially reveal differential m6A sites among transcript isoforms of individual genes.
Collapse
Affiliation(s)
- Rui Fan
- Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Road, Beijing 100191, China
| | - Chunmei Cui
- Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Road, Beijing 100191, China.
| | - Boming Kang
- Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Road, Beijing 100191, China
| | - Zecheng Chang
- State Key Laboratory for Diagnosis and Treatment of Severe Zoonotic Infectious Diseases, College of Basic Medicine, Jilin University, Changchun 130021, China
| | - Guoqing Wang
- State Key Laboratory for Diagnosis and Treatment of Severe Zoonotic Infectious Diseases, College of Basic Medicine, Jilin University, Changchun 130021, China.
| | - Qinghua Cui
- School of Sports Medicine, Wuhan Institute of Physical Education, No. 461 Luoyu Road, Wuchang District, Wuhan 430079, Hubei Province, China; Department of Biomedical Informatics, State Key Laboratory of Vascular Homeostasis and Remodeling, School of Basic Medical Sciences, Peking University, 38 Xueyuan Road, Beijing 100191, China; Department of Cardiology and Institute of Vascular Medicine, Peking University Third Hospital, 49 Huayuanbei Road, Beijing 100191, China.
| |
Collapse
|
6
|
Xia R, Yin X, Huang J, Chen K, Ma J, Wei Z, Su J, Blake N, Rigden DJ, Meng J, Song B. Interpretable deep cross networks unveiled common signatures of dysregulated epitranscriptomes across 12 cancer types. MOLECULAR THERAPY. NUCLEIC ACIDS 2024; 35:102376. [PMID: 39618823 PMCID: PMC11605186 DOI: 10.1016/j.omtn.2024.102376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Accepted: 10/25/2024] [Indexed: 01/12/2025]
Abstract
Cancer is a complex and multifaceted group of diseases characterized by uncontrolled cell growth that leads to the formation of malignant tumors. Recent studies suggest that N6-methyladenosine (m6A) RNA methylation plays pivotal roles in cancer pathology by influencing various cellular processes. However, the degree to which these mechanisms are shared across different cancer types remains unclear. In this study, we analyze an expansive array of 167 m6A epitranscriptome profiles covering 12 distinct cancer types and their originating normal tissues. We trained 12 distinct, cancer type-specific interpretable deep cross network models, which successfully distinguish between specific pairs of normal and cancer m6A contexts using integrated information from both the sequences and curated genomic knowledge. Interestingly, cross-cancer type testing indicated the existence of shared genomic patterns across various cancers at the epitranscriptome level. A pan-cancer model was subsequently developed to identify these shared patterns that could not be observed in a single cancer type. Our analysis uncovered, for the first time, a common epitranscriptome signature shared across multiple cancer types, particularly associated with RNA hybridization process and aberrant splicing. This highlights the importance of a comprehensive understanding of the pan-cancer epitranscriptome and holding potential implications in the development of RNA methylation-based therapeutics for various cancers.
Collapse
Affiliation(s)
- Rong Xia
- Department of Public Health, School of Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
- Department of Biological Sciences, School of Science, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
- School of AI and Advanced Computing, XJTLU Entrepreneur College (Taicang), Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Xiangyu Yin
- Department of Biological Sciences, School of Science, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Jiaming Huang
- Department of Biological Sciences, School of Science, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Kunqi Chen
- Key Laboratory of Ministry of Education for Gastrointestinal Cancer, School of Basic Medical Sciences, Fujian Medical University, Fuzhou 350004, China
| | - Jiongming Ma
- Department of Biological Sciences, School of Science, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Zhen Wei
- Department of Biological Sciences, School of Science, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
- Institute of Infection, Veterinary & Ecological Sciences, University of Liverpool, L7 8TX Liverpool, UK
| | - Jionglong Su
- School of AI and Advanced Computing, XJTLU Entrepreneur College (Taicang), Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
| | - Neil Blake
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Jia Meng
- Institute of Biomedical Research, Regulatory Mechanism and Targeted Therapy for Liver Cancer Shiyan Key Laboratory, Hubei Provincial Clinical Research Center for Precise Diagnosis and Treatment of Liver Cancer, Taihe Hospital, Hubei University of Medicine, Shiyan, Hubei 442000, China
- Department of Biological Sciences, School of Science, Suzhou Key Laboratory of Cancer Biology and Chronic Disease, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Bowen Song
- Department of Public Health, School of Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| |
Collapse
|
7
|
Wu Q, Chen M, Lin Y, Zhang J, Gao X, Wu Y, Wu C, Wen J, Li J, Li C, Bao W, Zhang D, Zheng M, Zhu A. Multiomics profiling uncovers curdione-induced reproductive toxicity in HTR-8/SVneo cells. Heliyon 2024; 10:e38650. [PMID: 39524727 PMCID: PMC11550733 DOI: 10.1016/j.heliyon.2024.e38650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 09/08/2024] [Accepted: 09/26/2024] [Indexed: 11/16/2024] Open
Abstract
The assessment of medication toxicity and safety is pivotal during pregnancy. Curdione, a sesquiterpene compound extracted from Curcumae Radix, displays beneficial properties in terms of anti-inflammatory, tumor growth suppression, and anti-coagulative effects. However, its reproductive toxicity and precise mechanism remain unclear. This study aims to explore the mechanism of curdione-induced toxicity damage in HTR-8/SVneo cells through the epigenetics, proteomics, and metabolomics, and experimental verification. The results showed that curdione elicited alterations in m6A modification, gene expression, protein levels, and cellular metabolism of HTR-8/SVneo cells. Additionally, curdione induces oxidative stress, mitochondrial and DNA damage, while also downregulating the expression of Wnt6, β-catenin, ZO-1, and CLDN1 proteins. Curdione has the potential to modulate oxidative stress, mitochondrial dysfunction, and disruption of tight junctions via the Wnt/β-catenin signaling pathway, which contributes to cellular damage in HTR-8/SVneo cells.
Collapse
Affiliation(s)
- Qibin Wu
- Fujian Maternity and Child Health Hospital, College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou 350001, China
| | - Mengting Chen
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Yifan Lin
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Jian Zhang
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Xinyue Gao
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Yajiao Wu
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Caijin Wu
- Fujian Maternity and Child Health Hospital, College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou 350001, China
| | - Jiaxin Wen
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Jiaqi Li
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Chutao Li
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Wenqiang Bao
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Dongcheng Zhang
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| | - Meijin Zheng
- Fujian Maternity and Child Health Hospital, College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou 350001, China
| | - An Zhu
- Fujian Maternity and Child Health Hospital, College of Clinical Medicine for Obstetrics & Gynecology and Pediatrics, Fujian Medical University, Fuzhou 350001, China
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, 350108, China
| |
Collapse
|
8
|
Lin B, Zhang J, Chen M, Gao X, Wen J, Tian K, Wu Y, Chen Z, Yang Q, Zhu A, Du C. Comprehensive Profiling of Transcriptome and m6A Epitranscriptome Uncovers the Neurotoxic Effects of Yunaconitine on HT22 Cells. Evol Bioinform Online 2024; 20:11769343241290461. [PMID: 39483791 PMCID: PMC11526304 DOI: 10.1177/11769343241290461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Accepted: 09/24/2024] [Indexed: 11/03/2024] Open
Abstract
Objective To explore different mRNA transcriptome patterns and RNA N6-methyladenosine (m6A) alteration in yunaconitine (YA)-treated HT22 mouse hippocampal neuron, and uncover the role of abnormal mRNA expression and RNA m6A modification in YA-induced neurotoxicity. Methods HT22 cells were treated with 0, 5, 10, and 50 μM of YA for 72 h to evaluate their viability and GSH content. Subsequently, mRNA-seq and MeRIP-seq analyses were performed on HT22 cells treated with 0 and 10 μM YA for 72 h, and molecular docking was used to simulate interactions between YA and differentially expressed m6A regulators. The mitochondrial membrane potential was examined using the JC-10 probe, and RT-qPCR was conducted to verify the expression levels of differentially expressed m6A regulatory factors, as well as to assess alterations in the mRNA expression levels of antioxidant genes. Results YA treatment significantly reduced the viability of HT22 cells and decreased GSH content. The mRNA-seq analysis obtained 1018 differentially expressed genes, KEGG and GO enrichment results of differentially expressed genes mainly comprise the nervous system development, cholinergic synapse, response to oxidative stress, and mitochondrial inner membrane. A total of 7 differentially expressed m6A regulators were identified by MeRIP-seq. Notably, molecular docking results suggested a stable interaction between YA and most of the differentially expressed m6A regulators. Conclusion This study showed that YA-induced HT22 cell damage was associated with the increased methylation modification level of target gene m6A and abnormal expression of m6A regulators.
Collapse
Affiliation(s)
- Beian Lin
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| | - Jian Zhang
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
- Department of Preventive Medicine, School of Public Health, Fujian Medical University Fuzhou, China
| | - Mengting Chen
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
| | - Xinyue Gao
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
| | - Jiaxin Wen
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
| | - Kun Tian
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
| | - Yajiao Wu
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
| | - Zekai Chen
- Department of Clinical Medicine, School of Basic Medicine, Fujian Medical University, Fuzhou, China
| | - Qiaomei Yang
- Department of Gynecology, Fujian Maternity and Child Health Hospital (Fujian Obstetrics and Gynecology Hospital), Fuzhou, China
| | - An Zhu
- Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China
- Department of Preventive Medicine, School of Public Health, Fujian Medical University Fuzhou, China
| | - Chunhong Du
- The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
| |
Collapse
|
9
|
Yao Z, Chen L, Liu Y, Feng B, Liu C, Chen Y, He S. Exploration of N6-methyladenosine modification in ascorbic acid 2-glucoside constructed stem cell sheets. J Mol Histol 2024; 55:909-925. [PMID: 39133390 DOI: 10.1007/s10735-024-10240-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 07/30/2024] [Indexed: 08/13/2024]
Abstract
The aim of this study was to explore the mechanism of bone marrow stem cells (BMSCs) sheets constructed with different doses of Ascorbic acid 2-glucoside (AA-2G) in conjunction with N6-methyladenosine (m6A)-associated epigenetic genes analysing transcriptome sequencing data. Experimental groups of BMSCs induced by different AA-2G concentrations were set up, and the tissue structures were observed by histological staining of cell slices and scanning electron microscopy. Expression patterns of DEGs were analysed using short-time sequence expression mining software, and DEGs associated with m6A were selected for gene ontology analysis and pathway analysis. The protein-protein interaction (PPI) network of DEGs was analysed and gene functions were predicted using the search tool of the Retrieve Interacting Genes database. There were 464 up-regulated DEGs and 303 down-regulated DEGs between the control and high-dose AA-2G treatment groups, and 175 up-regulated DEGs and 37 down-regulated DEGs between the low and high-dose AA-2G treatment groups. The profile 7 exhibited a gradual increase in gene expression levels over AA-2G concentration. In contrast, profile 0 exhibited a gradual decrease in gene expression levels over AA-2G concentration. In the PPI network of m6A-related DEGs in profile 7, the cluster of metallopeptidase inhibitor 1 (Timp1), intercellular adhesion molecule 1 (Icam1), insulin-like growth factor 1 (Igf1), matrix metallopeptidase 2 (Mmp2), serpin family E member 1 (Serpine1), C-X-C motif chemokine ligand 2 (Cxcl2), galectin 3 (Lgals3) and angiopoietin-1 (Angpt1) was the top hub gene cluster. The expression of all hub genes was significantly increased after AA-2G intervention (P < 0.05), and the expression of Igf1 and Timp1 increased with increasing intervention concentration. The m6A epigenetic modifications were involved in the AA-2G-induced formation of BMSCs. Igf1, Serpine1 and Cxcl2 in DEGs were enriched for tissue repair, promotion of endothelial and epithelial proliferation and regulation of apoptosis.
Collapse
Affiliation(s)
- Zhiye Yao
- Department of Neonatal Intensive Care Unit, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, No. 106 of Zhongshan Er Road, Yuexiu District, Guangzhou, 510080, China
| | - Liang Chen
- Department of Neonatal Intensive Care Unit, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, No. 106 of Zhongshan Er Road, Yuexiu District, Guangzhou, 510080, China
| | - Yumei Liu
- Department of Neonatal Intensive Care Unit, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, No. 106 of Zhongshan Er Road, Yuexiu District, Guangzhou, 510080, China
| | - Bowen Feng
- Department of Child Health Care, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, China
| | - Caisheng Liu
- Department of Neonatology, Guangzhou Women and Children's Medical Center, Guangzhou Medical University, Guangzhou, China
| | - Yanling Chen
- Department of Neonatal Intensive Care Unit, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, No. 106 of Zhongshan Er Road, Yuexiu District, Guangzhou, 510080, China
| | - Shaoru He
- Department of Neonatal Intensive Care Unit, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, No. 106 of Zhongshan Er Road, Yuexiu District, Guangzhou, 510080, China.
| |
Collapse
|
10
|
Tang X, Guo M, Zhang Y, Lv J, Gu C, Yang Y. Examining the evidence for mutual modulation between m6A modification and circular RNAs: current knowledge and future prospects. J Exp Clin Cancer Res 2024; 43:216. [PMID: 39095902 PMCID: PMC11297759 DOI: 10.1186/s13046-024-03136-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 07/22/2024] [Indexed: 08/04/2024] Open
Abstract
The resistance of cancer cells to treatment significantly impedes the success of therapy, leading to the recurrence of various types of cancers. Understanding the specific mechanisms of therapy resistance may offer novel approaches for alleviating drug resistance in cancer. Recent research has shown a reciprocal relationship between circular RNAs (circRNAs) and N6-methyladenosine (m6A) modification, and their interaction can affect the resistance and sensitivity of cancer therapy. This review aims to summarize the latest developments in the m6A modification of circRNAs and their importance in regulating therapy resistance in cancer. Furthermore, we explore their mutual interaction and exact mechanisms and provide insights into potential future approaches for reversing cancer resistance.
Collapse
Affiliation(s)
- Xiaozhu Tang
- Nanjing Hospital of Chinese Medicine Affiliated to Nanjing University of Chinese Medicine, Nanjing, China
- School of Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Mengjie Guo
- School of Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Yuanjiao Zhang
- School of Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Junxian Lv
- School of Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| | - Chunyan Gu
- Nanjing Hospital of Chinese Medicine Affiliated to Nanjing University of Chinese Medicine, Nanjing, China.
- School of Medicine, Nanjing University of Chinese Medicine, Nanjing, China.
| | - Ye Yang
- School of Medicine, Nanjing University of Chinese Medicine, Nanjing, China.
| |
Collapse
|
11
|
Esmaeili N, Bakheet A, Tse W, Liu S, Han X. Interaction of the intestinal cytokines-JAKs-STAT3 and 5 axes with RNA N6-methyladenosine to promote chronic inflammation-induced colorectal cancer. Front Oncol 2024; 14:1352845. [PMID: 39136000 PMCID: PMC11317299 DOI: 10.3389/fonc.2024.1352845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Accepted: 06/25/2024] [Indexed: 08/15/2024] Open
Abstract
Colorectal cancer (CRC) is one of the most common cancers, with a high mortality rate worldwide. Mounting evidence indicates that mRNA modifications are crucial in RNA metabolism, transcription, processing, splicing, degradation, and translation. Studies show that N6-methyladenosine (m6A) is mammalians' most common epi-transcriptomic modification. It has been demonstrated that m6A is involved in cancer formation, progression, invasion, and metastasis, suggesting it could be a potential biomarker for CRC diagnosis and developing therapeutics. Cytokines, growth factors, and hormones function in JAK/STAT3/5 signaling pathway, and they could regulate the intestinal response to infection, inflammation, and tumorigenesis. Reports show that the JAK/STAT3/5 pathway is involved in CRC development. However, the underlying mechanism is still unclear. Signal Transducer and Activator of Transcription 3/5 (STAT3, STAT5) can act as oncogenes or tumor suppressors in the context of tissue types. Also, epigenetic modifications and mutations could alter the balance between pro-oncogenic and tumor suppressor activities of the STAT3/5 signaling pathway. Thus, exploring the interaction of cytokines-JAKs-STAT3 and/or STAT5 with mRNA m6A is of great interest. This review provides a comprehensive overview of the characteristics and functions of m6A and JAKs-STAT3/5 and their relationship with gastrointestinal (GI) cancers.
Collapse
Affiliation(s)
- Nardana Esmaeili
- Division of Hematology and Oncology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
- Division of Cancer Biology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
| | - Ahmed Bakheet
- Division of Hematology and Oncology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
- Division of Cancer Biology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
| | - William Tse
- Division of Hematology and Oncology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
| | - Shujun Liu
- Division of Hematology and Oncology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
| | - Xiaonan Han
- Division of Hematology and Oncology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
- Division of Cancer Biology, Department of Medicine, MetroHealth Medical Center (MHMC), Case Western Reserve University (CWRU) School of Medicine, Cleveland, OH, United States
- Cancer Genomics and Epigenomics Program, Case Comprehensive Cancer Center, Case Western Reserve University (CWRU), Cleveland, OH, United States
| |
Collapse
|
12
|
Song R, J Sutton G, Li F, Liu Q, Wong JJL. Variable calling of m6A and associated features in databases: a guide for end-users. Brief Bioinform 2024; 25:bbae434. [PMID: 39258883 PMCID: PMC11388104 DOI: 10.1093/bib/bbae434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 07/01/2024] [Accepted: 08/19/2024] [Indexed: 09/12/2024] Open
Abstract
N6-methyladenosine (m$^{6}$A) is a widely-studied methylation to messenger RNAs, which has been linked to diverse cellular processes and human diseases. Numerous databases that collate m$^{6}$A profiles of distinct cell types have been created to facilitate quick and easy mining of m$^{6}$A signatures associated with cell-specific phenotypes. However, these databases contain inherent complexities that have not been explicitly reported, which may lead to inaccurate identification and interpretation of m$^{6}$A-associated biology by end-users who are unaware of them. Here, we review various m$^{6}$A-related databases, and highlight several critical matters. In particular, differences in peak-calling pipelines across databases drive substantial variability in both peak number and coordinates with only moderate reproducibility, and the inclusion of peak calls from early m$^{6}$A sequencing protocols may lead to the reporting of false positives or negatives. The awareness of these matters will help end-users avoid the inclusion of potentially unreliable data in their studies and better utilize m$^{6}$A databases to derive biologically meaningful results.
Collapse
Affiliation(s)
- Renhua Song
- Epigenetics and RNA Biology Laboratory, School of Medical Sciences, The University of Sydney, Camperdown, NSW 2050, Australia
- Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2050, Australia
| | - Gavin J Sutton
- Epigenetics and RNA Biology Laboratory, School of Medical Sciences, The University of Sydney, Camperdown, NSW 2050, Australia
- Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2050, Australia
| | - Fuyi Li
- College of Information Engineering, Northwest A&F University, Yangling 712100, Shaanxi, China
- South Australian immunoGENomics Cancer Institute (SAiGENCI), The University of Adelaide, Adelaide, South Australia 5005, Australia
| | - Qian Liu
- Nevada Institute of Personalized Medicine, University of Nevada, Las Vegas, Maryland Pkwy, NV 89154, United States
- School of Life Sciences, College of Sciences, University of Nevada, Las Vegas, Maryland Pkwy, NV 89154, United States
| | - Justin J-L Wong
- Epigenetics and RNA Biology Laboratory, School of Medical Sciences, The University of Sydney, Camperdown, NSW 2050, Australia
- Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW 2050, Australia
| |
Collapse
|
13
|
Min YH, Shao WX, Hu QS, Xie NB, Zhang S, Feng YQ, Xing XW, Yuan BF. Simultaneous Detection of Adenosine-to-Inosine Editing and N6-Methyladenosine at Identical RNA Sites through Deamination-Assisted Reverse Transcription Stalling. Anal Chem 2024; 96:8730-8739. [PMID: 38743814 DOI: 10.1021/acs.analchem.4c01022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Adenosine-to-inosine (A-to-I) editing and N6-methyladenosine (m6A) modifications are pivotal RNA modifications with widespread functional significance in physiological and pathological processes. Although significant effort has been dedicated to developing methodologies for identifying and quantifying these modifications, traditional approaches have often focused on each modification independently, neglecting the potential co-occurrence of A-to-I editing and m6A modifications at the same adenosine residues. This limitation has constrained our understanding of the intricate regulatory mechanisms governing RNA function and the interplay between different types of RNA modifications. To address this gap, we introduced an innovative technique called deamination-assisted reverse transcription stalling (DARTS), specifically designed for the simultaneous quantification of A-to-I editing and m6A at the same RNA sites. DARTS leverages the selective deamination activity of the engineered TadA-TadA8e protein, which converts adenosine residues to inosine, in combination with the unique property of Bst 2.0 DNA polymerase, which stalls when encountering inosine during reverse transcription. This approach enables the accurate quantification of A-to-I editing, m6A, and unmodified adenosine at identical RNA sites. The DARTS method is remarkable for its ability to directly quantify two distinct types of RNA modifications simultaneously, a capability that has remained largely unexplored in the field of RNA biology. By facilitating a comprehensive analysis of the co-occurrence and interaction between A-to-I editing and m6A modifications, DARTS opens new avenues for exploring the complex regulatory networks modulated by different RNA modifications.
Collapse
Affiliation(s)
- Yi-Hao Min
- College of Chemistry and Molecular Sciences, Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan 430071, China
| | - Wen-Xuan Shao
- College of Chemistry and Molecular Sciences, Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan 430071, China
| | - Qiu-Shuang Hu
- Department of Occupational and Environmental Health, School of Public Health, Wuhan University, Wuhan 430071, China
| | - Neng-Bin Xie
- Department of Occupational and Environmental Health, School of Public Health, Wuhan University, Wuhan 430071, China
- Research Center of Public Health, Renmin Hospital of Wuhan University, Wuhan University, Wuhan 430060, China
- Cancer Precision Diagnosis and Treatment and Translational Medicine Hubei Engineering Research Center, Zhongnan Hospital of Wuhan University, Wuhan Research Center for Infectious Diseases and Cancer, Chinese Academy of Medical Sciences, Wuhan 430071, China
| | - Shan Zhang
- College of Chemistry and Molecular Sciences, Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan 430071, China
| | - Yu-Qi Feng
- College of Chemistry and Molecular Sciences, Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan 430071, China
| | - Xi-Wen Xing
- Department of Biotechnology, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Bi-Feng Yuan
- College of Chemistry and Molecular Sciences, Department of Radiation and Medical Oncology, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan 430071, China
- Department of Occupational and Environmental Health, School of Public Health, Wuhan University, Wuhan 430071, China
- Research Center of Public Health, Renmin Hospital of Wuhan University, Wuhan University, Wuhan 430060, China
- Cancer Precision Diagnosis and Treatment and Translational Medicine Hubei Engineering Research Center, Zhongnan Hospital of Wuhan University, Wuhan Research Center for Infectious Diseases and Cancer, Chinese Academy of Medical Sciences, Wuhan 430071, China
| |
Collapse
|
14
|
Ye H, Li T, Rigden DJ, Wei Z. m6ACali: machine learning-powered calibration for accurate m6A detection in MeRIP-Seq. Nucleic Acids Res 2024; 52:4830-4842. [PMID: 38634812 PMCID: PMC11109940 DOI: 10.1093/nar/gkae280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 03/18/2024] [Accepted: 04/04/2024] [Indexed: 04/19/2024] Open
Abstract
We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a genomic feature-based classifier that refines the identification of m6A sites, distinguishing those genuinely present from those that can be detected in in-vitro transcribed (IVT) control experiments. We find that m6ACali effectively identifies non-specific binding peaks reported by exomePeak2 and MACS2 in novel MeRIP-Seq datasets without the need for paired IVT controls. The model interpretation revealed that off-target antibody binding sites commonly occur at short exons and short mRNAs, originating from high read coverage regions that share the motif sequence with true m6A sites. We also reveal that the ML strategy can efficiently adjust differentially methylated peaks and other antibody-dependent, base-resolution m6A detection techniques. As a result, m6ACali offers a promising method for the universal enhancement of m6A profiles generated by MeRIP-Seq experiments, elevating the benchmark for omics-level m6A data integration.
Collapse
Affiliation(s)
- Haokai Ye
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Tenglong Li
- Wisdom Lake Academy of Pharmacy, Xi’an Jiaotong-Liverpool University, Suzhou 215123, China
- Department of Biostatistics, School of Public Health, Boston University, Boston, MA, USA
| | - Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, L7 8TX Liverpool, UK
| | - Zhen Wei
- Department of Biological Sciences, Xi’an Jiaotong-Liverpool University, Suzhou, Jiangsu 215123, China
- Institute of Life Course and Medical Sciences, University of Liverpool, L7 8TX Liverpool, UK
| |
Collapse
|
15
|
Rennie S. Deep Learning for Elucidating Modifications to RNA-Status and Challenges Ahead. Genes (Basel) 2024; 15:629. [PMID: 38790258 PMCID: PMC11121098 DOI: 10.3390/genes15050629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 05/11/2024] [Accepted: 05/11/2024] [Indexed: 05/26/2024] Open
Abstract
RNA-binding proteins and chemical modifications to RNA play vital roles in the co- and post-transcriptional regulation of genes. In order to fully decipher their biological roles, it is an essential task to catalogue their precise target locations along with their preferred contexts and sequence-based determinants. Recently, deep learning approaches have significantly advanced in this field. These methods can predict the presence or absence of modification at specific genomic regions based on diverse features, particularly sequence and secondary structure, allowing us to decipher the highly non-linear sequence patterns and structures that underlie site preferences. This article provides an overview of how deep learning is being applied to this area, with a particular focus on the problem of mRNA-RBP binding, while also considering other types of chemical modification to RNA. It discusses how different types of model can handle sequence-based and/or secondary-structure-based inputs, the process of model training, including choice of negative regions and separating sets for testing and training, and offers recommendations for developing biologically relevant models. Finally, it highlights four key areas that are crucial for advancing the field.
Collapse
Affiliation(s)
- Sarah Rennie
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| |
Collapse
|
16
|
Tu G, Wang X, Xia R, Song B. m6A-TCPred: a web server to predict tissue-conserved human m 6A sites using machine learning approach. BMC Bioinformatics 2024; 25:127. [PMID: 38528499 PMCID: PMC10962094 DOI: 10.1186/s12859-024-05738-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 03/11/2024] [Indexed: 03/27/2024] Open
Abstract
BACKGROUND N6-methyladenosine (m6A) is the most prevalent post-transcriptional modification in eukaryotic cells that plays a crucial role in regulating various biological processes, and dysregulation of m6A status is involved in multiple human diseases including cancer contexts. A number of prediction frameworks have been proposed for high-accuracy identification of putative m6A sites, however, none have targeted for direct prediction of tissue-conserved m6A modified residues from non-conserved ones at base-resolution level. RESULTS We report here m6A-TCPred, a computational tool for predicting tissue-conserved m6A residues using m6A profiling data from 23 human tissues. By taking advantage of the traditional sequence-based characteristics and additional genome-derived information, m6A-TCPred successfully captured distinct patterns between potentially tissue-conserved m6A modifications and non-conserved ones, with an average AUROC of 0.871 and 0.879 tested on cross-validation and independent datasets, respectively. CONCLUSION Our results have been integrated into an online platform: a database holding 268,115 high confidence m6A sites with their conserved information across 23 human tissues; and a web server to predict the conserved status of user-provided m6A collections. The web interface of m6A-TCPred is freely accessible at: www.rnamd.org/m6ATCPred .
Collapse
Affiliation(s)
- Gang Tu
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, 215123, China
| | - Xuan Wang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, 215123, China.
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L7 8TX, UK.
| | - Rong Xia
- Department of Financial and Actuarial Mathematics, Xi'an Jiaotong-Liverpool University, Suzhou, 215123, China
| | - Bowen Song
- Department of Public Health, School of Medicine, Nanjing University of Chinese Medicine, Nanjing, 210023, China
| |
Collapse
|
17
|
Kovaka S, Hook PW, Jenike KM, Shivakumar V, Morina LB, Razaghi R, Timp W, Schatz MC. Uncalled4 improves nanopore DNA and RNA modification detection via fast and accurate signal alignment. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.05.583511. [PMID: 38496646 PMCID: PMC10942365 DOI: 10.1101/2024.03.05.583511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Nanopore signal analysis enables detection of nucleotide modifications from native DNA and RNA sequencing, providing both accurate genetic/transcriptomic and epigenetic information without additional library preparation. Presently, only a limited set of modifications can be directly basecalled (e.g. 5-methylcytosine), while most others require exploratory methods that often begin with alignment of nanopore signal to a nucleotide reference. We present Uncalled4, a toolkit for nanopore signal alignment, analysis, and visualization. Uncalled4 features an efficient banded signal alignment algorithm, BAM signal alignment file format, statistics for comparing signal alignment methods, and a reproducible de novo training method for k-mer-based pore models, revealing potential errors in ONT's state-of-the-art DNA model. We apply Uncalled4 to RNA 6-methyladenine (m6A) detection in seven human cell lines, identifying 26% more modifications than Nanopolish using m6Anet, including in several genes where m6A has known implications in cancer. Uncalled4 is available open-source at github.com/skovaka/uncalled4.
Collapse
|
18
|
Rigden DJ, Fernández XM. The 2024 Nucleic Acids Research database issue and the online molecular biology database collection. Nucleic Acids Res 2024; 52:D1-D9. [PMID: 38035367 PMCID: PMC10767945 DOI: 10.1093/nar/gkad1173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 11/23/2023] [Indexed: 12/02/2023] Open
Abstract
The 2024 Nucleic Acids Research database issue contains 180 papers from across biology and neighbouring disciplines. There are 90 papers reporting on new databases and 83 updates from resources previously published in the Issue. Updates from databases most recently published elsewhere account for a further seven. Nucleic acid databases include the new NAKB for structural information and updates from Genbank, ENA, GEO, Tarbase and JASPAR. The Issue's Breakthrough Article concerns NMPFamsDB for novel prokaryotic protein families and the AlphaFold Protein Structure Database has an important update. Metabolism is covered by updates from Reactome, Wikipathways and Metabolights. Microbes are covered by RefSeq, UNITE, SPIRE and P10K; viruses by ViralZone and PhageScope. Medically-oriented databases include the familiar COSMIC, Drugbank and TTD. Genomics-related resources include Ensembl, UCSC Genome Browser and Monarch. New arrivals cover plant imaging (OPIA and PlantPAD) and crop plants (SoyMD, TCOD and CropGS-Hub). The entire Database Issue is freely available online on the Nucleic Acids Research website (https://academic.oup.com/nar). Over the last year the NAR online Molecular Biology Database Collection has been updated, reviewing 1060 entries, adding 97 new resources and eliminating 388 discontinued URLs bringing the current total to 1959 databases. It is available at http://www.oxfordjournals.org/nar/database/c/.
Collapse
Affiliation(s)
- Daniel J Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | | |
Collapse
|
19
|
Lang X, Yu C, Shen M, Gu L, Qian Q, Zhou D, Tan J, Li Y, Peng X, Diao S, Deng Z, Ruan Z, Xu Z, Xing J, Li C, Wang R, Ding C, Cao Y, Liu Q. PRMD: an integrated database for plant RNA modifications. Nucleic Acids Res 2024; 52:D1597-D1613. [PMID: 37831097 PMCID: PMC10768107 DOI: 10.1093/nar/gkad851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 08/23/2023] [Accepted: 09/23/2023] [Indexed: 10/14/2023] Open
Abstract
The scope and function of RNA modifications in model plant systems have been extensively studied, resulting in the identification of an increasing number of novel RNA modifications in recent years. Researchers have gradually revealed that RNA modifications, especially N6-methyladenosine (m6A), which is one of the most abundant and commonly studied RNA modifications in plants, have important roles in physiological and pathological processes. These modifications alter the structure of RNA, which affects its molecular complementarity and binding to specific proteins, thereby resulting in various of physiological effects. The increasing interest in plant RNA modifications has necessitated research into RNA modifications and associated datasets. However, there is a lack of a convenient and integrated database with comprehensive annotations and intuitive visualization of plant RNA modifications. Here, we developed the Plant RNA Modification Database (PRMD; http://bioinformatics.sc.cn/PRMD and http://rnainformatics.org.cn/PRMD) to facilitate RNA modification research. This database contains information regarding 20 plant species and provides an intuitive interface for displaying information. Moreover, PRMD offers multiple tools, including RMlevelDiff, RMplantVar, RNAmodNet and Blast (for functional analyses), and mRNAbrowse, RNAlollipop, JBrowse and Integrative Genomics Viewer (for displaying data). Furthermore, PRMD is freely available, making it useful for the rapid development and promotion of research on plant RNA modifications.
Collapse
Affiliation(s)
- Xiaoqiang Lang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing 100091, China
- Microbiology and Metabolic Engineering Key Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, 610041, China
| | - Chunyan Yu
- Frontiers Science Center for Disease-related Molecular Network, Laboratory of Omics Technology and Bioinformatics, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, China
| | - Mengyuan Shen
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Lei Gu
- Epigenetics Laboratory, Max Planck Institute for Heart and Lung Research & Cardiopulmonary Institute (CPI). Parkstr.1 61231 Bad Nauheim Germany
| | - Qian Qian
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Degui Zhou
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Jiantao Tan
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Yiliang Li
- Guangdong Provincial Key Laboratory of Silviculture, Protection and Utilization/Guangdong Academy of Forestry, Guangzhou, Guangdong 510520, China
| | - Xin Peng
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Shu Diao
- Research Institute of Subtropical Forestry, Chinese Academy of Forestry, Hangzhou, China
| | - Zhujun Deng
- Precision Medicine Center, Precision Medicine Key Laboratory of Sichuan Province, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, China
| | - Zhaohui Ruan
- Sun Yat-sen University Cancer Center, State Key Laboratory Oncology in South China, Collaborative Innovation Center of Cancer Medicine, 510060, Guangzhou, China
| | - Zhi Xu
- Guangxi Key Laboratory of Images and Graphics Intelligent Processing, Guilin University of Electronics Technology, Guilin, 541004, China
| | - Junlian Xing
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Chen Li
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| | - Runfeng Wang
- Guangdong Provincial Key Laboratory of Crop Genetic Improvement, Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Changjun Ding
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing 100091, China
| | - Yi Cao
- Microbiology and Metabolic Engineering Key Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, 610041, China
| | - Qi Liu
- Rice Research Institute, Guangdong Academy of Agricultural Sciences, Key Laboratory of Genetics and Breeding of High Quality Rice in Southern China (Co-construction by Ministry and Province), Ministry of Agriculture and Rural Affairs, Guangdong Key Laboratory of New Technology in Rice Breeding, Guangdong Rice Engineering Laboratory, Guangzhou, 510640, China
| |
Collapse
|
20
|
Schievelbein MJ, Resende C, Glennon MM, Kerosky M, Brown JA. Global RNA modifications to the MALAT1 triple helix differentially affect thermostability and weaken binding to METTL16. J Biol Chem 2024; 300:105548. [PMID: 38092148 PMCID: PMC10805700 DOI: 10.1016/j.jbc.2023.105548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/27/2023] [Accepted: 12/01/2023] [Indexed: 12/28/2023] Open
Abstract
Therapeutic mRNAs are generated using modified nucleotides, namely N1-methylpseudouridine (m1Ψ) triphosphate, so that the mRNA evades detection by the immune system. RNA modifications, even at a single-nucleotide position, perturb RNA structure, although it is not well understood how structure and function is impacted by globally modified RNAs. Therefore, we examined the metastasis-associated lung adenocarcinoma transcript 1 triple helix, a highly structured stability element that includes single-, double-, and triple-stranded RNA, globally modified with N6-methyladenosine (m6A), pseudouridine (Ψ), or m1Ψ. UV thermal denaturation assays showed that m6A destabilizes both the Hoogsteen and Watson-Crick faces of the RNA by ∼20 °C, Ψ stabilizes the Hoogsteen and Watson-Crick faces of the RNA by ∼12 °C, and m1Ψ has minimal effect on the stability of the Hoogsteen face of the RNA but increases the stability of the Watson-Crick face by ∼9 °C. Native gel-shift assays revealed that binding of the methyltransferase-like protein 16 to the metastasis-associated lung adenocarcinoma transcript 1 triple helix was weakened by at least 8-, 99-, and 23-fold, respectively, when RNA is globally modified with m6A, Ψ, or m1Ψ. These results demonstrate that a more thermostable RNA structure does not lead to tighter RNA-protein interactions, thereby highlighting the regulatory power of RNA modifications by multiple means.
Collapse
Affiliation(s)
- Mika J Schievelbein
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA
| | - Carlos Resende
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA
| | - Madeline M Glennon
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA
| | - Matthew Kerosky
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA
| | - Jessica A Brown
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, Indiana, USA.
| |
Collapse
|
21
|
Xia H, Xu X, Guo Y, Deng X, Wang Y, Fu S. Molecular Characterization and Establishment of a Prognostic Model Based on Primary Immunodeficiency Features in Association with RNA Modifications in Triple-Negative Breast Cancer. Genes (Basel) 2023; 14:2172. [PMID: 38136994 PMCID: PMC10743198 DOI: 10.3390/genes14122172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 11/23/2023] [Accepted: 11/26/2023] [Indexed: 12/24/2023] Open
Abstract
Triple-negative breast cancer (TNBC) is the most aggressive subtype of breast cancer. Although immunotherapy is effective for some patients, most find it difficult to benefit from it. This study aims to explore the impact of specific immune pathways and their regulated molecular mechanisms in TNBC. The gene expression data of breast cancer patients were obtained from the TCGA and METABRIC databases. Gene set variation analysis (GSVA) revealed specific upregulation or abnormal expression of immunodeficiency pathways in TNBC patients. Multi-omics data showed significant differential expression of Primary Immunodeficiency Genes (PIDGs) in TNBC patients, who are prone to genomic-level variations. Consensus clustering was used in two datasets to classify patients into two distinct molecular subtypes based on PIDGs expression patterns, with each displaying different biological features and immune landscapes. To further explore the prognostic characteristics of PIDGs-regulated molecules, we constructed a four-gene prognostic PIDG score model and a nomogram using least absolute shrinkage and selection operator (LASSO) regression analysis in combination with clinicopathological parameters. The PIDG score was closely associated with the immune therapy and drug sensitivity of TNBC patients, providing potential guidance for clinical treatment. Particularly noteworthy is the close association of this scoring with RNA modifications; patients with different scores also exhibited different mutation landscapes. This study offers new insights for the clinical treatment of TNBC and for identifying novel prognostic markers and therapeutic targets in TNBC.
Collapse
Affiliation(s)
- Hongzhuo Xia
- The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, Departments of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Changsha 410013, China; (H.X.); (X.X.); (Y.G.); (X.D.)
- The Key Laboratory of Translational Cancer Stem Cell Research, Department of Pathophysiology, Hunan Normal University, Changsha 410012, China
| | - Xi Xu
- The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, Departments of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Changsha 410013, China; (H.X.); (X.X.); (Y.G.); (X.D.)
- The Key Laboratory of Translational Cancer Stem Cell Research, Department of Pathophysiology, Hunan Normal University, Changsha 410012, China
| | - Yuxuan Guo
- The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, Departments of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Changsha 410013, China; (H.X.); (X.X.); (Y.G.); (X.D.)
- The Key Laboratory of Translational Cancer Stem Cell Research, Department of Pathophysiology, Hunan Normal University, Changsha 410012, China
| | - Xiyun Deng
- The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, Departments of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Changsha 410013, China; (H.X.); (X.X.); (Y.G.); (X.D.)
- The Key Laboratory of Translational Cancer Stem Cell Research, Department of Pathophysiology, Hunan Normal University, Changsha 410012, China
| | - Yian Wang
- The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, Departments of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Changsha 410013, China; (H.X.); (X.X.); (Y.G.); (X.D.)
- The Engineering Research Center of Reproduction and Translational Medicine of Hunan Province, Changsha 410013, China
| | - Shujun Fu
- The Key Laboratory of Model Animals and Stem Cell Biology in Hunan Province, Departments of Pathology and Pathophysiology, Hunan Normal University School of Medicine, Changsha 410013, China; (H.X.); (X.X.); (Y.G.); (X.D.)
- The Key Laboratory of Translational Cancer Stem Cell Research, Department of Pathophysiology, Hunan Normal University, Changsha 410012, China
| |
Collapse
|
22
|
Xu Z, Wang X, Meng J, Zhang L, Song B. m5U-GEPred: prediction of RNA 5-methyluridine sites based on sequence-derived and graph embedding features. Front Microbiol 2023; 14:1277099. [PMID: 37937221 PMCID: PMC10627201 DOI: 10.3389/fmicb.2023.1277099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 10/02/2023] [Indexed: 11/09/2023] Open
Abstract
5-Methyluridine (m5U) is one of the most common post-transcriptional RNA modifications, which is involved in a variety of important biological processes and disease development. The precise identification of the m5U sites allows for a better understanding of the biological processes of RNA and contributes to the discovery of new RNA functional and therapeutic targets. Here, we present m5U-GEPred, a prediction framework, to combine sequence characteristics and graph embedding-based information for m5U identification. The graph embedding approach was introduced to extract the global information of training data that complemented the local information represented by conventional sequence features, thereby enhancing the prediction performance of m5U identification. m5U-GEPred outperformed the state-of-the-art m5U predictors built on two independent species, with an average AUROC of 0.984 and 0.985 tested on human and yeast transcriptomes, respectively. To further validate the performance of our newly proposed framework, the experimentally validated m5U sites identified from Oxford Nanopore Technology (ONT) were collected as independent testing data, and in this project, m5U-GEPred achieved reasonable prediction performance with ACC of 91.84%. We hope that m5U-GEPred should make a useful computational alternative for m5U identification.
Collapse
Affiliation(s)
- Zhongxing Xu
- Department of Public Health, School of Medicine and Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
- School of AI and Advanced Computing, Xi'an Jiaotong-Liverpool University, Suzhou, China
| | - Xuan Wang
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Jia Meng
- Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, China
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
- AI University Research Centre, Xi'an Jiaotong-Liverpool University, Suzhou, China
| | - Lin Zhang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, China
| | - Bowen Song
- Department of Public Health, School of Medicine and Holistic Integrative Medicine, Nanjing University of Chinese Medicine, Nanjing, China
| |
Collapse
|