1
|
Lasantha D, Vidanagamachchi S, Nallaperuma S. CRIECNN: Ensemble convolutional neural network and advanced feature extraction methods for the precise forecasting of circRNA-RBP binding sites. Comput Biol Med 2024; 174:108466. [PMID: 38615462 DOI: 10.1016/j.compbiomed.2024.108466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 03/29/2024] [Accepted: 04/08/2024] [Indexed: 04/16/2024]
Abstract
Circular RNAs (circRNAs) have surfaced as important non-coding RNA molecules in biology. Understanding interactions between circRNAs and RNA-binding proteins (RBPs) is crucial in circRNA research. Existing prediction models suffer from limited availability and accuracy, necessitating advanced approaches. In this study, we propose CRIECNN (Circular RNA-RBP Interaction predictor using an Ensemble Convolutional Neural Network), a novel ensemble deep learning model that enhances circRNA-RBP binding site prediction accuracy. CRIECNN employs advanced feature extraction methods and evaluates four distinct sequence datasets and encoding techniques (BERT, Doc2Vec, KNF, EIIP). The model consists of an ensemble convolutional neural network, a BiLSTM, and a self-attention mechanism for feature refinement. Our results demonstrate that CRIECNN outperforms state-of-the-art methods in accuracy and performance, effectively predicting circRNA-RBP interactions from both full-length sequences and fragments. This novel strategy makes an enormous advancement in the prediction of circRNA-RBP interactions, improving our understanding of circRNAs and their regulatory roles.
Collapse
Affiliation(s)
- Dilan Lasantha
- Department of Computer Science, University of Ruhuna, Sri Lanka.
| | | | - Sam Nallaperuma
- Department of Engineering, University of Cambridge, United Kingdom.
| |
Collapse
|
2
|
Li X, Qu W, Yan J, Tan J. RPI-EDLCN: An Ensemble Deep Learning Framework Based on Capsule Network for ncRNA-Protein Interaction Prediction. J Chem Inf Model 2024; 64:2221-2235. [PMID: 37158609 DOI: 10.1021/acs.jcim.3c00377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Noncoding RNAs (ncRNAs) play crucial roles in many cellular life activities by interacting with proteins. Identification of ncRNA-protein interactions (ncRPIs) is key to understanding the function of ncRNAs. Although a number of computational methods for predicting ncRPIs have been developed, the problem of predicting ncRPIs remains challenging. It has always been the focus of ncRPIs research to select suitable feature extraction methods and develop a deep learning architecture with better recognition performance. In this work, we proposed an ensemble deep learning framework, RPI-EDLCN, based on a capsule network (CapsuleNet) to predict ncRPIs. In terms of feature input, we extracted the sequence features, secondary structure sequence features, motif information, and physicochemical properties of ncRNA/protein. The sequence and secondary structure sequence features of ncRNA/protein are encoded by the conjoint k-mer method and then input into an ensemble deep learning model based on CapsuleNet by combining the motif information and physicochemical properties. In this model, the encoding features are processed by convolution neural network (CNN), deep neural network (DNN), and stacked autoencoder (SAE). Then the advanced features obtained from the processing are input into the CapsuleNet for further feature learning. Compared with other state-of-the-art methods under 5-fold cross-validation, the performance of RPI-EDLCN is the best, and the accuracy of RPI-EDLCN on RPI1807, RPI2241, and NPInter v2.0 data sets was 93.8%, 88.2%, and 91.9%, respectively. The results of the independent test indicated that RPI-EDLCN can effectively predict potential ncRPIs in different organisms. In addition, RPI-EDLCN successfully predicted hub ncRNAs and proteins in Mus musculus ncRNA-protein networks. Overall, our model can be used as an effective tool to predict ncRPIs and provides some useful guidance for future biological studies.
Collapse
Affiliation(s)
- Xiaoyi Li
- Department of Biomedical Engineering, Faculty of Environment and Life, Beijing University of Technology, Beijing International Science and Technology Cooperation Base for Intelligent Physiological Measurement and Clinical Transformation, Beijing 100124, China
| | - Wenyan Qu
- Department of Biomedical Engineering, Faculty of Environment and Life, Beijing University of Technology, Beijing International Science and Technology Cooperation Base for Intelligent Physiological Measurement and Clinical Transformation, Beijing 100124, China
| | - Jing Yan
- Department of Biomedical Engineering, Faculty of Environment and Life, Beijing University of Technology, Beijing International Science and Technology Cooperation Base for Intelligent Physiological Measurement and Clinical Transformation, Beijing 100124, China
| | - Jianjun Tan
- Department of Biomedical Engineering, Faculty of Environment and Life, Beijing University of Technology, Beijing International Science and Technology Cooperation Base for Intelligent Physiological Measurement and Clinical Transformation, Beijing 100124, China
| |
Collapse
|
3
|
Liu L, Wei Y, Tan Z, Zhang Q, Sun J, Zhao Q. Predicting circRNA-RBP Binding Sites Using a Hybrid Deep Neural Network. Interdiscip Sci 2024:10.1007/s12539-024-00616-z. [PMID: 38381315 DOI: 10.1007/s12539-024-00616-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/26/2024] [Accepted: 01/29/2024] [Indexed: 02/22/2024]
Abstract
Circular RNAs (circRNAs) are non-coding RNAs generated by reverse splicing. They are involved in biological process and human diseases by interacting with specific RNA-binding proteins (RBPs). Due to traditional biological experiments being costly, computational methods have been proposed to predict the circRNA-RBP interaction. However, these methods have problems of single feature extraction. Therefore, we propose a novel model called circ-FHN, which utilizes only circRNA sequences to predict circRNA-RBP interactions. The circ-FHN approach involves feature coding and a hybrid deep learning model. Feature coding takes into account the physicochemical properties of circRNA sequences and employs four coding methods to extract sequence features. The hybrid deep structure comprises a convolutional neural network (CNN) and a bidirectional gated recurrent unit (BiGRU). The CNN learns high-level abstract features, while the BiGRU captures long-term dependencies in the sequence. To assess the effectiveness of circ-FHN, we compared it to other computational methods on 16 datasets and conducted ablation experiments. Additionally, we conducted motif analysis. The results demonstrate that circ-FHN exhibits exceptional performance and surpasses other methods. circ-FHN is freely available at https://github.com/zhaoqi106/circ-FHN .
Collapse
Affiliation(s)
- Liwei Liu
- College of Science, Dalian Jiaotong University, Dalian, 116028, China
- Key Laboratory of Computational Science and Application of Hainan Province, Hainan Normal University, Haikou, 571158, China
| | - Yixin Wei
- College of Science, Dalian Jiaotong University, Dalian, 116028, China
| | - Zhebin Tan
- College of Software, Dalian Jiaotong University, Dalian, 116028, China
| | - Qi Zhang
- College of Science, Dalian Jiaotong University, Dalian, 116028, China
| | - Jianqiang Sun
- School of Information Science and Engineering, Linyi University, Linyi, 276000, China.
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
| |
Collapse
|
4
|
Shen Z, Liu W, Zhao S, Zhang Q, Wang S, Yuan L. Nucleotide-level prediction of CircRNA-protein binding based on fully convolutional neural network. Front Genet 2023; 14:1283404. [PMID: 37867600 PMCID: PMC10587422 DOI: 10.3389/fgene.2023.1283404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 09/21/2023] [Indexed: 10/24/2023] Open
Abstract
Introduction: CircRNA-protein binding plays a critical role in complex biological activity and disease. Various deep learning-based algorithms have been proposed to identify CircRNA-protein binding sites. These methods predict whether the CircRNA sequence includes protein binding sites from the sequence level, and primarily concentrate on analysing the sequence specificity of CircRNA-protein binding. For model performance, these methods are unsatisfactory in accurately predicting motif sites that have special functions in gene expression. Methods: In this study, based on the deep learning models that implement pixel-level binary classification prediction in computer vision, we viewed the CircRNA-protein binding sites prediction as a nucleotide-level binary classification task, and use a fully convolutional neural networks to identify CircRNA-protein binding motif sites (CPBFCN). Results: CPBFCN provides a new path to predict CircRNA motifs. Based on the MEME tool, the existing CircRNA-related and protein-related database, we analysed the motif functions discovered by CPBFCN. We also investigated the correlation between CircRNA sponge and motif distribution. Furthermore, by comparing the motif distribution with different input sequence lengths, we found that some motifs in the flanking sequences of CircRNA-protein binding region may contribute to CircRNA-protein binding. Conclusion: This study contributes to identify circRNA-protein binding and provides help in understanding the role of circRNA-protein binding in gene expression regulation.
Collapse
Affiliation(s)
- Zhen Shen
- School of Computer and Software, Nanyang Institute of Technology, Nanyang, Henan, China
| | - Wei Liu
- School of Computer and Software, Nanyang Institute of Technology, Nanyang, Henan, China
| | - ShuJun Zhao
- School of Computer and Software, Nanyang Institute of Technology, Nanyang, Henan, China
| | - QinHu Zhang
- EIT Institute for Advanced Study, Ningbo, Zhejiang, China
| | - SiGuo Wang
- EIT Institute for Advanced Study, Ningbo, Zhejiang, China
| | - Lin Yuan
- Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Engineering Research Center of Big Data Applied Technology, Faculty of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- Shandong Provincial Key Laboratory of Computer Networks, Shandong Fundamental Research Center for Computer Science, Jinan, China
| |
Collapse
|
5
|
Liu N, Zhang Z, Wu Y, Wang Y, Liang Y. CRBSP:Prediction of CircRNA-RBP Binding Sites Based on Multimodal Intermediate Fusion. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:2898-2906. [PMID: 37130249 DOI: 10.1109/tcbb.2023.3272400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Circular RNA (CircRNA) is widely expressed and has physiological and pathological significance, regulating post-transcriptional processes via its protein-binding activity. However, whereas much work has been done on linear RNA and RNA binding protein (RBP), little is known about the binding sites of CircRNA. The current report is on the development of a medium-term multimodal data fusion strategy, CRBSP, to predict CircRNA-RBP binding sites. CRBSP represents the CircRNA trinucleotide semantic, location, composition and frequency information as the corresponding coding methods of Word to vector (Word2vec), Position-specific trinucleotide propensity (PSTNP), Pseudo trinucleotide composition (PseTNC) and Trinucleotide nucleotide composition (TNC), respectively. CNN (Convolution Neural Networks) was used to extract global information and BiLSTM (bidirectional Long- and Short-Term Memory network) encoder and LSTM (Long- and Short-Term Memory network) decoder for local sequence information. Enhancement of the contributions of key features by the self-attention mechanism was followed by mid-term fusion of the four enhanced features. Logistic Regression (LR) classifier showed that CRBSP gives a mean AUC value of 0.9362 through 5-fold Cross Validation of all 37 datasets, a performance which is superior to five current state-of-the-art models. Similar evaluation of linear RNA-RBP binding sites gave an AUC value of 0.7615 which is also higher than other prediction methods, demonstrating the robustness of CRBSP.
Collapse
|
6
|
Mursalim MKN, Mengko TLER, Hertadi R, Purwarianti A, Susanty M. BiCaps-DBP: Predicting DNA-binding proteins from protein sequences using Bi-LSTM and a 1D-capsule network. Comput Biol Med 2023; 163:107241. [PMID: 37437362 DOI: 10.1016/j.compbiomed.2023.107241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 06/23/2023] [Accepted: 07/07/2023] [Indexed: 07/14/2023]
Abstract
Predicting DNA-binding proteins (DBPs) based solely on primary sequences is one of the most challenging problems in genome annotation. DBPs play a crucial role in various biological processes, including DNA replication, transcription, repair, and splicing. Some DBPs are essential in pharmaceutical research on various human cancers and autoimmune diseases. Existing experimental methods for identifying DBPs are time-consuming and costly. Therefore, developing a rapid and accurate computational technique is necessary to address the issue. This study introduces BiCaps-DBP, a deep learning-based method that improves DBP prediction performance by combining bidirectional long short-term memory with a 1D-capsule network. This study uses three training and independent datasets to evaluate the proposed model's generalizability and robustness. Based on three independent datasets, BiCaps-DBP achieved 1.05%, 5.79% and 0.40% higher accuracies than an existing predictor for PDB2272, PDB186 and PDB20000, respectively. These outcomes indicate that the proposed method is a promising DBP predictor.
Collapse
Affiliation(s)
- Muhammad K N Mursalim
- School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, 40132, Indonesia; Department of Informatics Engineering, Universal University, Batam, Indonesia
| | - Tati L E R Mengko
- School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, 40132, Indonesia.
| | - Rukman Hertadi
- Faculty of Mathematics and Natural Sciences, Bandung Institute of Technology, Bandung, 40132, Indonesia
| | - Ayu Purwarianti
- School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, 40132, Indonesia; Center for Artificial Intelligence (U-CoE AI-VLB), Bandung Institute of Technology, Bandung, 40132, Indonesia
| | - Meredita Susanty
- School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, 40132, Indonesia; Department of Computer Science, Pertamina University, Jakarta, Indonesia
| |
Collapse
|
7
|
Xu QR, Liu JL, Zhu RR, Huang WX, Huang H, Liu JC, Xu XP, Zhou XL. NSD2 promotes pressure overload-induced cardiac hypertrophy via activating circCmiss1/TfR1/ferroptosis signaling. Life Sci 2023:121873. [PMID: 37352916 DOI: 10.1016/j.lfs.2023.121873] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 06/14/2023] [Accepted: 06/16/2023] [Indexed: 06/25/2023]
Abstract
Heart failure typically occurs early in the clinical course of sustained cardiac hypertrophy that is accompanied by maladaptive remodeling of the heart. It is critical to discover new mechanisms and effective therapeutic targets to prevent and cure pathological cardiac hypertrophy. The objective of the study was to evaluate the effects of circRNAs on NSD2-induced ventricular remodeling. We screened the dysregulated circRNAs in normal or NSD2-/- C57BL/6 mice with or without transverse aortic constriction (TAC), and found that circCmss1 significantly increased in normal TAC mice, but decreased in NSD2-/- TAC mice. Angiotensin II(Ang II)induced neonatal cardiomyocyte hypertrophy in vitro and the pressure overload-induced cardiac hypertrophy in vivo can be reduced by Knocking down circCmss1. We further investigated the downstream signaling of circCmss1 in the progression of NSD2-promoted ventricular remodeling and discovered that circCmss1 could interact with a transcription factor EIF4A3 and induce the expression of transferrin receptor 1 (TfR1), thus activating the ferroptosis in cardiomyocytes. This study highlights the significance of NSD2 activation of circCmss1/EIF4A3/TfR1 as therapeutic targets for treating pathological myocardial hypertrophy.
Collapse
Affiliation(s)
- Qi-Rong Xu
- Department of Cardiovascular Surgery, The First Affiliated Hospital, Nanchang University, Nanchang, China
| | - Jin-Long Liu
- Institute of Translational Medicine, Shanghai University, Shanghai, China
| | - Rong-Rong Zhu
- Department of Cardiology, Jiangxi Hospital of Traditional Chinese Medicine, Jiangxi University of Chinese Medicine, China
| | - Wen-Xiong Huang
- Department of Cardiovascular Surgery, The First Affiliated Hospital, Nanchang University, Nanchang, China
| | - Huang Huang
- Department of Cardiovascular Surgery, The First Affiliated Hospital, Nanchang University, Nanchang, China
| | - Ji-Chun Liu
- Department of Cardiovascular Surgery, The First Affiliated Hospital, Nanchang University, Nanchang, China
| | - Xin-Ping Xu
- Jiangxi Institute of Respiratory Disease, The First Affiliated Hospital, Nanchang University, China.
| | - Xue-Liang Zhou
- Department of Cardiovascular Surgery, The First Affiliated Hospital, Nanchang University, Nanchang, China.
| |
Collapse
|
8
|
Jin J, Cong J, Lei S, Zhang Q, Zhong X, Su Y, Lu M, Ma Y, Li Z, Wang L, Zhu N, Yang J. Cracking the code: Deciphering the role of the tumor microenvironment in osteosarcoma metastasis. Int Immunopharmacol 2023; 121:110422. [PMID: 37302370 DOI: 10.1016/j.intimp.2023.110422] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 05/19/2023] [Accepted: 05/30/2023] [Indexed: 06/13/2023]
Abstract
Osteosarcoma (OS) is the most common malignant bone tumor in children and adolescents. It is characterized by a rapid progression, poor prognosis, and early pulmonary metastasis. Over the past 30 years, approximately 85% of patients with osteosarcoma have experienced metastasis. The five-year survival of patients with lung metastasis during the early stages of treatment is less than 20%. The tumor microenvironment (TME) not only provides conditions for tumor cell growth but also releases a variety of substances that can promote the metastasis of tumor cells to other tissues and organs. Currently, there is limited research on the role of the TME in osteosarcoma metastasis. Therefore, to explore methods for regulating osteosarcoma metastasis, further investigations must be conducted from the perspective of the TME. This will help to identify new potential biomarkers for predicting osteosarcoma metastasis and assist in the discovery of new drugs that target regulatory mechanisms for clinical diagnosis and treatment. This paper reviews the research progress on the mechanism of osteosarcoma metastasis based on TME theory, which will provide guidance for the clinical treatment of osteosarcoma.
Collapse
Affiliation(s)
- Jiamin Jin
- Department of Gastroenterology, Affiliated Hospital of Guilin Medical University, Guangxi, Guilin 541001, China; Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Jiacheng Cong
- Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Shangbo Lei
- Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Qiujin Zhang
- Department of Immunology, Guilin Medical University, Guilin 541199, China
| | - Xinyi Zhong
- Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Yingying Su
- Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Mingchuan Lu
- Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Yifen Ma
- Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Zihe Li
- Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China
| | - Liyan Wang
- Department of Gastroenterology, Affiliated Hospital of Guilin Medical University, Guangxi, Guilin 541001, China
| | - Ningxia Zhu
- Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China.
| | - Jinfeng Yang
- Department of Gastroenterology, Affiliated Hospital of Guilin Medical University, Guangxi, Guilin 541001, China; Department of Immunology, Guilin Medical University, Guilin 541199, China; Key Laboratory of Tumor Immunology and Microenvironmental Regulation, Guilin Medical University, Guilin 541199, China.
| |
Collapse
|
9
|
Cao C, Yang S, Li M, Li C. CircSSNN: circRNA-binding site prediction via sequence self-attention neural networks with pre-normalization. BMC Bioinformatics 2023; 24:220. [PMID: 37254080 DOI: 10.1186/s12859-023-05352-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 05/25/2023] [Indexed: 06/01/2023] Open
Abstract
BACKGROUND Circular RNAs (circRNAs) play a significant role in some diseases by acting as transcription templates. Therefore, analyzing the interaction mechanism between circRNA and RNA-binding proteins (RBPs) has far-reaching implications for the prevention and treatment of diseases. Existing models for circRNA-RBP identification usually adopt convolution neural network (CNN), recurrent neural network (RNN), or their variants as feature extractors. Most of them have drawbacks such as poor parallelism, insufficient stability, and inability to capture long-term dependencies. METHODS In this paper, we propose a new method completely using the self-attention mechanism to capture deep semantic features of RNA sequences. On this basis, we construct a CircSSNN model for the cirRNA-RBP identification. The proposed model constructs a feature scheme by fusing circRNA sequence representations with statistical distributions, static local contexts, and dynamic global contexts. With a stable and efficient network architecture, the distance between any two positions in a sequence is reduced to a constant, so CircSSNN can quickly capture the long-term dependencies and extract the deep semantic features. RESULTS Experiments on 37 circRNA datasets show that the proposed model has overall advantages in stability, parallelism, and prediction performance. Keeping the network structure and hyperparameters unchanged, we directly apply the CircSSNN to linRNA datasets. The favorable results show that CircSSNN can be transformed simply and efficiently without task-oriented tuning. CONCLUSIONS In conclusion, CircSSNN can serve as an appealing circRNA-RBP identification tool with good identification performance, excellent scalability, and wide application scope without the need for task-oriented fine-tuning of parameters, which is expected to reduce the professional threshold required for hyperparameter tuning in bioinformatics analysis.
Collapse
Affiliation(s)
- Chao Cao
- School of Computer Science and Technology, Guangxi University of Science and Technology, Liuzhou, China
| | - Shuhong Yang
- Key Laboratory of Guangxi Universities on Intelligent Computing and Distributed Information Processing, Guangxi University of Science and Technology, Liuzhou, China.
| | - Mengli Li
- School of Technology, Guilin University, Guilin, China
| | - Chungui Li
- School of Computer Science and Technology, Guangxi University of Science and Technology, Liuzhou, China.
| |
Collapse
|
10
|
Zhang Z, Xu J, Wu Y, Liu N, Wang Y, Liang Y. CapsNet-LDA: predicting lncRNA-disease associations using attention mechanism and capsule network based on multi-view data. Brief Bioinform 2023; 24:6889447. [PMID: 36511221 DOI: 10.1093/bib/bbac531] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Revised: 10/25/2022] [Accepted: 11/07/2022] [Indexed: 12/15/2022] Open
Abstract
Cumulative studies have shown that many long non-coding RNAs (lncRNAs) are crucial in a number of diseases. Predicting potential lncRNA-disease associations (LDAs) can facilitate disease prevention, diagnosis and treatment. Therefore, it is vital to develop practical computational methods for LDA prediction. In this study, we propose a novel predictor named capsule network (CapsNet)-LDA for LDA prediction. CapsNet-LDA first uses a stacked autoencoder for acquiring the informative low-dimensional representations of the lncRNA-disease pairs under multiple views, then the attention mechanism is leveraged to implement an adaptive allocation of importance weights to them, and they are subsequently processed using a CapsNet-based architecture for predicting LDAs. Different from the conventional convolutional neural networks (CNNs) that have some restrictions with the usage of scalar neurons and pooling operations. the CapsNets use vector neurons instead of scalar neurons that have better robustness for the complex combination of features and they use dynamic routing processes for updating parameters. CapsNet-LDA is superior to other five state-of-the-art models on four benchmark datasets, four perturbed datasets and an independent test set in the comparison experiments, demonstrating that CapsNet-LDA has excellent performance and robustness against perturbation, as well as good generalization ability. The ablation studies verify the effectiveness of some modules of CapsNet-LDA. Moreover, the ability of multi-view data to improve performance is proven. Case studies further indicate that CapsNet-LDA can accurately predict novel LDAs for specific diseases.
Collapse
Affiliation(s)
- Zequn Zhang
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Junlin Xu
- College of Information Science and Engineering, Hunan University, Changsha 410082, Hunan, China
| | - Yanan Wu
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Niannian Liu
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Yinglong Wang
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| | - Ying Liang
- College of Computer and Information Engineering, Jiangxi Agricultural University, Nanchang, 310045 Jiangxi, China
| |
Collapse
|
11
|
Wu H, Zheng S, He Q, Li Y. Recent Advances of Circular RNAs as Biomarkers for Osteosarcoma. Int J Gen Med 2023; 16:173-183. [PMID: 36687163 PMCID: PMC9850833 DOI: 10.2147/ijgm.s380834] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Accepted: 11/30/2022] [Indexed: 01/15/2023] Open
Abstract
Osteosarcoma is the most common primary malignant bone tumor in young adult, which is prone to early metastasis and poor prognosis. The current treatment methods need to be improved. Circular RNA is a covalently blocked circular, non-coding RNA that plays an essential role in the occurrence, development, clinical diagnosis, and treatment of various diseases. Recently, an increasing number of circRNAs have been identified in osteosarcoma. Understanding its role in osteosarcoma is conducive to the early detection, diagnosis, and treatment of osteosarcoma. In this paper, we reviewed the mechanism of action of circular RNA in the occurrence and development of osteosarcoma and its clinical application in recent years.
Collapse
Affiliation(s)
- Hongliang Wu
- Department of Orthopedics, Fuzhou Second Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, People’s Republic of China,Department of Orthopedics, Fuzhou Second Hospital, Fuzhou, People’s Republic of China,Department of Orthopedics, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China
| | - Sihang Zheng
- Department of Neurology, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China
| | - Qun He
- Department of Bioinformatics, School of Life Sciences, China Medical University, Shenyang, People’s Republic of China
| | - Yan Li
- Department of Orthopedics, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China,Correspondence: Yan Li; Qun He, Email ;
| |
Collapse
|
12
|
A pseudo-Siamese framework for circRNA-RBP binding sites prediction integrating BiLSTM and soft attention mechanism. Methods 2022; 207:57-64. [PMID: 36113743 DOI: 10.1016/j.ymeth.2022.09.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 08/24/2022] [Accepted: 09/09/2022] [Indexed: 11/20/2022] Open
Abstract
Circular RNAs (circRNAs) are widely expressed in tissues and play a key role in diseases through interacting with RNA binding proteins (RBPs). Since the high cost of traditional technology, computational methods are developed to identify the binding sites between circRNAs and RBPs. Unfortunately, these methods suffer from the insufficient learning of features and the single classification of output. To address these limitations, we propose a novel method named circ-pSBLA which constructs a pseudo-Siamese framework integrating Bi-directional long short-term memory (BiLSTM) network and soft attention mechanism for circRNA-RBP binding sites prediction. Softmax function and CatBoost are adopted to classify, respectively, and then a pseudo-Siamese framework is constructed. circ-pSBLA combines them to get final output. To validate the effectiveness of circ-pSBLA, we compare it with other state-of-the-art methods and carry out an ablation experiment on 17 sub-datasets. Moreover, we do motif analysis on 3 sub-datasets. The results show that circ-pSBLA achieves superior performance and outperforms other methods. All supporting source codes can be downloaded from https://github.com/gyj9811/circ-pSBLA.
Collapse
|
13
|
Wang Z, Lei X. A web server for identifying circRNA-RBP variable-length binding sites based on stacked generalization ensemble deep learning network. Methods 2022; 205:179-190. [PMID: 35810958 DOI: 10.1016/j.ymeth.2022.06.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 06/23/2022] [Accepted: 06/28/2022] [Indexed: 11/28/2022] Open
Abstract
Circular RNA (circRNA) can exert biological functions by interacting with RNA-binding protein (RBP), and some deep learning-based methods have been developed to predict RBP binding sites on circRNA. However, most of these methods identify circRNA-RBP binding sites are only based on single data resource and cannot provide exact binding sites, only providing the probability value of a sequence fragment. To solve these problems, we propose a binding sites localization algorithm that fuses binding sites from multiple databases, and further design a stacked generalization ensemble deep learning model named CirRBP to identify RBP binding sites on circRNA. The CirRBP is trained by combining the binding sites from multiple databases and makes predictions by weighted aggregating the predictions of each sub-model. The results show that the CirRBP outperforms any sub-model and existing online prediction model. For better access to our research results, we develop an open-source web application called CRWS (CircRNA-RBP Web Server). Its back-end learning model of the CRWS is a stacked generalization ensemble learning model CirRBP based on different deep learning frameworks. Given a full-length circRNA or fragment sequence and a target RBP, the CRWS can analyze and provide the exact potential binding sites of the target RBP on the given sequence through the binding sites localization algorithm, and visualize it. In addition, the CRWS can discover the most widely distributed motif in each RBP dataset. Up to now, CRWS is the first significant online tool that uses multi-source data to train models and predict exact binding sites. CRWS is now publicly and freely available without login requirement at: http://www.bioinformatics.team.
Collapse
Affiliation(s)
- Zhengfeng Wang
- School of Computer Science, Shaanxi Normal University, Xi'an 710119, China; College of Information Science and Engineering, Guilin University of Technology, Guilin 541004, China; Guangxi Key Laboratory of Embedded Technology and Intelligent System, Guilin University of Technology, Guilin 541004, China
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an 710119, China.
| |
Collapse
|
14
|
Yang Y, Hou Z, Wang Y, Ma H, Sun P, Ma Z, Wong KC, Li X. HCRNet: high-throughput circRNA-binding event identification from CLIP-seq data using deep temporal convolutional network. Brief Bioinform 2022; 23:6533504. [PMID: 35189638 DOI: 10.1093/bib/bbac027] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Revised: 01/03/2022] [Accepted: 01/17/2022] [Indexed: 01/11/2023] Open
Abstract
Identifying genome-wide binding events between circular RNAs (circRNAs) and RNA-binding proteins (RBPs) can greatly facilitate our understanding of functional mechanisms within circRNAs. Thanks to the development of cross-linked immunoprecipitation sequencing technology, large amounts of genome-wide circRNA binding event data have accumulated, providing opportunities for designing high-performance computational models to discriminate RBP interaction sites and thus to interpret the biological significance of circRNAs. Unfortunately, there are still no computational models sufficiently flexible to accommodate circRNAs from different data scales and with various degrees of feature representation. Here, we present HCRNet, a novel end-to-end framework for identification of circRNA-RBP binding events. To capture the hierarchical relationships, the multi-source biological information is fused to represent circRNAs, including various natural language sequence features. Furthermore, a deep temporal convolutional network incorporating global expectation pooling was developed to exploit the latent nucleotide dependencies in an exhaustive manner. We benchmarked HCRNet on 37 circRNA datasets and 31 linear RNA datasets to demonstrate the effectiveness of our proposed method. To evaluate further the model's robustness, we performed HCRNet on a full-length dataset containing 740 circRNAs. Results indicate that HCRNet generally outperforms existing methods. In addition, motif analyses were conducted to exhibit the interpretability of HCRNet on circRNAs. All supporting source code and data can be downloaded from https://github.com/yangyn533/HCRNet and https://doi.org/10.6084/m9.figshare.16943722.v1. And the web server of HCRNet is publicly accessible at http://39.104.118.143:5001/.
Collapse
Affiliation(s)
- Yuning Yang
- School of Information Science and Technology, Northeast Normal University, Changchun, Jilin, China
| | - Zilong Hou
- School of Artificial Intelligence, Jilin University, Changchun, Jilin, China
| | - Yansong Wang
- School of Artificial Intelligence, Jilin University, Changchun, Jilin, China
| | - Hongli Ma
- School of Mathematics, Shandong University, Jinan, Shandong, China
| | - Pingping Sun
- School of Information Science and Technology, Northeast Normal University, Changchun, Jilin, China
| | - Zhiqiang Ma
- School of Information Science and Technology, Northeast Normal University, Changchun, Jilin, China
| | - Ka-Chun Wong
- School of Computer Science, City University of Hong Kong, Hong Kong SAR
| | - Xiangtao Li
- School of Artificial Intelligence, Jilin University, Changchun, Jilin, China
| |
Collapse
|
15
|
Wang Z, Lei X. Prediction of RBP binding sites on circRNAs using an LSTM-based deep sequence learning architecture. Brief Bioinform 2021; 22:6355419. [PMID: 34415289 DOI: 10.1093/bib/bbab342] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 07/14/2021] [Accepted: 08/02/2021] [Indexed: 01/22/2023] Open
Abstract
Circular RNAs (circRNAs) are widely expressed in highly diverged eukaryotes. Although circRNAs have been known for many years, their function remains unclear. Interaction with RNA-binding protein (RBP) to influence post-transcriptional regulation is considered to be an important pathway for circRNA function, such as acting as an oncogenic RBP sponge to inhibit cancer. In this study, we design a deep learning framework, CRPBsites, to predict the binding sites of RBPs on circRNAs. In this model, the sequences of variable-length binding sites are transformed into embedding vectors by word2vec model. Bidirectional LSTM is used to encode the embedding vectors of binding sites, and then they are fed into another LSTM decoder for decoding and classification tasks. To train and test the model, we construct four datasets that contain sequences of variable-length binding sites on circRNAs, and each set corresponds to an RBP, which is overexpressed in bladder cancer tissues. Experimental results on four datasets and comparison with other existing models show that CRPBsites has superior performance. Afterwards, we found that there were highly similar binding motifs in the four binding site datasets. Finally, we applied well-trained CRPBsites to identify the binding sites of IGF2BP1 on circCDYL, and the results proved the effectiveness of this method. In conclusion, CRPBsites is an effective prediction model for circRNA-RBP interaction site identification. We hope that CRPBsites can provide valuable guidance for experimental studies on the influence of circRNA on post-transcriptional regulation.
Collapse
Affiliation(s)
- Zhengfeng Wang
- School of Computer Science, Shaanxi Normal University, Xi'an, China.,College of Information Science and Engineering, Guilin University of Technology, Guilin, China
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an, China
| |
Collapse
|
16
|
Sun W, Zhou H, Han X, Hou L, Xue X. Circular RNA: A novel type of biomarker for glioma (Review). Mol Med Rep 2021; 24:602. [PMID: 34165178 PMCID: PMC8240176 DOI: 10.3892/mmr.2021.12240] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 05/12/2021] [Indexed: 12/18/2022] Open
Abstract
With the rapid development of sequencing technologies, the characteristics and functions of circular RNAs (circRNAs) in different tissues, and their underlying pathophysiological mechanisms, have been identified. circRNAs are significantly enriched in the brain and are continually expressed from the embryonic stage to the adult stage in rats. Previous studies have reported that certain circRNAs are differentially expressed in glioma and regulate a number of biological processes, such as cell proliferation, metastasis and oncogenesis of glioma. Furthermore, certain circRNAs have been associated with tumor size, World Health Organization tumor grade and poor prognosis in patients with glioma. It has been hypothesized that circRNAs may be involved in the onset and progression of glioma through transcriptional regulation, protein translation and binding to microRNAs. These properties and functions suggest the potential of circRNAs as prognostic biomarkers and therapeutic targets for glioma. For the present review, published studies were examined from PubMed, Embase, Cochrane Central and the reference lists of the retrieved articles. The aim of the present review was to summarize the progress of circRNA research in glioma, discuss the potential diagnostic and prognostic values, and the roles of circRNAs in glioma, and provide a novel theoretical basis and research concepts for the prediction, diagnosis and treatment of glioma.
Collapse
Affiliation(s)
- Wei Sun
- Department of Radiotherapy, The Second Hospital of Hebei Medical University, Shijiazhuang, Hebei 050000, P.R. China
| | - Huandi Zhou
- Department of Radiotherapy, The Second Hospital of Hebei Medical University, Shijiazhuang, Hebei 050000, P.R. China
| | - Xuetao Han
- Department of Radiotherapy, The Second Hospital of Hebei Medical University, Shijiazhuang, Hebei 050000, P.R. China
| | - Liubing Hou
- Department of Radiotherapy, The Second Hospital of Hebei Medical University, Shijiazhuang, Hebei 050000, P.R. China
| | - Xiaoying Xue
- Department of Radiotherapy, The Second Hospital of Hebei Medical University, Shijiazhuang, Hebei 050000, P.R. China
| |
Collapse
|