1
|
Saha R, Vázquez-Salazar A, Nandy A, Chen IA. Fitness Landscapes and Evolution of Catalytic RNA. Annu Rev Biophys 2024; 53:109-125. [PMID: 39013026 DOI: 10.1146/annurev-biophys-030822-025038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2024]
Abstract
The relationship between genotype and phenotype, or the fitness landscape, is the foundation of genetic engineering and evolution. However, mapping fitness landscapes poses a major technical challenge due to the amount of quantifiable data that is required. Catalytic RNA is a special topic in the study of fitness landscapes due to its relatively small sequence space combined with its importance in synthetic biology. The combination of in vitro selection and high-throughput sequencing has recently provided empirical maps of both complete and local RNA fitness landscapes, but the astronomical size of sequence space limits purely experimental investigations. Next steps are likely to involve data-driven interpolation and extrapolation over sequence space using various machine learning techniques. We discuss recent progress in understanding RNA fitness landscapes, particularly with respect to protocells and machine representations of RNA. The confluence of technical advances may significantly impact synthetic biology in the near future.
Collapse
Affiliation(s)
- Ranajay Saha
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, California, USA; ,
| | - Alberto Vázquez-Salazar
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, California, USA; ,
| | - Aditya Nandy
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, California, USA; ,
- Department of Chemistry, The University of Chicago, Chicago, Illinois, USA
- The James Franck Institute, The University of Chicago, Chicago, Illinois, USA
| | - Irene A Chen
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, California, USA; ,
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California, USA
| |
Collapse
|
2
|
Yang S, Kim SH, Yang E, Kang M, Joo JY. Molecular insights into regulatory RNAs in the cellular machinery. Exp Mol Med 2024; 56:1235-1249. [PMID: 38871819 PMCID: PMC11263585 DOI: 10.1038/s12276-024-01239-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 02/27/2024] [Accepted: 03/05/2024] [Indexed: 06/15/2024] Open
Abstract
It is apparent that various functional units within the cellular machinery are derived from RNAs. The evolution of sequencing techniques has resulted in significant insights into approaches for transcriptome studies. Organisms utilize RNA to govern cellular systems, and a heterogeneous class of RNAs is involved in regulatory functions. In particular, regulatory RNAs are increasingly recognized to participate in intricately functioning machinery across almost all levels of biological systems. These systems include those mediating chromatin arrangement, transcription, suborganelle stabilization, and posttranscriptional modifications. Any class of RNA exhibiting regulatory activity can be termed a class of regulatory RNA and is typically represented by noncoding RNAs, which constitute a substantial portion of the genome. These RNAs function based on the principle of structural changes through cis and/or trans regulation to facilitate mutual RNA‒RNA, RNA‒DNA, and RNA‒protein interactions. It has not been clearly elucidated whether regulatory RNAs identified through deep sequencing actually function in the anticipated mechanisms. This review addresses the dominant properties of regulatory RNAs at various layers of the cellular machinery and covers regulatory activities, structural dynamics, modifications, associated molecules, and further challenges related to therapeutics and deep learning.
Collapse
Affiliation(s)
- Sumin Yang
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea
| | - Sung-Hyun Kim
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea
| | - Eunjeong Yang
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea
| | - Mingon Kang
- Department of Computer Science, University of Nevada, Las Vegas, NV, 89154, USA
| | - Jae-Yeol Joo
- Department of Pharmacy, College of Pharmacy, Hanyang University, Ansan, Gyeonggi-do, 15588, Republic of Korea.
| |
Collapse
|
3
|
Nie Z, Gao M, Jin X, Rao Y, Zhang X. MFPINC: prediction of plant ncRNAs based on multi-source feature fusion. BMC Genomics 2024; 25:531. [PMID: 38816689 PMCID: PMC11137975 DOI: 10.1186/s12864-024-10439-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 05/21/2024] [Indexed: 06/01/2024] Open
Abstract
Non-coding RNAs (ncRNAs) are recognized as pivotal players in the regulation of essential physiological processes such as nutrient homeostasis, development, and stress responses in plants. Common methods for predicting ncRNAs are susceptible to significant effects of experimental conditions and computational methods, resulting in the need for significant investment of time and resources. Therefore, we constructed an ncRNA predictor(MFPINC), to predict potential ncRNA in plants which is based on the PINC tool proposed by our previous studies. Specifically, sequence features were carefully refined using variance thresholding and F-test methods, while deep features were extracted and feature fusion were performed by applying the GRU model. The comprehensive evaluation of multiple standard datasets shows that MFPINC not only achieves more comprehensive and accurate identification of gene sequences, but also significantly improves the expressive and generalization performance of the model, and MFPINC significantly outperforms the existing competing methods in ncRNA identification. In addition, it is worth mentioning that our tool can also be found on Github ( https://github.com/Zhenj-Nie/MFPINC ) the data and source code can also be downloaded for free.
Collapse
Affiliation(s)
- Zhenjun Nie
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
| | - Mengqing Gao
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
| | - Xiu Jin
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
- Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei, 230036, China
| | - Yuan Rao
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China
- Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei, 230036, China
| | - Xiaodan Zhang
- School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, 230036, China.
- Key Laboratory of Agricultural Sensors, Ministry of Agriculture and Rural Affairs, Hefei, 230036, China.
| |
Collapse
|
4
|
Avila Santos AP, de Almeida BLS, Bonidia RP, Stadler PF, Stefanic P, Mandic-Mulec I, Rocha U, Sanches DS, de Carvalho ACPLF. BioDeepfuse: a hybrid deep learning approach with integrated feature extraction techniques for enhanced non-coding RNA classification. RNA Biol 2024; 21:1-12. [PMID: 38528797 DOI: 10.1080/15476286.2024.2329451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/23/2024] [Indexed: 03/27/2024] Open
Abstract
The accurate classification of non-coding RNA (ncRNA) sequences is pivotal for advanced non-coding genome annotation and analysis, a fundamental aspect of genomics that facilitates understanding of ncRNA functions and regulatory mechanisms in various biological processes. While traditional machine learning approaches have been employed for distinguishing ncRNA, these often necessitate extensive feature engineering. Recently, deep learning algorithms have provided advancements in ncRNA classification. This study presents BioDeepFuse, a hybrid deep learning framework integrating convolutional neural networks (CNN) or bidirectional long short-term memory (BiLSTM) networks with handcrafted features for enhanced accuracy. This framework employs a combination of k-mer one-hot, k-mer dictionary, and feature extraction techniques for input representation. Extracted features, when embedded into the deep network, enable optimal utilization of spatial and sequential nuances of ncRNA sequences. Using benchmark datasets and real-world RNA samples from bacterial organisms, we evaluated the performance of BioDeepFuse. Results exhibited high accuracy in ncRNA classification, underscoring the robustness of our tool in addressing complex ncRNA sequence data challenges. The effective melding of CNN or BiLSTM with external features heralds promising directions for future research, particularly in refining ncRNA classifiers and deepening insights into ncRNAs in cellular processes and disease manifestations. In addition to its original application in the context of bacterial organisms, the methodologies and techniques integrated into our framework can potentially render BioDeepFuse effective in various and broader domains.
Collapse
Affiliation(s)
- Anderson P Avila Santos
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, Brazil
- Department of Applied Microbial Ecology, Helmholtz Centre for Environmental Research - UFZ GmbH, Leipzig, Saxony, Germany
| | - Breno L S de Almeida
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, Brazil
| | - Robson P Bonidia
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, Brazil
- Department of Computer Science, Federal University of Technology - Paraná, UTFPR, Cornélio Procópio, Brazil
| | - Peter F Stadler
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig, Saxony, Germany
| | - Polonca Stefanic
- Department of Food Science and Technology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Ines Mandic-Mulec
- Department of Food Science and Technology, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Ulisses Rocha
- Department of Applied Microbial Ecology, Helmholtz Centre for Environmental Research - UFZ GmbH, Leipzig, Saxony, Germany
| | - Danilo S Sanches
- Department of Computer Science, Federal University of Technology - Paraná, UTFPR, Cornélio Procópio, Brazil
| | | |
Collapse
|
5
|
Rocca R, Grillone K, Citriniti EL, Gualtieri G, Artese A, Tagliaferri P, Tassone P, Alcaro S. Targeting non-coding RNAs: Perspectives and challenges of in-silico approaches. Eur J Med Chem 2023; 261:115850. [PMID: 37839343 DOI: 10.1016/j.ejmech.2023.115850] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/08/2023] [Accepted: 09/29/2023] [Indexed: 10/17/2023]
Abstract
The growing information currently available on the central role of non-coding RNAs (ncRNAs) including microRNAs (miRNAS) and long non-coding RNAs (lncRNAs) for chronic and degenerative human diseases makes them attractive therapeutic targets. RNAs carry out different functional roles in human biology and are deeply deregulated in several diseases. So far, different attempts to therapeutically target the 3D RNA structures with small molecules have been reported. In this scenario, the development of computational tools suitable for describing RNA structures and their potential interactions with small molecules is gaining more and more interest. Here, we describe the most suitable strategies to study ncRNAs through computational tools. We focus on methods capable of predicting 2D and 3D ncRNA structures. Furthermore, we describe computational tools to identify, design and optimize small molecule ncRNA binders. This review aims to outline the state of the art and perspectives of computational methods for ncRNAs over the past decade.
Collapse
Affiliation(s)
- Roberta Rocca
- Department of Health Science, Magna Graecia University, Catanzaro, Italy; Net4Science srl, Academic Spinoff, Magna Græcia University, Catanzaro, Italy
| | - Katia Grillone
- Department of Experimental and Clinical Medicine, Magna Græcia University, Catanzaro, Italy
| | | | | | - Anna Artese
- Department of Health Science, Magna Graecia University, Catanzaro, Italy; Net4Science srl, Academic Spinoff, Magna Græcia University, Catanzaro, Italy.
| | | | - Pierfrancesco Tassone
- Department of Experimental and Clinical Medicine, Magna Græcia University, Catanzaro, Italy
| | - Stefano Alcaro
- Department of Health Science, Magna Graecia University, Catanzaro, Italy; Net4Science srl, Academic Spinoff, Magna Græcia University, Catanzaro, Italy
| |
Collapse
|
6
|
Chen K, Zhu X, Wang J, Zhao Z, Hao L, Guo X, Liu Y. MFPred: prediction of ncRNA families based on multi-feature fusion. Brief Bioinform 2023; 24:bbad303. [PMID: 37615358 DOI: 10.1093/bib/bbad303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 07/30/2023] [Accepted: 07/31/2023] [Indexed: 08/25/2023] Open
Abstract
Non-coding RNA (ncRNA) plays a critical role in biology. ncRNAs from the same family usually have similar functions, as a result, it is essential to predict ncRNA families before identifying their functions. There are two primary methods for predicting ncRNA families, namely, traditional biological methods and computational methods. In traditional biological methods, a lot of manpower and resources are required to predict ncRNA families. Therefore, this paper proposed a new ncRNA family prediction method called MFPred based on computational methods. MFPred identified ncRNA families by extracting sequence features of ncRNAs, and it possessed three primary modules, including (1) four ncRNA sequences encoding and feature extraction module, which encoded ncRNA sequences and extracted four different features of ncRNA sequences, (2) dynamic Bi_GRU and feature fusion module, which extracted contextual information features of the ncRNA sequence and (3) ResNet_SE module that extracted local information features of the ncRNA sequence. In this study, MFPred was compared with the previously proposed ncRNA family prediction methods using two frequently used public ncRNA datasets, NCY and nRC. The results showed that MFPred outperformed other prediction methods in the two datasets.
Collapse
Affiliation(s)
- Kai Chen
- College of Software, jilin University, Changchun, 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, jilin University, Changchun, 130012, China
| | - Xiaodong Zhu
- College of Software, jilin University, Changchun, 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, jilin University, Changchun, 130012, China
- College of Computer Science and Technology, jilin University, Changchun, 130012, China
| | - Jiahao Wang
- College of Software, jilin University, Changchun, 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, jilin University, Changchun, 130012, China
| | - Ziqi Zhao
- College of Software, jilin University, Changchun, 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, jilin University, Changchun, 130012, China
| | - Lei Hao
- College of Software, jilin University, Changchun, 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, jilin University, Changchun, 130012, China
| | - Xinsheng Guo
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, jilin University, Changchun, 130012, China
- College of Computer Science and Technology, jilin University, Changchun, 130012, China
| | - Yuanning Liu
- College of Software, jilin University, Changchun, 130012, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, jilin University, Changchun, 130012, China
- College of Computer Science and Technology, jilin University, Changchun, 130012, China
| |
Collapse
|
7
|
Sutanto K, Turcotte M. Assessing Global-Local Secondary Structure Fingerprints to Classify RNA Sequences With Deep Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:2736-2747. [PMID: 34633933 DOI: 10.1109/tcbb.2021.3118358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
RNA elements that are transcribed but not translated into proteins are called non-coding RNAs (ncRNAs). They play wide-ranging roles in biological processes and disorders. Just like proteins, their structure is often intimately linked to their function. Many examples have been documented where structure is conserved across taxa despite sequence divergence. Thus, structure is often used to identify function. Specifically, the secondary structure is predicted and ncRNAs with similar structures are assumed to have same or similar functions. However, a strand of RNA can fold into multiple possible structures, and some strands even fold differently in vivo and in vitro. Furthermore, ncRNAs often function as RNA-protein complexes, which can affect structure. Because of these, we hypothesized using one structure per sequence may discard information, possibly resulting in poorer classification accuracy. Therefore, we propose using secondary structure fingerprints, comprising two categories: a higher-level representation derived from RNA-As-Graphs (RAG), and free energy fingerprints based on a curated repertoire of small structural motifs. The fingerprints take into account the difference between global and local structural matches. We also evaluated our deep learning architecture with k-mers. By combining our global-local fingerprints with 6-mer, we achieved an accuracy, precision, and recall of 91.04%, 91.10%, and 91.00%.
Collapse
|
8
|
Dupont MJ, Major F. D-ORB: A Web Server to Extract Structural Features of Related But Unaligned RNA Sequences. J Mol Biol 2023; 435:168181. [PMID: 37468182 DOI: 10.1016/j.jmb.2023.168181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 06/02/2023] [Accepted: 06/06/2023] [Indexed: 07/21/2023]
Abstract
Identifying the common structural elements of functionally related RNA sequences (family) is usually based on an alignment of the sequences, which is often subject to human bias and may not be accurate. The resulting covariance model (CM) provides probabilities for each base to covary with another, which allows to support evolutionarily the formation of double helical regions and possibly pseudoknots. The coexistence of alternative folds in RNA, resulting from its dynamic nature, may lead to the potential omission of motifs by CM. To overcome this limitation, we present D-ORB, a system of algorithms that identifies overrepresented motifs in the secondary conformational landscapes of a family when compared to those of unrelated sequences. The algorithms are bundled into an easy-to-use website allowing users to submit a family, and optionally provide unrelated sequences. D-ORB produces a non-pseudoknotted secondary structure based on the overrepresented motifs, a deep neural network classifier and two decision trees. When used to model an Rfam family, D-ORB fits overrepresented motifs in the corresponding Rfam structure; more than a hundred Rfam families have been modeled. The statistical approach behind D-ORB derives the structural composition of an RNA family, making it a valuable tool for analyzing and modeling it. Its easy-to-use interface and advanced algorithms make it an essential resource for researchers studying RNA structure. D-ORB is available at https://d-orb.major.iric.ca/.
Collapse
Affiliation(s)
- Mathieu J Dupont
- Department of Computer Science and Operations Research, and the Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, Quebec H3C 3J7, Canada
| | - François Major
- Department of Computer Science and Operations Research, and the Institute for Research in Immunology and Cancer, Université de Montréal, Montreal, Quebec H3C 3J7, Canada. https://twitter.com/francois_major
| |
Collapse
|
9
|
Chen K, Zhu X, Wang J, Hao L, Liu Z, Liu Y. ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction. BMC Bioinformatics 2023; 24:68. [PMID: 36849908 PMCID: PMC9972773 DOI: 10.1186/s12859-023-05191-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 02/16/2023] [Indexed: 03/01/2023] Open
Abstract
BACKGROUND Although research on non-coding RNAs (ncRNAs) is a hot topic in life sciences, the functions of numerous ncRNAs remain unclear. In recent years, researchers have found that ncRNAs of the same family have similar functions, therefore, it is important to accurately predict ncRNAs families to identify their functions. There are several methods available to solve the prediction problem of ncRNAs family, whose main ideas can be divided into two categories, including prediction based on the secondary structure features of ncRNAs, and prediction according to sequence features of ncRNAs. The first type of prediction method requires a complicated process and has a low accuracy in obtaining the secondary structure of ncRNAs, while the second type of method has a simple prediction process and a high accuracy, but there is still room for improvement. The existing methods for ncRNAs family prediction are associated with problems such as complicated prediction processes and low accuracy, in this regard, it is necessary to propose a new method to predict the ncRNAs family more perfectly. RESULTS A deep learning model-based method, ncDENSE, was proposed in this study, which predicted ncRNAs families by extracting ncRNAs sequence features. The bases in ncRNAs sequences were encoded by one-hot coding and later fed into an ensemble deep learning model, which contained the dynamic bi-directional gated recurrent unit (Bi-GRU), the dense convolutional network (DenseNet), and the Attention Mechanism (AM). To be specific, dynamic Bi-GRU was used to extract contextual feature information and capture long-term dependencies of ncRNAs sequences. AM was employed to assign different weights to features extracted by Bi-GRU and focused the attention on information with greater weights. Whereas DenseNet was adopted to extract local feature information of ncRNAs sequences and classify them by the full connection layer. According to our results, the ncDENSE method improved the Accuracy, Sensitivity, Precision, F-score, and MCC by 2.08[Formula: see text], 2.33[Formula: see text], 2.14[Formula: see text], 2.16[Formula: see text], and 2.39[Formula: see text], respectively, compared with the suboptimal method. CONCLUSIONS Overall, the ncDENSE method proposed in this paper extracts sequence features of ncRNAs by dynamic Bi-GRU and DenseNet and improves the accuracy in predicting ncRNAs family and other data.
Collapse
Affiliation(s)
- Kai Chen
- grid.64924.3d0000 0004 1760 5735College of Software, Jilin University, Changchun, 130012 China ,grid.64924.3d0000 0004 1760 5735Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012 China
| | - Xiaodong Zhu
- grid.64924.3d0000 0004 1760 5735College of Software, Jilin University, Changchun, 130012 China ,grid.64924.3d0000 0004 1760 5735Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012 China ,grid.64924.3d0000 0004 1760 5735College of Computer Science and Technology, Jilin University, Changchun, 130012 China
| | - Jiahao Wang
- grid.64924.3d0000 0004 1760 5735College of Software, Jilin University, Changchun, 130012 China ,grid.64924.3d0000 0004 1760 5735Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012 China
| | - Lei Hao
- grid.64924.3d0000 0004 1760 5735College of Software, Jilin University, Changchun, 130012 China ,grid.64924.3d0000 0004 1760 5735Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012 China
| | - Zhen Liu
- grid.64924.3d0000 0004 1760 5735College of Computer Science and Technology, Jilin University, Changchun, 130012 China ,grid.444367.60000 0000 9853 5396Graduate School of Engineering, Nagasaki Institute of Applied Science, 536 Aba-machi, Nagasaki 851-0193 Japan
| | - Yuanning Liu
- College of Software, Jilin University, Changchun, 130012, China. .,Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, China. .,College of Computer Science and Technology, Jilin University, Changchun, 130012, China.
| |
Collapse
|
10
|
Lima DDS, Amichi LJA, Fernandez MA, Constantino AA, Seixas FAV. NCYPred: A Bidirectional LSTM Network With Attention for Y RNA and Short Non-Coding RNA Classification. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:557-565. [PMID: 34826297 DOI: 10.1109/tcbb.2021.3131136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Short non-coding RNAs (sncRNAs) are involved in multiple cellular processes and can be divided into dozens of classes. Among such classes, Y RNAs have been gaining attention, being essential factors for the initiation of DNA replication on vertebrates, as well as potential tumor biomarkers. Homologs have also been described in nematodes and insects, as well as related sequences in bacteria. Methods capable of accurately predicting Y RNA transcripts are lacking. In this work, we developed an attention-based LSTM network and built a classification model able to classify sncRNAs (including Y RNA) directly from nucleotide sequences. A dataset consisting of 45,447 sncRNA sequences, from a wide range of organisms, obtained from Rfam 14.3 was built. Performance evaluation demonstrated that our proposed method, NCYPred (Non-Coding/Y RNA Prediction), can accurately predict Y RNA sequences and their homologs, as well as 11 additional classes, achieving results comparable with state-of-the-art methods. We also demonstrate that applying t-SNE on learned sequence representations could be useful for sequence analysis. Our model is freely available as a web-server (https://www.gpea.uem.br/ncypred/).
Collapse
|
11
|
Wang Y, Li Q, Wang S, Wang BJ, Jin Y, Hu H, Fu QS, Wang JW, Wu Q, Qian L, Cao TT, Xia YB, Huang XX, Xu L. The role of noncoding RNAs in cancer lipid metabolism. Front Oncol 2022; 12:1026257. [PMID: 36452489 PMCID: PMC9704363 DOI: 10.3389/fonc.2022.1026257] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 10/20/2022] [Indexed: 12/03/2023] Open
Abstract
Research on noncoding ribonucleic acids (ncRNAs) is mostly and broadly focused on microRNAs (miRNAs), cyclic RNAs (circRNAs), and long ncRNAs (lncRNAs), which have been confirmed to play important roles in tumor cell proliferation, invasion, and migration. Specifically, recent studies have shown that ncRNAs contribute to tumorigenesis and tumor development by mediating changes in enzymes related to lipid metabolism. The purpose of this review is to discuss the characterized ncRNAs involved in the lipid metabolism of tumors to highlight ncRNA-mediated lipid metabolism-related enzyme expression in malignant tumors and its importance to tumor development. In this review, we describe the types of ncRNA and the mechanism of tumor lipid metabolism and analyze the important role of ncRNA in tumor lipid metabolism and its future prospects from the perspectives of ncRNA biological function and lipid metabolic enzyme classification. However, several critical issues still need to be resolved. Because ncRNAs can affect tumor processes by regulating lipid metabolism enzymes, in the future, we can study the unique role of ncRNAs from four aspects: disease prevention, detection, diagnosis, and treatment. Therefore, in the future, the development of ncRNA-targeted therapy will become a hot direction and shoulder a major task in the medical field.
Collapse
Affiliation(s)
- Ye Wang
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Qian Li
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Song Wang
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Bi-jun Wang
- Department of Clinical Medicine, Clinical College of Anhui Medical University, Hefei, Anhui, China
| | - Yan Jin
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Hao Hu
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Qing-sheng Fu
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Jia-wei Wang
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Qing Wu
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Long Qian
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Ting-ting Cao
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Ya-bin Xia
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Xiao-xu Huang
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| | - Li Xu
- Department of Gastrointestinal Surgery, The First Affiliated Hospital of Wannan Medical College, Yijishan Hospital of Wannan Medical College, Wuhu, Anhui, China
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Wuhu, Anhui, China
- Non-coding RNA Research Center of Wannan Medical College, Yijishan Hospital, Wuhu, Anhui, China
| |
Collapse
|
12
|
Lodde V, Floris M, Muroni MR, Cucca F, Idda ML. Non-coding RNAs in malaria infection. WILEY INTERDISCIPLINARY REVIEWS. RNA 2022; 13:e1697. [PMID: 34651456 PMCID: PMC9286032 DOI: 10.1002/wrna.1697] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Revised: 09/02/2021] [Accepted: 09/21/2021] [Indexed: 12/31/2022]
Abstract
Malaria is one of the most severe infectious diseases affecting humans and it is caused by protozoan pathogens of the species Plasmodium (spp.). The malaria parasite Plasmodium is characterized by a complex, multistage life cycle that requires tight gene regulation which allows for host invasion and defense against host immune responses. Unfortunately, the mechanisms regulating gene expression during Plasmodium infection remain largely elusive, though several lines of evidence implicate a major involvement of non-coding RNAs (ncRNAs). The ncRNAs have been found to play a key role in regulating transcriptional and post-transcriptional events in a broad range of organisms including Plasmodium. In Plasmodium ncRNAs have been shown to regulate key events in the multistage life cycle and virulence ability. Here we review recent progress involving ncRNAs (microRNAs, long non-coding RNAs, and circular RNAs) and their role as regulators of gene expression during Plasmodium infection in human hosts with focus on the possibility of using these molecules as biomarkers for monitoring disease status. We also discuss the surprising function of ncRNAs in mediating the complex interplay between parasite and human host and future perspectives of the field. This article is categorized under: RNA in Disease and Development > RNA in Disease.
Collapse
Affiliation(s)
- Valeria Lodde
- Department of Biomedical SciencesUniversity of SassariSassariItaly
| | - Matteo Floris
- Department of Biomedical SciencesUniversity of SassariSassariItaly
| | - Maria Rosaria Muroni
- Department of Medical, Surgical, and Experimental SciencesUniversity of SassariSassariItaly
| | - Francesco Cucca
- Department of Biomedical SciencesUniversity of SassariSassariItaly
| | - Maria Laura Idda
- Institute for Genetic and Biomedical Research (IRGB), National Research Council (CNR)SassariItaly
| |
Collapse
|
13
|
Micheel J, Safrastyan A, Wollny D. Advances in Non-Coding RNA Sequencing. Noncoding RNA 2021; 7:70. [PMID: 34842804 PMCID: PMC8628893 DOI: 10.3390/ncrna7040070] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/22/2021] [Accepted: 10/26/2021] [Indexed: 12/11/2022] Open
Abstract
Non-coding RNAs (ncRNAs) comprise a set of abundant and functionally diverse RNA molecules. Since the discovery of the first ncRNA in the 1960s, ncRNAs have been shown to be involved in nearly all steps of the central dogma of molecular biology. In recent years, the pace of discovery of novel ncRNAs and their cellular roles has been greatly accelerated by high-throughput sequencing. Advances in sequencing technology, library preparation protocols as well as computational biology helped to greatly expand our knowledge of which ncRNAs exist throughout the kingdoms of life. Moreover, RNA sequencing revealed crucial roles of many ncRNAs in human health and disease. In this review, we discuss the most recent methodological advancements in the rapidly evolving field of high-throughput sequencing and how it has greatly expanded our understanding of ncRNA biology across a large number of different organisms.
Collapse
Affiliation(s)
| | | | - Damian Wollny
- RNA Bioinformatics/High Throughput Analysis, Faculty of Mathematics and Computer Science, Friedrich Schiller University, 07743 Jena, Germany; (J.M.); (A.S.)
| |
Collapse
|
14
|
Burgos M, Hurtado A, Jiménez R, Barrionuevo FJ. Non-Coding RNAs: lncRNAs, miRNAs, and piRNAs in Sexual Development. Sex Dev 2021; 15:335-350. [PMID: 34614501 DOI: 10.1159/000519237] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 08/09/2021] [Indexed: 11/19/2022] Open
Abstract
Non-coding RNAs (ncRNAs) are a group of RNAs that do not encode functional proteins, including long non-coding RNAs (lncRNAs), microRNAs (miRNAs), PIWI-interacting RNAs (piRNAs), and short interfering RNAs (siRNAs). In the last 2 decades an effort has been made to uncover the role of ncRNAs during development and disease, and nowadays it is clear that these molecules have a regulatory function in many of the developmental and physiological processes where they have been studied. In this review, we provide an overview of the role of ncRNAs during gonad determination and development, focusing mainly on mammals, although we also provide information from other species, in particular when there is not much information on the function of particular types of ncRNAs during mammalian sexual development.
Collapse
Affiliation(s)
- Miguel Burgos
- Departamento de Genética e Instituto de Biotecnología, Lab. 127, Centro de Investigación Biomédica, Universidad de Granada, Granada, Spain
| | - Alicia Hurtado
- Epigenetics and Sex Development Group, Berlin Institute for Medical Systems Biology, Max-Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Rafael Jiménez
- Departamento de Genética e Instituto de Biotecnología, Lab. 127, Centro de Investigación Biomédica, Universidad de Granada, Granada, Spain
| | - Francisco J Barrionuevo
- Departamento de Genética e Instituto de Biotecnología, Lab. 127, Centro de Investigación Biomédica, Universidad de Granada, Granada, Spain
| |
Collapse
|
15
|
Asim MN, Ibrahim MA, Imran Malik M, Dengel A, Ahmed S. Advances in Computational Methodologies for Classification and Sub-Cellular Locality Prediction of Non-Coding RNAs. Int J Mol Sci 2021; 22:8719. [PMID: 34445436 PMCID: PMC8395733 DOI: 10.3390/ijms22168719] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Revised: 08/02/2021] [Accepted: 08/03/2021] [Indexed: 02/06/2023] Open
Abstract
Apart from protein-coding Ribonucleic acids (RNAs), there exists a variety of non-coding RNAs (ncRNAs) which regulate complex cellular and molecular processes. High-throughput sequencing technologies and bioinformatics approaches have largely promoted the exploration of ncRNAs which revealed their crucial roles in gene regulation, miRNA binding, protein interactions, and splicing. Furthermore, ncRNAs are involved in the development of complicated diseases like cancer. Categorization of ncRNAs is essential to understand the mechanisms of diseases and to develop effective treatments. Sub-cellular localization information of ncRNAs demystifies diverse functionalities of ncRNAs. To date, several computational methodologies have been proposed to precisely identify the class as well as sub-cellular localization patterns of RNAs). This paper discusses different types of ncRNAs, reviews computational approaches proposed in the last 10 years to distinguish coding-RNA from ncRNA, to identify sub-types of ncRNAs such as piwi-associated RNA, micro RNA, long ncRNA, and circular RNA, and to determine sub-cellular localization of distinct ncRNAs and RNAs. Furthermore, it summarizes diverse ncRNA classification and sub-cellular localization determination datasets along with benchmark performance to aid the development and evaluation of novel computational methodologies. It identifies research gaps, heterogeneity, and challenges in the development of computational approaches for RNA sequence analysis. We consider that our expert analysis will assist Artificial Intelligence researchers with knowing state-of-the-art performance, model selection for various tasks on one platform, dominantly used sequence descriptors, neural architectures, and interpreting inter-species and intra-species performance deviation.
Collapse
Affiliation(s)
- Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany; (M.A.I.); (A.D.); (S.A.)
- Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany
| | - Muhammad Ali Ibrahim
- German Research Center for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany; (M.A.I.); (A.D.); (S.A.)
- Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany
| | - Muhammad Imran Malik
- National Center for Artificial Intelligence (NCAI), National University of Sciences and Technology, Islamabad 44000, Pakistan;
- School of Electrical Engineering & Computer Science, National University of Sciences and Technology, Islamabad 44000, Pakistan
| | - Andreas Dengel
- German Research Center for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany; (M.A.I.); (A.D.); (S.A.)
- Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany
| | - Sheraz Ahmed
- German Research Center for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany; (M.A.I.); (A.D.); (S.A.)
- DeepReader GmbH, Trippstadter Str. 122, 67663 Kaiserslautern, Germany
| |
Collapse
|
16
|
Chantsalnyam T, Siraj A, Tayara H, Chong KT. ncRDense: A novel computational approach for classification of non-coding RNA family by deep learning. Genomics 2021; 113:3030-3038. [PMID: 34242708 DOI: 10.1016/j.ygeno.2021.07.004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 06/29/2021] [Accepted: 07/03/2021] [Indexed: 12/14/2022]
Abstract
With the rapidly growing importance of biological research, non-coding RNAs (ncRNA) attract more attention in biology and bioinformatics. They play vital roles in biological processes such as transcription and translation. Classification of ncRNAs is essential to our understanding of disease mechanisms and treatment design. Many approaches to ncRNA classification have been developed, several of which use machine learning and deep learning. In this paper, we construct a novel deep learning-based architecture, ncRDense, to effectively classify and distinguish ncRNA families. In a comparative study, our model produces comparable results with existing state-of-the-art methods. Finally, we built a freely accessible web server for the ncRDense tool, which is available at http://nsclbio.jbnu.ac.kr/tools/ncRDense/.
Collapse
Affiliation(s)
- Tuvshinbayar Chantsalnyam
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea
| | - Arslan Siraj
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, South Korea.
| | - Kil To Chong
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, South Korea; Advanced Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, South Korea.
| |
Collapse
|