1
|
Wang Z, Dai Q, Song J, Duan X, Yang H, Yang Z. Predicting RBP Binding Sites of RNA With High-Order Encoding Features and CNN-BLSTM Hybrid Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2409-2419. [PMID: 34038367 DOI: 10.1109/tcbb.2021.3083930] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
RNA binding protein (RBP) is extensively involved in various cellular regulatory processes through the interaction with RNAs. Capturing the RBP binding preferences is fundamental for revealing the pathogenesis of complex diseases. Many experimental detection techniques are still time-consuming and labor-intensive, therefore, it is indispensable to develop a computational method with convincing accuracy. In this study, we proposed a CNN-BLSTM hybrid deep learning framework, named DeepDW, for predicting the RBP binding sites on RNAs with high-order encoding features of RNA sequence and secondary structure. The high-order encoding strategy was used to characterize the dependencies among adjacency nucleotides. For CNN-BLSTM hybrid model, DeepDW first employed two 1-D convolutional neural networks (CNNs) for learning the local features from high-order encoded matrices of RNA sequence and structure separately, and then applied two bidirectional long short-term memory networks (BLSTMs) to capture the global information in a higher level. Moreover, a series of experiments were carried out on 31 public datasets to evaluate our proposed framework, and DeepDW achieved superior performance than the state-of-the-art methods. The results indicated that the combination of high-order encoding method and CNN-BLSTM hybrid model had advantages in identifying RBP-RNA binding sites.
Collapse
|
2
|
He F, Wei R, Zhou Z, Huang L, Wang Y, Tang J, Zou Y, Shi L, Gu X, Davis MJ, Su Z. Integrative Analysis of Somatic Mutations in Non-coding Regions Altering RNA Secondary Structures in Cancer Genomes. Sci Rep 2019; 9:8205. [PMID: 31160636 PMCID: PMC6546760 DOI: 10.1038/s41598-019-44489-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Accepted: 05/17/2019] [Indexed: 01/01/2023] Open
Abstract
RNA secondary structure may influence many cellular processes, including RNA processing, stability, localization, and translation. Single-nucleotide variations (SNVs) that alter RNA secondary structure, referred to as riboSNitches, are potentially causative of human diseases, especially in untranslated regions (UTRs) and noncoding RNAs (ncRNAs). The functions of somatic mutations that act as riboSNitches in cancer development remain poorly understood. In this study, we developed a computational pipeline called SNIPER (riboSNitch-enriched or depleted elements in cancer genomes), which employs MeanDiff and EucDiff to detect riboSNitches and then identifies riboSNitch-enriched or riboSNitch-depleted non-coding elements across tumors. SNIPER is available at github: https://github.com/suzhixi/SNIPER/. We found that riboSNitches were more likely to be pathogenic. Moreover, we predicted several UTRs and lncRNAs (long non-coding RNA) that significantly enriched or depleted riboSNitches in cancer genomes, indicative of potential cancer driver or essential noncoding elements. Our study highlights the possibly neglected importance of RNA secondary structure in cancer genomes and provides a new strategy to identify new cancer-associated genes.
Collapse
Affiliation(s)
- Funan He
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Ran Wei
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Zhan Zhou
- Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Leihuan Huang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Yinan Wang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Jie Tang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Yangyun Zou
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Leming Shi
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China.,Shanghai Cancer Center and Cancer Institute, Fudan University, Shanghai, 200032, China
| | - Xun Gu
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Melissa J Davis
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, VIC, 3052, Australia
| | - Zhixi Su
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China. .,Singlera Genomics Inc, Shanghai, China.
| |
Collapse
|
3
|
Abstract
Small-molecule and protein/antibody drugs mainly act on genome-derived proteins to exert pharmacological effects. RNA based therapies hold the promise to expand the range of druggable targets from proteins to RNAs and the genome, as evidenced by several RNA drugs approved for clinical practice and many others under active trials. While chemo-engineered RNA mimics have found their success in marketed drugs and continue dominating basic research and drug development, these molecules are usually conjugated with extensive and various modifications. This makes them completely different from cellular RNAs transcribed from the genome that usually consist of unmodified ribonucleotides or just contain a few posttranscriptional modifications. The use of synthetic RNA mimics for RNA research and drug development is also in contrast with the ultimate success of protein research and therapy utilizing biologic or recombinant proteins produced and folded in living cells instead of polypeptides or proteins synthesized in vitro. Indeed, efforts have been made recently to develop RNA bioengineering technologies for cost-effective and large-scale production of biologic RNA molecules that may better capture the structures, functions, and safety profiles of natural RNAs. In this article, we provide an overview on RNA therapeutics for the treatment of human diseases via RNA interference mechanisms. By illustrating the structural differences between natural RNAs and chemo-engineered RNA mimics, we focus on discussion of a novel class of bioengineered/biologic RNA agents produced through fermentation and their potential applications to RNA research and drug development.
Collapse
Affiliation(s)
- Ai-Ming Yu
- Department of Biochemistry & Molecular Medicine, UC Davis School of Medicine, Sacramento, CA 95817, USA.
| | - Chao Jian
- Department of Biochemistry & Molecular Medicine, UC Davis School of Medicine, Sacramento, CA 95817, USA
| | - Allan H Yu
- Department of Biochemistry & Molecular Medicine, UC Davis School of Medicine, Sacramento, CA 95817, USA
| | - Mei-Juan Tu
- Department of Biochemistry & Molecular Medicine, UC Davis School of Medicine, Sacramento, CA 95817, USA
| |
Collapse
|
4
|
Gong J, Ju Y, Shao D, Zhang QC. Advances and challenges towards the study of RNA-RNA interactions in a transcriptome-wide scale. QUANTITATIVE BIOLOGY 2018. [DOI: 10.1007/s40484-018-0146-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/09/2022]
|
5
|
Chen Q, Lan C, Chen B, Wang L, Li J, Zhang C. Exploring Consensus RNA Substructural Patterns Using Subgraph Mining. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:1134-1146. [PMID: 28026781 DOI: 10.1109/tcbb.2016.2645202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Frequently recurring RNA structural motifs play important roles in RNA folding process and interaction with other molecules. Traditional index-based and shape-based schemas are useful in modeling RNA secondary structures but ignore the structural discrepancy of individual RNA family member. Further, the in-depth analysis of underlying substructure pattern is insufficient due to varied and unnormalized substructure data. This prevents us from understanding RNAs functions and their inherent synergistic regulation networks. This article thus proposes a novel labeled graph-based algorithm RnaGraph to uncover frequently RNA substructure patterns. Attribute data and graph data are combined to characterize diverse substructures and their correlations, respectively. Further, a top-k graph pattern mining algorithm is developed to extract interesting substructure motifs by integrating frequency and similarity. The experimental results show that our methods assist in not only modelling complex RNA secondary structures but also identifying hidden but interesting RNA substructure patterns.
Collapse
|
6
|
Dilley KA, Voorhies AA, Luthra P, Puri V, Stockwell TB, Lorenzi H, Basler CF, Shabman RS. The Ebola virus VP35 protein binds viral immunostimulatory and host RNAs identified through deep sequencing. PLoS One 2017. [PMID: 28636653 PMCID: PMC5479518 DOI: 10.1371/journal.pone.0178717] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Ebola virus and Marburg virus are members of the Filovirdae family and causative agents of hemorrhagic fever with high fatality rates in humans. Filovirus virulence is partially attributed to the VP35 protein, a well-characterized inhibitor of the RIG-I-like receptor pathway that triggers the antiviral interferon (IFN) response. Prior work demonstrates the ability of VP35 to block potent RIG-I activators, such as Sendai virus (SeV), and this IFN-antagonist activity is directly correlated with its ability to bind RNA. Several structural studies demonstrate that VP35 binds short synthetic dsRNAs; yet, there are no data that identify viral immunostimulatory RNAs (isRNA) or host RNAs bound to VP35 in cells. Utilizing a SeV infection model, we demonstrate that both viral isRNA and host RNAs are bound to Ebola and Marburg VP35s in cells. By deep sequencing the purified VP35-bound RNA, we identified the SeV copy-back defective interfering (DI) RNA, previously identified as a robust RIG-I activator, as the isRNA bound by multiple filovirus VP35 proteins, including the VP35 protein from the West African outbreak strain (Makona EBOV). Moreover, RNAs isolated from a VP35 RNA-binding mutant were not immunostimulatory and did not include the SeV DI RNA. Strikingly, an analysis of host RNAs bound by wild-type, but not mutant, VP35 revealed that select host RNAs are preferentially bound by VP35 in cell culture. Taken together, these data support a model in which VP35 sequesters isRNA in virus-infected cells to avert RIG-I like receptor (RLR) activation.
Collapse
Affiliation(s)
- Kari A. Dilley
- Virology Group, J. Craig Venter Institute, Rockville, Maryland, United States of America
- * E-mail: (RSS); (KAD)
| | - Alexander A. Voorhies
- Infectious Disease Group, J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Priya Luthra
- Center for Microbial Pathogenesis, Georgia State University, Atlanta, Georgia, United States of America
| | - Vinita Puri
- Virology Group, J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Timothy B. Stockwell
- Virology Group, J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Hernan Lorenzi
- Infectious Disease Group, J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Christopher F. Basler
- Center for Microbial Pathogenesis, Georgia State University, Atlanta, Georgia, United States of America
| | - Reed S. Shabman
- Virology Group, J. Craig Venter Institute, Rockville, Maryland, United States of America
- * E-mail: (RSS); (KAD)
| |
Collapse
|
7
|
Choudhary K, Deng F, Aviran S. Comparative and integrative analysis of RNA structural profiling data: current practices and emerging questions. QUANTITATIVE BIOLOGY 2017; 5:3-24. [PMID: 28717530 PMCID: PMC5510538 DOI: 10.1007/s40484-017-0093-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 12/08/2016] [Accepted: 12/15/2016] [Indexed: 12/30/2022]
Abstract
BACKGROUND Structure profiling experiments provide single-nucleotide information on RNA structure. Recent advances in chemistry combined with application of high-throughput sequencing have enabled structure profiling at transcriptome scale and in living cells, creating unprecedented opportunities for RNA biology. Propelled by these experimental advances, massive data with ever-increasing diversity and complexity have been generated, which give rise to new challenges in interpreting and analyzing these data. RESULTS We review current practices in analysis of structure profiling data with emphasis on comparative and integrative analysis as well as highlight emerging questions. Comparative analysis has revealed structural patterns across transcriptomes and has become an integral component of recent profiling studies. Additionally, profiling data can be integrated into traditional structure prediction algorithms to improve prediction accuracy. CONCLUSIONS To keep pace with experimental developments, methods to facilitate, enhance and refine such analyses are needed. Parallel advances in analysis methodology will complement profiling technologies and help them reach their full potential.
Collapse
Affiliation(s)
| | | | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA 95616, USA
| |
Collapse
|
8
|
Weldon C, Eperon IC, Dominguez C. Do we know whether potential G-quadruplexes actually form in long functional RNA molecules? Biochem Soc Trans 2016; 44:1761-1768. [PMID: 27913687 PMCID: PMC5135001 DOI: 10.1042/bst20160109] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Revised: 06/30/2016] [Accepted: 07/04/2016] [Indexed: 01/25/2023]
Abstract
The roles of deoxyribonucleic acid (DNA) G-quadruplex structures in gene expression and telomere maintenance have been well characterized. Recent results suggest that such structures could also play pivotal roles in ribonucleic acid (RNA) biology, such as splicing or translation regulation. However, it has been difficult to show that RNA G-quadruplexes (G4s) exist in specific long RNA sequences, such as precursor messenger RNA, in a functional or cellular context. Most current methods for identifying G4s involve the use of short, purified RNA sequences in vitro, in the absence of competition with secondary structures or protein binding. Therefore, novel methods need to be developed to allow the characterization of G4s in long functional RNAs and in a cellular context. This need has in part been met by our recent development of a method based on a comparison of RNA and 7-deaza-RNA that provides a test for identifying RNA G4s in such conditions.
Collapse
Affiliation(s)
- Carika Weldon
- Leicester Institute of Structural and Chemical Biology and Department of Molecular and Cellular Biology, University of Leicester, Lancaster Road, Leicester LE1 9HN, U.K
| | - Ian C Eperon
- Leicester Institute of Structural and Chemical Biology and Department of Molecular and Cellular Biology, University of Leicester, Lancaster Road, Leicester LE1 9HN, U.K
| | - Cyril Dominguez
- Leicester Institute of Structural and Chemical Biology and Department of Molecular and Cellular Biology, University of Leicester, Lancaster Road, Leicester LE1 9HN, U.K
| |
Collapse
|
9
|
Transcriptome-wide interrogation of RNA secondary structure in living cells with icSHAPE. Nat Protoc 2016; 11:273-90. [PMID: 26766114 DOI: 10.1038/nprot.2016.011] [Citation(s) in RCA: 106] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
icSHAPE (in vivo click selective 2-hydroxyl acylation and profiling experiment) captures RNA secondary structure at a transcriptome-wide level by measuring nucleotide flexibility at base resolution. Living cells are treated with the icSHAPE chemical NAI-N3 followed by selective chemical enrichment of NAI-N3-modified RNA, which provides an improved signal-to-noise ratio compared with similar methods leveraging deep sequencing. Purified RNA is then reverse-transcribed to produce cDNA, with SHAPE-modified bases leading to truncated cDNA. After deep sequencing of cDNA, computational analysis yields flexibility scores for every base across the starting RNA population. The entire experimental procedure can be completed in ∼5 d, and the sequencing and bioinformatics data analysis take an additional 4-5 d with no extensive computational skills required. Comparing in vivo and in vitro icSHAPE measurements can reveal in vivo RNA-binding protein imprints or facilitate the dissection of RNA post-transcriptional modifications. icSHAPE reactivities can additionally be used to constrain and improve RNA secondary structure prediction models.
Collapse
|