1
|
Aviran S, Incarnato D. Computational approaches for RNA structure ensemble deconvolution from structure probing data. J Mol Biol 2022; 434:167635. [PMID: 35595163 DOI: 10.1016/j.jmb.2022.167635] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 04/29/2022] [Accepted: 05/05/2022] [Indexed: 12/15/2022]
Abstract
RNA structure probing experiments have emerged over the last decade as a straightforward way to determine the structure of RNA molecules in a number of different contexts. Although powerful, the ability of RNA to dynamically interconvert between, and to simultaneously populate, alternative structural configurations, poses a nontrivial challenge to the interpretation of data derived from these experiments. Recent efforts aimed at developing computational methods for the reconstruction of coexisting alternative RNA conformations from structure probing data are paving the way to the study of RNA structure ensembles, even in the context of living cells. In this review, we critically discuss these methods, their limitations and possible future improvements.
Collapse
Affiliation(s)
- Sharon Aviran
- Biomedical Engineering Department and Genome Center, University of California, Davis, CA, USA.
| | - Danny Incarnato
- Department of Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Groningen, the Netherlands.
| |
Collapse
|
2
|
Advances in the Bioinformatics Knowledge of mRNA Polyadenylation in Baculovirus Genes. Viruses 2020; 12:v12121395. [PMID: 33291215 PMCID: PMC7762203 DOI: 10.3390/v12121395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Revised: 11/19/2020] [Accepted: 11/30/2020] [Indexed: 11/17/2022] Open
Abstract
Baculoviruses are a group of insect viruses with large circular dsDNA genomes exploited in numerous biotechnological applications, such as the biological control of agricultural pests, the expression of recombinant proteins or the gene delivery of therapeutic sequences in mammals, among others. Their genomes encode between 80 and 200 proteins, of which 38 are shared by all reported species. Thanks to multi-omic studies, there is remarkable information about the baculoviral proteome and the temporality in the virus gene expression. This allows some functional elements of the genome to be very well described, such as promoters and open reading frames. However, less information is available about the transcription termination signals and, consequently, there are still imprecisions about what are the limits of the transcriptional units present in the baculovirus genomes and how is the processing of the 3′ end of viral mRNA. Regarding to this, in this review we provide an update about the characteristics of DNA signals involved in this process and we contribute to their correct prediction through an exhaustive analysis that involves bibliography information, data mining, RNA structure and a comprehensive study of the core gene 3′ ends from 180 baculovirus genomes.
Collapse
|
3
|
Abstract
RNA performs and regulates a diverse range of cellular processes, with new functional roles being uncovered at a rapid pace. Interest is growing in how these functions are linked to RNA structures that form in the complex cellular environment. A growing suite of technologies that use advances in RNA structural probes, high-throughput sequencing and new computational approaches to interrogate RNA structure at unprecedented throughput are beginning to provide insights into RNA structures at new spatial, temporal and cellular scales.
Collapse
Affiliation(s)
- Eric J Strobel
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
| | - Angela M Yu
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
4
|
Qian X, Zhao J, Yeung PY, Zhang QC, Kwok CK. Revealing lncRNA Structures and Interactions by Sequencing-Based Approaches. Trends Biochem Sci 2018; 44:33-52. [PMID: 30459069 DOI: 10.1016/j.tibs.2018.09.012] [Citation(s) in RCA: 297] [Impact Index Per Article: 49.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2018] [Revised: 09/11/2018] [Accepted: 09/19/2018] [Indexed: 11/28/2022]
Abstract
Long noncoding RNAs (lncRNAs) have emerged as significant players in almost every level of gene function and regulation. Thus, characterizing the structures and interactions of lncRNAs is essential for understanding their mechanistic roles in cells. Through a combination of (bio)chemical approaches and automated capillary and high-throughput sequencing (HTS), the complexity and diversity of RNA structures and interactions has been revealed in the transcriptomes of multiple species. These methods have uncovered important biological insights into the mechanistic and functional roles of lncRNA in gene expression and RNA metabolism, as well as in development and disease. In this review, we summarize the latest sequencing strategies to reveal RNA structure, RNA-RNA, RNA-DNA, and RNA-protein interactions, and highlight the recent applications of these approaches to map functional lncRNAs. We discuss the advantages and limitations of these strategies, and provide recommendations to further advance methodologies capable of mapping RNA structure and interactions in order to discover new biology of lncRNAs and decipher their molecular mechanisms and implication in diseases.
Collapse
Affiliation(s)
- Xingyang Qian
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China; These authors contributed equally to this work
| | - Jieyu Zhao
- Department of Chemistry, City University of Hong Kong, Kowloon Tong, Hong Kong SAR, China; These authors contributed equally to this work
| | - Pui Yan Yeung
- Department of Chemistry, City University of Hong Kong, Kowloon Tong, Hong Kong SAR, China; These authors contributed equally to this work
| | - Qiangfeng Cliff Zhang
- MOE Key Laboratory of Bioinformatics, Center for Synthetic and Systems Biology, Tsinghua-Peking Joint Center for Life Sciences, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| | - Chun Kit Kwok
- Department of Chemistry, City University of Hong Kong, Kowloon Tong, Hong Kong SAR, China.
| |
Collapse
|
5
|
Choudhary K, Ruan L, Deng F, Shih N, Aviran S. SEQualyzer: interactive tool for quality control and exploratory analysis of high-throughput RNA structural profiling data. Bioinformatics 2018; 33:441-443. [PMID: 28172632 DOI: 10.1093/bioinformatics/btw627] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 09/25/2016] [Accepted: 09/26/2016] [Indexed: 11/14/2022] Open
Abstract
Summary To serve numerous functional roles, RNA must fold into specific structures. Determining these structures is thus of paramount importance. The recent advent of high-throughput sequencing-based structure profiling experiments has provided important insights into RNA structure and widened the scope of RNA studies. However, as a broad range of approaches continues to emerge, a universal framework is needed to quantitatively ensure consistent and high-quality data. We present SEQualyzer, a visual and interactive application that makes it easy and efficient to gauge data quality, screen for transcripts with high-quality information and identify discordant replicates in structure profiling experiments. Our methods rely on features common to a wide range of protocols and can serve as standards for quality control and analyses. Availability and Implementation SEQualyzer is written in R, is platform-independent, and is freely available at http://bme.ucdavis.edu/aviranlab/SEQualyzer. Contact saviran@ucdavis.edu Supplementary Informantion Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Krishna Choudhary
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Luyao Ruan
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Fei Deng
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Nathan Shih
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| |
Collapse
|
6
|
Lee YJ, Wang Q, Rio DC. Coordinate regulation of alternative pre-mRNA splicing events by the human RNA chaperone proteins hnRNPA1 and DDX5. Genes Dev 2018; 32:1060-1074. [PMID: 30042133 PMCID: PMC6075143 DOI: 10.1101/gad.316034.118] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Accepted: 05/29/2018] [Indexed: 01/12/2023]
Abstract
Alternative premessenger RNA (pre-mRNA) splicing is a post-transcriptional mechanism for controlling gene expression. Splicing patterns are determined by both RNA-binding proteins and nuclear pre-mRNA structure. Here, we analyzed pre-mRNA splicing patterns, RNA-binding sites, and RNA structures near these binding sites coordinately controlled by two splicing factors: the heterogeneous nuclear ribonucleoprotein hnRNPA1 and the RNA helicase DDX5. We identified thousands of alternative pre-mRNA splicing events controlled by these factors by RNA sequencing (RNA-seq) following RNAi. Enhanced cross-linking and immunoprecipitation (eCLIP) on nuclear extracts was used to identify protein-RNA-binding sites for both proteins in the nuclear transcriptome. We found a significant overlap between hnRNPA1 and DDX5 splicing targets and that they share many closely linked binding sites as determined by eCLIP analysis. In vivo SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) chemical RNA structure probing data were used to model RNA structures near several exons controlled and bound by both proteins. Both sequence motifs and in vivo UV cross-linking sites for hnRNPA1 and DDX5 were used to map binding sites in their RNA targets, and often these sites flanked regions of higher chemical reactivity, suggesting an organized nature of nuclear pre-mRNPs. This work provides a first glimpse into the possible RNA structures surrounding pre-mRNA splicing factor-binding sites.
Collapse
Affiliation(s)
- Yeon J Lee
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California 94720, USA
- Center for RNA Systems Biology, University of California at Berkeley, Berkeley, California 94720, USA
- California Institute for Quantitative Biosciences (QB3), University of California at Berkeley, Berkeley, California 94720, USA
| | - Qingqing Wang
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California 94720, USA
- Center for RNA Systems Biology, University of California at Berkeley, Berkeley, California 94720, USA
- California Institute for Quantitative Biosciences (QB3), University of California at Berkeley, Berkeley, California 94720, USA
| | - Donald C Rio
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California 94720, USA
- Center for RNA Systems Biology, University of California at Berkeley, Berkeley, California 94720, USA
- California Institute for Quantitative Biosciences (QB3), University of California at Berkeley, Berkeley, California 94720, USA
| |
Collapse
|
7
|
Kwok CK, Marsico G, Balasubramanian S. Detecting RNA G-Quadruplexes (rG4s) in the Transcriptome. Cold Spring Harb Perspect Biol 2018; 10:10/7/a032284. [PMID: 29967010 DOI: 10.1101/cshperspect.a032284] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
RNA G-quadruplex (rG4) secondary structures are proposed to play key roles in fundamental biological processes that include the modulation of transcriptional, co-transcriptional, and posttranscriptional events. Recent methodological developments that include predictive algorithms and structure-based sequencing have enabled the detection and mapping of rG4 structures on a transcriptome-wide scale at high sensitivity and resolution. The data generated by these studies provide valuable insights into the potentially diverse roles of rG4s in biology and open up a number of mechanistic hypotheses. Herein we highlight these methodologies and discuss the associated findings in relation to rG4-related biological mechanisms.
Collapse
Affiliation(s)
- Chun Kit Kwok
- Department of Chemistry, City University of Hong Kong, Hong Kong SAR, China
| | - Giovanni Marsico
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom.,Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge CB2 0RE, United Kingdom
| | - Shankar Balasubramanian
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom.,Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge CB2 0RE, United Kingdom
| |
Collapse
|
8
|
Watters KE, Choudhary K, Aviran S, Lucks JB, Perry KL, Thompson JR. Probing of RNA structures in a positive sense RNA virus reveals selection pressures for structural elements. Nucleic Acids Res 2018; 46:2573-2584. [PMID: 29294088 PMCID: PMC5861449 DOI: 10.1093/nar/gkx1273] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 12/07/2017] [Accepted: 12/18/2017] [Indexed: 12/20/2022] Open
Abstract
In single stranded (+)-sense RNA viruses, RNA structural elements (SEs) play essential roles in the infection process from replication to encapsidation. Using selective 2'-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) and covariation analysis, we explore the structural features of the third genome segment of cucumber mosaic virus (CMV), RNA3 (2216 nt), both in vitro and in plant cell lysates. Comparing SHAPE-Seq and covariation analysis results revealed multiple SEs in the coat protein open reading frame and 3' untranslated region. Four of these SEs were mutated and serially passaged in Nicotiana tabacum plants to identify biologically selected changes to the original mutated sequences. After passaging, loop mutants showed partial reversion to their wild-type sequence and SEs that were structurally disrupted by mutations were restored to wild-type-like structures via synonymous mutations in planta. These results support the existence and selection of virus open reading frame SEs in the host organism and provide a framework for further studies on the role of RNA structure in viral infection. Additionally, this work demonstrates the applicability of high-throughput chemical probing in plant cell lysates and presents a new method for calculating SHAPE reactivities from overlapping reverse transcriptase priming sites.
Collapse
Affiliation(s)
- Kyle E Watters
- Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Krishna Choudhary
- Department of Biomedical Engineering and Genome Center, University of California Davis, Davis, CA, USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California Davis, Davis, CA, USA
| | - Julius B Lucks
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60201, USA
| | - Keith L Perry
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, USA
| | - Jeremy R Thompson
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, USA
| |
Collapse
|
9
|
Statistical modeling of RNA structure profiling experiments enables parsimonious reconstruction of structure landscapes. Nat Commun 2018; 9:606. [PMID: 29426922 PMCID: PMC5807309 DOI: 10.1038/s41467-018-02923-8] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 01/09/2018] [Indexed: 11/23/2022] Open
Abstract
RNA plays key regulatory roles in diverse cellular processes, where its functionality often derives from folding into and converting between structures. Many RNAs further rely on co-existence of alternative structures, which govern their response to cellular signals. However, characterizing heterogeneous landscapes is difficult, both experimentally and computationally. Recently, structure profiling experiments have emerged as powerful and affordable structure characterization methods, which improve computational structure prediction. To date, efforts have centered on predicting one optimal structure, with much less progress made on multiple-structure prediction. Here, we report a probabilistic modeling approach that predicts a parsimonious set of co-existing structures and estimates their abundances from structure profiling data. We demonstrate robust landscape reconstruction and quantitative insights into structural dynamics by analyzing numerous data sets. This work establishes a framework for data-directed characterization of structure landscapes to aid experimentalists in performing structure-function studies. Different experimental and computational approaches can be used to study RNA structures. Here, the authors present a computational method for data-directed reconstruction of complex RNA structure landscapes, which predicts a parsimonious set of co-existing structures and estimates their abundances from structure profiling data.
Collapse
|
10
|
Jonkhout N, Tran J, Smith MA, Schonrock N, Mattick JS, Novoa EM. The RNA modification landscape in human disease. RNA (NEW YORK, N.Y.) 2017; 23:1754-1769. [PMID: 28855326 PMCID: PMC5688997 DOI: 10.1261/rna.063503.117] [Citation(s) in RCA: 377] [Impact Index Per Article: 53.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
RNA modifications have been historically considered as fine-tuning chemo-structural features of infrastructural RNAs, such as rRNAs, tRNAs, and snoRNAs. This view has changed dramatically in recent years, to a large extent as a result of systematic efforts to map and quantify various RNA modifications in a transcriptome-wide manner, revealing that RNA modifications are reversible, dynamically regulated, far more widespread than originally thought, and involved in major biological processes, including cell differentiation, sex determination, and stress responses. Here we summarize the state of knowledge and provide a catalog of RNA modifications and their links to neurological disorders, cancers, and other diseases. With the advent of direct RNA-sequencing technologies, we expect that this catalog will help prioritize those RNA modifications for transcriptome-wide maps.
Collapse
Affiliation(s)
- Nicky Jonkhout
- Garvan Institute of Medical Research, Darlinghurst, 2010 NSW, Australia
- St. Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Kensington NSW 2052, Australia
| | - Julia Tran
- Garvan Institute of Medical Research, Darlinghurst, 2010 NSW, Australia
| | - Martin A Smith
- Garvan Institute of Medical Research, Darlinghurst, 2010 NSW, Australia
- St. Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Kensington NSW 2052, Australia
| | - Nicole Schonrock
- Garvan Institute of Medical Research, Darlinghurst, 2010 NSW, Australia
- Genome.One, Darlinghurst, 2010 NSW, Australia
| | - John S Mattick
- Garvan Institute of Medical Research, Darlinghurst, 2010 NSW, Australia
- St. Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Kensington NSW 2052, Australia
| | - Eva Maria Novoa
- Garvan Institute of Medical Research, Darlinghurst, 2010 NSW, Australia
- St. Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Kensington NSW 2052, Australia
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
11
|
Dawn of the in vivo RNA structurome and interactome. Biochem Soc Trans 2017; 44:1395-1410. [PMID: 27911722 DOI: 10.1042/bst20160075] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Revised: 06/19/2016] [Accepted: 07/04/2016] [Indexed: 12/11/2022]
Abstract
RNA is one of the most fascinating biomolecules in living systems given its structural versatility to fold into elaborate architectures for important biological functions such as gene regulation, catalysis, and information storage. Knowledge of RNA structures and interactions can provide deep insights into their functional roles in vivo For decades, RNA structural studies have been conducted on a transcript-by-transcript basis. The advent of next-generation sequencing (NGS) has enabled the development of transcriptome-wide structural probing methods to profile the global landscape of RNA structures and interactions, also known as the RNA structurome and interactome, which transformed our understanding of the RNA structure-function relationship on a transcriptomic scale. In this review, molecular tools and NGS methods used for RNA structure probing are presented, novel insights uncovered by RNA structurome and interactome studies are highlighted, and perspectives on current challenges and potential future directions are discussed. A more complete understanding of the RNA structures and interactions in vivo will help illuminate the novel roles of RNA in gene regulation, development, and diseases.
Collapse
|
12
|
Li B, Tambe A, Aviran S, Pachter L. PROBer Provides a General Toolkit for Analyzing Sequencing-Based Toeprinting Assays. Cell Syst 2017; 4:568-574.e7. [PMID: 28501650 DOI: 10.1016/j.cels.2017.04.007] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2016] [Revised: 12/19/2016] [Accepted: 04/13/2017] [Indexed: 11/19/2022]
Abstract
A number of sequencing-based transcriptase drop-off assays have recently been developed to probe post-transcriptional dynamics of RNA-protein interaction, RNA structure, and RNA modification. Although these assays survey a diverse set of epitranscriptomic marks, we use the term toeprinting assays since they share methodological similarities. Their interpretation is predicated on addressing a similar computational challenge: how to learn isoform-specific chemical modification profiles in the face of complex read multi-mapping. We introduce PROBer, a statistical model and associated software, that addresses this challenge for the analysis of toeprinting assays. PROBer takes sequencing data as input and outputs estimated transcript abundances and isoform-specific modification profiles. Results on both simulated and biological data demonstrate that PROBer significantly outperforms individual methods tailored for specific toeprinting assays. Since the space of toeprinting assays is ever expanding and these assays are likely to be performed and analyzed together, we believe PROBer's unified data analysis solution will be valuable to the RNA community.
Collapse
Affiliation(s)
- Bo Li
- Center for RNA Systems Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Akshay Tambe
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Lior Pachter
- Departments of Biology and Computing & Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA.
| |
Collapse
|
13
|
Choudhary K, Deng F, Aviran S. Comparative and integrative analysis of RNA structural profiling data: current practices and emerging questions. QUANTITATIVE BIOLOGY 2017; 5:3-24. [PMID: 28717530 PMCID: PMC5510538 DOI: 10.1007/s40484-017-0093-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 12/08/2016] [Accepted: 12/15/2016] [Indexed: 12/30/2022]
Abstract
BACKGROUND Structure profiling experiments provide single-nucleotide information on RNA structure. Recent advances in chemistry combined with application of high-throughput sequencing have enabled structure profiling at transcriptome scale and in living cells, creating unprecedented opportunities for RNA biology. Propelled by these experimental advances, massive data with ever-increasing diversity and complexity have been generated, which give rise to new challenges in interpreting and analyzing these data. RESULTS We review current practices in analysis of structure profiling data with emphasis on comparative and integrative analysis as well as highlight emerging questions. Comparative analysis has revealed structural patterns across transcriptomes and has become an integral component of recent profiling studies. Additionally, profiling data can be integrated into traditional structure prediction algorithms to improve prediction accuracy. CONCLUSIONS To keep pace with experimental developments, methods to facilitate, enhance and refine such analyses are needed. Parallel advances in analysis methodology will complement profiling technologies and help them reach their full potential.
Collapse
Affiliation(s)
| | | | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA 95616, USA
| |
Collapse
|
14
|
DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo. Nat Methods 2016; 14:75-82. [PMID: 27819661 DOI: 10.1038/nmeth.4057] [Citation(s) in RCA: 268] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 09/29/2016] [Indexed: 12/25/2022]
Abstract
Coupling of structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structure studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduce biases and necessitate population-average assessments of RNA structure. Here we present dimethyl sulfate (DMS) mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase. DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low-abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in noncanonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs with their mature isoforms. These applications illustrate DMS-MaPseq's capacity to dramatically expand in vivo analysis of RNA structure.
Collapse
|
15
|
Choudhary K, Shih NP, Deng F, Ledda M, Li B, Aviran S. Metrics for rapid quality control in RNA structure probing experiments. Bioinformatics 2016; 32:3575-3583. [PMID: 27497441 DOI: 10.1093/bioinformatics/btw501] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Revised: 07/02/2016] [Accepted: 07/26/2016] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The diverse functionalities of RNA can be attributed to its capacity to form complex and varied structures. The recent proliferation of new structure probing techniques coupled with high-throughput sequencing has helped RNA studies expand in both scope and depth. Despite differences in techniques, most experiments face similar challenges in reproducibility due to the stochastic nature of chemical probing and sequencing. As these protocols expand to transcriptome-wide studies, quality control becomes a more daunting task. General and efficient methodologies are needed to quantify variability and quality in the wide range of current and emerging structure probing experiments. RESULTS We develop metrics to rapidly and quantitatively evaluate data quality from structure probing experiments, demonstrating their efficacy on both small synthetic libraries and transcriptome-wide datasets. We use a signal-to-noise ratio concept to evaluate replicate agreement, which has the capacity to identify high-quality data. We also consider and compare two methods to assess variability inherent in probing experiments, which we then utilize to evaluate the coverage adjustments needed to meet desired quality. The developed metrics and tools will be useful in summarizing large-scale datasets and will help standardize quality control in the field. AVAILABILITY AND IMPLEMENTATION The data and methods used in this article are freely available at: http://bme.ucdavis.edu/aviranlab/SPEQC_software CONTACT: saviran@ucdavis.eduSupplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Krishna Choudhary
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Nathan P Shih
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Fei Deng
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Mirko Ledda
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| | - Bo Li
- Center for RNA Systems Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, CA, USA
| |
Collapse
|
16
|
Deng F, Ledda M, Vaziri S, Aviran S. Data-directed RNA secondary structure prediction using probabilistic modeling. RNA (NEW YORK, N.Y.) 2016; 22:1109-1119. [PMID: 27251549 PMCID: PMC4931104 DOI: 10.1261/rna.055756.115] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 04/26/2016] [Indexed: 06/05/2023]
Abstract
Structure dictates the function of many RNAs, but secondary RNA structure analysis is either labor intensive and costly or relies on computational predictions that are often inaccurate. These limitations are alleviated by integration of structure probing data into prediction algorithms. However, existing algorithms are optimized for a specific type of probing data. Recently, new chemistries combined with advances in sequencing have facilitated structure probing at unprecedented scale and sensitivity. These novel technologies and anticipated wealth of data highlight a need for algorithms that readily accommodate more complex and diverse input sources. We implemented and investigated a recently outlined probabilistic framework for RNA secondary structure prediction and extended it to accommodate further refinement of structural information. This framework utilizes direct likelihood-based calculations of pseudo-energy terms per considered structural context and can readily accommodate diverse data types and complex data dependencies. We use real data in conjunction with simulations to evaluate performances of several implementations and to show that proper integration of structural contexts can lead to improvements. Our tests also reveal discrepancies between real data and simulations, which we show can be alleviated by refined modeling. We then propose statistical preprocessing approaches to standardize data interpretation and integration into such a generic framework. We further systematically quantify the information content of data subsets, demonstrating that high reactivities are major drivers of SHAPE-directed predictions and that better understanding of less informative reactivities is key to further improvements. Finally, we provide evidence for the adaptive capability of our framework using mock probe simulations.
Collapse
Affiliation(s)
- Fei Deng
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, California 95616, USA
| | - Mirko Ledda
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, California 95616, USA
| | - Sana Vaziri
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, California 95616, USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California at Davis, Davis, California 95616, USA
| |
Collapse
|
17
|
Kutchko KM, Laederach A. Transcending the prediction paradigm: novel applications of SHAPE to RNA function and evolution. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 8. [PMID: 27396578 PMCID: PMC5179297 DOI: 10.1002/wrna.1374] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Revised: 04/29/2016] [Accepted: 05/23/2016] [Indexed: 12/31/2022]
Abstract
Selective 2′‐hydroxyl acylation analyzed by primer extension (SHAPE) provides information on RNA structure at single‐nucleotide resolution. It is most often used in conjunction with RNA secondary structure prediction algorithms as a probabilistic or thermodynamic restraint. With the recent advent of ultra‐high‐throughput approaches for collecting SHAPE data, the applications of this technology are extending beyond structure prediction. In this review, we discuss recent applications of SHAPE data in the transcriptomic context and how this new experimental paradigm is changing our understanding of these experiments and RNA folding in general. SHAPE experiments probe both the secondary and tertiary structure of an RNA, suggesting that model‐free approaches for within and comparative RNA structure analysis can provide significant structural insight without the need for a full structural model. New methods incorporating SHAPE at different nucleotide resolutions are required to parse these transcriptomic data sets to transcend secondary structure modeling with global structural metrics. These ‘multiscale’ approaches provide deeper insights into RNA global structure, evolution, and function in the cell. WIREs RNA 2017, 8:e1374. doi: 10.1002/wrna.1374 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Katrina M Kutchko
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
18
|
Watters KE, Yu AM, Strobel EJ, Settle AH, Lucks JB. Characterizing RNA structures in vitro and in vivo with selective 2'-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq). Methods 2016; 103:34-48. [PMID: 27064082 DOI: 10.1016/j.ymeth.2016.04.002] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Revised: 04/01/2016] [Accepted: 04/04/2016] [Indexed: 01/08/2023] Open
Abstract
RNA molecules adopt a wide variety of structures that perform many cellular functions, including, among others, catalysis, small molecule sensing, and cellular defense. Our ability to characterize, predict, and design RNA structures are key factors for understanding and controlling the biological roles of RNAs. Fortunately, there has been rapid progress in this area, especially with respect to experimental methods that can characterize RNA structures in a high throughput fashion using chemical probing and next-generation sequencing. Here, we describe one such method, selective 2'-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), which measures nucleotide resolution flexibility information for RNAs in vitro and in vivo. We outline the process of designing and performing a SHAPE-Seq experiment and describe methods for using experimental SHAPE-Seq data to restrain computational folding algorithms to generate more accurate predictions of RNA secondary structure. We also provide a number of examples of SHAPE-Seq reactivity spectra obtained in vitro and in vivo and discuss important considerations for performing SHAPE-Seq experiments, both in terms of collecting and analyzing data. Finally, we discuss improvements and extensions of these experimental and computational techniques that promise to deepen our knowledge of RNA folding and function.
Collapse
Affiliation(s)
- Kyle E Watters
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States
| | - Angela M Yu
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States; Tri-Institutional Program in Computational Biology and Medicine, Cornell University, Ithaca, New York, Weill Cornell Medical College, New York, New York, Memorial Sloan-Kettering Cancer Center, New York, New York, United States; Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10065, United States
| | - Eric J Strobel
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States
| | - Alex H Settle
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States
| | - Julius B Lucks
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY 14853, United States.
| |
Collapse
|
19
|
Silverman IM, Berkowitz ND, Gosai SJ, Gregory BD. Genome-Wide Approaches for RNA Structure Probing. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 907:29-59. [PMID: 27256381 DOI: 10.1007/978-3-319-29073-7_2] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
RNA molecules of all types fold into complex secondary and tertiary structures that are important for their function and regulation. Structural and catalytic RNAs such as ribosomal RNA (rRNA) and transfer RNA (tRNA) are central players in protein synthesis, and only function through their proper folding into intricate three-dimensional structures. Studies of messenger RNA (mRNA) regulation have also revealed that structural elements embedded within these RNA species are important for the proper regulation of their total level in the transcriptome. More recently, the discovery of microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) has shed light on the importance of RNA structure to genome, transcriptome, and proteome regulation. Due to the relatively small number, high conservation, and importance of structural and catalytic RNAs to all life, much early work in RNA structure analysis mapped out a detailed view of these molecules. Computational and physical methods were used in concert with enzymatic and chemical structure probing to create high-resolution models of these fundamental biological molecules. However, the recent expansion in our knowledge of the importance of RNA structure to coding and regulatory RNAs has left the field in need of faster and scalable methods for high-throughput structural analysis. To address this, nuclease and chemical RNA structure probing methodologies have been adapted for genome-wide analysis. These methods have been deployed to globally characterize thousands of RNA structures in a single experiment. Here, we review these experimental methodologies for high-throughput RNA structure determination and discuss the insights gained from each approach.
Collapse
Affiliation(s)
- Ian M Silverman
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA.,Cell and Molecular Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Nathan D Berkowitz
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA.,Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Sager J Gosai
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Brian D Gregory
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA. .,Cell and Molecular Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA. .,Genomics and Computational Biology Graduate Group, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
20
|
Ding Y, Kwok CK, Tang Y, Bevilacqua PC, Assmann SM. Genome-wide profiling of in vivo RNA structure at single-nucleotide resolution using structure-seq. Nat Protoc 2015; 10:1050-66. [DOI: 10.1038/nprot.2015.064] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
21
|
Tang Y, Bouvier E, Kwok CK, Ding Y, Nekrutenko A, Bevilacqua PC, Assmann SM. StructureFold: genome-wide RNA secondary structure mapping and reconstruction in vivo. Bioinformatics 2015; 31:2668-75. [PMID: 25886980 DOI: 10.1093/bioinformatics/btv213] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2014] [Accepted: 04/12/2015] [Indexed: 01/06/2023] Open
Abstract
MOTIVATION RNAs fold into complex structures that are integral to the diverse mechanisms underlying RNA regulation of gene expression. Recent development of transcriptome-wide RNA structure profiling through the application of structure-probing enzymes or chemicals combined with high-throughput sequencing has opened a new field that greatly expands the amount of in vitro and in vivo RNA structural information available. The resultant datasets provide the opportunity to investigate RNA structural information on a global scale. However, the analysis of high-throughput RNA structure profiling data requires considerable computational effort and expertise. RESULTS We present a new platform, StructureFold, that provides an integrated computational solution designed specifically for large-scale RNA structure mapping and reconstruction across any transcriptome. StructureFold automates the processing and analysis of raw high-throughput RNA structure profiling data, allowing the seamless incorporation of wet-bench structural information from chemical probes and/or ribonucleases to restrain RNA secondary structure prediction via the RNAstructure and ViennaRNA package algorithms. StructureFold performs reads mapping and alignment, normalization and reactivity derivation, and RNA structure prediction in a single user-friendly web interface or via local installation. The variation in transcript abundance and length that prevails in living cells and consequently causes variation in the counts of structure-probing events between transcripts is accounted for. Accordingly, StructureFold is applicable to RNA structural profiling data obtained in vivo as well as to in vitro or in silico datasets. StructureFold is deployed via the Galaxy platform. AVAILABILITY AND IMPLEMENTATION StructureFold is freely available as a component of Galaxy available at: https://usegalaxy.org/. CONTACT yxt148@psu.edu or sma3@psu.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yin Tang
- Department of Biology, Center for RNA Molecular Biology, Bioinformatics and Genomics Graduate Program
| | - Emil Bouvier
- Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA, Galaxyproject.org, University Park, PA 16802, USA and Baltimore, MD 21218, USA
| | - Chun Kit Kwok
- Center for RNA Molecular Biology, Department of Chemistry and
| | - Yiliang Ding
- Department of Biology, Center for RNA Molecular Biology, Department of Chemistry and
| | - Anton Nekrutenko
- Bioinformatics and Genomics Graduate Program, Department of Biochemistry and Molecular Biology, Penn State University, University Park, PA 16802, USA, Galaxyproject.org, University Park, PA 16802, USA and Baltimore, MD 21218, USA
| | - Philip C Bevilacqua
- Center for RNA Molecular Biology, Department of Chemistry and Plant Biology Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Sarah M Assmann
- Department of Biology, Center for RNA Molecular Biology, Bioinformatics and Genomics Graduate Program, Plant Biology Graduate Program, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
22
|
The RNA structurome: transcriptome-wide structure probing with next-generation sequencing. Trends Biochem Sci 2015; 40:221-32. [DOI: 10.1016/j.tibs.2015.02.005] [Citation(s) in RCA: 122] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 02/16/2015] [Accepted: 02/17/2015] [Indexed: 01/16/2023]
|
23
|
Abstract
The diverse roles of RNAs depend on their ability to fold so as to form biologically functional structures. Thus, understanding the function of a given RNA molecule often requires experimental analysis of its secondary structure by in vitro RNA probing, which is more accurate than using prediction programs only. This chapter presents in vitro RNA probing protocols that we routinely use, from RNA transcript production and purification to RNA structure determination using enzymatic (RNases T1, T2, and V1) and chemical (DMS, CMCT, kethoxal, and Pb(2+)) probing performed on both unlabeled and end-labeled RNAs.
Collapse
Affiliation(s)
- Jean-Vincent Philippe
- CNRS UMR 7365 IMoPA, Université de Lorraine, Biopôle, 9 avenue de la Forêt de Haye, Vandoeuvre-lès-Nancy, 54506, France
| | | | | | | |
Collapse
|