1
|
Monroy-Eklund A, Taylor C, Weidmann CA, Burch C, Laederach A. Structural analysis of MALAT1 long noncoding RNA in cells and in evolution. RNA (NEW YORK, N.Y.) 2023; 29:691-704. [PMID: 36792358 PMCID: PMC10159000 DOI: 10.1261/rna.079388.122] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 02/02/2023] [Indexed: 05/06/2023]
Abstract
Although not canonically polyadenylated, the long noncoding RNA MALAT1 (metastasis-associated lung adenocarcinoma transcript 1) is stabilized by a highly conserved 76-nt triple helix structure on its 3' end. The entire MALAT1 transcript is over 8000 nt long in humans. The strongest structural conservation signal in MALAT1 (as measured by covariation of base pairs) is in the triple helix structure. Primary sequence analysis of covariation alone does not reveal the degree of structural conservation of the entire full-length transcript, however. Furthermore, RNA structure is often context dependent; RNA binding proteins that are differentially expressed in different cell types may alter structure. We investigate here the in-cell and cell-free structures of the full-length human and green monkey (Chlorocebus sabaeus) MALAT1 transcripts in multiple tissue-derived cell lines using SHAPE chemical probing. Our data reveal levels of uniform structural conservation in different cell lines, in cells and cell-free, and even between species, despite significant differences in primary sequence. The uniformity of the structural conservation across the entire transcript suggests that, despite seeing covariation signals only in the triple helix junction of the lncRNA, the rest of the transcript's structure is remarkably conserved, at least in primates and across multiple cell types and conditions.
Collapse
Affiliation(s)
- Anais Monroy-Eklund
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Colin Taylor
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Chase A Weidmann
- Department of Biological Chemistry, University of Michigan Medical School, Center for RNA Biomedicine, Rogel Cancer Center, Ann Arbor, Michigan 48109, USA
| | - Christina Burch
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
2
|
RNA Secondary Structure Alteration Caused by Single Nucleotide Variants. Methods Mol Biol 2023; 2586:107-120. [PMID: 36705901 DOI: 10.1007/978-1-0716-2768-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
A point mutation in coding RNA can cause not only an amino acid substitution but also a dynamic change of RNA secondary structure, leading to a dysfunctional RNA. Although in silico structure prediction has been used to detect structure-disrupting point mutations known as riboSNitches, exhaustive simulation of long RNAs is needed to detect a significant enrichment or depletion of riboSNitches in functional RNAs. Here, we have developed a novel algorithm Radiam (RNA secondary structure Analysis with Deletion, Insertion, And substitution Mutations) for a comprehensive riboSNitch analysis of long RNAs. Radiam is based on the ParasoR framework, which efficiently computes local RNA secondary structures for long RNAs. ParasoR can compute a variety of structure scores over globally consistent structures with maximal span constraints for the base pair distance. Using the reusable structure database made by ParasoR, Radiam performs an efficient recomputation of the secondary structures for mutated sequences. An exhaustive simulation of Radiam is expected to find reliable riboSNitch candidates on long RNAs by evaluating their statistical significance in terms of the change of local structure stability.
Collapse
|
3
|
Ferrero-Serrano Á, Sylvia MM, Forstmeier PC, Olson AJ, Ware D, Bevilacqua PC, Assmann SM. Experimental demonstration and pan-structurome prediction of climate-associated riboSNitches in Arabidopsis. Genome Biol 2022; 23:101. [PMID: 35440059 PMCID: PMC9017077 DOI: 10.1186/s13059-022-02656-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 03/20/2022] [Indexed: 11/23/2022] Open
Abstract
Background Genome-wide association studies (GWAS) aim to correlate phenotypic changes with genotypic variation. Upon transcription, single nucleotide variants (SNVs) may alter mRNA structure, with potential impacts on transcript stability, macromolecular interactions, and translation. However, plant genomes have not been assessed for the presence of these structure-altering polymorphisms or “riboSNitches.” Results We experimentally demonstrate the presence of riboSNitches in transcripts of two Arabidopsis genes, ZINC RIBBON 3 (ZR3) and COTTON GOLGI-RELATED 3 (CGR3), which are associated with continentality and temperature variation in the natural environment. These riboSNitches are also associated with differences in the abundance of their respective transcripts, implying a role in regulating the gene's expression in adaptation to local climate conditions. We then computationally predict riboSNitches transcriptome-wide in mRNAs of 879 naturally inbred Arabidopsis accessions. We characterize correlations between SNPs/riboSNitches in these accessions and 434 climate descriptors of their local environments, suggesting a role of these variants in local adaptation. We integrate this information in CLIMtools V2.0 and provide a new web resource, T-CLIM, that reveals associations between transcript abundance variation and local environmental variation. Conclusion We functionally validate two plant riboSNitches and, for the first time, demonstrate riboSNitch conditionality dependent on temperature, coining the term “conditional riboSNitch.” We provide the first pan-genome-wide prediction of riboSNitches in plants. We expand our previous CLIMtools web resource with riboSNitch information and with 1868 additional Arabidopsis genomes and 269 additional climate conditions, which will greatly facilitate in silico studies of natural genetic variation, its phenotypic consequences, and its role in local adaptation. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02656-4.
Collapse
Affiliation(s)
- Ángel Ferrero-Serrano
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| | - Megan M Sylvia
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Peter C Forstmeier
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Andrew J Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.,USDA ARS NAA Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| | - Philip C Bevilacqua
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Department of Chemistry, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA. .,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| |
Collapse
|
4
|
Martin SE, Gan H, Toomer G, Sridhar N, Sztuba-Solinska J. The m 6A landscape of polyadenylated nuclear (PAN) RNA and its related methylome in the context of KSHV replication. RNA (NEW YORK, N.Y.) 2021; 27:1102-1125. [PMID: 34187903 PMCID: PMC8370742 DOI: 10.1261/rna.078777.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 06/25/2021] [Indexed: 05/10/2023]
Abstract
Polyadenylated nuclear (PAN) RNA is a long noncoding transcript involved in Kaposi's sarcoma-associated herpesvirus (KSHV) lytic reactivation and regulation of cellular and viral gene expression. We have previously shown that PAN RNA has dynamic secondary structure and protein binding profiles that can be influenced by epitranscriptomic modifications. N6-methyladenosine (m6A) is one of the most abundant chemical signatures found in viral RNA genomes and virus-encoded RNAs. Here, we combined antibody-independent next-generation mapping with direct RNA sequencing to address the epitranscriptomic status of PAN RNA in KSHV infected cells. We showed that PAN m6A status is dynamic, reaching the highest number of modifications at the late lytic stages of KSHV infection. Using a newly developed method, termed selenium-modified deoxythymidine triphosphate (SedTTP)-reverse transcription (RT) and ligation assisted PCR analysis of m6A (SLAP), we gained insight into the fraction of modification at identified sites. By applying comprehensive proteomic approaches, we identified writers and erasers that regulate the m6A status of PAN, and readers that can convey PAN m6A phenotypic effects. We verified the temporal and spatial subcellular availability of the methylome components for PAN modification by performing confocal microscopy analysis. Additionally, the RNA biochemical probing (SHAPE-MaP) outlined local and global structural alterations invoked by m6A in the context of full-length PAN RNA. This work represents the first comprehensive overview of the dynamic interplay that takes place between the cellular epitranscriptomic machinery and a specific viral RNA in the context of KSHV infected cells.
Collapse
MESH Headings
- Adenosine/analogs & derivatives
- Adenosine/genetics
- Adenosine/metabolism
- Alpha-Ketoglutarate-Dependent Dioxygenase FTO/genetics
- Alpha-Ketoglutarate-Dependent Dioxygenase FTO/metabolism
- Base Pairing
- Base Sequence
- Cell Line, Tumor
- Endonucleases/genetics
- Endonucleases/metabolism
- Epigenesis, Genetic
- Herpesvirus 8, Human/genetics
- Herpesvirus 8, Human/metabolism
- Heterogeneous-Nuclear Ribonucleoprotein Group C/genetics
- Heterogeneous-Nuclear Ribonucleoprotein Group C/metabolism
- Host-Pathogen Interactions/genetics
- Humans
- Lymphocytes/metabolism
- Lymphocytes/virology
- Methylation
- Methyltransferases/genetics
- Methyltransferases/metabolism
- Nucleic Acid Conformation
- RNA, Long Noncoding/genetics
- RNA, Long Noncoding/metabolism
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- RNA, Nuclear/genetics
- RNA, Nuclear/metabolism
- RNA-Binding Proteins/genetics
- RNA-Binding Proteins/metabolism
- Reverse Transcription
- Sequence Analysis, RNA
- Transcriptome
Collapse
Affiliation(s)
| | - Huachen Gan
- Department of Biological Sciences, Auburn University, Auburn, Alabama 36849, USA
| | - Gabriela Toomer
- Department of Biological Sciences, Auburn University, Auburn, Alabama 36849, USA
| | - Nikitha Sridhar
- Department of Biological Sciences, Auburn University, Auburn, Alabama 36849, USA
| | | |
Collapse
|
5
|
Marangio P, Law KYT, Sanguinetti G, Granneman S. diffBUM-HMM: a robust statistical modeling approach for detecting RNA flexibility changes in high-throughput structure probing data. Genome Biol 2021; 22:165. [PMID: 34044851 PMCID: PMC8157727 DOI: 10.1186/s13059-021-02379-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 05/10/2021] [Indexed: 11/21/2022] Open
Abstract
Advancing RNA structural probing techniques with next-generation sequencing has generated demands for complementary computational tools to robustly extract RNA structural information amidst sampling noise and variability. We present diffBUM-HMM, a noise-aware model that enables accurate detection of RNA flexibility and conformational changes from high-throughput RNA structure-probing data. diffBUM-HMM is widely compatible, accounting for sampling variation and sequence coverage biases, and displays higher sensitivity than existing methods while robust against false positives. Our analyses of datasets generated with a variety of RNA probing chemistries demonstrate the value of diffBUM-HMM for quantitatively detecting RNA structural changes and RNA-binding protein binding sites.
Collapse
Affiliation(s)
- Paolo Marangio
- School of Informatics, The University of Edinburgh, Edinburgh, UK
- SISSA Data Science Excellence Department Initiative, Trieste, Italy
| | - Ka Ying Toby Law
- Centre for Synthetic and Systems Biology, The University of Edinburgh, Edinburgh, UK
| | - Guido Sanguinetti
- Centre for Synthetic and Systems Biology, The University of Edinburgh, Edinburgh, UK.
- School of Informatics, The University of Edinburgh, Edinburgh, UK.
- SISSA Data Science Excellence Department Initiative, Trieste, Italy.
| | - Sander Granneman
- Centre for Synthetic and Systems Biology, The University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
6
|
Lin J, Chen Y, Zhang Y, Ouyang Z. Identification and analysis of RNA structural disruptions induced by single nucleotide variants using Riprap and RiboSNitchDB. NAR Genom Bioinform 2020; 2:lqaa057. [PMID: 33575608 PMCID: PMC7671322 DOI: 10.1093/nargab/lqaa057] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 06/22/2020] [Accepted: 08/11/2020] [Indexed: 11/27/2022] Open
Abstract
RNA conformational alteration has significant impacts on cellular processes and phenotypic variations. An emerging genetic factor of RNA conformational alteration is a new class of single nucleotide variant (SNV) named riboSNitch. RiboSNitches have been demonstrated to be involved in many genetic diseases. However, identifying riboSNitches is notably difficult as the signals of RNA structural disruption are often subtle. Here, we introduce a novel computational framework–RIboSNitch Predictor based on Robust Analysis of Pairing probabilities (Riprap). Riprap identifies structurally disrupted regions around any given SNVs based on robust analysis of local structural configurations between wild-type and mutant RNA sequences. Compared to previous approaches, Riprap shows higher accuracy when assessed on hundreds of known riboSNitches captured by various experimental RNA structure probing methods including the parallel analysis of RNA structure (PARS) and the selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE). Further, Riprap detects the experimentally validated riboSNitch that regulates human catechol-O-methyltransferase haplotypes and outputs structurally disrupted regions precisely at base resolution. Riprap provides a new approach to interpreting disease-related genetic variants. In addition, we construct a database (RiboSNitchDB) that includes the annotation and visualization of all presented riboSNitches in this study as well as 24 629 predicted riboSNitches from human expression quantitative trait loci.
Collapse
Affiliation(s)
- Jianan Lin
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003, USA
| | - Yang Chen
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003, USA
| | - Yuping Zhang
- Department of Statistics, University of Connecticut, Storrs, CT 06269, USA
| | - Zhengqing Ouyang
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003, USA
| |
Collapse
|
7
|
Xue AY, Yu AM, Lucks JB, Bagheri N. DUETT quantitatively identifies known and novel events in nascent RNA structural dynamics from chemical probing data. Bioinformatics 2019; 35:5103-5112. [PMID: 31389563 PMCID: PMC6954663 DOI: 10.1093/bioinformatics/btz449] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Revised: 04/29/2019] [Accepted: 07/19/2019] [Indexed: 01/23/2023] Open
Abstract
MOTIVATION RNA molecules can undergo complex structural dynamics, especially during transcription, which influence their biological functions. Recently developed high-throughput chemical probing experiments that study RNA cotranscriptional folding generate nucleotide-resolution 'reactivities' for each length of a growing nascent RNA that reflect structural dynamics. However, the manual annotation and qualitative interpretation of reactivity across these large datasets can be nuanced, laborious, and difficult for new practitioners. We developed a quantitative and systematic approach to automatically detect RNA folding events from these datasets to reduce human bias/error, standardize event discovery and generate hypotheses about RNA folding trajectories for further analysis and experimental validation. RESULTS Detection of Unknown Events with Tunable Thresholds (DUETT) identifies RNA structural transitions in cotranscriptional RNA chemical probing datasets. DUETT employs a feedback control-inspired method and a linear regression approach and relies on interpretable and independently tunable parameter thresholds to match qualitative user expectations with quantitatively identified folding events. We validate the approach by identifying known RNA structural transitions within the cotranscriptional folding pathways of the Escherichia coli signal recognition particle RNA and the Bacillus cereus crcB fluoride riboswitch. We identify previously overlooked features of these datasets such as heightened reactivity patterns in the signal recognition particle RNA about 12 nt lengths before base-pair rearrangement. We then apply a sensitivity analysis to identify tradeoffs when choosing parameter thresholds. Finally, we show that DUETT is tunable across a wide range of contexts, enabling flexible application to study broad classes of RNA folding mechanisms. AVAILABILITY AND IMPLEMENTATION https://github.com/BagheriLab/DUETT. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Albert Y Xue
- Department of Chemical & Biological Engineering, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
| | - Angela M Yu
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Julius B Lucks
- Department of Chemical & Biological Engineering, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
| | - Neda Bagheri
- Department of Chemical & Biological Engineering, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston IL, USA
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
| |
Collapse
|
8
|
Freidhoff P, Bruist MF. In silico survey of the central conserved regions in viroids of the Pospiviroidae family for conserved asymmetric loop structures. RNA (NEW YORK, N.Y.) 2019; 25:985-1003. [PMID: 31123078 PMCID: PMC6633198 DOI: 10.1261/rna.070409.119] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Accepted: 05/14/2019] [Indexed: 06/01/2023]
Abstract
Viroids are the smallest replicative pathogens, consisting of RNA circles (∼300 nucleotides) that require host machinery to replicate. Structural RNA elements recruit these host factors. Currently, many of these structural elements and the nature of their interactions are unknown. All Pospiviroidae have homology in the central conserved region (CCR). The CCR of potato spindle tuber viroid (PSTVd) contains a sarcin/ricin domain (SRD), the only viroid structural element with an unequivocal replication role. We assumed that every member of this family uses this region to recruit host factors, and that each CCR has an SRD-like asymmetric loop within it. Potential SRD or SRD-like motifs were sought in the CCR of each Pospiviroidae member as follows. Motif location in each CCR was predicted with MUSCLE alignment and Vienna RNAfold. Viroid-specific models of SRD-like motifs were built by superimposing noncanonical base pairs and nucleotides on a model of an SRD. The RNA geometry search engine FR3D was then used to find nucleotide groups close to the geometry suggested by this superimposition. Atomic resolution structures were assembled using the molecular visualization program Chimera, and the stability of each motif was assessed with molecular dynamics (MD). Some models required a protonated cytosine. To be stable within a cell, the pKa of that cytosine must be shifted up. Constant pH-replica exchange MD analysis showed such a shift in the proposed structures. These data show that every Pospiviroidae member could form a motif that resembles an SRD in its CCR, and imply there could be undiscovered mimics of other RNA domains.
Collapse
Affiliation(s)
- Paul Freidhoff
- Department of Chemistry and Biochemistry, University of the Sciences, Philadelphia, Pennsylvania 19104, USA
| | - Michael F Bruist
- Department of Chemistry and Biochemistry, University of the Sciences, Philadelphia, Pennsylvania 19104, USA
| |
Collapse
|
9
|
He F, Wei R, Zhou Z, Huang L, Wang Y, Tang J, Zou Y, Shi L, Gu X, Davis MJ, Su Z. Integrative Analysis of Somatic Mutations in Non-coding Regions Altering RNA Secondary Structures in Cancer Genomes. Sci Rep 2019; 9:8205. [PMID: 31160636 PMCID: PMC6546760 DOI: 10.1038/s41598-019-44489-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Accepted: 05/17/2019] [Indexed: 01/01/2023] Open
Abstract
RNA secondary structure may influence many cellular processes, including RNA processing, stability, localization, and translation. Single-nucleotide variations (SNVs) that alter RNA secondary structure, referred to as riboSNitches, are potentially causative of human diseases, especially in untranslated regions (UTRs) and noncoding RNAs (ncRNAs). The functions of somatic mutations that act as riboSNitches in cancer development remain poorly understood. In this study, we developed a computational pipeline called SNIPER (riboSNitch-enriched or depleted elements in cancer genomes), which employs MeanDiff and EucDiff to detect riboSNitches and then identifies riboSNitch-enriched or riboSNitch-depleted non-coding elements across tumors. SNIPER is available at github: https://github.com/suzhixi/SNIPER/. We found that riboSNitches were more likely to be pathogenic. Moreover, we predicted several UTRs and lncRNAs (long non-coding RNA) that significantly enriched or depleted riboSNitches in cancer genomes, indicative of potential cancer driver or essential noncoding elements. Our study highlights the possibly neglected importance of RNA secondary structure in cancer genomes and provides a new strategy to identify new cancer-associated genes.
Collapse
Affiliation(s)
- Funan He
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Ran Wei
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Zhan Zhou
- Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Leihuan Huang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Yinan Wang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Jie Tang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Yangyun Zou
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Leming Shi
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China.,Shanghai Cancer Center and Cancer Institute, Fudan University, Shanghai, 200032, China
| | - Xun Gu
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Melissa J Davis
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, VIC, 3052, Australia
| | - Zhixi Su
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China. .,Singlera Genomics Inc, Shanghai, China.
| |
Collapse
|
10
|
Choudhary K, Lai YH, Tran EJ, Aviran S. dStruct: identifying differentially reactive regions from RNA structurome profiling data. Genome Biol 2019; 20:40. [PMID: 30791935 PMCID: PMC6385470 DOI: 10.1186/s13059-019-1641-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 01/24/2019] [Indexed: 12/16/2022] Open
Abstract
RNA biology is revolutionized by recent developments of diverse high-throughput technologies for transcriptome-wide profiling of molecular RNA structures. RNA structurome profiling data can be used to identify differentially structured regions between groups of samples. Existing methods are limited in scope to specific technologies and/or do not account for biological variation. Here, we present dStruct which is the first broadly applicable method for differential analysis accounting for biological variation in structurome profiling data. dStruct is compatible with diverse profiling technologies, is validated with experimental data and simulations, and outperforms existing methods.
Collapse
Affiliation(s)
- Krishna Choudhary
- Department of Biomedical Engineering and Genome Center, University of California, Davis, One Shields Avenue, Davis, 95616 CA USA
| | - Yu-Hsuan Lai
- Department of Biochemistry, Purdue University, BCHM 305, 175 S. University Street, West Lafayette, 47907-2063 IN USA
| | - Elizabeth J. Tran
- Department of Biochemistry, Purdue University, BCHM 305, 175 S. University Street, West Lafayette, 47907-2063 IN USA
- Purdue University Center for Cancer Research, Purdue University, Hansen Life Sciences Research Building, Room 141, 201 S. University Street, West Lafayette, 47907-2064 IN USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California, Davis, One Shields Avenue, Davis, 95616 CA USA
| |
Collapse
|
11
|
Hurst T, Xu X, Zhao P, Chen SJ. Quantitative Understanding of SHAPE Mechanism from RNA Structure and Dynamics Analysis. J Phys Chem B 2018; 122:4771-4783. [PMID: 29659274 DOI: 10.1021/acs.jpcb.8b00575] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) method probes RNA local structural and dynamic information at single nucleotide resolution. To gain quantitative insights into the relationship between nucleotide flexibility, RNA 3D structure, and SHAPE reactivity, we develop a 3D Structure-SHAPE Relationship model (3DSSR) to rebuild SHAPE profiles from 3D structures. The model starts from RNA structures and combines nucleotide interaction strength and conformational propensity, ligand (SHAPE reagent) accessibility, and base-pairing pattern through a composite function to quantify the correlation between SHAPE reactivity and nucleotide conformational stability. The 3DSSR model shows the relationship between SHAPE reactivity and RNA structure and energetics. Comparisons between the 3DSSR-predicted SHAPE profile and the experimental SHAPE data show correlation, suggesting that the extracted analytical function may have captured the key factors that determine the SHAPE reactivity profile. Furthermore, the theory offers an effective method to sieve RNA 3D models and exclude models that are incompatible with experimental SHAPE data.
Collapse
Affiliation(s)
- Travis Hurst
- Department of Physics, Department of Biochemistry , and University of Missouri Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Xiaojun Xu
- Department of Physics, Department of Biochemistry , and University of Missouri Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Peinan Zhao
- Department of Physics, Department of Biochemistry , and University of Missouri Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry , and University of Missouri Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| |
Collapse
|
12
|
Lackey L, Coria A, Woods C, McArthur E, Laederach A. Allele-specific SHAPE-MaP assessment of the effects of somatic variation and protein binding on mRNA structure. RNA (NEW YORK, N.Y.) 2018; 24:513-528. [PMID: 29317542 PMCID: PMC5855952 DOI: 10.1261/rna.064469.117] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 01/04/2018] [Indexed: 05/22/2023]
Abstract
The impact of inherited and somatic mutations on messenger RNA (mRNA) structure remains poorly understood. Recent technological advances that leverage next-generation sequencing to obtain experimental structure data, such as SHAPE-MaP, can reveal structural effects of mutations, especially when these data are incorporated into structure modeling. Here, we analyze the ability of SHAPE-MaP to detect the relatively subtle structural changes caused by single-nucleotide mutations. We find that allele-specific sorting greatly improved our detection ability. Thus, we used SHAPE-MaP with a novel combination of clone-free robotic mutagenesis and allele-specific sorting to perform a rapid, comprehensive survey of noncoding somatic and inherited riboSNitches in two cancer-associated mRNAs, TPT1 and LCP1 Using rigorous thermodynamic modeling of the Boltzmann suboptimal ensemble, we identified a subset of mutations that change TPT1 and LCP1 RNA structure, with approximately 14% of all variants identified as riboSNitches. To confirm that these in vitro structures were biologically relevant, we tested how dependent TPT1 and LCP1 mRNA structures were on their environments. We performed SHAPE-MaP on TPT1 and LCP1 mRNAs in the presence or absence of cellular proteins and found that both mRNAs have similar overall folds in all conditions. RiboSNitches identified within these mRNAs in vitro likely exist under biological conditions. Overall, these data reveal a robust mRNA structural landscape where differences in environmental conditions and most sequence variants do not significantly alter RNA structural ensembles. Finally, predicting riboSNitches in mRNAs from sequence alone remains particularly challenging; these data will provide the community with benchmarks for further algorithmic development.
Collapse
Affiliation(s)
- Lela Lackey
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Aaztli Coria
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Chanin Woods
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Evonne McArthur
- School of Medicine, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
13
|
Lin L, McKerrow WH, Richards B, Phonsom C, Lawrence CE. Characterization and visualization of RNA secondary structure Boltzmann ensemble via information theory. BMC Bioinformatics 2018; 19:82. [PMID: 29506466 PMCID: PMC5836418 DOI: 10.1186/s12859-018-2078-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 02/20/2018] [Indexed: 12/26/2022] Open
Abstract
Background The nearest neighbor model and associated dynamic programming algorithms allow for the efficient estimation of the RNA secondary structure Boltzmann ensemble. However because a given RNA secondary structure only contains a fraction of the possible helices that could form from a given sequence, the Boltzmann ensemble is multimodal. Several methods exist for clustering structures and finding those modes. However less focus is given to exploring the underlying reasons for this multimodality: the presence of conflicting basepairs. Information theory, or more specifically mutual information, provides a method to identify those basepairs that are key to the secondary structure. Results To this end we find most informative basepairs and visualize the effect of these basepairs on the secondary structure. Knowing whether a most informative basepair is present tells us not only the status of the particular pair but also provides a large amount of information about which other pairs are present or not present. We find that a few basepairs account for a large amount of the structural uncertainty. The identification of these pairs indicates small changes to sequence or stability that will have a large effect on structure. Conclusion We provide a novel algorithm that uses mutual information to identify the key basepairs that lead to a multimodal Boltzmann distribution. We then visualize the effect of these pairs on the overall Boltzmann ensemble.
Collapse
Affiliation(s)
- Luan Lin
- Center for Devices and Radiological Health, U.S. Food and Drug Administration, Silver Spring, 20993, MD, USA
| | - Wilson H McKerrow
- Division of Applied Mathematics, Brown University, Providence, 02912, RI, USA
| | | | - Chukiat Phonsom
- Department of Mathematics, University of Southern California, Los Angeles, 90089, CA, USA
| | - Charles E Lawrence
- Division of Applied Mathematics, Brown University, Providence, 02912, RI, USA.
| |
Collapse
|