1
|
Kang J, Wei S, Jia Z, Ma Y, Chen H, Sun C, Xu J, Tao J, Dong Y, Lv W, Tian H, Guo X, Bi S, Zhang C, Jiang Y, Lv H, Zhang M. Effects of genetic variation on the structure of RNA and protein. Proteomics 2024; 24:e2300235. [PMID: 38197532 DOI: 10.1002/pmic.202300235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 12/15/2023] [Accepted: 12/19/2023] [Indexed: 01/11/2024]
Abstract
Changes in the structure of RNA and protein, have an important impact on biological functions and are even important determinants of disease pathogenesis and treatment. Some genetic variations, including copy number variation, single nucleotide variation, and so on, can lead to changes in biological function and increased susceptibility to certain diseases by changing the structure of RNA or protein. With the development of structural biology and sequencing technology, a large amount of RNA and protein structure data and genetic variation data resources has emerged to be used to explain biological processes. Here, we reviewed the effects of genetic variation on the structure of RNAs and proteins, and investigated their impact on several diseases. An online resource (http://www.onethird-lab.com/gems/) to support convenient retrieval of common tools is also built. Finally, the challenges and future development of the effects of genetic variation on RNA and protein were discussed.
Collapse
Affiliation(s)
- Jingxuan Kang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Siyu Wei
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Zhe Jia
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Yingnan Ma
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Haiyan Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Chen Sun
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Jing Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Junxian Tao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Yu Dong
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Wenhua Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Hongsheng Tian
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Xuying Guo
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Shuo Bi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Chen Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yongshuai Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Hongchao Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| | - Mingming Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- The Epigenome-Wide Association Study Project, Harbin, China
| |
Collapse
|
2
|
Xu Q, Bao X, Lin Z, Tang L, He LN, Ren J, Zuo Z, Hu K. AStruct: detection of allele-specific RNA secondary structure in structuromic probing data. BMC Bioinformatics 2024; 25:91. [PMID: 38429654 PMCID: PMC11264973 DOI: 10.1186/s12859-024-05704-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 02/14/2024] [Indexed: 03/03/2024] Open
Abstract
BACKGROUND Uncovering functional genetic variants from an allele-specific perspective is of paramount importance in advancing our understanding of gene regulation and genetic diseases. Recently, various allele-specific events, such as allele-specific gene expression, allele-specific methylation, and allele-specific binding, have been explored on a genome-wide scale due to the development of high-throughput sequencing methods. RNA secondary structure, which plays a crucial role in multiple RNA-associated processes like RNA modification, translation and splicing, has emerged as an essential focus of relevant research. However, tools to identify genetic variants associated with allele-specific RNA secondary structures are still lacking. RESULTS Here, we develop a computational tool called 'AStruct' that enables us to detect allele-specific RNA secondary structure (ASRS) from RT-stop based structuromic probing data. AStruct shows robust performance in both simulated datasets and public icSHAPE datasets. We reveal that single nucleotide polymorphisms (SNPs) with higher AStruct scores are enriched in coding regions and tend to be functional. These SNPs are highly conservative, have the potential to disrupt sites involved in m6A modification or protein binding, and are frequently associated with disease. CONCLUSIONS AStruct is a tool dedicated to invoke allele-specific RNA secondary structure events at heterozygous SNPs in RT-stop based structuromic probing data. It utilizes allelic variants, base pairing and RT-stop information under different cell conditions to detect dynamic and functional ASRS. Compared to sequence-based tools, AStruct considers dynamic cell conditions and outperforms in detecting functional variants. AStruct is implemented in JAVA and is freely accessible at: https://github.com/canceromics/AStruct .
Collapse
Affiliation(s)
- Qingru Xu
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Xiaoqiong Bao
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Zhuobin Lin
- Guangdong Key Laboratory of Liver Disease Research, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Lin Tang
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Li-Na He
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Jian Ren
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China
| | - Zhixiang Zuo
- State Key Laboratory of Oncology in South China, Cancer Center, Collaborative Innovation Center for Cancer Medicine, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510060, China.
| | - Kunhua Hu
- Guangdong Key Laboratory of Liver Disease Research, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
3
|
Waldern JM, Kumar J, Laederach A. Disease-associated human genetic variation through the lens of precursor and mature RNA structure. Hum Genet 2022; 141:1659-1672. [PMID: 34741198 PMCID: PMC9072596 DOI: 10.1007/s00439-021-02395-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 10/26/2021] [Indexed: 12/14/2022]
Abstract
Disease-associated variants (DAVs) are commonly considered either through a genomic lens that describes variant function at the DNA level, or at the protein function level if the variant is translated. Although the genomic and proteomic effects of variation are well-characterized, genetic variants disrupting post-transcriptional regulation is another mechanism of disease that remains understudied. Specific RNA sequence motifs mediate post-transcriptional regulation both in the nucleus and cytoplasm of eukaryotic cells, often by binding to RNA-binding proteins or other RNAs. However, many DAVs map far from these motifs, which suggests deeper layers of post-transcriptional mechanistic control. Here, we consider a transcriptomic framework to outline the importance of post-transcriptional regulation as a mechanism of disease-causing single-nucleotide variation in the human genome. We first describe the composition of the human transcriptome and the importance of abundant yet overlooked components such as introns and untranslated regions (UTRs) of messenger RNAs (mRNAs). We present an analysis of Human Gene Mutation Database variants mapping to mRNAs and examine the distribution of causative disease-associated variation across the transcriptome. Although our analysis confirms the importance of post-transcriptional regulatory motifs, a majority of DAVs do not directly map to known regulatory motifs. Therefore, we review evidence that regions outside these well-characterized motifs can regulate function by RNA structure-mediated mechanisms in all four elements of an mRNA: exons, introns, 5' and 3' UTRs. To this end, we review published examples of riboSNitches, which are single-nucleotide variants that result in a change in RNA structure that is causative of the disease phenotype. In this review, we present the current state of knowledge of how DAVs act at the transcriptome level, both through altering post-transcriptional regulatory motifs and by the effects of RNA structure.
Collapse
Affiliation(s)
- Justin M Waldern
- Department of Biology, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Jayashree Kumar
- Department of Biology, University of North Carolina, Chapel Hill, NC, 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, NC, 27599, USA.
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
| |
Collapse
|
4
|
G R, Mitra A, Pk V. Predicting functional riboSNitches in the context of alternative splicing. Gene X 2022; 837:146694. [PMID: 35738445 DOI: 10.1016/j.gene.2022.146694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 05/11/2022] [Accepted: 06/17/2022] [Indexed: 11/19/2022] Open
Abstract
RNAs are the major regulators of gene expression, and their secondary structures play crucial roles at different levels. RiboSNitches are disease-associated SNPs that cause changes in the pre-mRNA secondary structural ensemble. Several riboSNitches have been detected in the 5' and 3' untranslated regions and lncRNA. Although cases of secondary structural elements playing a regulatory role in alternative splicing are known, regions specific to splicing events, such as splice junctions have not received much attention. We tested splice-site mutations for their efficiency in disrupting the secondary structure and hypothesized that these could play a crucial role in alternative splicing. Multiple riboSNitch prediction methods were applied to obtain overlapping results that are potentially more reliable. Putative riboSNitches were identified from aberrant 5' and 3' splice site mutations, cancer-causing somatic mutations, and genes that harbor the regulatory RNA secondary structural elements. Our workflow for predicting riboSNitches associated with alternative splicing is novel and paves the way for subsequent experimental validation.
Collapse
Affiliation(s)
- Ramya G
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India.
| | - Abhijit Mitra
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India.
| | - Vinod Pk
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India.
| |
Collapse
|
5
|
Ferrero-Serrano Á, Sylvia MM, Forstmeier PC, Olson AJ, Ware D, Bevilacqua PC, Assmann SM. Experimental demonstration and pan-structurome prediction of climate-associated riboSNitches in Arabidopsis. Genome Biol 2022; 23:101. [PMID: 35440059 PMCID: PMC9017077 DOI: 10.1186/s13059-022-02656-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Accepted: 03/20/2022] [Indexed: 11/23/2022] Open
Abstract
Background Genome-wide association studies (GWAS) aim to correlate phenotypic changes with genotypic variation. Upon transcription, single nucleotide variants (SNVs) may alter mRNA structure, with potential impacts on transcript stability, macromolecular interactions, and translation. However, plant genomes have not been assessed for the presence of these structure-altering polymorphisms or “riboSNitches.” Results We experimentally demonstrate the presence of riboSNitches in transcripts of two Arabidopsis genes, ZINC RIBBON 3 (ZR3) and COTTON GOLGI-RELATED 3 (CGR3), which are associated with continentality and temperature variation in the natural environment. These riboSNitches are also associated with differences in the abundance of their respective transcripts, implying a role in regulating the gene's expression in adaptation to local climate conditions. We then computationally predict riboSNitches transcriptome-wide in mRNAs of 879 naturally inbred Arabidopsis accessions. We characterize correlations between SNPs/riboSNitches in these accessions and 434 climate descriptors of their local environments, suggesting a role of these variants in local adaptation. We integrate this information in CLIMtools V2.0 and provide a new web resource, T-CLIM, that reveals associations between transcript abundance variation and local environmental variation. Conclusion We functionally validate two plant riboSNitches and, for the first time, demonstrate riboSNitch conditionality dependent on temperature, coining the term “conditional riboSNitch.” We provide the first pan-genome-wide prediction of riboSNitches in plants. We expand our previous CLIMtools web resource with riboSNitch information and with 1868 additional Arabidopsis genomes and 269 additional climate conditions, which will greatly facilitate in silico studies of natural genetic variation, its phenotypic consequences, and its role in local adaptation. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02656-4.
Collapse
Affiliation(s)
- Ángel Ferrero-Serrano
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| | - Megan M Sylvia
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Peter C Forstmeier
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Andrew J Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.,USDA ARS NAA Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| | - Philip C Bevilacqua
- Department of Biochemistry, Microbiology, and Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Department of Chemistry, Pennsylvania State University, University Park, State College, PA, 16802, USA.,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA
| | - Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA. .,Center for RNA Molecular Biology, Pennsylvania State University, University Park, State College, PA, 16802, USA.
| |
Collapse
|
6
|
Martens L, Rühle F, Witten A, Meder B, Katus HA, Arbustini E, Hasenfuß G, Sinner MF, Kääb S, Pankuweit S, Angermann C, Bornberg-Bauer E, Stoll M. A genetic variant alters the secondary structure of the lncRNA H19 and is associated with dilated cardiomyopathy. RNA Biol 2021; 18:409-415. [PMID: 34313541 DOI: 10.1080/15476286.2021.1952756] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
Abstract
lncRNAs are at the core of many regulatory processes and have also been recognized to be involved in various complex diseases. They affect gene regulation through direct interactions with RNA, DNA or proteins. Accordingly, lncRNA structure is likely to be essential for their regulatory function. Point mutations, which manifest as SNPs (single nucleotide polymorphisms) in genome screens, can substantially alter their function and, subsequently, the expression of their downstream regulated genes. To test the effect of SNPs on structure, we investigated lncRNAs associated with dilated cardiomyopathy. Among 322 human candidate lncRNAs, we demonstrate first the significant association of an SNP located in lncRNA H19 using data from 1084 diseased and 751 control patients. H19 is generally highly expressed in the heart, with a complex expression pattern during heart development. Next, we used MFE (minimum free energy) folding to demonstrate a significant refolding in the secondary structure of this 861 nt long lncRNA. Since MFE folding may overlook the importance of sub-optimal structures, we showed that this refolding also manifests in the overall Boltzmann structure ensemble. There, the composition of structures is tremendously affected in their thermodynamic probabilities through the genetic variant. Finally, we confirmed these results experimentally, using SHAPE-Seq, corroborating that SNPs affecting such structures may explain hidden genetic variance not accounted for through genome wide association studies. Our results suggest that structural changes in lncRNAs, and lncRNA H19 in particular, affect regulatory processes and represent optimal targets for further in-depth studies probing their molecular interactions.
Collapse
Affiliation(s)
- Leonie Martens
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany
| | - Frank Rühle
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany
| | - Anika Witten
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany
| | - Benjamin Meder
- German Centre for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, Heidelberg, Germany.,Department of Cardiology, Heidelberg University, Heidelberg, Germany.,Genome Technology Center Stanford, Department of Genetics, Stanford University, Stanford, United States
| | - Hugo A Katus
- German Centre for Cardiovascular Research (DZHK), Partner Site Heidelberg/Mannheim, Heidelberg, Germany.,Department of Cardiology, Heidelberg University, Heidelberg, Germany
| | - Eloisa Arbustini
- Centre for Inherited Cardiovascular Diseases, IRCCS Foundation, University Hospital Policlinico San Matteo, Pavia, Italy
| | - Gerd Hasenfuß
- Department of Cardiology and Pneumology, University Medical Center Göttingen, Göttingen, Germany.,German Center for Cardiovascular Research (DZHK), Partner Site, Göttingen, Germany
| | - Moritz F Sinner
- Department of Cardiology, University Hospital, LMU Munich, Munich, Germany.,German Centre for Cardiovascular Research (DZHK), Partner Site: Munich Heart Alliance, Munich, Germany
| | - Stefan Kääb
- Department of Cardiology, University Hospital, LMU Munich, Munich, Germany.,German Centre for Cardiovascular Research (DZHK), Partner Site: Munich Heart Alliance, Munich, Germany
| | - Sabine Pankuweit
- Department of Cardiology, University Hospital Giessen and Marburg, Marburg, Germany
| | - Christiane Angermann
- Comprehensive Heart Failure Center, University Hospital and University of Würzburg, Würzburg, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Monika Stoll
- Department of Genetic Epidemiology, Institute of Human Genetics, University of Münster, Münster, Germany.,Department of Biochemistry, Genetic Epidemiology and Statistical Genetics, CARIM School for Cardiovascular Diseases, Maastricht Center for Systems Biology (MaCSBio), Maastricht University, Maastricht, The Netherlands
| |
Collapse
|
7
|
Dynamic Molecular Epidemiology Reveals Lineage-Associated Single-Nucleotide Variants That Alter RNA Structure in Chikungunya Virus. Genes (Basel) 2021; 12:genes12020239. [PMID: 33567556 PMCID: PMC7914560 DOI: 10.3390/genes12020239] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 01/29/2021] [Accepted: 02/04/2021] [Indexed: 01/21/2023] Open
Abstract
Chikungunya virus (CHIKV) is an emerging Alphavirus which causes millions of human infections every year. Outbreaks have been reported in Africa and Asia since the early 1950s, from three CHIKV lineages: West African, East Central South African, and Asian Urban. As new outbreaks occurred in the Americas, individual strains from the known lineages have evolved, creating new monophyletic groups that generated novel geographic-based lineages. Building on a recently updated phylogeny of CHIKV, we report here the availability of an interactive CHIKV phylodynamics dataset, which is based on more than 900 publicly available CHIKV genomes. We provide an interactive view of CHIKV molecular epidemiology built on Nextstrain, a web-based visualization framework for real-time tracking of pathogen evolution. CHIKV molecular epidemiology reveals single nucleotide variants that change the stability and fold of locally stable RNA structures. We propose alternative RNA structure formation in different CHIKV lineages by predicting more than a dozen RNA elements that are subject to perturbation of the structure ensemble upon variation of a single nucleotide.
Collapse
|
8
|
Lin J, Chen Y, Zhang Y, Ouyang Z. Identification and analysis of RNA structural disruptions induced by single nucleotide variants using Riprap and RiboSNitchDB. NAR Genom Bioinform 2020; 2:lqaa057. [PMID: 33575608 PMCID: PMC7671322 DOI: 10.1093/nargab/lqaa057] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 06/22/2020] [Accepted: 08/11/2020] [Indexed: 11/27/2022] Open
Abstract
RNA conformational alteration has significant impacts on cellular processes and phenotypic variations. An emerging genetic factor of RNA conformational alteration is a new class of single nucleotide variant (SNV) named riboSNitch. RiboSNitches have been demonstrated to be involved in many genetic diseases. However, identifying riboSNitches is notably difficult as the signals of RNA structural disruption are often subtle. Here, we introduce a novel computational framework–RIboSNitch Predictor based on Robust Analysis of Pairing probabilities (Riprap). Riprap identifies structurally disrupted regions around any given SNVs based on robust analysis of local structural configurations between wild-type and mutant RNA sequences. Compared to previous approaches, Riprap shows higher accuracy when assessed on hundreds of known riboSNitches captured by various experimental RNA structure probing methods including the parallel analysis of RNA structure (PARS) and the selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE). Further, Riprap detects the experimentally validated riboSNitch that regulates human catechol-O-methyltransferase haplotypes and outputs structurally disrupted regions precisely at base resolution. Riprap provides a new approach to interpreting disease-related genetic variants. In addition, we construct a database (RiboSNitchDB) that includes the annotation and visualization of all presented riboSNitches in this study as well as 24 629 predicted riboSNitches from human expression quantitative trait loci.
Collapse
Affiliation(s)
- Jianan Lin
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003, USA
| | - Yang Chen
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003, USA
| | - Yuping Zhang
- Department of Statistics, University of Connecticut, Storrs, CT 06269, USA
| | - Zhengqing Ouyang
- Department of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, MA 01003, USA
| |
Collapse
|
9
|
Lu X, Ding Y, Bai Y, Li J, Zhang G, Wang S, Gao W, Xu L, Wang H. Detection of Allosteric Effects of lncRNA Secondary Structures Altered by SNPs in Human Diseases. Front Cell Dev Biol 2020; 8:242. [PMID: 32322582 PMCID: PMC7156602 DOI: 10.3389/fcell.2020.00242] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Accepted: 03/23/2020] [Indexed: 12/19/2022] Open
Abstract
Recent studies have shown that structuralized long non-coding RNAs (lncRNAs) play important roles in genetic and epigenetic processes. The spatial structures of most lncRNAs can be altered by distinct in vivo and in vitro cellular environments, as well as by DNA structural variations, such as single-nucleotide polymorphisms (SNPs) and variants (SNVs). In the present study, we extended candidate SNPs that had linkage disequilibria with those significantly associated with lung diseases in genome-wide association studies in order to investigate potential disease mechanisms originating from SNP structural changes of host lncRNAs. Following accurate alignments, we recognized 115 ternary-relationship pairs among 41 SNPs, 10 lncRNA transcripts, and 1 type of lung disease (adenocarcinoma of the lung). Then, we evaluated the structural heterogeneity induced by SNP alleles by developing a local-RNA-structure alignment algorithm and employing randomized strategies to determine the significance of structural variation. We identified four ternary-relationship pairs that were significantly associated with SNP-induced lncRNA allosteric effects. Moreover, these conformational changes disrupted the interactive regions and binding affinities of lncRNA-HCG23 and TF-E2F6, suggesting that these may represent regulatory mechanisms in lung diseases. Taken together, our findings support that SNP-induced changes in lncRNA conformations regulate many biological processes, providing novel insight into the role of the lncRNA “structurome” in human diseases.
Collapse
Affiliation(s)
- Xiaoyan Lu
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yu Ding
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yu Bai
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Jing Li
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Guosi Zhang
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China
| | - Siyu Wang
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China
| | - Wenyan Gao
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Liangde Xu
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Hong Wang
- School of Ophthalmology and Optometry and Eye Hospital, School of Biomedical Engineering, Wenzhou Medical University, Wenzhou, China.,College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| |
Collapse
|
10
|
Pulos-Holmes MC, Srole DN, Juarez MG, Lee ASY, McSwiggen DT, Ingolia NT, Cate JH. Repression of ferritin light chain translation by human eIF3. eLife 2019; 8:48193. [PMID: 31414986 PMCID: PMC6721798 DOI: 10.7554/elife.48193] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Accepted: 08/14/2019] [Indexed: 11/13/2022] Open
Abstract
A central problem in human biology remains the discovery of causal molecular links between mutations identified in genome-wide association studies (GWAS) and their corresponding disease traits. This challenge is magnified for variants residing in non-coding regions of the genome. Single-nucleotide polymorphisms (SNPs) in the 5ʹ untranslated region (5ʹ-UTR) of the ferritin light chain (FTL) gene that cause hyperferritinemia are reported to disrupt translation repression by altering iron regulatory protein (IRP) interactions with the FTL mRNA 5ʹ-UTR. Here, we show that human eukaryotic translation initiation factor 3 (eIF3) acts as a distinct repressor of FTL mRNA translation, and eIF3-mediated FTL repression is disrupted by a subset of SNPs in FTL that cause hyperferritinemia. These results identify a direct role for eIF3-mediated translational control in a specific human disease.
Collapse
Affiliation(s)
- Mia C Pulos-Holmes
- Department of Molecular & Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Daniel N Srole
- Department of Molecular & Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Maria G Juarez
- Department of Molecular & Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Amy S-Y Lee
- Biology Department, Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, United States
| | - David T McSwiggen
- Department of Molecular & Cell Biology, University of California, Berkeley, Berkeley, United States
| | - Nicholas T Ingolia
- Department of Molecular & Cell Biology, University of California, Berkeley, Berkeley, United States.,California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States
| | - Jamie H Cate
- Department of Molecular & Cell Biology, University of California, Berkeley, Berkeley, United States.,California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, United States.,Department of Chemistry, University of California, Berkeley, Berkeley, United States.,Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, United States
| |
Collapse
|
11
|
He F, Wei R, Zhou Z, Huang L, Wang Y, Tang J, Zou Y, Shi L, Gu X, Davis MJ, Su Z. Integrative Analysis of Somatic Mutations in Non-coding Regions Altering RNA Secondary Structures in Cancer Genomes. Sci Rep 2019; 9:8205. [PMID: 31160636 PMCID: PMC6546760 DOI: 10.1038/s41598-019-44489-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Accepted: 05/17/2019] [Indexed: 01/01/2023] Open
Abstract
RNA secondary structure may influence many cellular processes, including RNA processing, stability, localization, and translation. Single-nucleotide variations (SNVs) that alter RNA secondary structure, referred to as riboSNitches, are potentially causative of human diseases, especially in untranslated regions (UTRs) and noncoding RNAs (ncRNAs). The functions of somatic mutations that act as riboSNitches in cancer development remain poorly understood. In this study, we developed a computational pipeline called SNIPER (riboSNitch-enriched or depleted elements in cancer genomes), which employs MeanDiff and EucDiff to detect riboSNitches and then identifies riboSNitch-enriched or riboSNitch-depleted non-coding elements across tumors. SNIPER is available at github: https://github.com/suzhixi/SNIPER/. We found that riboSNitches were more likely to be pathogenic. Moreover, we predicted several UTRs and lncRNAs (long non-coding RNA) that significantly enriched or depleted riboSNitches in cancer genomes, indicative of potential cancer driver or essential noncoding elements. Our study highlights the possibly neglected importance of RNA secondary structure in cancer genomes and provides a new strategy to identify new cancer-associated genes.
Collapse
Affiliation(s)
- Funan He
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Ran Wei
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Zhan Zhou
- Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Leihuan Huang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Yinan Wang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Jie Tang
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Yangyun Zou
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Leming Shi
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China.,Shanghai Cancer Center and Cancer Institute, Fudan University, Shanghai, 200032, China
| | - Xun Gu
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Melissa J Davis
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, VIC, 3052, Australia
| | - Zhixi Su
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China. .,Singlera Genomics Inc, Shanghai, China.
| |
Collapse
|
12
|
Choudhary K, Lai YH, Tran EJ, Aviran S. dStruct: identifying differentially reactive regions from RNA structurome profiling data. Genome Biol 2019; 20:40. [PMID: 30791935 PMCID: PMC6385470 DOI: 10.1186/s13059-019-1641-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 01/24/2019] [Indexed: 12/16/2022] Open
Abstract
RNA biology is revolutionized by recent developments of diverse high-throughput technologies for transcriptome-wide profiling of molecular RNA structures. RNA structurome profiling data can be used to identify differentially structured regions between groups of samples. Existing methods are limited in scope to specific technologies and/or do not account for biological variation. Here, we present dStruct which is the first broadly applicable method for differential analysis accounting for biological variation in structurome profiling data. dStruct is compatible with diverse profiling technologies, is validated with experimental data and simulations, and outperforms existing methods.
Collapse
Affiliation(s)
- Krishna Choudhary
- Department of Biomedical Engineering and Genome Center, University of California, Davis, One Shields Avenue, Davis, 95616 CA USA
| | - Yu-Hsuan Lai
- Department of Biochemistry, Purdue University, BCHM 305, 175 S. University Street, West Lafayette, 47907-2063 IN USA
| | - Elizabeth J. Tran
- Department of Biochemistry, Purdue University, BCHM 305, 175 S. University Street, West Lafayette, 47907-2063 IN USA
- Purdue University Center for Cancer Research, Purdue University, Hansen Life Sciences Research Building, Room 141, 201 S. University Street, West Lafayette, 47907-2064 IN USA
| | - Sharon Aviran
- Department of Biomedical Engineering and Genome Center, University of California, Davis, One Shields Avenue, Davis, 95616 CA USA
| |
Collapse
|
13
|
Lackey L, Coria A, Woods C, McArthur E, Laederach A. Allele-specific SHAPE-MaP assessment of the effects of somatic variation and protein binding on mRNA structure. RNA (NEW YORK, N.Y.) 2018; 24:513-528. [PMID: 29317542 PMCID: PMC5855952 DOI: 10.1261/rna.064469.117] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Accepted: 01/04/2018] [Indexed: 05/22/2023]
Abstract
The impact of inherited and somatic mutations on messenger RNA (mRNA) structure remains poorly understood. Recent technological advances that leverage next-generation sequencing to obtain experimental structure data, such as SHAPE-MaP, can reveal structural effects of mutations, especially when these data are incorporated into structure modeling. Here, we analyze the ability of SHAPE-MaP to detect the relatively subtle structural changes caused by single-nucleotide mutations. We find that allele-specific sorting greatly improved our detection ability. Thus, we used SHAPE-MaP with a novel combination of clone-free robotic mutagenesis and allele-specific sorting to perform a rapid, comprehensive survey of noncoding somatic and inherited riboSNitches in two cancer-associated mRNAs, TPT1 and LCP1 Using rigorous thermodynamic modeling of the Boltzmann suboptimal ensemble, we identified a subset of mutations that change TPT1 and LCP1 RNA structure, with approximately 14% of all variants identified as riboSNitches. To confirm that these in vitro structures were biologically relevant, we tested how dependent TPT1 and LCP1 mRNA structures were on their environments. We performed SHAPE-MaP on TPT1 and LCP1 mRNAs in the presence or absence of cellular proteins and found that both mRNAs have similar overall folds in all conditions. RiboSNitches identified within these mRNAs in vitro likely exist under biological conditions. Overall, these data reveal a robust mRNA structural landscape where differences in environmental conditions and most sequence variants do not significantly alter RNA structural ensembles. Finally, predicting riboSNitches in mRNAs from sequence alone remains particularly challenging; these data will provide the community with benchmarks for further algorithmic development.
Collapse
Affiliation(s)
- Lela Lackey
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Aaztli Coria
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Chanin Woods
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Evonne McArthur
- School of Medicine, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
14
|
Steri M, Idda ML, Whalen MB, Orrù V. Genetic variants in mRNA untranslated regions. WILEY INTERDISCIPLINARY REVIEWS-RNA 2018; 9:e1474. [PMID: 29582564 DOI: 10.1002/wrna.1474] [Citation(s) in RCA: 98] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Revised: 02/05/2018] [Accepted: 02/11/2018] [Indexed: 12/24/2022]
Abstract
Genome Wide Association Studies (GWAS) have mapped thousands of genetic variants associated with complex disease risk and regulating quantitative traits, thus exploiting an unprecedented high-resolution genetic characterization of the human genome. A small fraction (3.7%) of the identified associations is located in untranslated regions (UTRs), and the molecular mechanism has been elucidated for few of them. Genetic variations at UTRs may modify regulatory elements affecting the interaction of the UTRs with proteins and microRNAs. The overall functional consequences include modulation of messenger RNA (mRNA) transcription, secondary structure, stability, localization, translation, and access to regulators like microRNAs (miRNAs) and RNA-binding proteins (RBPs). Alterations of these regulatory mechanisms are known to modify molecular pathways and cellular processes, potentially leading to disease processes. Here, we analyze some examples of genetic risk variants mapping in the UTR regulatory elements. We describe a recently identified genetic variant localized in the 3'UTR of the TNFSF13B gene, associated with autoimmunity risk and responsible of an increased stability and translation of TNFSF13B mRNA. We discuss how the correct use and interpretation of public GWAS repositories could lead to a better understanding of etiopathogenetic mechanisms and the generation of robust biological hypothesis as starting point for further functional studies. This article is categorized under: RNA Structure and Dynamics > RNA Structure, Dynamics and Chemistry RNA Evolution and Genomics > Computational Analyses of RNA RNA in Disease and Development > RNA in Disease.
Collapse
Affiliation(s)
- Maristella Steri
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Cagliari, Italy
| | - M Laura Idda
- Laboratory of Genetics and Genomics, National Institute on Aging, National Institute of Health, Baltimore, Maryland
| | - Michael B Whalen
- Istituto di Biofisica, Consiglio Nazionale delle Ricerche (CNR), Trento, Italy
| | - Valeria Orrù
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche (CNR), Monserrato, Cagliari, Italy
| |
Collapse
|
15
|
Woods CT, Laederach A. Classification of RNA structure change by 'gazing' at experimental data. Bioinformatics 2018; 33:1647-1655. [PMID: 28130241 PMCID: PMC5447233 DOI: 10.1093/bioinformatics/btx041] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Accepted: 01/20/2017] [Indexed: 11/12/2022] Open
Abstract
Motivation Mutations (or Single Nucleotide Variants) in folded RiboNucleic Acid structures that cause local or global conformational change are riboSNitches. Predicting riboSNitches is challenging, as it requires making two, albeit related, structure predictions. The data most often used to experimentally validate riboSNitch predictions is Selective 2' Hydroxyl Acylation by Primer Extension, or SHAPE. Experimentally establishing a riboSNitch requires the quantitative comparison of two SHAPE traces: wild-type (WT) and mutant. Historically, SHAPE data was collected on electropherograms and change in structure was evaluated by 'gel gazing.' SHAPE data is now routinely collected with next generation sequencing and/or capillary sequencers. We aim to establish a classifier capable of simulating human 'gazing' by identifying features of the SHAPE profile that human experts agree 'looks' like a riboSNitch. Results We find strong quantitative agreement between experts when RNA scientists 'gaze' at SHAPE data and identify riboSNitches. We identify dynamic time warping and seven other features predictive of the human consensus. The classSNitch classifier reported here accurately reproduces human consensus for 167 mutant/WT comparisons with an Area Under the Curve (AUC) above 0.8. When we analyze 2019 mutant traces for 17 different RNAs, we find that features of the WT SHAPE reactivity allow us to improve thermodynamic structure predictions of riboSNitches. This is significant, as accurate RNA structural analysis and prediction is likely to become an important aspect of precision medicine. Availability and Implementation The classSNitch R package is freely available at http://classsnitch.r-forge.r-project.org . Contact alain@email.unc.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chanin Tolson Woods
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
16
|
Froebel BR, Trujillo AJ, Sullivan JM. Effects of Pathogenic Variations in the Human Rhodopsin Gene (hRHO) on the Predicted Accessibility for a Lead Candidate Ribozyme. Invest Ophthalmol Vis Sci 2017; 58:3576-3591. [PMID: 28715844 PMCID: PMC5516567 DOI: 10.1167/iovs.16-20877] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Purpose The mutation-independent strategy for hammerhead ribozyme (hhRz) or RNA interference (RNAi)-based gene therapeutics to treat autosomal dominant diseases is predicated on the hypothesis that a single therapeutic would equivalently suppress all/most of the diverse mutant mRNAs in patients with the disease phenotype. However, the hypothesis has not been formally tested. We address this through a comprehensive bioinformatics study of how mutations affect target mRNA structure accessibility for a single lead hhRz therapeutic (725GUC↓), designed against human rod rhodopsin mRNA (hRHO), for patients with hRHO mutations that cause autosomal dominant retinitis pigmentosa. Methods A total of 199 in silico coding region mutations (missense, nonsense, insert, deletion, indel) were made in hRHO mRNA based on Human Gene Mutation Database and Database of Single Nucleotide Polymorphisms. Each mRNA was folded with MFold, SFold, and OligoWalk algorithms and subjected to a bioinformatics model called multiparameter prediction of RNA accessibility. Predicted accessibility of each mutant over both a broad local region and the explicit lead ribozyme annealing site were compared quantitatively to wild-type hRHO mRNA. Results Accessibility of the 725GUC↓ site is sensitive to some mutations. For single nucleotide missense mutations, proximity of the mutation to the hhRz annealing site increases the impact on predicted accessibility, but some distant mutations also influence accessibility. Conclusions A mutation-independent strategy appears viable in this specific context but certain mutations could significantly influence ribozyme or RNAi efficacy through impact on accessibility at the target annealing site/region. This possibility must be considered in applications of this gene therapy strategy.
Collapse
Affiliation(s)
- Beau R Froebel
- Research Service, VA Western New York Healthcare System, Buffalo, New York, United States 2Department of Ophthalmology, State University of New York, University at Buffalo, Buffalo, New York, United States 3The Ross Eye Institute of University at Buffalo, Buffalo, New York, United States
| | - Alexandria J Trujillo
- Research Service, VA Western New York Healthcare System, Buffalo, New York, United States 2Department of Ophthalmology, State University of New York, University at Buffalo, Buffalo, New York, United States 4Department of Pharmacology/Toxicology, State University of New York, University at Buffalo, Buffalo, New York, United States
| | - Jack M Sullivan
- Research Service, VA Western New York Healthcare System, Buffalo, New York, United States 2Department of Ophthalmology, State University of New York, University at Buffalo, Buffalo, New York, United States 3The Ross Eye Institute of University at Buffalo, Buffalo, New York, United States 4Department of Pharmacology/Toxicology, State University of New York, University at Buffalo, Buffalo, New York, United States 5Department of Physiology/Biophysics, State University of New York, University at Buffalo, Buffalo, New York, United States 6Neuroscience Program, State University of New York, University at Buffalo, Buffalo, New York, United States 7The RNA Institute, University at Albany-State University of New York, Albany, New York, United States
| |
Collapse
|
17
|
Woods CT, Lackey L, Williams B, Dokholyan NV, Gotz D, Laederach A. Comparative Visualization of the RNA Suboptimal Conformational Ensemble In Vivo. Biophys J 2017. [PMID: 28625696 PMCID: PMC5529173 DOI: 10.1016/j.bpj.2017.05.031] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
When a ribonucleic acid (RNA) molecule folds, it often does not adopt a single, well-defined conformation. The folding energy landscape of an RNA is highly dependent on its nucleotide sequence and molecular environment. Cellular molecules sometimes alter the energy landscape, thereby changing the ensemble of likely low-energy conformations. The effects of these energy landscape changes on the conformational ensemble are particularly challenging to visualize for large RNAs. We have created a robust approach for visualizing the conformational ensemble of RNAs that is well suited for in vitro versus in vivo comparisons. Our method creates a stable map of conformational space for a given RNA sequence. We first identify single point mutations in the RNA that maximally sample suboptimal conformational space based on the ensemble’s partition function. Then, we cluster these diverse ensembles to identify the most diverse partition functions for Boltzmann stochastic sampling. By using, to our knowledge, a novel nestedness distance metric, we iteratively add mutant suboptimal ensembles to converge on a stable 2D map of conformational space. We then compute the selective 2′ hydroxyl acylation by primer extension (SHAPE)-directed ensemble for the RNA folding under different conditions, and we project these ensembles on the map to visualize. To validate our approach, we established a conformational map of the Vibrio vulnificus add adenine riboswitch that reveals five classes of structures. In the presence of adenine, projection of the SHAPE-directed sampling correctly identified the on-conformation; without the ligand, only off-conformations were visualized. We also collected the whole-transcript in vitro and in vivo SHAPE-MaP for human β-actin messenger RNA that revealed similar global folds in both conditions. Nonetheless, a comparison of in vitro and in vivo data revealed that specific regions exhibited significantly different SHAPE-MaP profiles indicative of structural rearrangements, including rearrangement consistent with binding of the zipcode protein in a region distal to the stop codon.
Collapse
Affiliation(s)
- Chanin T Woods
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Lela Lackey
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Benfeard Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Nikolay V Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - David Gotz
- Carolina Health Informatics Program, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; School of Information and Library Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | - Alain Laederach
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina.
| |
Collapse
|
18
|
The roles of RNA processing in translating genotype to phenotype. NATURE REVIEWS. MOLECULAR CELL BIOLOGY 2016. [PMID: 27847391 DOI: 10.1038/nrm.2016.139.] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A goal of human genetics studies is to determine the mechanisms by which genetic variation produces phenotypic differences that affect human health. Efforts in this respect have previously focused on genetic variants that affect mRNA levels by altering epigenetic and transcriptional regulation. Recent studies show that genetic variants that affect RNA processing are at least equally as common as, and are largely independent from, those variants that affect transcription. We highlight the impact of genetic variation on pre-mRNA splicing and polyadenylation, and on the stability, translation and structure of mRNAs as mechanisms that produce phenotypic traits. These results emphasize the importance of including RNA processing signals in analyses to identify functional variants.
Collapse
|
19
|
Manning KS, Cooper TA. The roles of RNA processing in translating genotype to phenotype. Nat Rev Mol Cell Biol 2016; 18:102-114. [PMID: 27847391 DOI: 10.1038/nrm.2016.139] [Citation(s) in RCA: 141] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
A goal of human genetics studies is to determine the mechanisms by which genetic variation produces phenotypic differences that affect human health. Efforts in this respect have previously focused on genetic variants that affect mRNA levels by altering epigenetic and transcriptional regulation. Recent studies show that genetic variants that affect RNA processing are at least equally as common as, and are largely independent from, those variants that affect transcription. We highlight the impact of genetic variation on pre-mRNA splicing and polyadenylation, and on the stability, translation and structure of mRNAs as mechanisms that produce phenotypic traits. These results emphasize the importance of including RNA processing signals in analyses to identify functional variants.
Collapse
Affiliation(s)
- Kassie S Manning
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, USA.,Integrative Molecular and Biomedical Sciences Program, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Thomas A Cooper
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, Texas 77030, USA.,Department of Molecular Physiology and Biophysics, Baylor College of Medicine, Houston, Texas 77030, USA.,Integrative Molecular and Biomedical Sciences Program, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
20
|
Abstract
Single-stranded RNA molecules fold into extraordinarily complicated secondary and tertiary structures as a result of intramolecular base pairing. In vivo, these RNA structures are not static. Instead, they are remodeled in response to changes in the prevailing physicochemical environment of the cell and as a result of intermolecular base pairing and interactions with RNA-binding proteins. Remarkable technical advances now allow us to probe RNA secondary structure at single-nucleotide resolution and genome-wide, both in vitro and in vivo. These data sets provide new glimpses into the RNA universe. Analyses of RNA structuromes in HIV, yeast, Arabidopsis, and mammalian cells and tissues have revealed regulatory effects of RNA structure on messenger RNA (mRNA) polyadenylation, splicing, translation, and turnover. Application of new methods for genome-wide identification of mRNA modifications, particularly methylation and pseudouridylation, has shown that the RNA "epitranscriptome" both influences and is influenced by RNA structure. In this review, we describe newly developed genome-wide RNA structure-probing methods and synthesize the information emerging from their application.
Collapse
Affiliation(s)
- Philip C Bevilacqua
- Department of Chemistry.,Department of Biochemistry and Molecular Biology.,Center for RNA Molecular Biology
| | | | - Zhao Su
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802;
| | - Sarah M Assmann
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802;
| |
Collapse
|
21
|
Bhartiya D, Scaria V. Genomic variations in non-coding RNAs: Structure, function and regulation. Genomics 2016; 107:59-68. [DOI: 10.1016/j.ygeno.2016.01.005] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Revised: 01/05/2016] [Accepted: 01/08/2016] [Indexed: 01/05/2023]
|
22
|
Kutchko KM, Sanders W, Ziehr B, Phillips G, Solem A, Halvorsen M, Weeks KM, Moorman N, Laederach A. Multiple conformations are a conserved and regulatory feature of the RB1 5' UTR. RNA (NEW YORK, N.Y.) 2015; 21:1274-85. [PMID: 25999316 PMCID: PMC4478346 DOI: 10.1261/rna.049221.114] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2014] [Accepted: 03/27/2015] [Indexed: 05/22/2023]
Abstract
Folding to a well-defined conformation is essential for the function of structured ribonucleic acids (RNAs) like the ribosome and tRNA. Structured elements in the untranslated regions (UTRs) of specific messenger RNAs (mRNAs) are known to control expression. The importance of unstructured regions adopting multiple conformations, however, is still poorly understood. High-resolution SHAPE-directed Boltzmann suboptimal sampling of the Homo sapiens Retinoblastoma 1 (RB1) 5' UTR yields three distinct conformations compatible with the experimental data. Private single nucleotide variants (SNVs) identified in two patients with retinoblastoma each collapse the structural ensemble to a single but distinct well-defined conformation. The RB1 5' UTRs from Bos taurus (cow) and Trichechus manatus latirostris (manatee) are divergent in sequence from H. sapiens (human) yet maintain structural compatibility with high-probability base pairs. SHAPE chemical probing of the cow and manatee RB1 5' UTRs reveals that they also adopt multiple conformations. Luciferase reporter assays reveal that 5' UTR mutations alter RB1 expression. In a traditional model of disease, causative SNVs disrupt a key structural element in the RNA. For the subset of patients with heritable retinoblastoma-associated SNVs in the RB1 5' UTR, the absence of multiple structures is likely causative of the cancer. Our data therefore suggest that selective pressure will favor multiple conformations in eukaryotic UTRs to regulate expression.
Collapse
Affiliation(s)
- Katrina M Kutchko
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Wes Sanders
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Ben Ziehr
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill, North Carolina 27599, USA Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Gabriela Phillips
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Amanda Solem
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Matthew Halvorsen
- Institute for Genomic Medicine, Columbia University Medical Center, New York, New York 10032, USA
| | - Kevin M Weeks
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Nathaniel Moorman
- Department of Microbiology and Immunology, University of North Carolina, Chapel Hill, North Carolina 27599, USA Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| |
Collapse
|
23
|
Solem AC, Halvorsen M, Ramos SBV, Laederach A. The potential of the riboSNitch in personalized medicine. WILEY INTERDISCIPLINARY REVIEWS-RNA 2015; 6:517-32. [PMID: 26115028 PMCID: PMC4543445 DOI: 10.1002/wrna.1291] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Revised: 03/25/2015] [Accepted: 05/13/2015] [Indexed: 01/28/2023]
Abstract
RNA conformation plays a significant role in stability, ligand binding, transcription, and translation. Single nucleotide variants (SNVs) have the potential to disrupt specific structural elements because RNA folds in a sequence-specific manner. A riboSNitch is an element of RNA structure with a specific function that is disrupted by an SNV or a single nucleotide polymorphism (SNP; or polymorphism; SNVs occur with low frequency in the population, <1%). The riboSNitch is analogous to a riboswitch, where binding of a small molecule rather than mutation alters the structure of the RNA to control gene regulation. RiboSNitches are particularly relevant to interpreting the results of genome-wide association studies (GWAS). Often GWAS identify SNPs associated with a phenotype mapping to noncoding regions of the genome. Because a majority of the human genome is transcribed, significant subsets of GWAS SNPs are putative riboSNitches. The extent to which the transcriptome is tolerant of SNP-induced structure change is still poorly understood. Recent advances in ultra high-throughput structure probing begin to reveal the structural complexities of mutation-induced structure change. This review summarizes our current understanding of SNV and SNP-induced structure change in the human transcriptome and discusses the importance of riboSNitch discovery in interpreting GWAS results and massive sequencing projects.
Collapse
Affiliation(s)
- Amanda C Solem
- Department of Biology, University of North Carolina, Chapel Hill, NC, USA
| | - Matthew Halvorsen
- Department of Biology, University of North Carolina, Chapel Hill, NC, USA.,Institute for Genomic Medicine, Columbia University, New York, NY, USA
| | - Silvia B V Ramos
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, NC, USA.,Bioinformatics and Computational Biology Program, University of North Carolina, Chapel Hill, NC, USA
| |
Collapse
|
24
|
Corley M, Solem A, Qu K, Chang HY, Laederach A. Detecting riboSNitches with RNA folding algorithms: a genome-wide benchmark. Nucleic Acids Res 2015; 43:1859-68. [PMID: 25618847 PMCID: PMC4330374 DOI: 10.1093/nar/gkv010] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Ribonucleic acid (RNA) secondary structure prediction continues to be a significant challenge, in particular when attempting to model sequences with less rigidly defined structures, such as messenger and non-coding RNAs. Crucial to interpreting RNA structures as they pertain to individual phenotypes is the ability to detect RNAs with large structural disparities caused by a single nucleotide variant (SNV) or riboSNitches. A recently published human genome-wide parallel analysis of RNA structure (PARS) study identified a large number of riboSNitches as well as non-riboSNitches, providing an unprecedented set of RNA sequences against which to benchmark structure prediction algorithms. Here we evaluate 11 different RNA folding algorithms’ riboSNitch prediction performance on these data. We find that recent algorithms designed specifically to predict the effects of SNVs on RNA structure, in particular remuRNA, RNAsnp and SNPfold, perform best on the most rigorously validated subsets of the benchmark data. In addition, our benchmark indicates that general structure prediction algorithms (e.g. RNAfold and RNAstructure) have overall better performance if base pairing probabilities are considered rather than minimum free energy calculations. Although overall aggregate algorithmic performance on the full set of riboSNitches is relatively low, significant improvement is possible if the highest confidence predictions are evaluated independently.
Collapse
Affiliation(s)
- Meredith Corley
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 37599, USA Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Amanda Solem
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 37599, USA
| | - Kun Qu
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Howard Y Chang
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, CA 94305, USA Howard Hughes Medical Institute, Stanford University, Stanford, CA 94305, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 37599, USA Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
25
|
Li MJ, Wang J. Current trend of annotating single nucleotide variation in humans--A case study on SNVrap. Methods 2014; 79-80:32-40. [PMID: 25308971 DOI: 10.1016/j.ymeth.2014.10.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2014] [Revised: 09/25/2014] [Accepted: 10/02/2014] [Indexed: 12/16/2022] Open
Abstract
As high throughput methods, such as whole genome genotyping arrays, whole exome sequencing (WES) and whole genome sequencing (WGS), have detected huge amounts of genetic variants associated with human diseases, function annotation of these variants is an indispensable step in understanding disease etiology. Large-scale functional genomics projects, such as The ENCODE Project and Roadmap Epigenomics Project, provide genome-wide profiling of functional elements across different human cell types and tissues. With the urgent demands for identification of disease-causal variants, comprehensive and easy-to-use annotation tool is highly in demand. Here we review and discuss current progress and trend of the variant annotation field. Furthermore, we introduce a comprehensive web portal for annotating human genetic variants. We use gene-based features and the latest functional genomics datasets to annotate single nucleotide variation (SNVs) in human, at whole genome scale. We further apply several function prediction algorithms to annotate SNVs that might affect different biological processes, including transcriptional gene regulation, alternative splicing, post-transcriptional regulation, translation and post-translational modifications. The SNVrap web portal is freely available at http://jjwanglab.org/snvrap.
Collapse
Affiliation(s)
- Mulin Jun Li
- Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, China
| | - Junwen Wang
- Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region, China; Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, China.
| |
Collapse
|
26
|
Mirza AH, Kaur S, Brorsson CA, Pociot F. Effects of GWAS-associated genetic variants on lncRNAs within IBD and T1D candidate loci. PLoS One 2014; 9:e105723. [PMID: 25144376 PMCID: PMC4140826 DOI: 10.1371/journal.pone.0105723] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 07/23/2014] [Indexed: 11/19/2022] Open
Abstract
Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/-5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs.
Collapse
Affiliation(s)
- Aashiq H. Mirza
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics E, Herlev University Hospital, Herlev, Denmark
- Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Copenhagen, Denmark
| | - Simranjeet Kaur
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics E, Herlev University Hospital, Herlev, Denmark
- Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Caroline A. Brorsson
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics E, Herlev University Hospital, Herlev, Denmark
| | - Flemming Pociot
- Copenhagen Diabetes Research Center (CPH-DIRECT), Department of Pediatrics E, Herlev University Hospital, Herlev, Denmark
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Copenhagen, Denmark
- * E-mail:
| |
Collapse
|
27
|
Li MJ, Yan B, Sham PC, Wang J. Exploring the function of genetic variants in the non-coding genomic regions: approaches for identifying human regulatory variants affecting gene expression. Brief Bioinform 2014; 16:393-412. [PMID: 24916300 DOI: 10.1093/bib/bbu018] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Accepted: 04/23/2014] [Indexed: 12/13/2022] Open
Abstract
Understanding the genetic basis of human traits/diseases and the underlying mechanisms of how these traits/diseases are affected by genetic variations is critical for public health. Current genome-wide functional genomics data uncovered a large number of functional elements in the noncoding regions of human genome, providing new opportunities to study regulatory variants (RVs). RVs play important roles in transcription factor bindings, chromatin states and epigenetic modifications. Here, we systematically review an array of methods currently used to map RVs as well as the computational approaches in annotating and interpreting their regulatory effects, with emphasis on regulatory single-nucleotide polymorphism. We also briefly introduce experimental methods to validate these functional RVs.
Collapse
|
28
|
|
29
|
Ramos SBV, Laederach A. Molecular biology: A second layer of information in RNA. Nature 2014; 505:621-2. [PMID: 24476882 DOI: 10.1038/505621a] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Silvia B V Ramos
- Obstetrics and Gynecology Department and the Biology Department, University of North Carolina, Chapel Hill, North Carolina 27599-3280, USA
| | - Alain Laederach
- Obstetrics and Gynecology Department and the Biology Department, University of North Carolina, Chapel Hill, North Carolina 27599-3280, USA
| |
Collapse
|
30
|
Sabarinathan R, Wenzel A, Novotny P, Tang X, Kalari KR, Gorodkin J. Transcriptome-wide analysis of UTRs in non-small cell lung cancer reveals cancer-related genes with SNV-induced changes on RNA secondary structure and miRNA target sites. PLoS One 2014; 9:e82699. [PMID: 24416147 PMCID: PMC3885406 DOI: 10.1371/journal.pone.0082699] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Accepted: 10/26/2013] [Indexed: 01/08/2023] Open
Abstract
Traditional mutation assessment methods generally focus on predicting disruptive changes in protein-coding regions rather than non-coding regulatory regions like untranslated regions (UTRs) of mRNAs. The UTRs, however, are known to have many sequence and structural motifs that can regulate translational and transcriptional efficiency and stability of mRNAs through interaction with RNA-binding proteins and other non-coding RNAs like microRNAs (miRNAs). In a recent study, transcriptomes of tumor cells harboring mutant and wild-type KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) genes in patients with non-small cell lung cancer (NSCLC) have been sequenced to identify single nucleotide variations (SNVs). About 40% of the total SNVs (73,717) identified were mapped to UTRs, but omitted in the previous analysis. To meet this obvious demand for analysis of the UTRs, we designed a comprehensive pipeline to predict the effect of SNVs on two major regulatory elements, secondary structure and miRNA target sites. Out of 29,290 SNVs in 6462 genes, we predict 472 SNVs (in 408 genes) affecting local RNA secondary structure, 490 SNVs (in 447 genes) affecting miRNA target sites and 48 that do both. Together these disruptive SNVs were present in 803 different genes, out of which 188 (23.4%) were previously known to be cancer-associated. Notably, this ratio is significantly higher (one-sided Fisher's exact test p-value = 0.032) than the ratio (20.8%) of known cancer-associated genes (n = 1347) in our initial data set (n = 6462). Network analysis shows that the genes harboring disruptive SNVs were involved in molecular mechanisms of cancer, and the signaling pathways of LPS-stimulated MAPK, IL-6, iNOS, EIF2 and mTOR. In conclusion, we have found hundreds of SNVs which are highly disruptive with respect to changes in the secondary structure and miRNA target sites within UTRs. These changes hold the potential to alter the expression of known cancer genes or genes linked to cancer-associated pathways.
Collapse
Affiliation(s)
- Radhakrishnan Sabarinathan
- Center for non-coding RNA in Technology and Health, Section for Animal Genetics, Bioinformatics and Breeding, IKVH, University of Copenhagen, Frederiksberg, Denmark
| | - Anne Wenzel
- Center for non-coding RNA in Technology and Health, Section for Animal Genetics, Bioinformatics and Breeding, IKVH, University of Copenhagen, Frederiksberg, Denmark
| | - Peter Novotny
- Bioinformatics Centre, Department of Biology and Biotech Research and Innovation Centre, University of Copenhagen, Copenhagen, Denmark
| | - Xiaojia Tang
- Division of Biostatistics and Bioinformatics, Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States of America
| | - Krishna R. Kalari
- Division of Biostatistics and Bioinformatics, Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States of America
- Department of Cancer Biology, Mayo Clinic Comprehensive Cancer Center, Jacksonville, Florida, United States of America
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, Section for Animal Genetics, Bioinformatics and Breeding, IKVH, University of Copenhagen, Frederiksberg, Denmark
- * E-mail:
| |
Collapse
|
31
|
Day L, Abdelhadi Ep Souki O, Albrecht AA, Steinhöfel K. Accessibility of microRNA binding sites in metastable RNA secondary structures in the presence of SNPs. ACTA ACUST UNITED AC 2013; 30:343-52. [PMID: 24292936 DOI: 10.1093/bioinformatics/btt695] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
MOTIVATION We study microRNA (miRNA) bindings to metastable RNA secondary structures close to minimum free energy conformations in the context of single nucleotide polymorphisms (SNPs) and messenger RNA (mRNA) concentration levels, i.e. whether features of miRNA bindings to metastable conformations could provide additional information supporting the differences in expression levels of the two sequences defined by a SNP. In our study, the instances [mRNA/3'UTR; SNP; miRNA] were selected based on strong expression level analyses, SNP locations within binding regions and the computationally feasible identification of metastable conformations. RESULTS We identified 14 basic cases [mRNA; SNP; miRNA] of 3' UTR-lengths ranging from 124 up to 1078 nt reported in recent literature, and we analyzed the number, structure and miRNA binding to metastable conformations within an energy offset above mfe conformations. For each of the 14 instances, the miRNA binding characteristics are determined by the corresponding STarMir output. Among the different parameters we introduced and analyzed, we found that three of them, related to the average depth and average opening energy of metastable conformations, may provide supporting information for a stronger separation between miRNA bindings to the two alleles defined by a given SNP. AVAILABILITY AND IMPLEMENTATION At http://kks.inf.kcl.ac.uk/MSbind.html the MSbind tool is available for calculating features of metastable conformations determined by putative miRNA binding sites.
Collapse
Affiliation(s)
- Luke Day
- Department of Informatics, King's College London, London WC2R 2LS and Middlesex University London, School of Science and Technology, London NW4 4BT, UK
| | | | | | | |
Collapse
|
32
|
Sabarinathan R, Tafer H, Seemann SE, Hofacker IL, Stadler PF, Gorodkin J. RNAsnp: efficient detection of local RNA secondary structure changes induced by SNPs. Hum Mutat 2013; 34:546-56. [PMID: 23315997 PMCID: PMC3708107 DOI: 10.1002/humu.22273] [Citation(s) in RCA: 101] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2012] [Accepted: 12/18/2012] [Indexed: 02/05/2023]
Abstract
Structural characteristics are essential for the functioning of many noncoding RNAs and cis-regulatory elements of mRNAs. SNPs may disrupt these structures, interfere with their molecular function, and hence cause a phenotypic effect. RNA folding algorithms can provide detailed insights into structural effects of SNPs. The global measures employed so far suffer from limited accuracy of folding programs on large RNAs and are computationally too demanding for genome-wide applications. Here, we present a strategy that focuses on the local regions of maximal structural change between mutant and wild-type. These local regions are approximated in a “screening mode” that is intended for genome-wide applications. Furthermore, localized regions are identified as those with maximal discrepancy. The mutation effects are quantified in terms of empirical P values. To this end, the RNAsnp software uses extensive precomputed tables of the distribution of SNP effects as function of length and GC content. RNAsnp thus achieves both a noise reduction and speed-up of several orders of magnitude over shuffling-based approaches. On a data set comprising 501 SNPs associated with human-inherited diseases, we predict 54 to have significant local structural effect in the untranslated region of mRNAs. RNAsnp is available at http://rth.dk/resources/rnasnp.
Collapse
|
33
|
Rogler LE, Kosmyna B, Moskowitz D, Bebawee R, Rahimzadeh J, Kutchko K, Laederach A, Notarangelo LD, Giliani S, Bouhassira E, Frenette P, Roy-Chowdhury J, Rogler CE. Small RNAs derived from lncRNA RNase MRP have gene-silencing activity relevant to human cartilage-hair hypoplasia. Hum Mol Genet 2013; 23:368-82. [PMID: 24009312 DOI: 10.1093/hmg/ddt427] [Citation(s) in RCA: 79] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Post-transcriptional processing of some long non-coding RNAs (lncRNAs) reveals that they are a source of miRNAs. We show that the 268-nt non-coding RNA component of mitochondrial RNA processing endoribonuclease, (RNase MRP), is the source of at least two short (∼20 nt) RNAs designated RMRP-S1 and RMRP-S2, which function as miRNAs. Point mutations in RNase MRP cause human cartilage-hair hypoplasia (CHH), and several disease-causing mutations map to RMRP-S1 and -S2. SHAPE chemical probing identified two alternative secondary structures altered by disease mutations. RMRP-S1 and -S2 are significantly reduced in two fibroblast cell lines and a B-cell line derived from CHH patients. Tests of gene regulatory activity of RMRP-S1 and -S2 identified over 900 genes that were significantly regulated, of which over 75% were down-regulated, and 90% contained target sites with seed complements of RMRP-S1 and -S2 predominantly in their 3' UTRs. Pathway analysis identified regulated genes that function in skeletal development, hair development and hematopoietic cell differentiation including PTCH2 and SOX4 among others, linked to major CHH phenotypes. Also, genes associated with alternative RNA splicing, cell proliferation and differentiation were highly targeted. Therefore, alterations RMRP-S1 and -S2, caused by point mutations in RMRP, are strongly implicated in the molecular mechanism of CHH.
Collapse
|
34
|
Evolutionary evidence for alternative structure in RNA sequence co-variation. PLoS Comput Biol 2013; 9:e1003152. [PMID: 23935473 PMCID: PMC3723493 DOI: 10.1371/journal.pcbi.1003152] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Accepted: 06/05/2013] [Indexed: 02/06/2023] Open
Abstract
Sequence conservation and co-variation of base pairs are hallmarks of structured RNAs. For certain RNAs (e.g. riboswitches), a single sequence must adopt at least two alternative secondary structures to effectively regulate the message. If alternative secondary structures are important to the function of an RNA, we expect to observe evolutionary co-variation supporting multiple conformations. We set out to characterize the evolutionary co-variation supporting alternative conformations in riboswitches to determine the extent to which alternative secondary structures are conserved. We found strong co-variation support for the terminator, P1, and anti-terminator stems in the purine riboswitch by extending alignments to include terminator sequences. When we performed Boltzmann suboptimal sampling on purine riboswitch sequences with terminators we found that these sequences appear to have evolved to favor specific alternative conformations. We extended our analysis of co-variation to classic alignments of group I/II introns, tRNA, and other classes of riboswitches. In a majority of these RNAs, we found evolutionary evidence for alternative conformations that are compatible with the Boltzmann suboptimal ensemble. Our analyses suggest that alternative conformations are selected for and thus likely play functional roles in even the most structured of RNAs. RNA (Ribonucleic Acid) is a messenger of genetic information, master regulator, and catalyst in the cell. To carry out its function, RNA can fold into complex three-dimensional structures. Certain classes of RNAs, called riboswitches, adopt at least two alternative structures to act as a switch. We set out to detect the evolutionary signal for alternative structures in riboswitches as we hypothesize that these RNA sequences must have evolved to allow both conformations. We find that indeed such signals exist when we compare the sequences of riboswitches from multiple species. When we extend this analysis to other RNA regulators in the cell that are not thought of as switches, we detect equivalent evolutionary support for alternative structures. Viewed through the lens of evolutionary structure conservation RNA sequences appear to have adapted to adopt multiple conformations.
Collapse
|
35
|
Sabarinathan R, Tafer H, Seemann SE, Hofacker IL, Stadler PF, Gorodkin J. The RNAsnp web server: predicting SNP effects on local RNA secondary structure. Nucleic Acids Res 2013; 41:W475-9. [PMID: 23630321 PMCID: PMC3977658 DOI: 10.1093/nar/gkt291] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The function of many non-coding RNA genes and cis-regulatory elements of messenger RNA largely depends on the structure, which is in turn determined by their sequence. Single nucleotide polymorphisms (SNPs) and other mutations may disrupt the RNA structure, interfere with the molecular function and hence cause a phenotypic effect. RNAsnp is an efficient method to predict the effect of SNPs on local RNA secondary structure based on the RNA folding algorithms implemented in the Vienna RNA package. The SNP effects are quantified in terms of empirical P-values, which, for computational efficiency, are derived from extensive pre-computed tables of distributions of substitution effects as a function of gene length and GC content. Here, we present a web service that not only provides an interface for RNAsnp but also features a graphical output representation. In addition, the web server is connected to a local mirror of the UCSC genome browser database that enables the users to select the genomic sequences for analysis and visualize the results directly in the UCSC genome browser. The RNAsnp web server is freely available at: http://rth.dk/resources/rnasnp/.
Collapse
Affiliation(s)
- Radhakrishnan Sabarinathan
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | | | | | | | | | | |
Collapse
|
36
|
Salari R, Kimchi-Sarfaty C, Gottesman MM, Przytycka TM. Sensitive measurement of single-nucleotide polymorphism-induced changes of RNA conformation: application to disease studies. Nucleic Acids Res 2012; 41:44-53. [PMID: 23125360 PMCID: PMC3592397 DOI: 10.1093/nar/gks1009] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Single-nucleotide polymorphisms (SNPs) are often linked to critical phenotypes such as diseases or responses to vaccines, medications and environmental factors. However, the specific molecular mechanisms by which a causal SNP acts is usually not obvious. Changes in RNA secondary structure emerge as a possible explanation necessitating the development of methods to measure the impact of single-nucleotide variation on RNA structure. Despite the recognition of the importance of considering the changes in Boltzmann ensemble of RNA conformers in this context, a formal method to perform directly such comparison was lacking. Here, we solved this problem and designed an efficient method to compute the relative entropy between the Boltzmann ensembles of the native and a mutant structure. On the basis of this theoretical progress, we developed a software tool, remuRNA, and investigated examples of its application. Comparing the impact of common SNPs naturally occurring in populations with the impact of random point mutations, we found that structural changes introduced by common SNPs are smaller than those introduced by random point mutations. This suggests a natural selection against mutations that significantly change RNA structure and demonstrates, surprisingly, that randomly inserted point mutations provide inadequate estimation of random mutations effects. Subsequently, we applied remuRNA to determine which of the disease-associated non-coding SNPs are potentially related to RNA structural changes.
Collapse
Affiliation(s)
- Raheleh Salari
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
37
|
Basu M, Das T, Ghosh A, Majumder S, Maji AK, Kanjilal SD, Mukhopadhyay I, Roychowdhury S, Banerjee S, Sengupta S. Gene-gene interaction and functional impact of polymorphisms on innate immune genes in controlling Plasmodium falciparum blood infection level. PLoS One 2012; 7:e46441. [PMID: 23071570 PMCID: PMC3470565 DOI: 10.1371/journal.pone.0046441] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2012] [Accepted: 08/30/2012] [Indexed: 12/19/2022] Open
Abstract
Genetic variations in toll-like receptors and cytokine genes of the innate immune pathways have been implicated in controlling parasite growth and the pathogenesis of Plasmodium falciparum mediated malaria. We previously published genetic association of TLR4 non-synonymous and TNF-α promoter polymorphisms with P.falciparum blood infection level and here we extend the study considerably by (i) investigating genetic dependence of parasite-load on interleukin-12B polymorphisms, (ii) reconstructing gene-gene interactions among candidate TLRs and cytokine loci, (iii) exploring genetic and functional impact of epistatic models and (iv) providing mechanistic insights into functionality of disease-associated regulatory polymorphisms. Our data revealed that carriage of AA (P = 0.0001) and AC (P = 0.01) genotypes of IL12B 3′UTR polymorphism was associated with a significant increase of mean log-parasitemia relative to rare homozygous genotype CC. Presence of IL12B+1188 polymorphism in five of six multifactor models reinforced its strong genetic impact on malaria phenotype. Elevation of genetic risk in two-component models compared to the corresponding single locus and reduction of IL12B (2.2 fold) and lymphotoxin-α (1.7 fold) expressions in patients'peripheral-blood-mononuclear-cells under TLR4Thr399Ile risk genotype background substantiated the role of Multifactor Dimensionality Reduction derived models. Marked reduction of promoter activity of TNF-α risk haplotype (C-C-G-G) compared to wild-type haplotype (T-C-G-G) with (84%) and without (78%) LPS stimulation and the loss of binding of transcription factors detected in-silico supported a causal role of TNF-1031. Significantly lower expression of IL12B+1188 AA (5 fold) and AC (9 fold) genotypes compared to CC and under-representation (P = 0.0048) of allele A in transcripts of patients' PBMCs suggested an Allele-Expression-Imbalance. Allele (A+1188C) dependent differential stability (2 fold) of IL12B-transcripts upon actinomycin-D treatment and observed structural modulation (P = 0.013) of RNA-ensemble were the plausible explanations for AEI. In conclusion, our data provides functional support to the hypothesis that de-regulated receptor-cytokine axis of innate immune pathway influences blood infection level in P. falciparum malaria.
Collapse
Affiliation(s)
- Madhumita Basu
- Department of Biochemistry, University of Calcutta, Kolkata, West Bengal, India
| | - Tania Das
- Cancer & Cell Biology Division, Indian Institute of Chemical Biology, Kolkata, West Bengal, India
| | - Alip Ghosh
- Centre for Liver Research, The Institute of Post-Graduate Medical Education & Research, Kolkata, West Bengal, India
| | - Subhadipa Majumder
- Department of Biochemistry, University of Calcutta, Kolkata, West Bengal, India
| | - Ardhendu Kumar Maji
- Department of Protozoology, The Calcutta School of Tropical Medicine, Kolkata, West Bengal, India
| | - Sumana Datta Kanjilal
- Department of Pediatric Medicine, Calcutta National Medical College, Kolkata, West Bengal, India
| | | | - Susanta Roychowdhury
- Cancer & Cell Biology Division, Indian Institute of Chemical Biology, Kolkata, West Bengal, India
| | - Soma Banerjee
- Centre for Liver Research, The Institute of Post-Graduate Medical Education & Research, Kolkata, West Bengal, India
| | - Sanghamitra Sengupta
- Department of Biochemistry, University of Calcutta, Kolkata, West Bengal, India
- * E-mail:
| |
Collapse
|