1
|
Bose R, Saleem I, Mustoe AM. Causes, functions, and therapeutic possibilities of RNA secondary structure ensembles and alternative states. Cell Chem Biol 2024; 31:17-35. [PMID: 38199037 PMCID: PMC10842484 DOI: 10.1016/j.chembiol.2023.12.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 11/21/2023] [Accepted: 12/12/2023] [Indexed: 01/12/2024]
Abstract
RNA secondary structure plays essential roles in encoding RNA regulatory fate and function. Most RNAs populate ensembles of alternatively paired states and are continually unfolded and refolded by cellular processes. Measuring these structural ensembles and their contributions to cellular function has traditionally posed major challenges, but new methods and conceptual frameworks are beginning to fill this void. In this review, we provide a mechanism- and function-centric compendium of the roles of RNA secondary structural ensembles and minority states in regulating the RNA life cycle, from transcription to degradation. We further explore how dysregulation of RNA structural ensembles contributes to human disease and discuss the potential of drugging alternative RNA states to therapeutically modulate RNA activity. The emerging paradigm of RNA structural ensembles as central to RNA function provides a foundation for a deeper understanding of RNA biology and new therapeutic possibilities.
Collapse
Affiliation(s)
- Ritwika Bose
- Therapeutic Innovation Center (THINC), Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX, USA
| | - Irfana Saleem
- Therapeutic Innovation Center (THINC), Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX, USA
| | - Anthony M Mustoe
- Therapeutic Innovation Center (THINC), Department of Biochemistry and Molecular Pharmacology, Baylor College of Medicine, Houston, TX, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
2
|
Szyjka CE, Strobel EJ. Observation of coordinated RNA folding events by systematic cotranscriptional RNA structure probing. Nat Commun 2023; 14:7839. [PMID: 38030633 PMCID: PMC10687018 DOI: 10.1038/s41467-023-43395-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 11/08/2023] [Indexed: 12/01/2023] Open
Abstract
RNA begins to fold as it is transcribed by an RNA polymerase. Consequently, RNA folding is constrained by the direction and rate of transcription. Understanding how RNA folds into secondary and tertiary structures therefore requires methods for determining the structure of cotranscriptional folding intermediates. Cotranscriptional RNA chemical probing methods accomplish this by systematically probing the structure of nascent RNA that is displayed from an RNA polymerase. Here, we describe a concise, high-resolution cotranscriptional RNA chemical probing procedure called variable length Transcription Elongation Complex RNA structure probing (TECprobe-VL). We demonstrate the accuracy and resolution of TECprobe-VL by replicating and extending previous analyses of ZTP and fluoride riboswitch folding and mapping the folding pathway of a ppGpp-sensing riboswitch. In each system, we show that TECprobe-VL identifies coordinated cotranscriptional folding events that mediate transcription antitermination. Our findings establish TECprobe-VL as an accessible method for mapping cotranscriptional RNA folding pathways.
Collapse
Affiliation(s)
- Courtney E Szyjka
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY, 14260, USA
| | - Eric J Strobel
- Department of Biological Sciences, The University at Buffalo, Buffalo, NY, 14260, USA.
| |
Collapse
|
3
|
Henderson AN, McDonnell RT, Elcock AH. Modeling the 3D structure and conformational dynamics of very large RNAs using coarse-grained molecular simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.06.543892. [PMID: 37333149 PMCID: PMC10274748 DOI: 10.1101/2023.06.06.543892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
We describe a computational approach to building and simulating realistic 3D models of very large RNA molecules (>1000 nucleotides) at a resolution of one "bead" per nucleotide. The method starts with a predicted secondary structure and uses several stages of energy minimization and Brownian dynamics (BD) simulation to build 3D models. A key step in the protocol is the temporary addition of a 4 th spatial dimension that allows all predicted helical elements to become disentangled from each other in an effectively automated way. We then use the resulting 3D models as input to Brownian dynamics simulations that include hydrodynamic interactions (HIs) that allow the diffusive properties of the RNA to be modelled as well as enabling its conformational dynamics to be simulated. To validate the dynamics part of the method, we first show that when applied to small RNAs with known 3D structures the BD-HI simulation models accurately reproduce their experimental hydrodynamic radii (Rh). We then apply the modelling and simulation protocol to a variety of RNAs for which experimental Rh values have been reported ranging in size from 85 to 3569 nucleotides. We show that the 3D models, when used in BD-HI simulations, produce hydrodynamic radii that are usually in good agreement with experimental estimates for RNAs that do not contain tertiary contacts that persist even under very low salt conditions. Finally, we show that sampling of the conformational dynamics of large RNAs on timescales of 100 µs is computationally feasible with BD-HI simulations.
Collapse
|
4
|
Dingle K, Novev JK, Ahnert SE, Louis AA. Predicting phenotype transition probabilities via conditional algorithmic probability approximations. J R Soc Interface 2022; 19:20220694. [PMID: 36514888 PMCID: PMC9748496 DOI: 10.1098/rsif.2022.0694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 11/18/2022] [Indexed: 12/15/2022] Open
Abstract
Unravelling the structure of genotype-phenotype (GP) maps is an important problem in biology. Recently, arguments inspired by algorithmic information theory (AIT) and Kolmogorov complexity have been invoked to uncover simplicity bias in GP maps, an exponentially decaying upper bound in phenotype probability with the increasing phenotype descriptional complexity. This means that phenotypes with many genotypes assigned via the GP map must be simple, while complex phenotypes must have few genotypes assigned. Here, we use similar arguments to bound the probability P(x → y) that phenotype x, upon random genetic mutation, transitions to phenotype y. The bound is [Formula: see text], where [Formula: see text] is the estimated conditional complexity of y given x, quantifying how much extra information is required to make y given access to x. This upper bound is related to the conditional form of algorithmic probability from AIT. We demonstrate the practical applicability of our derived bound by predicting phenotype transition probabilities (and other related quantities) in simulations of RNA and protein secondary structures. Our work contributes to a general mathematical understanding of GP maps and may facilitate the prediction of transition probabilities directly from examining phenotype themselves, without utilizing detailed knowledge of the GP map.
Collapse
Affiliation(s)
- Kamaludin Dingle
- Department of Chemical Engineering and Biotechnology, Cambridge University, Cambridge CB2 1TN, UK
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA
- Department of Mathematics and Natural Sciences, Centre for Applied Mathematics and Bioinformatics (CAMB), Gulf University for Science and Technology, 32093, Kuwait
| | - Javor K. Novev
- Department of Chemical Engineering and Biotechnology, Cambridge University, Cambridge CB2 1TN, UK
| | - Sebastian E. Ahnert
- Department of Chemical Engineering and Biotechnology, Cambridge University, Cambridge CB2 1TN, UK
| | - Ard A. Louis
- Department of Physics, Rudolf Peierls Centre for Theoretical Physics, Oxford University, Oxford OX1 2JD, UK
| |
Collapse
|
5
|
Ross CJ, Ulitsky I. Discovering functional motifs in long noncoding RNAs. WILEY INTERDISCIPLINARY REVIEWS. RNA 2022; 13:e1708. [PMID: 34981665 DOI: 10.1002/wrna.1708] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 11/19/2021] [Accepted: 12/04/2021] [Indexed: 12/27/2022]
Abstract
Long noncoding RNAs (lncRNAs) are products of pervasive transcription that closely resemble messenger RNAs on the molecular level, yet function through largely unknown modes of action. The current model is that the function of lncRNAs often relies on specific, typically short, conserved elements, connected by linkers in which specific sequences and/or structures are less important. This notion has fueled the development of both computational and experimental methods focused on the discovery of functional elements within lncRNA genes, based on diverse signals such as evolutionary conservation, predicted structural elements, or the ability to rescue loss-of-function phenotypes. In this review, we outline the main challenges that the different methods need to overcome, describe the recently developed approaches, and discuss their respective limitations. This article is categorized under: RNA Evolution and Genomics > Computational Analyses of RNA RNA Interactions with Proteins and Other Molecules > Protein-RNA Interactions: Functional Implications Regulatory RNAs/RNAi/Riboswitches > Regulatory RNAs.
Collapse
Affiliation(s)
- Caroline Jane Ross
- Biological Regulation and Molecular Neuroscience, Weizmann Institute of Science, Rehovot, Israel
| | - Igor Ulitsky
- Biological Regulation and Molecular Neuroscience, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
6
|
Abstract
Recent events have pushed RNA research into the spotlight. Continued discoveries of RNA with unexpected diverse functions in healthy and diseased cells, such as the role of RNA as both the source and countermeasure to a severe acute respiratory syndrome coronavirus 2 infection, are igniting a new passion for understanding this functionally and structurally versatile molecule. Although RNA structure is key to function, many foundational characteristics of RNA structure are misunderstood, and the default state of RNA is often thought of and depicted as a single floppy strand. The purpose of this perspective is to help adjust mental models, equipping the community to better use the fundamental aspects of RNA structural information in new mechanistic models, enhance experimental design to test these models, and refine data interpretation. We discuss six core observations focused on the inherent nature of RNA structure and how to incorporate these characteristics to better understand RNA structure. We also offer some ideas for future efforts to make validated RNA structural information available and readily used by all researchers.
Collapse
Affiliation(s)
- Quentin Vicens
- Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, School of Medicine, Aurora, CO 80045
- RNA BioScience Initiative, University of Colorado Denver School of Medicine, Aurora, CO 80045
| | - Jeffrey S. Kieft
- Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, School of Medicine, Aurora, CO 80045
- RNA BioScience Initiative, University of Colorado Denver School of Medicine, Aurora, CO 80045
| |
Collapse
|
7
|
Rodgers ML, Woodson SA. A roadmap for rRNA folding and assembly during transcription. Trends Biochem Sci 2021; 46:889-901. [PMID: 34176739 PMCID: PMC8526401 DOI: 10.1016/j.tibs.2021.05.009] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Revised: 05/14/2021] [Accepted: 05/27/2021] [Indexed: 01/11/2023]
Abstract
Ribonucleoprotein (RNP) assembly typically begins during transcription when folding of the newly synthesized RNA is coupled with the recruitment of RNA-binding proteins (RBPs). Upon binding, the proteins induce structural rearrangements in the RNA that are crucial for the next steps of assembly. Focusing primarily on bacterial ribosome assembly, we discuss recent work showing that early RNA-protein interactions are more dynamic than previously supposed, and remain so, until sufficient proteins are recruited to each transcript to consolidate an entire domain of the RNP. We also review studies showing that stable assembly of an RNP competes against modification and processing of the RNA. Finally, we discuss how transcription sets the timeline for competing and cooperative RNA-RBP interactions that determine the fate of the nascent RNA. How this dance is coordinated is the focus of this review.
Collapse
Affiliation(s)
- Margaret L Rodgers
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Sarah A Woodson
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA.
| |
Collapse
|
8
|
Manrubia S, Cuesta JA, Aguirre J, Ahnert SE, Altenberg L, Cano AV, Catalán P, Diaz-Uriarte R, Elena SF, García-Martín JA, Hogeweg P, Khatri BS, Krug J, Louis AA, Martin NS, Payne JL, Tarnowski MJ, Weiß M. From genotypes to organisms: State-of-the-art and perspectives of a cornerstone in evolutionary dynamics. Phys Life Rev 2021; 38:55-106. [PMID: 34088608 DOI: 10.1016/j.plrev.2021.03.004] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 03/01/2021] [Indexed: 12/21/2022]
Abstract
Understanding how genotypes map onto phenotypes, fitness, and eventually organisms is arguably the next major missing piece in a fully predictive theory of evolution. We refer to this generally as the problem of the genotype-phenotype map. Though we are still far from achieving a complete picture of these relationships, our current understanding of simpler questions, such as the structure induced in the space of genotypes by sequences mapped to molecular structures, has revealed important facts that deeply affect the dynamical description of evolutionary processes. Empirical evidence supporting the fundamental relevance of features such as phenotypic bias is mounting as well, while the synthesis of conceptual and experimental progress leads to questioning current assumptions on the nature of evolutionary dynamics-cancer progression models or synthetic biology approaches being notable examples. This work delves with a critical and constructive attitude into our current knowledge of how genotypes map onto molecular phenotypes and organismal functions, and discusses theoretical and empirical avenues to broaden and improve this comprehension. As a final goal, this community should aim at deriving an updated picture of evolutionary processes soundly relying on the structural properties of genotype spaces, as revealed by modern techniques of molecular and functional analysis.
Collapse
Affiliation(s)
- Susanna Manrubia
- Department of Systems Biology, Centro Nacional de Biotecnología (CSIC), Madrid, Spain; Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain.
| | - José A Cuesta
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Spain; Instituto de Biocomputación y Física de Sistemas Complejos (BiFi), Universidad de Zaragoza, Spain; UC3M-Santander Big Data Institute (IBiDat), Getafe, Madrid, Spain
| | - Jacobo Aguirre
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Centro de Astrobiología, CSIC-INTA, ctra. de Ajalvir km 4, 28850 Torrejón de Ardoz, Madrid, Spain
| | - Sebastian E Ahnert
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, UK; The Alan Turing Institute, British Library, 96 Euston Road, London NW1 2DB, UK
| | | | - Alejandro V Cano
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pablo Catalán
- Grupo Interdisciplinar de Sistemas Complejos (GISC), Madrid, Spain; Departamento de Matemáticas, Universidad Carlos III de Madrid, Leganés, Spain
| | - Ramon Diaz-Uriarte
- Department of Biochemistry, Universidad Autónoma de Madrid, Madrid, Spain; Instituto de Investigaciones Biomédicas "Alberto Sols" (UAM-CSIC), Madrid, Spain
| | - Santiago F Elena
- Instituto de Biología Integrativa de Sistemas, I(2)SysBio (CSIC-UV), València, Spain; The Santa Fe Institute, Santa Fe, NM, USA
| | | | - Paulien Hogeweg
- Theoretical Biology and Bioinformatics Group, Utrecht University, the Netherlands
| | - Bhavin S Khatri
- The Francis Crick Institute, London, UK; Department of Life Sciences, Imperial College London, London, UK
| | - Joachim Krug
- Institute for Biological Physics, University of Cologne, Köln, Germany
| | - Ard A Louis
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, UK
| | - Nora S Martin
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK; Sainsbury Laboratory, University of Cambridge, Cambridge, UK
| | - Joshua L Payne
- Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | - Marcel Weiß
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK; Sainsbury Laboratory, University of Cambridge, Cambridge, UK
| |
Collapse
|
9
|
Chizzolini F, Passalacqua LFM, Oumais M, Dingilian AI, Szostak JW, Lupták A. Large Phenotypic Enhancement of Structured Random RNA Pools. J Am Chem Soc 2020; 142:1941-1951. [PMID: 31887027 DOI: 10.1021/jacs.9b11396] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Laboratory evolution of functional RNAs has applications in many areas of chemical and synthetic biology. In vitro selections critically depend on the presence of functional molecules, such as aptamers and ribozymes, in the starting sequence pools. For selection of novel functions the pools are typically transcribed from random-sequence DNA templates, yielding a highly diverse set of RNAs that contain a multitude of folds and biochemical activities. The phenotypic potential, the frequency of functional RNAs, is very low, requiring large complexity of starting pools, surpassing 1015 different sequences, to identify highly active isolates. Furthermore, the majority of random sequences is not structured and has a high propensity for aggregation; the in vitro selection process thus involves not just enrichment of functional RNAs, but also their purification from aggregation-prone "free-riders". We reasoned that purification of the nonaggregating, monomeric subpopulation of a random-sequence RNA pool will yield pools of folded, functional RNAs. We performed six rounds of selection for monomeric sequences and show that the enriched population is compactly folded. In vitro selections originating from various mixtures of the compact pool and a fully random pool showed that sequences from the compact pool always dominate the population once a biochemical activity is detectable. A head-to-head competition of the two pools starting from a low (5 × 1012) sequence diversity revealed that the phenotypic potential of the compact pool is about 1000-times higher than the fully random pool. A selection for folded and monomeric RNA pools thus greatly increases the frequency of functional RNAs from that seen in random-sequence pools, providing a facile experimental approach to isolation of highly active functional RNAs from low-diversity populations.
Collapse
Affiliation(s)
- Fabio Chizzolini
- Department of Pharmaceutical Sciences , University of California at Irvine , Irvine , California 92697 , United States
| | - Luiz F M Passalacqua
- Department of Pharmaceutical Sciences , University of California at Irvine , Irvine , California 92697 , United States
| | - Mona Oumais
- Department of Chemistry , University of California at Irvine , Irvine , California 92697 , United States
| | - Armine I Dingilian
- Department of Pharmaceutical Sciences , University of California at Irvine , Irvine , California 92697 , United States
| | - Jack W Szostak
- Howard Hughes Medical Institute, Department of Molecular Biology, and Center for Computational and Integrative Biology , Massachusetts General Hospital , Boston , Massachusetts 02114 , United States.,Department of Chemistry and Chemical Biology , Harvard University , 12 Oxford Street , Cambridge , Massachusetts 02138 , United States
| | - Andrej Lupták
- Department of Pharmaceutical Sciences , University of California at Irvine , Irvine , California 92697 , United States.,Department of Chemistry , University of California at Irvine , Irvine , California 92697 , United States.,Department of Molecular Biology and Biochemistry , University of California at Irvine , Irvine , California 92697 , United States
| |
Collapse
|
10
|
Oliver CG, Reinharz V, Waldispühl J. On the emergence of structural complexity in RNA replicators. RNA (NEW YORK, N.Y.) 2019; 25:1579-1591. [PMID: 31467146 PMCID: PMC6859851 DOI: 10.1261/rna.070391.119] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 08/19/2019] [Indexed: 06/10/2023]
Abstract
The RNA world hypothesis relies on the ability of ribonucleic acids to spontaneously acquire complex structures capable of supporting essential biological functions. Multiple sophisticated evolutionary models have been proposed for their emergence, but they often assume specific conditions. In this work, we explore a simple and parsimonious scenario describing the emergence of complex molecular structures at the early stages of life. We show that at specific GC content regimes, an undirected replication model is sufficient to explain the apparition of multibranched RNA secondary structures-a structural signature of many essential ribozymes. We ran a large-scale computational study to map energetically stable structures on complete mutational networks of 50-nt-long RNA sequences. Our results reveal that the sequence landscape with stable structures is enriched with multibranched structures at a length scale coinciding with the appearance of complex structures in RNA databases. A random replication mechanism preserving a 50% GC content may suffice to explain a natural enrichment of stable complex structures in populations of functional RNAs. In contrast, an evolutionary mechanism eliciting the most stable folds at each generation appears to help reaching multibranched structures at highest GC content.
Collapse
Affiliation(s)
- Carlos G Oliver
- School of Computer Science, McGill University, Montreal, QC H3A 2B3, Canada
| | - Vladimir Reinharz
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan 34126, South Korea
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, Montreal, QC H3A 2B3, Canada
| |
Collapse
|
11
|
Kirsch R, Seemann SE, Ruzzo WL, Cohen SM, Stadler PF, Gorodkin J. Identification and characterization of novel conserved RNA structures in Drosophila. BMC Genomics 2018; 19:899. [PMID: 30537930 PMCID: PMC6288889 DOI: 10.1186/s12864-018-5234-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Accepted: 11/08/2018] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome. RESULTS To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units. CONCLUSIONS This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.
Collapse
Affiliation(s)
- Rebecca Kirsch
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, D-04107 Germany
| | - Stefan E. Seemann
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
| | - Walter L. Ruzzo
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- School of Computer Science and Engineering, University of Washington, Box 352350, Seattle, 98195-2350 WA USA
- Department of Genome Sciences, University of Washington, Box 355065, Seattle, 98195-5065 WA USA
- Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, 98109-1024 WA USA
| | - Stephen M. Cohen
- Department of Cellular and Molecular Medicine, University of Copenhagen, Blegdamsvej 3, Copenhagen N, DK-2200 Denmark
| | - Peter F. Stadler
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, D-04107 Germany
- Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103 Germany
- Faculdad de Ciencias, Universidad Nacional de Colombia, Sede Bogotá, Ciudad Universitaria, Bogotá, COL-111321 D.C. Colombia
- Department of Theoretical Chemistry, University of Vienna, Währinger Straße 17, Vienna, A-1090 Austria
- Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM87501 USA
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
| |
Collapse
|
12
|
Abstract
3'-untranslated regions (3'-UTRs) are the noncoding parts of mRNAs. Compared to yeast, in humans, median 3'-UTR length has expanded approximately tenfold alongside an increased generation of alternative 3'-UTR isoforms. In contrast, the number of coding genes, as well as coding region length, has remained similar. This suggests an important role for 3'-UTRs in the biology of higher organisms. 3'-UTRs are best known to regulate diverse fates of mRNAs, including degradation, translation, and localization, but they can also function like long noncoding or small RNAs, as has been shown for whole 3'-UTRs as well as for cleaved fragments. Furthermore, 3'-UTRs determine the fate of proteins through the regulation of protein-protein interactions. They facilitate cotranslational protein complex formation, which establishes a role for 3'-UTRs as evolved eukaryotic operons. Whereas bacterial operons promote the interaction of two subunits, 3'-UTRs enable the formation of protein complexes with diverse compositions. All of these 3'-UTR functions are accomplished by effector proteins that are recruited by RNA-binding proteins that bind to 3'-UTR cis-elements. In summary, 3'-UTRs seem to be major players in gene regulation that enable local functions, compartmentalization, and cooperativity, which makes them important tools for the regulation of phenotypic diversity of higher organisms.
Collapse
Affiliation(s)
- Christine Mayr
- Department of Cancer Biology and Genetics, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA;
| |
Collapse
|
13
|
Ochieng PO, White NA, Feig M, Hoogstraten CG. Intrinsic Base-Pair Rearrangement in the Hairpin Ribozyme Directs RNA Conformational Sampling and Tertiary Interface Formation. J Phys Chem B 2016; 120:10885-10898. [PMID: 27701852 DOI: 10.1021/acs.jpcb.6b05606] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Dynamic fluctuations in RNA structure enable conformational changes that are required for catalysis and recognition. In the hairpin ribozyme, the catalytically active structure is formed as an intricate tertiary interface between two RNA internal loops. Substantial alterations in the structure of each loop are observed upon interface formation, or docking. The very slow on-rate for this relatively tight interaction has led us to hypothesize a double conformational capture mechanism for RNA-RNA recognition. We used extensive molecular dynamics simulations to assess conformational sampling in the undocked form of the loop domain containing the scissile phosphate (loop A). We observed several major accessible conformations with distinctive patterns of hydrogen bonding and base stacking interactions in the active-site internal loop. Several important conformational features characteristic of the docked state were observed in well-populated substates, consistent with the kinetic sampling of docking-competent states by isolated loop A. Our observations suggest a hybrid or multistage binding mechanism, in which initial conformational selection of a docking-competent state is followed by induced-fit adjustment to an in-line, chemically reactive state only after formation of the initial complex with loop B.
Collapse
Affiliation(s)
- Patrick O Ochieng
- Department of Biochemistry and Molecular Biology, Michigan State University , East Lansing, Michigan 48824, United States
| | - Neil A White
- Department of Biochemistry and Molecular Biology, Michigan State University , East Lansing, Michigan 48824, United States
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University , East Lansing, Michigan 48824, United States
| | - Charles G Hoogstraten
- Department of Biochemistry and Molecular Biology, Michigan State University , East Lansing, Michigan 48824, United States
| |
Collapse
|
14
|
Bouckenheimer J, Assou S, Riquier S, Hou C, Philippe N, Sansac C, Lavabre-Bertrand T, Commes T, Lemaître JM, Boureux A, De Vos J. Long non-coding RNAs in human early embryonic development and their potential in ART. Hum Reprod Update 2016; 23:19-40. [PMID: 27655590 DOI: 10.1093/humupd/dmw035] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2016] [Revised: 07/20/2016] [Accepted: 08/23/2016] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Human long non-coding RNAs (lncRNAs) are an emerging category of transcripts with increasingly documented functional roles during development. LncRNAs and roles during human early embryo development have recently begun to be unravelled. OBJECTIVE AND RATIONALE This review summarizes the most recent knowledge on lncRNAs and focuses on their expression patterns and role during early human embryo development and in pluripotent stem cells (PSCs). Public mRNA sequencing (mRNA-seq) data were used to illustrate these expression signatures. SEARCH METHODS The PubMed and EMBASE databases were first interrogated using specific terms, such as 'lncRNAs', to get an extensive overview on lncRNAs up to February 2016, and then using 'human lncRNAs' and 'embryo', 'development', or 'PSCs' to focus on lncRNAs involved in human embryo development or in PSC.Recently published RNA-seq data from human oocytes and pre-implantation embryos (including single-cell data), PSC and a panel of normal and malignant adult tissues were used to describe the specific expression patterns of some lncRNAs in early human embryos. OUTCOMES The existence and the crucial role of lncRNAs in many important biological phenomena in each branch of the life tree are now well documented. The number of identified lncRNAs is rapidly increasing and has already outnumbered that of protein-coding genes. Unlike small non-coding RNAs, a variety of mechanisms of action have been proposed for lncRNAs. The functional role of lncRNAs has been demonstrated in many biological and developmental processes, including cell pluripotency induction, X-inactivation or gene imprinting. Analysis of RNA-seq data highlights that lncRNA abundance changes significantly during human early embryonic development. This suggests that lncRNAs could represent candidate biomarkers for developing non-invasive tests for oocyte or embryo quality. Finally, some of these lncRNAs are also expressed in human cancer tissues, suggesting that reactivation of an embryonic lncRNA program may contribute to human malignancies. WIDER IMPLICATIONS LncRNAs are emerging potential key players in gene expression regulation. Analysis of RNA-seq data from human pre-implantation embryos identified lncRNA signatures that are specific to this critical step. We anticipate that further studies will show that these new transcripts are major regulators of embryo development. These findings might also be used to develop new tests/treatments for improving the pregnancy success rate in IVF procedures or for regenerative medicine applications involving PSC.
Collapse
Affiliation(s)
- Julien Bouckenheimer
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France
| | - Said Assou
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France
| | - Sébastien Riquier
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France
| | - Cyrielle Hou
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France
| | - Nicolas Philippe
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France.,Coretec, Montpellier, France
| | - Caroline Sansac
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France
| | | | - Thérèse Commes
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France.,Institut de Biologie Computationnelle, Montpellier F 34000, France
| | - Jean-Marc Lemaître
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France .,INSERM, U1183, Montpellier F 34000, France.,Stem Cell Core Facility SAFE-iPSC, INGESTEM, Saint-Eloi Hospital, Montpellier F 34000, France
| | - Anthony Boureux
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France.,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France
| | - John De Vos
- Institute for Regenerative Medicine and Biotherapy, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France .,INSERM, U1183, Montpellier F 34000, France.,Université de Montpellier, Montpellier F 34000, France.,Institut de Biologie Computationnelle, Montpellier F 34000, France.,Stem Cell Core Facility SAFE-iPSC, INGESTEM, Saint-Eloi Hospital, Montpellier F 34000, France.,Department of Cell and Tissue Engineering, CHU Montpellier, Saint-Eloi Hospital, Montpellier F 34000, France
| |
Collapse
|
15
|
Nitsche A, Stadler PF. Evolutionary clues in lncRNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 8. [PMID: 27436689 DOI: 10.1002/wrna.1376] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Revised: 06/06/2016] [Accepted: 06/09/2016] [Indexed: 12/13/2022]
Abstract
The diversity of long non-coding RNAs (lncRNAs) in the human transcriptome is in stark contrast to the sparse exploration of their functions concomitant with their conservation and evolution. The pervasive transcription of the largely non-coding human genome makes the evolutionary age and conservation patterns of lncRNAs to a topic of interest. Yet it is a fairly unexplored field and not that easy to determine as for protein-coding genes. Although there are a few experimentally studied cases, which are conserved at the sequence level, most lncRNAs exhibit weak or untraceable primary sequence conservation. Recent studies shed light on the interspecies conservation of secondary structures among lncRNA homologs by using diverse computational methods. This highlights the importance of structure on functionality of lncRNAs as opposed to the poor impact of primary sequence changes. Further clues in the evolution of lncRNAs are given by selective constraints on non-coding gene structures (e.g., promoters or splice sites) as well as the conservation of prevalent spatio-temporal expression patterns. However, a rapid evolutionary turnover is observable throughout the heterogeneous group of lncRNAs. This still gives rise to questions about its functional meaning. WIREs RNA 2017, 8:e1376. doi: 10.1002/wrna.1376 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Anne Nitsche
- Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Institute de Biologie Moléculaire et Cellulaire, Université de Strasbourg, Cedex, France
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Interdisciplinary Center for Bioinformatics, University Leipzig, Leipzig, Germany.,Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany.,Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology - IZI, Leipzig, Germany.,Center for Non-Coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark.,Department of Theoretical Chemistry, University of Vienna, Wien, Austria.,Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
16
|
Dingle K, Schaper S, Louis AA. The structure of the genotype-phenotype map strongly constrains the evolution of non-coding RNA. Interface Focus 2015; 5:20150053. [PMID: 26640651 DOI: 10.1098/rsfs.2015.0053] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The prevalence of neutral mutations implies that biological systems typically have many more genotypes than phenotypes. But, can the way that genotypes are distributed over phenotypes determine evolutionary outcomes? Answering such questions is difficult, in part because the number of genotypes can be hyper-astronomically large. By solving the genotype-phenotype (GP) map for RNA secondary structure (SS) for systems up to length L = 126 nucleotides (where the set of all possible RNA strands would weigh more than the mass of the visible universe), we show that the GP map strongly constrains the evolution of non-coding RNA (ncRNA). Simple random sampling over genotypes predicts the distribution of properties such as the mutational robustness or the number of stems per SS found in naturally occurring ncRNA with surprising accuracy. Because we ignore natural selection, this strikingly close correspondence with the mapping suggests that structures allowing for functionality are easily discovered, despite the enormous size of the genetic spaces. The mapping is extremely biased: the majority of genotypes map to an exponentially small portion of the morphospace of all biophysically possible structures. Such strong constraints provide a non-adaptive explanation for the convergent evolution of structures such as the hammerhead ribozyme. These results present a particularly clear example of bias in the arrival of variation strongly shaping evolutionary outcomes and may be relevant to Mayr's distinction between proximate and ultimate causes in evolutionary biology.
Collapse
Affiliation(s)
- Kamaludin Dingle
- Rudolf Peierls Centre for Theoretical Physics , University of Oxford , Oxford OX1 3NP , UK ; Systems Biology DTC , University of Oxford , Oxford , UK ; Department of Mathematics and Natural Sciences , Gulf University for Science and Technology , Block 5, West Mishref , Kuwait
| | - Steffen Schaper
- Rudolf Peierls Centre for Theoretical Physics , University of Oxford , Oxford OX1 3NP , UK
| | - Ard A Louis
- Rudolf Peierls Centre for Theoretical Physics , University of Oxford , Oxford OX1 3NP , UK
| |
Collapse
|
17
|
Cordero P, Das R. Rich RNA Structure Landscapes Revealed by Mutate-and-Map Analysis. PLoS Comput Biol 2015; 11:e1004473. [PMID: 26566145 PMCID: PMC4643908 DOI: 10.1371/journal.pcbi.1004473] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2015] [Accepted: 07/20/2015] [Indexed: 11/19/2022] Open
Abstract
Landscapes exhibiting multiple secondary structures arise in natural RNA molecules that modulate gene expression, protein synthesis, and viral infection [corrected]. We report herein that high-throughput chemical experiments can isolate an RNA's multiple alternative secondary structures as they are stabilized by systematic mutagenesis (mutate-and-map, M2) and that a computational algorithm, REEFFIT, enables unbiased reconstruction of these states' structures and populations. In an in silico benchmark on non-coding RNAs with complex landscapes, M2-REEFFIT recovers 95% of RNA helices present with at least 25% population while maintaining a low false discovery rate (10%) and conservative error estimates. In experimental benchmarks, M2-REEFFIT recovers the structure landscapes of a 35-nt MedLoop hairpin, a 110-nt 16S rRNA four-way junction with an excited state, a 25-nt bistable hairpin, and a 112-nt three-state adenine riboswitch with its expression platform, molecules whose characterization previously required expert mutational analysis and specialized NMR or chemical mapping experiments. With this validation, M2-REEFFIT enabled tests of whether artificial RNA sequences might exhibit complex landscapes in the absence of explicit design. An artificial flavin mononucleotide riboswitch and a randomly generated RNA sequence are found to interconvert between three or more states, including structures for which there was no design, but that could be stabilized through mutations. These results highlight the likely pervasiveness of rich landscapes with multiple secondary structures in both natural and artificial RNAs and demonstrate an automated chemical/computational route for their empirical characterization.
Collapse
Affiliation(s)
- Pablo Cordero
- Biomedical Informatics Program, Stanford University, Stanford, California, United States of America
- Biochemistry Department, Stanford University, Stanford, California, United States of America
| | - Rhiju Das
- Biomedical Informatics Program, Stanford University, Stanford, California, United States of America
- Biochemistry Department, Stanford University, Stanford, California, United States of America
- Physics Department, Stanford University, Stanford, California, United States of America
| |
Collapse
|
18
|
Wang H, Niu QW, Wu HW, Liu J, Ye J, Yu N, Chua NH. Analysis of non-coding transcriptome in rice and maize uncovers roles of conserved lncRNAs associated with agriculture traits. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2015; 84:404-16. [PMID: 26387578 DOI: 10.1111/tpj.13018] [Citation(s) in RCA: 110] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Accepted: 08/26/2015] [Indexed: 05/07/2023]
Abstract
Long non-coding RNAs (lncRNAs) have recently been found to widely exist in eukaryotes and play important roles in key biological processes. To extend our knowledge of lncRNAs in crop plants we performed both non-directional and strand-specific RNA-sequencing experiments to profile non-coding transcriptomes of various rice and maize organs at different developmental stages. Analysis of more than 3 billion reads identified 22 334 long intergenic non-coding RNAs (lincRNAs) and 6673 pairs of sense and natural antisense transcript (NAT). Many lincRNA genes were associated with epigenetic marks. Expression of rice lincRNA genes was significantly correlated with that of nearby protein-coding genes. A set of NAT genes also showed expression correlation with their sense genes. More than 200 rice lincRNA genes had homologous non-coding sequences in the maize genome. Much more lincRNA and NAT genes were derived from conserved genomic regions between the two cereals presenting positional conservation. Protein-coding genes flanking or having a sense-antisense relationship to these conserved lncRNA genes were mainly involved in development and stress responses, suggesting that the associated lncRNAs might have similar functions. Integrating previous genome-wide association studies (GWAS), we found that hundreds of lincRNAs contain trait-associated SNPs (single nucleotide polymorphisms [SNPs]) suggesting their putative contributions to developmental and agriculture traits.
Collapse
Affiliation(s)
- Huan Wang
- Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Qi-Wen Niu
- Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Hui-Wen Wu
- Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Jun Liu
- Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Jian Ye
- Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, Singapore City, 117604, Singapore
| | - Niu Yu
- Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Nam-Hai Chua
- Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| |
Collapse
|
19
|
Gopal A, Egecioglu DE, Yoffe AM, Ben-Shaul A, Rao ALN, Knobler CM, Gelbart WM. Viral RNAs are unusually compact. PLoS One 2014; 9:e105875. [PMID: 25188030 PMCID: PMC4154850 DOI: 10.1371/journal.pone.0105875] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2014] [Accepted: 07/21/2014] [Indexed: 01/28/2023] Open
Abstract
A majority of viruses are composed of long single-stranded genomic RNA molecules encapsulated by protein shells with diameters of just a few tens of nanometers. We examine the extent to which these viral RNAs have evolved to be physically compact molecules to facilitate encapsulation. Measurements of equal-length viral, non-viral, coding and non-coding RNAs show viral RNAs to have among the smallest sizes in solution, i.e., the highest gel-electrophoretic mobilities and the smallest hydrodynamic radii. Using graph-theoretical analyses we demonstrate that their sizes correlate with the compactness of branching patterns in predicted secondary structure ensembles. The density of branching is determined by the number and relative positions of 3-helix junctions, and is highly sensitive to the presence of rare higher-order junctions with 4 or more helices. Compact branching arises from a preponderance of base pairing between nucleotides close to each other in the primary sequence. The density of branching represents a degree of freedom optimized by viral RNA genomes in response to the evolutionary pressure to be packaged reliably. Several families of viruses are analyzed to delineate the effects of capsid geometry, size and charge stabilization on the selective pressure for RNA compactness. Compact branching has important implications for RNA folding and viral assembly.
Collapse
Affiliation(s)
- Ajaykumar Gopal
- Department of Chemistry & Biochemistry, University of California Los Angeles, Los Angeles, California, United States of America
| | - Defne E. Egecioglu
- Department of Chemistry & Biochemistry, University of California Los Angeles, Los Angeles, California, United States of America
| | - Aron M. Yoffe
- Department of Chemistry & Biochemistry, University of California Los Angeles, Los Angeles, California, United States of America
| | - Avinoam Ben-Shaul
- Institute of Chemistry & The Fritz Haber Research Center, The Hebrew University of Jerusalem, Givat Ram, Jerusalem, Israel
| | - Ayala L. N. Rao
- Department of Plant Pathology, University of California Riverside, Riverside, California, United States of America
| | - Charles M. Knobler
- Department of Chemistry & Biochemistry, University of California Los Angeles, Los Angeles, California, United States of America
| | - William M. Gelbart
- Department of Chemistry & Biochemistry, University of California Los Angeles, Los Angeles, California, United States of America
- * E-mail:
| |
Collapse
|
20
|
Evidence of pervasive biologically functional secondary structures within the genomes of eukaryotic single-stranded DNA viruses. J Virol 2013; 88:1972-89. [PMID: 24284329 DOI: 10.1128/jvi.03031-13] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here.
Collapse
|
21
|
Faber M, Klumpp S. Kinetic Monte Carlo approach to RNA folding dynamics using structure-based models. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:052701. [PMID: 24329290 DOI: 10.1103/physreve.88.052701] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Revised: 09/03/2013] [Indexed: 06/03/2023]
Abstract
RNA molecules form three-dimensional structures via base pairing that determine the function and biochemical activity of the molecule. Here we introduce a structure-based method for studying the folding dynamics of RNA secondary structures. The approach focuses on native contacts that are parametrized with standard empirical free energies. Kinetic Monte Carlo simulations for free folding of simple hairpins and complex structures such as a tRNA as well as for folding in the presence of an external force show good agreement with experimental data. A systematic comparison of simulated and experimental folding rates for various structures shows a strong correlation, indicating that the approach can predict folding rates within about an order of magnitude.
Collapse
Affiliation(s)
- Michael Faber
- Max Planck Institute of Colloids and Interfaces, Science Park Golm, 14424 Potsdam, Germany
| | - Stefan Klumpp
- Max Planck Institute of Colloids and Interfaces, Science Park Golm, 14424 Potsdam, Germany
| |
Collapse
|
22
|
Abstract
Long intervening noncoding RNAs (lincRNAs) are transcribed from thousands of loci in mammalian genomes and might play widespread roles in gene regulation and other cellular processes. This Review outlines the emerging understanding of lincRNAs in vertebrate animals, with emphases on how they are being identified and current conclusions and questions regarding their genomics, evolution and mechanisms of action.
Collapse
Affiliation(s)
- Igor Ulitsky
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | | |
Collapse
|
23
|
Prediction of hammerhead ribozyme intracellular activity with the catalytic core fingerprint. Biochem J 2013; 451:439-51. [DOI: 10.1042/bj20121761] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Hammerhead ribozyme is a versatile tool for down-regulation of gene expression in vivo. Owing to its small size and high activity, it is used as a model for RNA structure–function relationship studies. In the present paper we describe a new extended hammerhead ribozyme HH-2 with a tertiary stabilizing motif constructed on the basis of the tetraloop receptor sequence. This ribozyme is very active in living cells, but shows low activity in vitro. To understand it, we analysed tertiary structure models of substrate–ribozyme complexes. We calculated six unique catalytic core geometry parameters as distances and angles between particular atoms that we call the ribozyme fingerprint. A flanking sequence and tertiary motif change the geometry of the general base, general acid, nucleophile and leaving group. We found almost complete correlation between these parameters and the decrease of target gene expression in the cells. The tertiary structure model calculations allow us to predict ribozyme intracellular activity. Our approach could be widely adapted to characterize catalytic properties of other RNAs.
Collapse
|
24
|
Structure and function of long noncoding RNAs in epigenetic regulation. Nat Struct Mol Biol 2013; 20:300-7. [DOI: 10.1038/nsmb.2480] [Citation(s) in RCA: 1087] [Impact Index Per Article: 98.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2012] [Accepted: 11/20/2012] [Indexed: 12/21/2022]
|
25
|
Villarreal LP, Witzany G. The DNA Habitat and its RNA Inhabitants: At the Dawn of RNA Sociology. GENOMICS INSIGHTS 2013; 6:1-12. [PMID: 26217106 PMCID: PMC4510605 DOI: 10.4137/gei.s11490] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Most molecular biological concepts derive from physical chemical assumptions about the genetic code that are basically more than 40 years old. Additionally, systems biology, another quantitative approach, investigates the sum of interrelations to obtain a more holistic picture of nucleotide sequence order. Recent empirical data on genetic code compositions and rearrangements by mobile genetic elements and noncoding RNAs, together with results of virus research and their role in evolution, does not really fit into these concepts and compel a reexamination. In this review, we try to find an alternate hypothesis. It seems plausible now that if we look at the abundance of regulatory RNAs and persistent viruses in host genomes, we will find more and more evidence that the key players that edit the genetic codes of host genomes are consortia of RNA agents and viruses that drive evolutionary novelty and regulation of cellular processes in all steps of development. This agent-based approach may lead to a qualitative RNA sociology that investigates and identifies relevant behavioral motifs of cooperative RNA consortia. In addition to molecular biological perspectives, this may lead to a better understanding of genetic code evolution and dynamics.
Collapse
Affiliation(s)
- Luis P Villarreal
- Department of Molecular Biology and Biochemistry, University of California, Irvine, CA, USA
| | | |
Collapse
|
26
|
Kraft JJ, Treder K, Peterson MS, Miller WA. Cation-dependent folding of 3' cap-independent translation elements facilitates interaction of a 17-nucleotide conserved sequence with eIF4G. Nucleic Acids Res 2013; 41:3398-413. [PMID: 23361463 PMCID: PMC3597692 DOI: 10.1093/nar/gkt026] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The 3′-untranslated regions of many plant viral RNAs contain cap-independent translation elements (CITEs) that drive translation initiation at the 5′-end of the mRNA. The barley yellow dwarf virus-like CITE (BTE) stimulates translation by binding the eIF4G subunit of translation initiation factor eIF4F with high affinity. To understand this interaction, we characterized the dynamic structural properties of the BTE, mapped the eIF4G-binding sites on the BTE and identified a region of eIF4G that is crucial for BTE binding. BTE folding involves cooperative uptake of magnesium ions and is driven primarily by charge neutralization. Footprinting experiments revealed that functional eIF4G fragments protect the highly conserved stem–loop I and a downstream bulge. The BTE forms a functional structure in the absence of protein, and the loop that base pairs the 5′-untranslated region (5′-UTR) remains solvent-accessible at high eIF4G concentrations. The region in eIF4G between the eIF4E-binding site and the MIF4G region is required for BTE binding and translation. The data support the model in which the eIF4F complex binds directly to the BTE which base pairs simultaneously to the 5′-UTR, allowing eIF4F to recruit the 40S ribosomal subunit to the 5′-end.
Collapse
Affiliation(s)
- Jelena J Kraft
- Department of Plant Pathology and Microbiology, Iowa State University, Ames, IA 50011, USA
| | | | | | | |
Collapse
|
27
|
Ferrada E, Wagner A. A comparison of genotype-phenotype maps for RNA and proteins. Biophys J 2012; 102:1916-25. [PMID: 22768948 DOI: 10.1016/j.bpj.2012.01.047] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2011] [Revised: 01/19/2012] [Accepted: 01/27/2012] [Indexed: 02/04/2023] Open
Abstract
The relationship between the genotype (sequence) and the phenotype (structure) of macromolecules affects their ability to evolve new structures and functions. We here compare the genotype space organization of proteins and RNA molecules to identify differences that may affect this ability. To this end, we computationally study the genotype-phenotype relationship for short RNA and lattice proteins of a reduced monomer alphabet size, to make exhaustive analysis and direct comparison of their genotype spaces feasible. We find that many fewer protein molecules than RNA molecules fold, but they fold into many more structures than RNA. In consequence, protein phenotypes have smaller genotype networks whose member genotypes tend to be more similar than for RNA phenotypes. Neighborhoods in sequence space of a given radius around an RNA molecule contain more novel structures than for protein molecules. We compare this property to evidence from natural RNA and protein molecules, and conclude that RNA genotype space may be more conducive to the evolution of new structure phenotypes.
Collapse
Affiliation(s)
- Evandro Ferrada
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.
| | | |
Collapse
|
28
|
Wan Y, Qu K, Ouyang Z, Kertesz M, Li J, Tibshirani R, Makino DL, Nutter RC, Segal E, Chang HY. Genome-wide measurement of RNA folding energies. Mol Cell 2012; 48:169-81. [PMID: 22981864 DOI: 10.1016/j.molcel.2012.08.008] [Citation(s) in RCA: 174] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2012] [Revised: 06/19/2012] [Accepted: 08/02/2012] [Indexed: 12/31/2022]
Abstract
RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.
Collapse
Affiliation(s)
- Yue Wan
- Howard Hughes Medical Institute and Program in Epithelial Biology, Stanford University, Stanford, CA 94305, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Adilakshmi T, Sudol I, Tapinos N. Combinatorial action of miRNAs regulates transcriptional and post-transcriptional gene silencing following in vivo PNS injury. PLoS One 2012; 7:e39674. [PMID: 22792185 PMCID: PMC3391190 DOI: 10.1371/journal.pone.0039674] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2012] [Accepted: 05/25/2012] [Indexed: 11/18/2022] Open
Abstract
Injury response in the peripheral nervous system (PNS) is characterized by rapid alterations in the genetic program of Schwann cells. However, the epigenetic mechanisms modulating these changes remain elusive. Here we show that sciatic nerve injury in mice induces a cohort of 22 miRNAs, which coordinate Schwann cell differentiation and dedifferentiation through a combinatorial modulation of their positive and negative gene regulators. These miRNAs and their targeted mRNAs form functional complexes with the Argonaute-2 protein to mediate post-transcriptional gene silencing. MiR-138 and miR-709 show the highest affinity amongst the cohort, for binding and regulation of Egr2, Sox-2 and c-Jun expression following injury. Moreover, miR-709 participates in the formation of epigenetic silencing complexes with H3K27me3 and Argonaute-1 to induce transcriptional gene silencing of the Egr2 promoter. Collectively, we identified a discrete cohort of miRNAs as the central epigenetic regulators of the transition between differentiation and dedifferentiation during the acute phase of PNS injury.
Collapse
Affiliation(s)
- Tadepalli Adilakshmi
- Molecular Neuroscience Laboratory, Weis Center for Research, Geisinger Clinic, Danville, Pennsylvania, United States of America
| | - Ida Sudol
- Molecular Neuroscience Laboratory, Weis Center for Research, Geisinger Clinic, Danville, Pennsylvania, United States of America
| | - Nikos Tapinos
- Molecular Neuroscience Laboratory, Weis Center for Research, Geisinger Clinic, Danville, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
30
|
Behrouzi R, Roh JH, Kilburn D, Briber RM, Woodson SA. Cooperative tertiary interaction network guides RNA folding. Cell 2012; 149:348-57. [PMID: 22500801 DOI: 10.1016/j.cell.2012.01.057] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2011] [Revised: 11/02/2011] [Accepted: 01/26/2012] [Indexed: 01/06/2023]
Abstract
Noncoding RNAs form unique 3D structures, which perform many regulatory functions. To understand how RNAs fold uniquely despite a small number of tertiary interaction motifs, we mutated the major tertiary interactions in a group I ribozyme by single-base substitutions. The resulting perturbations to the folding energy landscape were measured using SAXS, ribozyme activity, hydroxyl radical footprinting, and native PAGE. Double- and triple-mutant cycles show that most tertiary interactions have a small effect on the stability of the native state. Instead, the formation of core and peripheral structural motifs is cooperatively linked in near-native folding intermediates, and this cooperativity depends on the native helix orientation. The emergence of a cooperative interaction network at an early stage of folding suppresses nonnative structures and guides the search for the native state. We suggest that cooperativity in noncoding RNAs arose from natural selection of architectures conducive to forming a unique, stable fold.
Collapse
Affiliation(s)
- Reza Behrouzi
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218, USA
| | | | | | | | | |
Collapse
|
31
|
De Lucrezia D, Slanzi D, Poli I, Polticelli F, Minervini G. Do natural proteins differ from random sequences polypeptides? Natural vs. random proteins classification using an evolutionary neural network. PLoS One 2012; 7:e36634. [PMID: 22615786 PMCID: PMC3353917 DOI: 10.1371/journal.pone.0036634] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Accepted: 04/04/2012] [Indexed: 11/19/2022] Open
Abstract
Are extant proteins the exquisite result of natural selection or are they random sequences slightly edited by evolution? This question has puzzled biochemists for long time and several groups have addressed this issue comparing natural protein sequences to completely random ones coming to contradicting conclusions. Previous works in literature focused on the analysis of primary structure in an attempt to identify possible signature of evolutionary editing. Conversely, in this work we compare a set of 762 natural proteins with an average length of 70 amino acids and an equal number of completely random ones of comparable length on the basis of their structural features. We use an ad hoc Evolutionary Neural Network Algorithm (ENNA) in order to assess whether and to what extent natural proteins are edited from random polypeptides employing 11 different structure-related variables (i.e. net charge, volume, surface area, coil, alpha helix, beta sheet, percentage of coil, percentage of alpha helix, percentage of beta sheet, percentage of secondary structure and surface hydrophobicity). The ENNA algorithm is capable to correctly distinguish natural proteins from random ones with an accuracy of 94.36%. Furthermore, we study the structural features of 32 random polypeptides misclassified as natural ones to unveil any structural similarity to natural proteins. Results show that random proteins misclassified by the ENNA algorithm exhibit a significant fold similarity to portions or subdomains of extant proteins at atomic resolution. Altogether, our results suggest that natural proteins are significantly edited from random polypeptides and evolutionary editing can be readily detected analyzing structural features. Furthermore, we also show that the ENNA, employing simple structural descriptors, can predict whether a protein chain is natural or random.
Collapse
Affiliation(s)
- Davide De Lucrezia
- European Centre for Living Technology, University Ca’ Foscari Venice. Venice, Italy
| | - Debora Slanzi
- Dept. of Environmental Sciences, Informatics and Statistics, University Ca’ Foscari Venice, Venice, Italy
| | - Irene Poli
- European Centre for Living Technology, University Ca’ Foscari Venice. Venice, Italy
- Dept. of Environmental Sciences, Informatics and Statistics, University Ca’ Foscari Venice, Venice, Italy
| | - Fabio Polticelli
- Dept. of Biology, University of Roma Tre. Rome, Italy
- National Institute for Nuclear Physics, Roma Tre Section. Rome, Italy
| | - Giovanni Minervini
- European Centre for Living Technology, University Ca’ Foscari Venice. Venice, Italy
- * E-mail:
| |
Collapse
|
32
|
Harish A, Caetano-Anollés G. Ribosomal history reveals origins of modern protein synthesis. PLoS One 2012; 7:e32776. [PMID: 22427882 PMCID: PMC3299690 DOI: 10.1371/journal.pone.0032776] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2011] [Accepted: 01/30/2012] [Indexed: 02/06/2023] Open
Abstract
The origin and evolution of the ribosome is central to our understanding of the cellular world. Most hypotheses posit that the ribosome originated in the peptidyl transferase center of the large ribosomal subunit. However, these proposals do not link protein synthesis to RNA recognition and do not use a phylogenetic comparative framework to study ribosomal evolution. Here we infer evolution of the structural components of the ribosome. Phylogenetic methods widely used in morphometrics are applied directly to RNA structures of thousands of molecules and to a census of protein structures in hundreds of genomes. We find that components of the small subunit involved in ribosomal processivity evolved earlier than the catalytic peptidyl transferase center responsible for protein synthesis. Remarkably, subunit RNA and proteins coevolved, starting with interactions between the oldest proteins (S12 and S17) and the oldest substructure (the ribosomal ratchet) in the small subunit and ending with the rise of a modern multi-subunit ribosome. Ancestral ribonucleoprotein components show similarities to in vitro evolved RNA replicase ribozymes and protein structures in extant replication machinery. Our study therefore provides important clues about the chicken-or-egg dilemma associated with the central dogma of molecular biology by showing that ribosomal history is driven by the gradual structural accretion of protein and RNA structures. Most importantly, results suggest that functionally important and conserved regions of the ribosome were recruited and could be relics of an ancient ribonucleoprotein world.
Collapse
Affiliation(s)
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana-Champaign, Illinois, United States of America
| |
Collapse
|
33
|
Abstract
Changes to the conformation of coding and non-coding RNAs form the basis of elements of genetic regulation and provide an important source of complexity, which drives many of the fundamental processes of life. Although the structure of RNA is highly flexible, the underlying dynamics of RNA are robust and are limited to transitions between the few conformations that preserve favourable base-pairing and stacking interactions. The mechanisms by which cellular processes harness the intrinsic dynamic behaviour of RNA and use it within functionally productive pathways are complex. The versatile functions and ease by which it is integrated into a wide variety of genetic circuits and biochemical pathways suggests there is a general and fundamental role for RNA dynamics in cellular processes.
Collapse
|
34
|
Labean TH, Butt TR, Kauffman SA, Schultes EA. Protein folding absent selection. Genes (Basel) 2011; 2:608-26. [PMID: 24710212 PMCID: PMC3927614 DOI: 10.3390/genes2030608] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2011] [Revised: 08/05/2011] [Accepted: 08/11/2011] [Indexed: 11/16/2022] Open
Abstract
Biological proteins are known to fold into specific 3D conformations. However, the fundamental question has remained: Do they fold because they are biological, and evolution has selected sequences which fold? Or is folding a common trait, widespread throughout sequence space? To address this question arbitrary, unevolved, random-sequence proteins were examined for structural features found in folded, biological proteins. Libraries of long (71 residue), random-sequence polypeptides, with ensemble amino acid composition near the mean for natural globular proteins, were expressed as cleavable fusions with ubiquitin. The structural properties of both the purified pools and individual isolates were then probed using circular dichroism, fluorescence emission, and fluorescence quenching techniques. Despite this necessarily sparse "sampling" of sequence space, structural properties that define globular biological proteins, namely collapsed conformations, secondary structure, and cooperative unfolding, were found to be prevalent among unevolved sequences. Thus, for polypeptides the size of small proteins, natural selection is not necessary to account for the compact and cooperative folded states observed in nature.
Collapse
Affiliation(s)
- Thomas H Labean
- Sequenomics LLC, 1428 Chanterelle Lane, Hillsborough, NC 27278, USA.
| | - Tauseef R Butt
- LifeSensors Inc., 271 Great Valley Parkway, Suite 100, Malvern, PA 19355, USA.
| | - Stuart A Kauffman
- Complex Systems Center University of Vermont, 200C Farrell Hall, 210 Colchester Ave., Burlington, VT 05405, USA.
| | - Erik A Schultes
- Sequenomics LLC, 1428 Chanterelle Lane, Hillsborough, NC 27278, USA.
| |
Collapse
|
35
|
Guo Z, Gibson M, Sitha S, Chu S, Mohanty U. Role of large thermal fluctuations and magnesium ions in t-RNA selectivity of the ribosome. Proc Natl Acad Sci U S A 2011; 108:3947-51. [PMID: 21368154 PMCID: PMC3054037 DOI: 10.1073/pnas.1100671108] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The fidelity of translation selection begins with the base pairing of codon-anticodon complex between the m-RNA and tRNAs. Binding of cognate and near-cognate tRNAs induces 30S subunit of the ribosome to wrap around the ternary complex, EF-Tu(GTP)aa-tRNA. We have proposed that large thermal fluctuations play a crucial role in the selection process. To test this conjecture, we have developed a theoretical technique to determine the probability that the ternary complex, as a result of large thermal fluctuations, forms contacts leading to stabilization of the GTPase activated state. We argue that the configurational searches for such processes are in the tail end of the probability distribution and show that the probability for this event is localized around the most likely configuration. Small variations in the repositioning of cognate relative to near-cognate complexes lead to rate enhancement of the cognate complex. The binding energies of over a dozen unique site-bound magnesium structural motifs are investigated and provide insights into the nature of interaction of divalent metal ions with the ribosome.
Collapse
Affiliation(s)
- Zuojun Guo
- Department of Chemistry, Boston College, Chestnut Hill, MA 02467; and
| | - Meghan Gibson
- Department of Chemistry, Boston College, Chestnut Hill, MA 02467; and
| | - Sanyasi Sitha
- Department of Chemistry, Boston College, Chestnut Hill, MA 02467; and
| | - Steven Chu
- Departments of Physics, Molecular, and Cell Biology, University of California, Berkeley, CA 94720
| | - Udayan Mohanty
- Department of Chemistry, Boston College, Chestnut Hill, MA 02467; and
| |
Collapse
|
36
|
Zhang J, Lau MW, Ferré-D'Amaré AR. Ribozymes and riboswitches: modulation of RNA function by small molecules. Biochemistry 2010; 49:9123-31. [PMID: 20931966 DOI: 10.1021/bi1012645] [Citation(s) in RCA: 127] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Diverse small molecules interact with catalytic RNAs (ribozymes) as substrates and cofactors, and their intracellular concentrations are sensed by gene-regulatory mRNA domains (riboswitches) that modulate transcription, splicing, translation, or RNA stability. Although recognition mechanisms vary from RNA to RNA, structural analyses reveal recurring strategies that arise from the intrinsic properties of RNA such as base pairing and stacking with conjugated heterocycles, and cation-dependent recognition of anionic functional groups. These studies also suggest that, to a first approximation, the magnitude of ligand-induced reorganization of an RNA is inversely proportional to the complexity of the riboswitch or ribozyme. How these small molecule binding-induced changes in RNA lead to alteration in gene expression is less well understood. While different riboswitches have been proposed to be under either kinetic or thermodynamic control, the biochemical and structural mechanisms that give rise to regulatory consequences downstream of small molecule recognition by RNAs mostly remain to be elucidated.
Collapse
Affiliation(s)
- Jinwei Zhang
- Howard Hughes Medical Institute, Seattle, Washington 98109-1024, USA
| | | | | |
Collapse
|
37
|
Werner A. Predicting translational diffusion of evolutionary conserved RNA structures by the nucleotide number. Nucleic Acids Res 2010; 39:e17. [PMID: 21068070 PMCID: PMC3035447 DOI: 10.1093/nar/gkq808] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Ribonucleic acids are highly conserved essential parts of cellular life. RNA function is determined to a large extent by its hydrodynamic behaviour. The presented study proposes a strategy to predict the hydrodynamic behaviour of RNA single strands on the basis of the polymer size. By atom-level shell-modelling of high-resolution structures, hydrodynamic radius and diffusion coefficient of evolutionary conserved RNA single strands (ssRNA) were calculated. The diffusion coefficients D of 17–174 nucleotides (nt) containing ssRNA depended on the number of nucleotides N with D = 4.56 × 10−10 N−0.39 m2 s−1. The hydrodynamic radius RH depended on N with RH = 5.00 × 10−10N0.38 m. An average ratio of the radius of gyration and the hydrodynamic radius of 0.98 ± 0.08 was calculated in solution. The empirical law was tested by in solution measured hydrodynamic radii and radii of gyration and was found to be highly consistent with experimental data of evolutionary conserved ssRNA. Furthermore, the hydrodynamic behaviour of several evolutionary unevolved ribonucleic acids could be predicted. Based on atom-level shell-modelling of high-resolution structures and experimental hydrodynamic data, empirical models are proposed, which enable to predict the translational diffusion coefficient and molecular size of short RNA single strands solely on the basis of the polymer size.
Collapse
Affiliation(s)
- Arne Werner
- Experimental Biomolecular Physics, Applied Physics, Royal Institute of Technology, Stockholm, SE-10691, Sweden.
| |
Collapse
|
38
|
Abstract
Many non-coding RNAs fold into complex three-dimensional structures, yet the self-assembly of RNA structure is hampered by mispairing, weak tertiary interactions, electrostatic barriers, and the frequent requirement that the 5' and 3' ends of the transcript interact. This rugged free energy landscape for RNA folding means that some RNA molecules in a population rapidly form their native structure, while many others become kinetically trapped in misfolded conformations. Transient binding of RNA chaperone proteins destabilize misfolded intermediates and lower the transition states between conformations, producing a smoother landscape that increases the rate of folding and the probability that a molecule will find the native structure. DEAD-box proteins couple the chemical potential of ATP hydrolysis with repetitive cycles of RNA binding and release, expanding the range of conditions under which they can refold RNA structures.
Collapse
Affiliation(s)
- Sarah A Woodson
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
39
|
Sun FJ, Caetano-Anollés G. The ancient history of the structure of ribonuclease P and the early origins of Archaea. BMC Bioinformatics 2010; 11:153. [PMID: 20334683 PMCID: PMC2858038 DOI: 10.1186/1471-2105-11-153] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2010] [Accepted: 03/24/2010] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Ribonuclease P is an ancient endonuclease that cleaves precursor tRNA and generally consists of a catalytic RNA subunit (RPR) and one or more proteins (RPPs). It represents an important macromolecular complex and model system that is universally distributed in life. Its putative origins have inspired fundamental hypotheses, including the proposal of an ancient RNA world. RESULTS To study the evolution of this complex, we constructed rooted phylogenetic trees of RPR molecules and substructures and estimated RPP age using a cladistic method that embeds structure directly into phylogenetic analysis. The general approach was used previously to study the evolution of tRNA, SINE RNA and 5S rRNA, the origins of metabolism, and the evolution and complexity of the protein world, and revealed here remarkable evolutionary patterns. Trees of molecules uncovered the tripartite nature of life and the early origin of archaeal RPRs. Trees of substructures showed molecules originated in stem P12 and were accessorized with a catalytic P1-P4 core structure before the first substructure was lost in Archaea. This core currently interacts with RPPs and ancient segments of the tRNA molecule. Finally, a census of protein domain structure in hundreds of genomes established RPPs appeared after the rise of metabolic enzymes at the onset of the protein world. CONCLUSIONS The study provides a detailed account of the history and early diversification of a fundamental ribonucleoprotein and offers further evidence in support of the existence of a tripartite organismal world that originated by the segregation of archaeal lineages from an ancient community of primordial organisms.
Collapse
Affiliation(s)
- Feng-Jie Sun
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Laboratory of Molecular Epigenetics of the Ministry of Education, School of Life Sciences, Northeast Normal University, Changchun 130024, Jilin Province, PR China
- W.M. Keck Center for Comparative and Functional Genomics, Roy J. Carver Biotechnology Center, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| |
Collapse
|
40
|
Abstract
Ribozymes (catalytic RNAs) were the center of a presumed RNA world in the early origin of life. In this issue, Lau and Unrau show evidence that an RNA world could have used a similar evolutionary pathway as most proteins do.
Collapse
Affiliation(s)
- Ulrich F Müller
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, 92093, USA.
| |
Collapse
|
41
|
Briones C, Stich M, Manrubia SC. The dawn of the RNA World: toward functional complexity through ligation of random RNA oligomers. RNA (NEW YORK, N.Y.) 2009; 15:743-9. [PMID: 19318464 PMCID: PMC2673073 DOI: 10.1261/rna.1488609] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2008] [Accepted: 01/31/2009] [Indexed: 05/23/2023]
Abstract
A main unsolved problem in the RNA World scenario for the origin of life is how a template-dependent RNA polymerase ribozyme emerged from short RNA oligomers obtained by random polymerization on mineral surfaces. A number of computational studies have shown that the structural repertoire yielded by that process is dominated by topologically simple structures, notably hairpin-like ones. A fraction of these could display RNA ligase activity and catalyze the assembly of larger, eventually functional RNA molecules retaining their previous modular structure: molecular complexity increases but template replication is absent. This allows us to build up a stepwise model of ligation-based, modular evolution that could pave the way to the emergence of a ribozyme with RNA replicase activity, step at which information-driven Darwinian evolution would be triggered.
Collapse
Affiliation(s)
- Carlos Briones
- Centro de Astrobiología (CSIC-INTA), 28850 Torrejón de Ardoz, Madrid, Spain.
| | | | | |
Collapse
|
42
|
Ditzler MA, Rueda D, Mo J, Håkansson K, Walter NG. A rugged free energy landscape separates multiple functional RNA folds throughout denaturation. Nucleic Acids Res 2008; 36:7088-99. [PMID: 18988629 PMCID: PMC2602785 DOI: 10.1093/nar/gkn871] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The dynamic mechanisms by which RNAs acquire biologically functional structures are of increasing importance to the rapidly expanding fields of RNA therapeutics and biotechnology. Large energy barriers separating misfolded and functional states arising from alternate base pairing are a well-appreciated characteristic of RNA. In contrast, it is typically assumed that functionally folded RNA occupies a single native basin of attraction that is free of deeply dividing energy barriers (ergodic hypothesis). This assumption is widely used as an implicit basis to interpret experimental ensemble-averaged data. Here, we develop an experimental approach to isolate persistent sub-populations of a small RNA enzyme and show by single molecule fluorescence resonance energy transfer (smFRET), biochemical probing and high-resolution mass spectrometry that commitment to one of several catalytically active folds occurs unexpectedly high on the RNA folding energy landscape, resulting in partially irreversible folding. Our experiments reveal the retention of molecular heterogeneity following the complete loss of all native secondary and tertiary structure. Our results demonstrate a surprising longevity of molecular heterogeneity and advance our current understanding beyond that of non-functional misfolds of RNA kinetically trapped on a rugged folding-free energy landscape.
Collapse
Affiliation(s)
- Mark A Ditzler
- Department of Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | | | | | | | | |
Collapse
|
43
|
Abstract
We present a theory of the dependence on sequence of the three-dimensional size of large single-stranded (ss) RNA molecules. The work is motivated by the fact that the genomes of many viruses are large ssRNA molecules-often several thousand nucleotides long-and that these RNAs are spontaneously packaged into small rigid protein shells. We argue that there has been evolutionary pressure for the genome to have overall spatial properties-including an appropriate radius of gyration, R(g)-that facilitate this assembly process. For an arbitrary RNA sequence, we introduce the (thermal) average maximum ladder distance (MLD) and use it as a measure of the "extendedness" of the RNA secondary structure. The MLD values of viral ssRNAs that package into capsids of fixed size are shown to be consistently smaller than those for randomly permuted sequences of the same length and base composition, and also smaller than those of natural ssRNAs that are not under evolutionary pressure to have a compact native form. By mapping these secondary structures onto a linear polymer model and by using MLD as a measure of effective contour length, we predict the R(g) values of viral ssRNAs are smaller than those of nonviral sequences. More generally, we predict the average MLD values of large nonviral ssRNAs scale as N(0.67+/-0.01), where N is the number of nucleotides, and that their R(g) values vary as MLD(0.5) in an ideal solvent, and hence as N(0.34). An alternative analysis, which explicitly includes all branches, is introduced and shown to yield consistent results.
Collapse
|
44
|
Sun FJ, Caetano-Anollés G. Evolutionary patterns in the sequence and structure of transfer RNA: a window into early translation and the genetic code. PLoS One 2008; 3:e2799. [PMID: 18665254 PMCID: PMC2474678 DOI: 10.1371/journal.pone.0002799] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2008] [Accepted: 07/02/2008] [Indexed: 01/06/2023] Open
Abstract
Transfer RNA (tRNA) molecules play vital roles during protein synthesis. Their acceptor arms are aminoacylated with specific amino acid residues while their anticodons delimit codon specificity. The history of these two functions has been generally linked in evolutionary studies of the genetic code. However, these functions could have been differentially recruited as evolutionary signatures were left embedded in tRNA molecules. Here we built phylogenies derived from the sequence and structure of tRNA, we forced taxa into monophyletic groups using constraint analyses, tested competing evolutionary hypotheses, and generated timelines of amino acid charging and codon discovery. Charging of Sec, Tyr, Ser and Leu appeared ancient, while specificities related to Asn, Met, and Arg were derived. The timelines also uncovered an early role of the second and then first codon bases, identified codons for Ala and Pro as the most ancient, and revealed important evolutionary take-overs related to the loss of the long variable arm in tRNA. The lack of correlation between ancestries of amino acid charging and encoding indicated that the separate discoveries of these functions reflected independent histories of recruitment. These histories were probably curbed by co-options and important take-overs during early diversification of the living world.
Collapse
Affiliation(s)
- Feng-Jie Sun
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| |
Collapse
|
45
|
Dawson WK, Fujiwara K, Kawai G. Prediction of RNA pseudoknots using heuristic modeling with mapping and sequential folding. PLoS One 2007; 2:e905. [PMID: 17878940 PMCID: PMC1975678 DOI: 10.1371/journal.pone.0000905] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2007] [Accepted: 08/08/2007] [Indexed: 12/01/2022] Open
Abstract
Predicting RNA secondary structure is often the first step to determining the structure of RNA. Prediction approaches have historically avoided searching for pseudoknots because of the extreme combinatorial and time complexity of the problem. Yet neglecting pseudoknots limits the utility of such approaches. Here, an algorithm utilizing structure mapping and thermodynamics is introduced for RNA pseudoknot prediction that finds the minimum free energy and identifies information about the flexibility of the RNA. The heuristic approach takes advantage of the 5′ to 3′ folding direction of many biological RNA molecules and is consistent with the hierarchical folding hypothesis and the contact order model. Mapping methods are used to build and analyze the folded structure for pseudoknots and to add important 3D structural considerations. The program can predict some well known pseudoknot structures correctly. The results of this study suggest that many functional RNA sequences are optimized for proper folding. They also suggest directions we can proceed in the future to achieve even better results.
Collapse
Affiliation(s)
- Wayne K Dawson
- Department of Life and Environmental Sciences, Chiba Institute of Technology, Narashino-shi, Chiba, Japan.
| | | | | |
Collapse
|
46
|
Grimson A, Farh KKH, Johnston WK, Garrett-Engele P, Lim LP, Bartel DP. MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol Cell 2007; 27:91-105. [PMID: 17612493 PMCID: PMC3800283 DOI: 10.1016/j.molcel.2007.06.017] [Citation(s) in RCA: 2905] [Impact Index Per Article: 170.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2006] [Revised: 05/30/2007] [Accepted: 06/18/2007] [Indexed: 02/08/2023]
Abstract
Mammalian microRNAs (miRNAs) pair to 3'UTRs of mRNAs to direct their posttranscriptional repression. Important for target recognition are approximately 7 nt sites that match the seed region of the miRNA. However, these seed matches are not always sufficient for repression, indicating that other characteristics help specify targeting. By combining computational and experimental approaches, we uncovered five general features of site context that boost site efficacy: AU-rich nucleotide composition near the site, proximity to sites for coexpressed miRNAs (which leads to cooperative action), proximity to residues pairing to miRNA nucleotides 13-16, positioning within the 3'UTR at least 15 nt from the stop codon, and positioning away from the center of long UTRs. A model combining these context determinants quantitatively predicts site performance both for exogenously added miRNAs and for endogenous miRNA-message interactions. Because it predicts site efficacy without recourse to evolutionary conservation, the model also identifies effective nonconserved sites and siRNA off-targets.
Collapse
Affiliation(s)
- Andrew Grimson
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA
| | - Kyle Kai-How Farh
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA
- Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Wendy K. Johnston
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA
| | - Philip Garrett-Engele
- Rosetta Inpharmatics (wholly owned subsidiary of Merck and Co.), 401 Terry Avenue N, Seattle, WA 98109, USA
| | - Lee P. Lim
- Rosetta Inpharmatics (wholly owned subsidiary of Merck and Co.), 401 Terry Avenue N, Seattle, WA 98109, USA
- Contact: (L.P.L.), (D.P.B.)
| | - David P. Bartel
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA
- Contact: (L.P.L.), (D.P.B.)
| |
Collapse
|
47
|
Burbano HA, Andrade E. Analysis of tRNA abstract shapes of precursor/derivative amino acids in Archaea. Gene X 2007; 396:75-83. [PMID: 17433860 DOI: 10.1016/j.gene.2007.02.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2006] [Revised: 02/27/2007] [Accepted: 02/27/2007] [Indexed: 11/18/2022] Open
Abstract
Wong's theory of the genetic code's origin states that because of historical constraints, codon assignment depends on the relation between precursor and derivative amino acids, a result of the coevolutionary process between amino acids' biosynthetic pathways and tRNAs. Based on arguments supporting the assumption that natural selection favors more stable and thus functionally constrained structures, we tested whether precursor and derivative tRNAs are equally evolved by measuring their structural parameters, thermostability and molecular plasticity. We also estimated the extent to which precursor and derivative tRNAs differ within Archaea. We used Archaea sequences of both precursor and derivative tRNAs in order to examine the plastic repertoires or sets of suboptimal structures at a defined free energy interval. We grouped secondary structures according to their helix nesting and adjacency using abstract shapes analysis. This clustering enabled us to infer a consensus sequence for all shapes that fit the clover leaf secondary structure [Giegerich, R., et al., Nucleic Acids Res 2004; 32 (16): 4843-51.]. This consensus sequence was then folded in order to retrieve a set of suboptimal structures. For each pair of precursor and derivative tRNAs, we compared these plastic repertoires based on the number of secondary structures, the thermostability of the minimum free energy structure and two structural parameters (base pair propensity (P) and mean length of helical stem structures (S)), which were measured for every representative secondary structure [Schultes, E.A., et al., J Mol Evol 1999; 49 (1): 76-83.]. We found that derivative tRNAs have fewer numbers of shapes, higher thermostability and more stable parameters than precursor tRNAs, a fact in full agreement with Wong's coevolution theory of the genetic code.
Collapse
Affiliation(s)
- Hernán A Burbano
- Grupo de Biología Molecular Teórica y Evolutiva, Universidad Nacional de Colombia, Bogotá, D.C., Colombia
| | | |
Collapse
|
48
|
Brown PH, Balbo A, Schuck P. Using prior knowledge in the determination of macromolecular size-distributions by analytical ultracentrifugation. Biomacromolecules 2007; 8:2011-24. [PMID: 17521163 PMCID: PMC1994561 DOI: 10.1021/bm070193j] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Analytical ultracentrifugation has reemerged as a widely used tool for the study of ensembles of biological macromolecules to understand, for example, their size-distribution and interactions in free solution. Such information can be obtained from the mathematical analysis of the concentration and signal gradients across the solution column and their evolution in time generated as a result of the gravitational force. In sedimentation velocity analytical ultracentrifugation, this analysis is frequently conducted using high resolution, diffusion-deconvoluted sedimentation coefficient distributions. They are based on Fredholm integral equations, which are ill-posed unless stabilized by regularization. In many fields, maximum entropy and Tikhonov-Phillips regularization are well-established and powerful approaches that calculate the most parsimonious distribution consistent with the data and prior knowledge, in accordance with Occam's razor. In the implementations available in analytical ultracentrifugation, to date, the basic assumption implied is that all sedimentation coefficients are equally likely and that the information retrieved should be condensed to the least amount possible. Frequently, however, more detailed distributions would be warranted by specific detailed prior knowledge on the macromolecular ensemble under study, such as the expectation of the sample to be monodisperse or paucidisperse or the expectation for the migration to establish a bimodal sedimentation pattern based on Gilbert-Jenkins' theory for the migration of chemically reacting systems. So far, such prior knowledge has remained largely unused in the calculation of the sedimentation coefficient or molecular weight distributions or was only applied as constraints. In the present paper, we examine how prior expectations can be built directly into the computational data analysis, conservatively in a way that honors the complete information of the experimental data, whether or not consistent with the prior expectation. Consistent with analogous results in other fields, we find that the use of available prior knowledge can have a dramatic effect on the resulting molecular weight, sedimentation coefficient, and size-and-shape distributions and can significantly increase both their sensitivity and their resolution. Further, the use of multiple alternative prior information allows us to probe the range of possible interpretations consistent with the data.
Collapse
Affiliation(s)
- Patrick H. Brown
- Protein Biophysics Resource, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892
| | - Andrea Balbo
- Protein Biophysics Resource, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892
| | - Peter Schuck
- Protein Biophysics Resource, National Institute of Biomedical Imaging and Bioengineering, National Institutes of Health, Bethesda, MD 20892
| |
Collapse
|
49
|
Koculi E, Hyeon C, Thirumalai D, Woodson SA. Charge density of divalent metal cations determines RNA stability. J Am Chem Soc 2007; 129:2676-82. [PMID: 17295487 PMCID: PMC2523262 DOI: 10.1021/ja068027r] [Citation(s) in RCA: 127] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
RNA molecules are exquisitely sensitive to the properties of counterions. The folding equilibrium of the Tetrahymena ribozyme is measured by nondenaturing gel electrophoresis in the presence of divalent group IIA metal cations. The stability of the folded ribozyme increases with the charge density (zeta) of the cation. Similar scaling is found when the free energy of the RNA folded in small and large metal cations is measured by urea denaturation. Brownian dynamics simulations of a polyelectrolyte show that the experimental observations can be explained by nonspecific ion-RNA interactions in the absence of site-specific metal chelation. The experimental and simulation results establish that RNA stability is largely determined by a combination of counterion charge and the packing efficiency of condensed cations that depends on the excluded volume of the cations.
Collapse
Affiliation(s)
- Eda Koculi
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218, USA
| | | | | | | |
Collapse
|
50
|
Jones C, Spencer AC, Hsu JL, Spremulli L, Martinis SA, DeRider M, Agris PF. A counterintuitive Mg2+-dependent and modification-assisted functional folding of mitochondrial tRNAs. J Mol Biol 2006; 362:771-86. [PMID: 16949614 PMCID: PMC1781928 DOI: 10.1016/j.jmb.2006.07.036] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2006] [Revised: 07/05/2006] [Accepted: 07/19/2006] [Indexed: 10/24/2022]
Abstract
Mitochondrial tRNAs (mtRNAs) often lack domains and posttranscriptional modifications that are found in cytoplasmic tRNAs. These structural and chemical elements normally stabilize the folding of cytoplasmic tRNAs into canonical structures that are competent for aminoacylation and translation. For example, the dihydrouridine (D) stem and loop domain is involved in the tertiary structure of cytoplasmic tRNAs through hydrogen bonds and a Mg2+ bridge to the ribothymidine (T) stem and loop domain. These interactions are often absent in mtRNA because the D-domain is truncated or missing. Using gel mobility shift analyses, UV, circular dichroism and NMR spectroscopies and aminoacylation assays, we have investigated the functional folding interactions of chemically synthesized and site-specifically modified mitochondrial and cytoplasmic tRNAs. We found that Mg2+ is critical for folding of the truncated D-domain of bovine mtRNAMet with the tRNA's T-domain. Contrary to the expectation that Mg2+ stabilizes RNA folding, the mtRNAMet D-domain structure was unfolded and relaxed, rather than stabilized in the presence of Mg2+. Because the D-domain is transcribed prior to the T-domain, we conclude that Mg2+ prevents misfolding of the 5'-half of bovine mtRNAMet facilitating its correct interaction with the T-domain. The interaction of the mtRNAMet D-domain with the T-domain was enhanced by a pseudouridine located in either the D or T-domains compared to that of the unmodified RNAs (Kd=25.3, 24.6 and 44.4 microM, respectively). Mg2+ also affected the folding interaction of a yeast mtRNALeu1, but had minimal effect on the folding of an Escherichia coli cytoplasmic tRNALeu. The D-domain modification, dihydrouridine, facilitated mtRNALeu folding. These data indicate that conserved modifications assist and stabilize the formation of the functional mtRNA tertiary structure.
Collapse
Affiliation(s)
- Christopher Jones
- Department of Structural and Molecular Biology, 128 Polk Hall, Campus Box 7622, North Carolina State University, Raleigh, NC 27695-7622
| | - Angela C. Spencer
- Department of Chemistry, Campus Box 3290, Venable and Kenan Laboratories, University of North Carolina-Chapel Hill, Chapel Hill, NC 27599-3290
| | - Jennifer L. Hsu
- Department of Biochemistry, 419 Roger Adams Laboratory, Box B-4, 600 S. Mathews Ave., University of Illinois at Urbana-Champaign, Urbana, Il 61801
| | - Linda Spremulli
- Department of Chemistry, Campus Box 3290, Venable and Kenan Laboratories, University of North Carolina-Chapel Hill, Chapel Hill, NC 27599-3290
| | - Susan A. Martinis
- Department of Biochemistry, 419 Roger Adams Laboratory, Box B-4, 600 S. Mathews Ave., University of Illinois at Urbana-Champaign, Urbana, Il 61801
| | - Michele DeRider
- Department of Structural and Molecular Biology, 128 Polk Hall, Campus Box 7622, North Carolina State University, Raleigh, NC 27695-7622
| | - Paul F. Agris
- Department of Structural and Molecular Biology, 128 Polk Hall, Campus Box 7622, North Carolina State University, Raleigh, NC 27695-7622
- Corresponding author; E-mail address of corresponding author:
| |
Collapse
|