1
|
Ward AJ, Partridge BE. Beyond DAD: proposing a one-letter code for nucleobase-mediated molecular recognition. J Mater Chem B 2025; 13:485-495. [PMID: 39569673 DOI: 10.1039/d4tb01999g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2024]
Abstract
Nucleobase binding is a fundamental molecular recognition event central to modern biological and bioinspired supramolecular research. Underpinning this recognition is a deceptively simple hydrogen-bonding code, primarily based on the canonical nucleobases in DNA and RNA. Inspired by these biotic structures, chemists and biologists have designed abiotic hydrogen-bonding motifs that can interact with, augment, and reshape native molecular recognition, for applications ranging from genetic code expansion and nucleic acid recognition to supramolecular materials utilizing mono- and bifacial nucleobases. However, as the number of nucleobase-inspired motifs expands, the absence of a standard vocabulary to describe hydrogen bond (HB) patterns has led to a haphazard mixture of shorthand descriptors that are confusing and inconsistent. Alternative notations that specify individual HB sites (such as DAD for donor-acceptor-donor) are cumbersome for biological and supramolecular constructs that contain many such patterns. This situation creates a barrier to sharing and interpreting nucleobase-related research across sub-disciplines, hindering collaboration and innovation. In this perspective, we aim to initiate discourse on this issue by considering what would be needed to formulate a concise one-letter code for the HB patterns associated with synthetic nucleobases. We first summarize some of the issues caused by the current absence of a consistent naming scheme. Subsequently, we discuss some key considerations in designing a coherent naming system. Finally, we leverage chemical rationale and pedagogical mnemonic considerations to propose a succinct and intuitive one-letter code for supramolecular two- and three-HB motifs. We hope that this discussion will spark conversations within our interdisciplinary community, thereby facilitating collaboration and easing communication among researchers engaged in synthetic nucleobase design.
Collapse
Affiliation(s)
- Aiden J Ward
- Department of Chemistry, University of Rochester, Rochester, NY 14627-0216, USA.
| | - Benjamin E Partridge
- Department of Chemistry, University of Rochester, Rochester, NY 14627-0216, USA.
| |
Collapse
|
2
|
Zirbel CL, Auffinger P. Lone Pair…π Contacts and Structure Signatures of r(UNCG) Tetraloops, Z-Turns, and Z-Steps: A WebFR3D Survey. Molecules 2022; 27:molecules27144365. [PMID: 35889236 PMCID: PMC9323530 DOI: 10.3390/molecules27144365] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2022] [Revised: 06/29/2022] [Accepted: 07/04/2022] [Indexed: 02/04/2023] Open
Abstract
Z-DNA and Z-RNA have long appeared as oddities to nucleic acid scientists. However, their Z-step constituents are recurrently observed in all types of nucleic acid systems including ribosomes. Z-steps are NpN steps that are isostructural to Z-DNA CpG steps. Among their structural features, Z-steps are characterized by the presence of a lone pair…π contact that involves the stacking of the ribose O4′ atom of the first nucleotide with the 3′-face of the second nucleotide. Recently, it has been documented that the CpG step of the ubiquitous r(UNCG) tetraloops is a Z-step. Accordingly, such r(UNCG) conformations were called Z-turns. It has also been recognized that an r(GAAA) tetraloop in appropriate conditions can shapeshift to an unusual Z-turn conformation embedding an ApA Z-step. In this report, we explore the multiplicity of RNA motifs based on Z-steps by using the WebFR3D tool to which we added functionalities to be able to retrieve motifs containing lone pair…π contacts. Many examples that underscore the diversity and universality of these motifs are provided as well as tutorial guidance on using WebFR3D. In addition, this study provides an extensive survey of crystallographic, cryo-EM, NMR, and molecular dynamics studies on r(UNCG) tetraloops with a critical view on how to conduct database searches and exploit their results.
Collapse
Affiliation(s)
- Craig L. Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA;
| | - Pascal Auffinger
- Architecture et Réactivité de l’ARN, UPR 9002, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, 67084 Strasbourg, France
- Correspondence: ; Tel.: +33-3-8841-7049; Fax: +33-3-8860-2218
| |
Collapse
|
3
|
Mráziková K, Mlýnský V, Kührová P, Pokorná P, Kruse H, Krepl M, Otyepka M, Banáš P, Šponer J. UUCG RNA Tetraloop as a Formidable Force-Field Challenge for MD Simulations. J Chem Theory Comput 2020; 16:7601-7617. [PMID: 33215915 DOI: 10.1021/acs.jctc.0c00801] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Explicit solvent atomistic molecular dynamics (MD) simulations represent an established technique to study structural dynamics of RNA molecules and an important complement for diverse experimental methods. However, performance of molecular mechanical (MM) force fields (ff's) remains far from satisfactory even after decades of development, as apparent from a problematic structural description of some important RNA motifs. Actually, some of the smallest RNA molecules belong to the most challenging systems for MD simulations and, among them, the UUCG tetraloop is saliently difficult. We report a detailed analysis of UUCG MD simulations, depicting the sequence of events leading to the loss of the UUCG native state during MD simulations. The total amount of MD simulation data analyzed in this work is close to 1.3 ms. We identify molecular interactions, backbone conformations, and substates that are involved in the process. Then, we unravel specific ff deficiencies using diverse quantum mechanical/molecular mechanical (QM/MM) and QM calculations. Comparison between the MM and QM methods shows discrepancies in the description of the 5'-flanking phosphate moiety and both signature sugar-base interactions. Our work indicates that poor behavior of the UUCG tetraloop in simulations is a complex issue that cannot be attributed to one dominant and straightforwardly correctable factor. Instead, there is a concerted effect of multiple ff inaccuracies that are coupled and amplifying each other. We attempted to improve the simulation behavior by some carefully tailored interventions, but the results were still far from satisfactory, underlying the difficulties in development of accurate nucleic acid ff's.
Collapse
Affiliation(s)
- Klaudia Mráziková
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic
| | - Vojtěch Mlýnský
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Petra Kührová
- Regional Centre of Advanced Technologies and Materials, Faculty of Science, Palacký University, Šlechtitelů 27, 783 71 Olomouc, Czech Republic
| | - Pavlína Pokorná
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic
| | - Holger Kruse
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Miroslav Krepl
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Faculty of Science, Palacký University, Šlechtitelů 27, 783 71 Olomouc, Czech Republic
| | - Michal Otyepka
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Faculty of Science, Palacký University, Šlechtitelů 27, 783 71 Olomouc, Czech Republic
| | - Pavel Banáš
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.,Regional Centre of Advanced Technologies and Materials, Faculty of Science, Palacký University, Šlechtitelů 27, 783 71 Olomouc, Czech Republic
| | - Jiří Šponer
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| |
Collapse
|
4
|
Bevilacqua PC, Harris ME, Piccirilli JA, Gaines C, Ganguly A, Kostenbader K, Ekesan Ş, York DM. An Ontology for Facilitating Discussion of Catalytic Strategies of RNA-Cleaving Enzymes. ACS Chem Biol 2019; 14:1068-1076. [PMID: 31095369 PMCID: PMC6661149 DOI: 10.1021/acschembio.9b00202] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
A predictive understanding of the mechanisms of RNA cleavage is important for the design of emerging technology built from biological and synthetic molecules that have promise for new biochemical and medicinal applications. Over the past 15 years, RNA cleavage reactions involving 2'-O-transphosphorylation have been discussed using a simplified framework introduced by Breaker that consists of four fundamental catalytic strategies (designated α, β, γ, and δ) that contribute to rate enhancement. As more detailed mechanistic data emerge, there is need for the framework to evolve and keep pace. We develop an ontology for discussion of strategies of enzymes that catalyze RNA cleavage via 2'-O-transphosphorylation that stratifies Breaker's framework into primary (1°), secondary (2°), and tertiary (3°) contributions to enable more precise interpretation of mechanism in the context of structure and bonding. Further, we point out instances where atomic-level changes give rise to changes in more than one catalytic contribution, a phenomenon we refer to as "functional blurring". We hope that this ontology will help clarify our conversations and pave the path forward toward a consensus view of these fundamental and fascinating mechanisms. The insight gained will deepen our understanding of RNA cleavage reactions catalyzed by natural protein and RNA enzymes, as well as aid in the design of new engineered DNA and synthetic enzymes.
Collapse
Affiliation(s)
- Philip C. Bevilacqua
- Department of Chemistry, Center for RNA Molecular Biology, and Department of Biochemistry, Microbiology, and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Michael E. Harris
- Department of Chemistry, University of Florida, Gainesville, Florida 32611, USA
| | - Joseph A. Piccirilli
- Department of Chemistry, and Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL 60637, USA
| | - Colin Gaines
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854-8087, USA
| | - Abir Ganguly
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854-8087, USA
| | - Ken Kostenbader
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854-8087, USA
| | - Şölen Ekesan
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854-8087, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine, Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854-8087, USA
| |
Collapse
|
5
|
Abstract
Purpose
The purpose of this paper is to investigate indexers’ evaluation on the usability of ontology vs thesaurus in representation of concepts and semantic relations. To do so, “searching” category of ASIS&T thesaurus was selected and ASIS&TOnto was built based on it.
Design/methodology/approach
The usability examination method is used in order to compare the two semantic tools. Nine indexers were recruited as participants, who were proficient in English language, had experience in using the thesaurus and all had successfully passed the course of “information representation.” They were asked to think aloud while working with the tools and to answer a semi-structured interview. The data gathering was continued until it reached its saturation point.
Findings
The results of this study revealed that the definitions and scope notes represented in indexing tools such as thesauri and ontologies have an important role in improvement of indexers’ understanding. On comparing the hierarchical relations, results show that converting the structure of hierarchical relationships of ASIS&T thesaurus can enhance the indexers understanding of them, and also enriching the associative relations of ASIS&T thesaurus can cause indexers to have a better understanding and evaluation of the presented concepts and relations.
Originality/value
This study shares our findings on the usability of ASIS&T thesaurus as a core set of vocabulary for building a “searching” domain as a prototype ontology in the area of library and information science and provides the indexers viewpoints of the two semantic tools in this area.
Collapse
|
6
|
Schudoma C. It's a loop world - single strands in RNA as structural and functional elements. Biomol Concepts 2015; 2:171-81. [PMID: 25962027 DOI: 10.1515/bmc.2011.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2011] [Accepted: 03/25/2011] [Indexed: 01/31/2023] Open
Abstract
Unpaired regions in RNA molecules - loops - are centrally involved in defining the characteristic three-dimensional (3D) architecture of RNAs and are of high interest in RNA engineering and design. Loops adopt diverse, but specific conformations stabilised by complex tertiary structural interactions that provide structural flexibility to RNA structures that would otherwise not be possible if they only consisted of the rigid A-helical shapes usually formed by canonical base pairing. By participating in sequence-non-local contacts, they furthermore contribute to stabilising the overall fold of RNA molecules. Interactions between RNAs and other nucleic acids, proteins, or small molecules are also generally mediated by RNA loop structures. Therefore, the function of an RNA molecule is generally dependent on its loops. Examples include intermolecular interactions between RNAs as part of the microRNA processing pathways, ribozymatic activity, or riboswitch-ligand interactions. Bioinformatics approaches have been successfully applied to the identification of novel RNA structural motifs including loops, local and global RNA 3D structure prediction, and structural and conformational analysis of RNAs and have contributed to a better understanding of the sequence-structure-function relationships in RNA loops.
Collapse
|
7
|
The RNA structurome: transcriptome-wide structure probing with next-generation sequencing. Trends Biochem Sci 2015; 40:221-32. [DOI: 10.1016/j.tibs.2015.02.005] [Citation(s) in RCA: 122] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 02/16/2015] [Accepted: 02/17/2015] [Indexed: 01/16/2023]
|
8
|
Abstract
Secondary structure diagrams are essential, in RNA biology, to communicate functional hypotheses and summarize structural data, and communicate them visually as drafts or finalized publication-ready figures. While many tools are currently available to automate the production of such diagrams, their capacities are usually partial, making it hard for a user to decide which to use in a given context. In this chapter, we guide the reader through the steps involved in the production of expressive publication-quality illustrations featuring the RNA secondary structure. We present major existing representations and layouts, and give precise instructions to produce them using available free software, including jViz.RNA, the PseudoViewer, RILogo, R-chie, RNAplot, R2R, and VARNA. We describe the file formats and structural descriptions accepted by popular RNA visualization tools. We also provide command lines and Python scripts to ease the user's access to advanced features. Finally, we discuss and illustrate alternative approaches to visualize the secondary structure in the presence of probing data, pseudoknots, RNA-RNA interactions, and comparative data.
Collapse
|
9
|
Coimbatore Narayanan B, Westbrook J, Ghosh S, Petrov AI, Sweeney B, Zirbel CL, Leontis NB, Berman HM. The Nucleic Acid Database: new features and capabilities. Nucleic Acids Res 2013; 42:D114-22. [PMID: 24185695 PMCID: PMC3964972 DOI: 10.1093/nar/gkt980] [Citation(s) in RCA: 145] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The Nucleic Acid Database (NDB) (http://ndbserver.rutgers.edu) is a web portal providing access to information about 3D nucleic acid structures and their complexes. In addition to primary data, the NDB contains derived geometric data, classifications of structures and motifs, standards for describing nucleic acid features, as well as tools and software for the analysis of nucleic acids. A variety of search capabilities are available, as are many different types of reports. This article describes the recent redesign of the NDB Web site with special emphasis on new RNA-derived data and annotations and their implementation and integration into the search capabilities.
Collapse
Affiliation(s)
- Buvaneswari Coimbatore Narayanan
- Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Rutgers, the State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854-8076, USA, Department of Chemistry and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, OH 43403, USA and Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | | | | | | | | | | | | | | |
Collapse
|
10
|
A New Toolkit for Modeling RNA from a Pseudo-Torsional Space. J Mol Biol 2012; 421:1-5. [DOI: 10.1016/j.jmb.2012.05.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
11
|
de Bono B, Hoehndorf R, Wimalaratne S, Gkoutos G, Grenon P. The RICORDO approach to semantic interoperability for biomedical data and models: strategy, standards and solutions. BMC Res Notes 2011; 4:313. [PMID: 21878109 PMCID: PMC3192696 DOI: 10.1186/1756-0500-4-313] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Accepted: 08/30/2011] [Indexed: 11/25/2022] Open
Abstract
Background The practice and research of medicine generates considerable quantities of data and model resources (DMRs). Although in principle biomedical resources are re-usable, in practice few can currently be shared. In particular, the clinical communities in physiology and pharmacology research, as well as medical education, (i.e. PPME communities) are facing considerable operational and technical obstacles in sharing data and models. Findings We outline the efforts of the PPME communities to achieve automated semantic interoperability for clinical resource documentation in collaboration with the RICORDO project. Current community practices in resource documentation and knowledge management are overviewed. Furthermore, requirements and improvements sought by the PPME communities to current documentation practices are discussed. The RICORDO plan and effort in creating a representational framework and associated open software toolkit for the automated management of PPME metadata resources is also described. Conclusions RICORDO is providing the PPME community with tools to effect, share and reason over clinical resource annotations. This work is contributing to the semantic interoperability of DMRs through ontology-based annotation by (i) supporting more effective navigation and re-use of clinical DMRs, as well as (ii) sustaining interoperability operations based on the criterion of biological similarity. Operations facilitated by RICORDO will range from automated dataset matching to model merging and managing complex simulation workflows. In effect, RICORDO is contributing to community standards for resource sharing and interoperability.
Collapse
Affiliation(s)
- Bernard de Bono
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK.
| | | | | | | | | |
Collapse
|
12
|
Rocca-Serra P, Bellaousov S, Birmingham A, Chen C, Cordero P, Das R, Davis-Neulander L, Duncan CD, Halvorsen M, Knight R, Leontis NB, Mathews DH, Ritz J, Stombaugh J, Weeks KM, Zirbel CL, Laederach A. Sharing and archiving nucleic acid structure mapping data. RNA (NEW YORK, N.Y.) 2011; 17:1204-12. [PMID: 21610212 PMCID: PMC3138558 DOI: 10.1261/rna.2753211] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Nucleic acids are particularly amenable to structural characterization using chemical and enzymatic probes. Each individual structure mapping experiment reveals specific information about the structure and/or dynamics of the nucleic acid. Currently, there is no simple approach for making these data publically available in a standardized format. We therefore developed a standard for reporting the results of single nucleotide resolution nucleic acid structure mapping experiments, or SNRNASMs. We propose a schema for sharing nucleic acid chemical probing data that uses generic public servers for storing, retrieving, and searching the data. We have also developed a consistent nomenclature (ontology) within the Ontology of Biomedical Investigations (OBI), which provides unique identifiers (termed persistent URLs, or PURLs) for classifying the data. Links to standardized data sets shared using our proposed format along with a tutorial and links to templates can be found at http://snrnasm.bio.unc.edu.
Collapse
Affiliation(s)
| | - Stanislav Bellaousov
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester, Rochester, New York 14642, USA
| | | | - Chunxia Chen
- Biology Department, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Pablo Cordero
- Biochemistry Department, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Biochemistry Department, Stanford University, Stanford, California 94305, USA
| | - Lauren Davis-Neulander
- Biology Department, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Caia D.S. Duncan
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Matthew Halvorsen
- Biology Department, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Rob Knight
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado 80309, USA
- Howard Hughes Medical Institute, Boulder, Colorado 80309, USA
| | - Neocles B. Leontis
- Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, USA
| | - David H. Mathews
- Department of Biochemistry and Biophysics and Center for RNA Biology, University of Rochester, Rochester, New York 14642, USA
| | - Justin Ritz
- Biology Department, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Jesse Stombaugh
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado 80309, USA
| | - Kevin M. Weeks
- Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| | - Craig L. Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, Ohio 43403, USA
| | - Alain Laederach
- Biology Department, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
- Corresponding author.E-mail .
| |
Collapse
|
13
|
Davis AR, Kirkpatrick CC, Znosko BM. Structural characterization of naturally occurring RNA single mismatches. Nucleic Acids Res 2010; 39:1081-94. [PMID: 20876693 PMCID: PMC3035445 DOI: 10.1093/nar/gkq793] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
RNA is known to be involved in several cellular processes; however, it is only active when it is folded into its correct 3D conformation. The folding, bending and twisting of an RNA molecule is dependent upon the multitude of canonical and non-canonical secondary structure motifs. These motifs contribute to the structural complexity of RNA but also serve important integral biological functions, such as serving as recognition and binding sites for other biomolecules or small ligands. One of the most prevalent types of RNA secondary structure motifs are single mismatches, which occur when two canonical pairs are separated by a single non-canonical pair. To determine sequence–structure relationships and to identify structural patterns, we have systematically located, annotated and compared all available occurrences of the 30 most frequently occurring single mismatch-nearest neighbor sequence combinations found in experimentally determined 3D structures of RNA-containing molecules deposited into the Protein Data Bank. Hydrogen bonding, stacking and interaction of nucleotide edges for the mismatched and nearest neighbor base pairs are described and compared, allowing for the identification of several structural patterns. Such a database and comparison will allow researchers to gain insight into the structural features of unstudied sequences and to quickly look-up studied sequences.
Collapse
Affiliation(s)
- Amber R Davis
- Department of Chemistry, Saint Louis University, St Louis, MO 63103, USA
| | | | | |
Collapse
|
14
|
Cruz-Toledo J, Dumontier M, Parisien M, Major F. RKB: a Semantic Web knowledge base for RNA. J Biomed Semantics 2010; 1 Suppl 1:S2. [PMID: 20626922 PMCID: PMC2903721 DOI: 10.1186/2041-1480-1-s1-s2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Increasingly sophisticated knowledge about RNA structure and function requires an inclusive knowledge representation that facilitates the integration of independently -generated information arising from such efforts as genome sequencing projects, microarray analyses, structure determination and RNA SELEX experiments. While RNAML, an XML-based representation, has been proposed as an exchange format for a select subset of information, it lacks domain-specific semantics that are essential for answering questions that require expert knowledge. Here, we describe an RNA knowledge base (RKB) for structure-based knowledge using RDF/OWL Semantic Web technologies. RKB extends a number of ontologies and contains basic terminology for nucleic acid composition along with context/model-specific structural features such as sugar conformations, base pairings and base stackings. RKB (available at http://semanticscience.org/projects/rkb) is populated with PDB entries and MC-Annotate structural annotation. We show queries to the RKB using description logic reasoning, thus opening the door to question answering over independently-published RNA knowledge using Semantic Web technologies.
Collapse
Affiliation(s)
- Jose Cruz-Toledo
- Department of Biology, Carleton University 1125 Colonel By Drive, K1S5B6, Ottawa, Canada .
| | | | | | | |
Collapse
|
15
|
Mungall CJ, Batchelor C, Eilbeck K. Evolution of the Sequence Ontology terms and relationships. J Biomed Inform 2010; 44:87-93. [PMID: 20226267 DOI: 10.1016/j.jbi.2010.03.002] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2009] [Revised: 02/11/2010] [Accepted: 03/05/2010] [Indexed: 10/19/2022]
Abstract
The Sequence Ontology is an established ontology, with a large user community, for the purpose of genomic annotation. We are reforming the ontology to provide better terms and relationships to describe the features of biological sequence, for both genomic and derived sequence. The SO is working within the guidelines of the OBO Foundry to provide interoperability between SO and the other related OBO ontologies. Here, we report changes and improvements made to SO including new relationships to better define the mereological, spatial and temporal aspects of biological sequence.
Collapse
|
16
|
Alexander RW, Eargle J, Luthey-Schulten Z. Experimental and computational determination of tRNA dynamics. FEBS Lett 2009; 584:376-86. [PMID: 19932098 DOI: 10.1016/j.febslet.2009.11.061] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Revised: 11/14/2009] [Accepted: 11/16/2009] [Indexed: 10/20/2022]
Abstract
As the molecular representation of the genetic code, tRNA plays a central role in the translational machinery where it interacts with several proteins and other RNAs during the course of protein synthesis. These interactions exploit the dynamic flexibility of tRNA. In this minireview, we discuss the effects of modified bases, ions, and proteins on tRNA structure and dynamics and the challenges of observing its motions over the cycle of translation.
Collapse
Affiliation(s)
- Rebecca W Alexander
- Department of Chemistry, Wake Forest University, Winston-Salem, NC 27109-7486, United States.
| | | | | |
Collapse
|
17
|
Brown JW, Birmingham A, Griffiths PE, Jossinet F, Kachouri-Lafond R, Knight R, Lang BF, Leontis N, Steger G, Stombaugh J, Westhof E. The RNA structure alignment ontology. RNA (NEW YORK, N.Y.) 2009; 15:1623-31. [PMID: 19622678 PMCID: PMC2743057 DOI: 10.1261/rna.1601409] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2009] [Accepted: 05/26/2009] [Indexed: 05/19/2023]
Abstract
Multiple sequence alignments are powerful tools for understanding the structures, functions, and evolutionary histories of linear biological macromolecules (DNA, RNA, and proteins), and for finding homologs in sequence databases. We address several ontological issues related to RNA sequence alignments that are informed by structure. Multiple sequence alignments are usually shown as two-dimensional (2D) matrices, with rows representing individual sequences, and columns identifying nucleotides from different sequences that correspond structurally, functionally, and/or evolutionarily. However, the requirement that sequences and structures correspond nucleotide-by-nucleotide is unrealistic and hinders representation of important biological relationships. High-throughput sequencing efforts are also rapidly making 2D alignments unmanageable because of vertical and horizontal expansion as more sequences are added. Solving the shortcomings of traditional RNA sequence alignments requires explicit annotation of the meaning of each relationship within the alignment. We introduce the notion of "correspondence," which is an equivalence relation between RNA elements in sets of sequences as the basis of an RNA alignment ontology. The purpose of this ontology is twofold: first, to enable the development of new representations of RNA data and of software tools that resolve the expansion problems with current RNA sequence alignments, and second, to facilitate the integration of sequence data with secondary and three-dimensional structural information, as well as other experimental information, to create simultaneously more accurate and more exploitable RNA alignments.
Collapse
|
18
|
Prosdocimi F, Chisham B, Pontelli E, Thompson JD, Stoltzfus A. Initial implementation of a comparative data analysis ontology. Evol Bioinform Online 2009; 5:47-66. [PMID: 19812726 PMCID: PMC2747124 DOI: 10.4137/ebo.s2320] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Comparative analysis is used throughout biology. When entities under comparison (e.g. proteins, genomes, species) are related by descent, evolutionary theory provides a framework that, in principle, allows N-ary comparisons of entities, while controlling for non-independence due to relatedness. Powerful software tools exist for specialized applications of this approach, yet it remains under-utilized in the absence of a unifying informatics infrastructure. A key step in developing such an infrastructure is the definition of a formal ontology. The analysis of use cases and existing formalisms suggests that a significant component of evolutionary analysis involves a core problem of inferring a character history, relying on key concepts: “Operational Taxonomic Units” (OTUs), representing the entities to be compared; “character-state data” representing the observations compared among OTUs; “phylogenetic tree”, representing the historical path of evolution among the entities; and “transitions”, the inferred evolutionary changes in states of characters that account for observations. Using the Web Ontology Language (OWL), we have defined these and other fundamental concepts in a Comparative Data Analysis Ontology (CDAO). CDAO has been evaluated for its ability to represent token data sets and to support simple forms of reasoning. With further development, CDAO will provide a basis for tools (for semantic transformation, data retrieval, validation, integration, etc.) that make it easier for software developers and biomedical researchers to apply evolutionary methods of inference to diverse types of data, so as to integrate this powerful framework for reasoning into their research.
Collapse
Affiliation(s)
- Francisco Prosdocimi
- Department of Structural Biology and Genomics, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), F-67400 Illkirch, France
| | | | | | | | | |
Collapse
|
19
|
Analysis of four-way junctions in RNA structures. J Mol Biol 2009; 390:547-59. [PMID: 19445952 DOI: 10.1016/j.jmb.2009.04.084] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2009] [Revised: 04/21/2009] [Accepted: 04/30/2009] [Indexed: 11/24/2022]
Abstract
RNA secondary structures can be divided into helical regions composed of canonical Watson-Crick and related base pairs, as well as single-stranded regions such as hairpin loops, internal loops, and junctions. These elements function as building blocks in the design of diverse RNA molecules with various fundamental functions in the cell. To better understand the intricate architecture of three-dimensional (3D) RNAs, we analyze existing RNA four-way junctions in terms of base-pair interactions and 3D configurations. Specifically, we identify nine broad junction families according to coaxial stacking patterns and helical configurations. We find that helices within junctions tend to arrange in roughly parallel and perpendicular patterns and stabilize their conformations using common tertiary motifs such as coaxial stacking, loop-helix interaction, and helix packing interaction. Our analysis also reveals a number of highly conserved base-pair interaction patterns and novel tertiary motifs such as A-minor-coaxial stacking combinations and sarcin/ricin motif variants. Such analyses of RNA building blocks can ultimately help in the difficult task of RNA 3D structure prediction.
Collapse
|
20
|
Fast Structural Alignment of Biomolecules Using a Hash Table, N-Grams and String Descriptors. ALGORITHMS 2009. [DOI: 10.3390/a2020692] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
21
|
Stombaugh J, Zirbel CL, Westhof E, Leontis NB. Frequency and isostericity of RNA base pairs. Nucleic Acids Res 2009; 37:2294-312. [PMID: 19240142 PMCID: PMC2673412 DOI: 10.1093/nar/gkp011] [Citation(s) in RCA: 147] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Most of the hairpin, internal and junction loops that appear single-stranded in standard RNA secondary structures form recurrent 3D motifs, where non-Watson-Crick base pairs play a central role. Non-Watson-Crick base pairs also play crucial roles in tertiary contacts in structured RNA molecules. We previously classified RNA base pairs geometrically so as to group together those base pairs that are structurally similar (isosteric) and therefore able to substitute for each other by mutation without disrupting the 3D structure. Here, we introduce a quantitative measure of base pair isostericity, the IsoDiscrepancy Index (IDI), to more accurately determine which base pair substitutions can potentially occur in conserved motifs. We extract and classify base pairs from a reduced-redundancy set of RNA 3D structures from the Protein Data Bank (PDB) and calculate centroids (exemplars) for each base combination and geometric base pair type (family). We use the exemplars and IDI values to update our online Basepair Catalog and the Isostericity Matrices (IM) for each base pair family. From the database of base pairs observed in 3D structures we derive base pair occurrence frequencies for each of the 12 geometric base pair families. In order to improve the statistics from the 3D structures, we also derive base pair occurrence frequencies from rRNA sequence alignments.
Collapse
Affiliation(s)
- Jesse Stombaugh
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH 43403, USA
| | | | | | | |
Collapse
|
22
|
Prosdocimi F, Chisham B, Pontelli E, Stoltzfus A, Thompson JD. Knowledge Standardization in Evolutionary Biology: The Comparative Data Analysis Ontology. Evol Biol 2009. [DOI: 10.1007/978-3-642-00952-5_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
23
|
Laederach A, Das R, Vicens Q, Pearlman SM, Brenowitz M, Herschlag D, Altman RB. Semiautomated and rapid quantification of nucleic acid footprinting and structure mapping experiments. Nat Protoc 2008; 3:1395-401. [PMID: 18772866 DOI: 10.1038/nprot.2008.134] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
We have developed protocols for rapidly quantifying the band intensities from nucleic acid chemical mapping gels at single-nucleotide resolution. These protocols are implemented in the software SAFA (semi-automated footprinting analysis) that can be downloaded without charge from http://safa.stanford.edu. The protocols implemented in SAFA have five steps: (i) lane identification, (ii) gel rectification, (iii) band assignment, (iv) model fitting and (v) band-intensity normalization. SAFA enables the rapid quantitation of gel images containing thousands of discrete bands, thereby eliminating a bottleneck to the analysis of chemical mapping experiments. An experienced user of the software can quantify a gel image in approximately 20 min. Although SAFA was developed to analyze hydroxyl radical (*OH) footprints, it effectively quantifies the gel images obtained with other types of chemical mapping probes. We also present a series of tutorial movies that illustrate the best practices and different steps in the SAFA analysis as a supplement to this protocol.
Collapse
Affiliation(s)
- Alain Laederach
- Department of Developmental Genetics and Bioinformatics, Wadsworth Center, Albany, New York 12208, USA.
| | | | | | | | | | | | | |
Collapse
|
24
|
Nasalean L, Stombaugh J, Zirbel CL, Leontis NB. RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching. NON-PROTEIN CODING RNAS 2008. [DOI: 10.1007/978-3-540-70840-7_1] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
25
|
Czerwoniec A, Dunin-Horkawicz S, Purta E, Kaminska KH, Kasprzak JM, Bujnicki JM, Grosjean H, Rother K. MODOMICS: a database of RNA modification pathways. 2008 update. Nucleic Acids Res 2008; 37:D118-21. [PMID: 18854352 PMCID: PMC2686465 DOI: 10.1093/nar/gkn710] [Citation(s) in RCA: 175] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
MODOMICS, a database devoted to the systems biology of RNA modification, has been subjected to substantial improvements. It provides comprehensive information on the chemical structure of modified nucleosides, pathways of their biosynthesis, sequences of RNAs containing these modifications and RNA-modifying enzymes. MODOMICS also provides cross-references to other databases and to literature. In addition to the previously available manually curated tRNA sequences from a few model organisms, we have now included additional tRNAs and rRNAs, and all RNAs with 3D structures in the Nucleic Acid Database, in which modified nucleosides are present. In total, 3460 modified bases in RNA sequences of different organisms have been annotated. New RNA-modifying enzymes have been also added. The current collection of enzymes includes mainly proteins for the model organisms Escherichia coli and Saccharomyces cerevisiae, and is currently being expanded to include proteins from other organisms, in particular Archaea and Homo sapiens. For enzymes with known structures, links are provided to the corresponding Protein Data Bank entries, while for many others homology models have been created. Many new options for database searching and querying have been included. MODOMICS can be accessed at http://genesilico.pl/modomics.
Collapse
Affiliation(s)
- Anna Czerwoniec
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
| | - Stanislaw Dunin-Horkawicz
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
| | - Elzbieta Purta
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
| | - Katarzyna H. Kaminska
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
| | - Joanna M. Kasprzak
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
| | - Janusz M. Bujnicki
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
| | - Henri Grosjean
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
| | - Kristian Rother
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Umultowska 89, PL-61-614 Poznan, Poland, Max Planck Institute for Developmental Biology, Department 1, Protein Evolution Spemannstr. 35, 72076 Tuebingen, Germany, Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Ks. Trojdena 4, PL-02-190 Warsaw, Poland, Institute of Biochemistry and Biophysics PAS, Pawinskiego 5a, 02-106 Warsaw and IGM, Univ Paris-Sud, UMR 8621, Orsay, F 91405, France
- *To whom correspondence should be addressed. Tel: +48-22 597 0752; Fax: +48 22 597 0715;
| |
Collapse
|
26
|
Vicens Q, Paukstelis PJ, Westhof E, Lambowitz AM, Cech TR. Toward predicting self-splicing and protein-facilitated splicing of group I introns. RNA (NEW YORK, N.Y.) 2008; 14:2013-2029. [PMID: 18768647 PMCID: PMC2553746 DOI: 10.1261/rna.1027208] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2008] [Accepted: 07/08/2008] [Indexed: 05/26/2023]
Abstract
In the current era of massive discoveries of noncoding RNAs within genomes, being able to infer a function from a nucleotide sequence is of paramount interest. Although studies of individual group I introns have identified self-splicing and nonself-splicing examples, there is no overall understanding of the prevalence of self-splicing or the factors that determine it among the >2300 group I introns sequenced to date. Here, the self-splicing activities of 12 group I introns from various organisms were assayed under six reaction conditions that had been shown previously to promote RNA catalysis for different RNAs. Besides revealing that assessing self-splicing under only one condition can be misleading, this survey emphasizes that in vitro self-splicing efficiency is correlated with the GC content of the intron (>35% GC was generally conductive to self-splicing), and with the ability of the introns to form particular tertiary interactions. Addition of the Neurospora crassa CYT-18 protein activated splicing of two nonself-splicing introns, but inhibited the second step of self-splicing for two others. Together, correlations between sequence, predicted structure and splicing begin to establish rules that should facilitate our ability to predict the self-splicing activity of any group I intron from its sequence.
Collapse
Affiliation(s)
- Quentin Vicens
- Howard Hughes Medical Institute, University of Colorado, Department of Chemistry and Biochemistry, Boulder, Colorado 80309-0215, USA.
| | | | | | | | | |
Collapse
|
27
|
Abstract
Riboswitches are RNAs capable of binding cellular metabolites using a diverse array of secondary and tertiary structures to modulate gene expression. The recent determination of the three-dimensional structures of parts of six different riboswitches illuminates common features that allow riboswitches to be grouped into one of two types. Type I riboswitches, as exemplified by the purine riboswitch, are characterized by a single, localized binding pocket supported by a largely pre-established global fold. This arrangement limits ligand-induced conformational changes in the RNA to a small region. In contrast, Type II riboswitches, such as the thiamine pyrophosphate riboswitch, contain binding pockets split into at least two spatially distinct sites. As a result, binding induces both local changes to the binding pocket and global architecture. Similar organizational themes are found in other noncoding RNAs, making it possible to begin to build a hierarchical classification of RNA structure based on the spatial organization of their active sites and associated secondary structural elements.
Collapse
Affiliation(s)
- Rebecca K Montange
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado 80309, USA
| | | |
Collapse
|
28
|
Schuster P. Modeling in biological chemistry. From biochemical kinetics to systems biology. MONATSHEFTE FUR CHEMIE 2008. [DOI: 10.1007/s00706-008-0892-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
29
|
Richardson JS, Schneider B, Murray LW, Kapral GJ, Immormino RM, Headd JJ, Richardson DC, Ham D, Hershkovits E, Williams LD, Keating KS, Pyle AM, Micallef D, Westbrook J, Berman HM. RNA backbone: consensus all-angle conformers and modular string nomenclature (an RNA Ontology Consortium contribution). RNA (NEW YORK, N.Y.) 2008; 14:465-81. [PMID: 18192612 PMCID: PMC2248255 DOI: 10.1261/rna.657708] [Citation(s) in RCA: 190] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2007] [Accepted: 10/29/2007] [Indexed: 05/03/2023]
Abstract
A consensus classification and nomenclature are defined for RNA backbone structure using all of the backbone torsion angles. By a consensus of several independent analysis methods, 46 discrete conformers are identified as suitably clustered in a quality-filtered, multidimensional dihedral angle distribution. Most of these conformers represent identifiable features or roles within RNA structures. The conformers are given two-character names that reflect the seven-angle delta epsilon zeta alpha beta gamma delta combinations empirically found favorable for the sugar-to-sugar "suite" unit within which the angle correlations are strongest (e.g., 1a for A-form, 5z for the start of S-motifs). Since the half-nucleotides are specified by a number for delta epsilon zeta and a lowercase letter for alpha beta gamma delta, this modular system can also be parsed to describe traditional nucleotide units (e.g., a1) or the dinucleotides (e.g., a1a1) that are especially useful at the level of crystallographic map fitting. This nomenclature can also be written as a string with two-character suite names between the uppercase letters of the base sequence (N1aG1gN1aR1aA1cN1a for a GNRA tetraloop), facilitating bioinformatic comparisons. Cluster means, standard deviations, coordinates, and examples are made available, as well as the Suitename software that assigns suite conformer names and conformer match quality (suiteness) from atomic coordinates. The RNA Ontology Consortium will combine this new backbone system with others that define base pairs, base-stacking, and hydrogen-bond relationships to provide a full description of RNA structural motifs.
Collapse
Affiliation(s)
- Jane S Richardson
- Department of Biochemistry, Duke University Medical Center, Durham, North Carolina, 27710-3711, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Smit S, Rother K, Heringa J, Knight R. From knotted to nested RNA structures: a variety of computational methods for pseudoknot removal. RNA (NEW YORK, N.Y.) 2008; 14:410-6. [PMID: 18230758 PMCID: PMC2248259 DOI: 10.1261/rna.881308] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Pseudoknots are abundant in RNA structures. Many computational analyses require pseudoknot-free structures, which means that some of the base pairs in the knotted structure must be disregarded to obtain a nested structure. There is a surprising diversity of methods to perform this pseudoknot removal task, but these methods are often poorly described and studies can therefore be difficult to reproduce (in part, because different procedures may be intuitively obvious to different investigators). Here we provide a variety of algorithms for pseudoknot removal, some of which can incorporate sequence or alignment information in the removal process. We demonstrate that different methods lead to different results, which might affect structure-based analyses. This work thus provides a starting point for discussion of the extent to which these different methods recapture the underlying biological reality. We provide access to reference implementations through a web interface (at http://www.ibi.vu.nl/programs/k2nwww), and the source code is available in the PyCogent project.
Collapse
Affiliation(s)
- Sandra Smit
- Centre for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
| | | | | | | |
Collapse
|
31
|
Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A, Mungall CJ, Leontis N, Rocca-Serra P, Ruttenberg A, Sansone SA, Scheuermann RH, Shah N, Whetzel PL, Lewis S. The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol 2008; 25:1251-5. [PMID: 17989687 DOI: 10.1038/nbt1346] [Citation(s) in RCA: 1163] [Impact Index Per Article: 68.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The value of any kind of data is greatly enhanced when it exists in a form that allows it to be integrated with other data. One approach to integration is through the annotation of multiple bodies of data using common controlled vocabularies or 'ontologies'. Unfortunately, the very success of this approach has led to a proliferation of ontologies, which itself creates obstacles to integration. The Open Biomedical Ontologies (OBO) consortium is pursuing a strategy to overcome this problem. Existing OBO ontologies, including the Gene Ontology, are undergoing coordinated reform, and new ontologies are being created on the basis of an evolving set of shared principles governing ontology development. The result is an expanding family of ontologies designed to be interoperable and logically well formed and to incorporate accurate representations of biological reality. We describe this OBO Foundry initiative and provide guidelines for those who might wish to become involved.
Collapse
Affiliation(s)
- Barry Smith
- Department of Philosophy and New York State Center of Excellence in Bioinformatics and Life Sciences, University at Buffalo, Buffalo, New York 14203, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Kim DH, Shreenivasaiah PK, Hong S, Kim T, Song HK. Current research trends in systems biology. Anim Cells Syst (Seoul) 2008. [DOI: 10.1080/19768354.2008.9647172] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
|
33
|
Lisi V, Major F. A comparative analysis of the triloops in all high-resolution RNA structures reveals sequence structure relationships. RNA (NEW YORK, N.Y.) 2007; 13:1537-45. [PMID: 17652406 PMCID: PMC1950765 DOI: 10.1261/rna.597507] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Despite an increasing number of experimentally determined RNA structures, the gap between the number of structures and that of RNA families is still growing. To overcome this limitation, efficient and reliable RNA modeling methodologies must be developed. In order to reach this goal, here, we show how triloop sequence-structure relationships have been inferred through a systematic analysis of all triloops found in available high-resolution structures. The structural annotation of all triloops allowed us to define discrete states of the triloop's conformational space, and therefore an explicit sequence-to-structure relation. The sequence-structure relationships inferred from this explicit relation are presented in a convenient modeling table that provides a limited set of possible three-dimensional structures given any triloop sequence. The table is indexed by the two nucleotides that form the triloop's flanking base pair, since they are shown to provide the most information about the triloop three-dimensional structures. We also report the observations in the X-ray crystallographic structures of important conformational variations, which we believe might be the result of RNA dynamic.
Collapse
Affiliation(s)
- Véronique Lisi
- Institute for Research in Immunology and Cancer, Department of Computer Science and Operations Research, Université de Montréal, Québec, Canada
| | | |
Collapse
|
34
|
Sarver M, Zirbel CL, Stombaugh J, Mokdad A, Leontis NB. FR3D: finding local and composite recurrent structural motifs in RNA 3D structures. J Math Biol 2007; 56:215-52. [PMID: 17694311 PMCID: PMC2837920 DOI: 10.1007/s00285-007-0110-x] [Citation(s) in RCA: 192] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2006] [Revised: 06/30/2006] [Indexed: 11/26/2022]
Abstract
New methods are described for finding recurrent three-dimensional (3D) motifs in RNA atomic-resolution structures. Recurrent RNA 3D motifs are sets of RNA nucleotides with similar spatial arrangements. They can be local or composite. Local motifs comprise nucleotides that occur in the same hairpin or internal loop. Composite motifs comprise nucleotides belonging to three or more different RNA strand segments or molecules. We use a base-centered approach to construct efficient, yet exhaustive search procedures using geometric, symbolic, or mixed representations of RNA structure that we implement in a suite of MATLAB programs, "Find RNA 3D" (FR3D). The first modules of FR3D preprocess structure files to classify base-pair and -stacking interactions. Each base is represented geometrically by the position of its glycosidic nitrogen in 3D space and by the rotation matrix that describes its orientation with respect to a common frame. Base-pairing and base-stacking interactions are calculated from the base geometries and are represented symbolically according to the Leontis/Westhof basepairing classification, extended to include base-stacking. These data are stored and used to organize motif searches. For geometric searches, the user supplies the 3D structure of a query motif which FR3D uses to find and score geometrically similar candidate motifs, without regard to the sequential position of their nucleotides in the RNA chain or the identity of their bases. To score and rank candidate motifs, FR3D calculates a geometric discrepancy by rigidly rotating candidates to align optimally with the query motif and then comparing the relative orientations of the corresponding bases in the query and candidate motifs. Given the growing size of the RNA structure database, it is impossible to explicitly compute the discrepancy for all conceivable candidate motifs, even for motifs with less than ten nucleotides. The screening algorithm that we describe finds all candidate motifs whose geometric discrepancy with respect to the query motif falls below a user-specified cutoff discrepancy. This technique can be applied to RMSD searches. Candidate motifs identified geometrically may be further screened symbolically to identify those that contain particular basepair types or base-stacking arrangements or that conform to sequence continuity or nucleotide identity constraints. Purely symbolic searches for motifs containing user-defined sequence, continuity and interaction constraints have also been implemented. We demonstrate that FR3D finds all occurrences, both local and composite and with nucleotide substitutions, of sarcin/ricin and kink-turn motifs in the 23S and 5S ribosomal RNA 3D structures of the H. marismortui 50S ribosomal subunit and assigns the lowest discrepancy scores to bona fide examples of these motifs. The search algorithms have been optimized for speed to allow users to search the non-redundant RNA 3D structure database on a personal computer in a matter of minutes.
Collapse
Affiliation(s)
- Michael Sarver
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Craig L. Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Jesse Stombaugh
- Department of Chemistry, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Ali Mokdad
- Department of Chemistry, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Neocles B. Leontis
- Department of Chemistry, Bowling Green State University, Bowling Green, OH 43403, USA
| |
Collapse
|
35
|
Affiliation(s)
- Philip C Bevilacqua
- Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
| | | |
Collapse
|
36
|
Abstract
RNA research has made great progress in recent years. A variety of unforeseen complexities have been identified, many with relevance to human brain disease. For example, neurologic illnesses may arise because of perturbations in distinct but interrelated tiers of RNA-based genetic regulation: pre-mRNA splicing; nonsplicing RNA modifications; and mRNA translational regulation. Furthermore, there is poor correlation between mRNA levels and protein levels in mammalian cells, due partly to complicated post-transcriptional regulation by hitherto unknown noncoding RNAs. Some noncoding RNAs have been shown to be involved in human brain diseases. Diseases potentially mediated by alterations in RNA processes include tauopathies, myotonic dystrophy, Alzheimer disease, brain cancer, and many others. Here we present an overview of new research highlighting functions for RNA that far surpass the "messenger in the middle" role and that identify RNA molecules as important agents in the human brain in health and in disease states.
Collapse
Affiliation(s)
- Peter T Nelson
- Department of Pathology and Division of Neuropathology, University of Kentucky, Sanders-Brown Center on Aging, Lexington, Kentucky 40536-0230, USA.
| | | |
Collapse
|
37
|
Abstract
The world of regulatory RNAs is fast expanding into mainstream molecular biology as both a subject of intense mechanistic study and as a tool for functional characterization. The RNA world is one of complex structures that carry out catalysis, sense metabolites and synthesize proteins. The dynamic and structural nature of RNAs presents a whole new set of informatics challenges to the computational community. The ability to relate structure and dynamics to function will be key to understanding this complex world. I review several important classes of structured RNAs that present our community with a series of biologically novel informatics challenges. I also review available informatics tools that have been recently developed in the field.
Collapse
Affiliation(s)
- Alain Laederach
- Department of Genetics, 300 Pasteur Drive, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
38
|
Abstract
The ribosome is responsible for protein synthesis, the translation of the genetic code, in all living organisms. Ribosomes are composed of RNA (ribosomal RNA) and protein (ribosomal protein). Soluble protein factors bind to the ribosome and facilitate different phases of translation. Genetic approaches have proved useful for the identification and characterization of the structural and functional roles of specific nucleotides in ribosomal RNA and of specific amino acids in ribosomal proteins and in ribosomal factors. This chapter summarizes examples of mutations identified in ribosomal RNA, ribosomal proteins, and ribosomal factors.
Collapse
MESH Headings
- Animals
- Base Sequence
- DNA Mutational Analysis
- Humans
- Mutation
- Nucleic Acid Conformation
- Peptide Elongation Factors/genetics
- Peptide Initiation Factors/genetics
- Peptide Termination Factors/genetics
- Protein Subunits/genetics
- RNA, Ribosomal, 16S/analysis
- RNA, Ribosomal, 16S/chemistry
- RNA, Ribosomal, 16S/physiology
- RNA, Ribosomal, 23S/analysis
- RNA, Ribosomal, 23S/chemistry
- RNA, Ribosomal, 23S/physiology
- Ribosomal Proteins/genetics
- Ribosomes/genetics
- Sequence Analysis, RNA
Collapse
Affiliation(s)
- Kathleen L Triman
- Department of Biology, Franklin and Marshall College, Lancaster, PA 17604, USA
| |
Collapse
|
39
|
Jossinet F, Ludwig TE, Westhof E. RNA structure: bioinformatic analysis. Curr Opin Microbiol 2007; 10:279-85. [PMID: 17548241 DOI: 10.1016/j.mib.2007.05.010] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2007] [Accepted: 05/23/2007] [Indexed: 01/30/2023]
Abstract
The range of functions ascribed to RNA molecules has grown considerably during recent years. Consequently, the analysis and comparison of RNA sequences have become recurrent tasks in molecular biology. Because the biological function of an RNA is expressed more by its folded architecture than by its sequence, original computational tools adapted to the multifaceted RNA functions have to be developed. Such tools, recently published, enable a user to solve classical problems related to RNA research: constructing 'structural' multiple alignments, inferring complete structures and structural motifs from RNA alignments, or searching structural homology in genomic databases.
Collapse
Affiliation(s)
- Fabrice Jossinet
- Architecture et Réactivité de l'ARN, Université Louis Pasteur, Institut de Biologie Moléculaire et Cellulaire, CNRS, F-67084 Strasbourg, France
| | | | | |
Collapse
|
40
|
Laederach A, Chan JM, Schwartzman A, Willgohs E, Altman RB. Coplanar and coaxial orientations of RNA bases and helices. RNA (NEW YORK, N.Y.) 2007; 13:643-50. [PMID: 17339576 PMCID: PMC1852812 DOI: 10.1261/rna.381407] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Electrostatic interactions, base-pairing, and especially base-stacking dominate RNA three-dimensional structures. In an A-form RNA helix, base-stacking results in nearly perfect parallel orientations of all bases in the helix. Interestingly, when an RNA structure containing multiple helices is visualized at the atomic level, it is often possible to find an orientation such that only the edges of most bases are visible. This suggests that a general aspect of higher level RNA structure is a coplanar arrangement of base-normal vectors. We have analyzed all solved RNA crystal structures to determine the degree to which RNA base-normal vectors are globally coplanar. Using a statistical test based on the Watson-Girdle distribution, we determined that 330 out of 331 known RNA structures show statistically significant (p < 0.05; false discovery rate [FDR] = 0.05) coplanar normal vector orientations. Not surprisingly, 94% of the helices in RNA show bipolar arrangements of their base-normal vectors (p < 0.05). This allows us to compute a mean axis for each helix and compare their orientations within an RNA structure. This analysis revealed that 62% (208/331) of the RNA structures exhibit statistically significant coaxial packing of helices (p < 0.05, FDR = 0.08). Further analysis reveals that the bases in hairpin loops and junctions are also generally planar. This work demonstrates coplanar base orientation and coaxial helix packing as an emergent behavior of RNA structure and may be useful as a structural modeling constraint.
Collapse
Affiliation(s)
- Alain Laederach
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | | | | | | | | |
Collapse
|
41
|
Davis IW, Leaver-Fay A, Chen VB, Block JN, Kapral GJ, Wang X, Murray LW, Arendall WB, Snoeyink J, Richardson JS, Richardson DC. MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res 2007; 35:W375-83. [PMID: 17452350 PMCID: PMC1933162 DOI: 10.1093/nar/gkm216] [Citation(s) in RCA: 3240] [Impact Index Per Article: 180.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MolProbity is a general-purpose web server offering quality validation for 3D structures of proteins, nucleic acids and complexes. It provides detailed all-atom contact analysis of any steric problems within the molecules as well as updated dihedral-angle diagnostics, and it can calculate and display the H-bond and van der Waals contacts in the interfaces between components. An integral step in the process is the addition and full optimization of all hydrogen atoms, both polar and nonpolar. New analysis functions have been added for RNA, for interfaces, and for NMR ensembles. Additionally, both the web site and major component programs have been rewritten to improve speed, convenience, clarity and integration with other resources. MolProbity results are reported in multiple forms: as overall numeric scores, as lists or charts of local problems, as downloadable PDB and graphics files, and most notably as informative, manipulable 3D kinemage graphics shown online in the KiNG viewer. This service is available free to all users at http://molprobity.biochem.duke.edu.
Collapse
Affiliation(s)
- Ian W. Davis
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Andrew Leaver-Fay
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Vincent B. Chen
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Jeremy N. Block
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Gary J. Kapral
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Xueyi Wang
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Laura W. Murray
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - W. Bryan Arendall
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Jack Snoeyink
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - Jane S. Richardson
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
| | - David C. Richardson
- Department of Biochemistry, Duke University, Durham, NC, USA and Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, USA
- *To whom correspondence should be addressed. +1-919-684-6010
| |
Collapse
|
42
|
Wang X, Kapral G, Murray L, Richardson D, Richardson J, Snoeyink J. RNABC: forward kinematics to reduce all-atom steric clashes in RNA backbone. J Math Biol 2007; 56:253-78. [PMID: 17401565 PMCID: PMC2153530 DOI: 10.1007/s00285-007-0082-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2006] [Revised: 01/23/2007] [Indexed: 10/23/2022]
Abstract
Although accurate details in RNA structure are of great importance for understanding RNA function, the backbone conformation is difficult to determine, and most existing RNA structures show serious steric clashes (>or= 0.4 A overlap) when hydrogen atoms are taken into account. We have developed a program called RNABC (RNA Backbone Correction) that performs local perturbations to search for alternative conformations that avoid those steric clashes or other local geometry problems. Its input is an all-atom coordinate file for an RNA crystal structure (usually from the MolProbity web service), with problem areas specified. RNABC rebuilds a suite (the unit from sugar to sugar) by anchoring the phosphorus and base positions, which are clearest in crystallographic electron density, and reconstructing the other atoms using forward kinematics. Geometric parameters are constrained within user-specified tolerance of canonical or original values, and torsion angles are constrained to ranges defined through empirical database analyses. Several optimizations reduce the time required to search the many possible conformations. The output results are clustered and presented to the user, who can choose whether to accept one of the alternative conformations. Two test evaluations show the effectiveness of RNABC, first on the S-motifs from 42 RNA structures, and second on the worst problem suites (clusters of bad clashes, or serious sugar pucker outliers) in 25 unrelated RNA structures. Among the 101 S-motifs, 88 had diagnosed problems, and RNABC produced clash-free conformations with acceptable geometry for 71 of those (about 80%). For the 154 worst problem suites, RNABC proposed alternative conformations for 72. All but 8 of those were judged acceptable after examining electron density (where available) and local conformation. Thus, even for these worst cases, nearly half the time RNABC suggested corrections suitable to initiate further crystallographic refinement. The program is available from http://kinemage.biochem.duke.edu .
Collapse
Affiliation(s)
- Xueyi Wang
- Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, 27599-3175, USA
| | - Gary Kapral
- Department of Biochemistry, Duke University, Durham, NC, 27710-3711, USA
| | - Laura Murray
- Department of Biochemistry, Duke University, Durham, NC, 27710-3711, USA
| | - David Richardson
- Department of Biochemistry, Duke University, Durham, NC, 27710-3711, USA
| | - Jane Richardson
- Department of Biochemistry, Duke University, Durham, NC, 27710-3711, USA
| | - Jack Snoeyink
- Department of Computer Science, UNC Chapel Hill, Chapel Hill, NC, 27599-3175, USA
| |
Collapse
|
43
|
Abstract
All pairwise interactions occurring between bases which could be detected in three-dimensional structures of crystallized RNA molecules are annotated on new planar diagrams. The diagrams attempt to map the underlying complex networks of base–base interactions and, especially, they aim at conveying key relationships between helical domains: co-axial stacking, bending and all Watson–Crick as well as non-Watson–Crick base pairs. Although such wiring diagrams cannot replace full stereographic images for correct spatial understanding and representation, they reveal structural similarities as well as the conserved patterns and distances between motifs which are present within the interaction networks of folded RNAs of similar or unrelated functions. Finally, the diagrams could help devising methods for meaningfully transforming RNA structures into graphs amenable to network analysis.
Collapse
Affiliation(s)
| | - E. Westhof
- To whom correspondence should be addressed. Tel/Fax: +33 388 41 70 46; Email :
| |
Collapse
|
44
|
Leontis NB, Lescoute A, Westhof E. The building blocks and motifs of RNA architecture. Curr Opin Struct Biol 2006; 16:279-87. [PMID: 16713707 PMCID: PMC4857889 DOI: 10.1016/j.sbi.2006.05.009] [Citation(s) in RCA: 258] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2006] [Revised: 04/12/2006] [Accepted: 05/10/2006] [Indexed: 10/24/2022]
Abstract
RNA motifs can be defined broadly as recurrent structural elements containing multiple intramolecular RNA-RNA interactions, as observed in atomic-resolution RNA structures. They constitute the modular building blocks of RNA architecture, which is organized hierarchically. Recent work has focused on analyzing RNA backbone conformations to identify, define and search for new instances of recurrent motifs in X-ray structures. One current view asserts that recurrent RNA strand segments with characteristic backbone configurations qualify as independent motifs. Other considerations indicate that, to characterize modular motifs, one must take into account the larger structural context of such strand segments. This follows the biologically relevant motivation, which is to identify RNA structural characteristics that are subject to sequence constraints and that thus relate RNA architectures to sequences.
Collapse
Affiliation(s)
- Neocles B Leontis
- Department of Chemistry and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, OH 43402, USA
| | | | | |
Collapse
|