1
|
Emrizal R, Hamdani HY, Firdaus-Raih M. Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures. Int J Mol Sci 2021; 22:ijms22168553. [PMID: 34445259 PMCID: PMC8395288 DOI: 10.3390/ijms22168553] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 08/01/2021] [Accepted: 08/06/2021] [Indexed: 12/12/2022] Open
Abstract
The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule's functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database.
Collapse
Affiliation(s)
- Reeki Emrizal
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia;
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia
| | - Hazrina Yusof Hamdani
- Advanced Medical and Dental Institute, Universiti Sains Malaysia, Bertam, Kepala Batas 13200, Pulau Pinang, Malaysia
- Correspondence: (H.Y.H.); (M.F.-R.)
| | - Mohd Firdaus-Raih
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia;
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia
- Correspondence: (H.Y.H.); (M.F.-R.)
| |
Collapse
|
2
|
Soulé A, Reinharz V, Sarrazin-Gendron R, Denise A, Waldispühl J. Finding recurrent RNA structural networks with fast maximal common subgraphs of edge-colored graphs. PLoS Comput Biol 2021; 17:e1008990. [PMID: 34048427 PMCID: PMC8191989 DOI: 10.1371/journal.pcbi.1008990] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 06/10/2021] [Accepted: 04/22/2021] [Indexed: 11/25/2022] Open
Abstract
RNA tertiary structure is crucial to its many non-coding molecular functions. RNA architecture is shaped by its secondary structure composed of stems, stacked canonical base pairs, enclosing loops. While stems are precisely captured by free-energy models, loops composed of non-canonical base pairs are not. Nor are distant interactions linking together those secondary structure elements (SSEs). Databases of conserved 3D geometries (a.k.a. modules) not captured by energetic models are leveraged for structure prediction and design, but the computational complexity has limited their study to local elements, loops. Representing the RNA structure as a graph has recently allowed to expend this work to pairs of SSEs, uncovering a hierarchical organization of these 3D modules, at great computational cost. Systematically capturing recurrent patterns on a large scale is a main challenge in the study of RNA structures. In this paper, we present an efficient algorithm to compute maximal isomorphisms in edge colored graphs. We extend this algorithm to a framework well suited to identify RNA modules, and fast enough to considerably generalize previous approaches. To exhibit the versatility of our framework, we first reproduce results identifying all common modules spanning more than 2 SSEs, in a few hours instead of weeks. The efficiency of our new algorithm is demonstrated by computing the maximal modules between any pair of entire RNA in the non-redundant corpus of known RNA 3D structures. We observe that the biggest modules our method uncovers compose large shared sub-structure spanning hundreds of nucleotides and base pairs between the ribosomes of Thermus thermophilus, Escherichia Coli, and Pseudomonas aeruginosa. Ribonucleic Acids (RNAs) are performing a broad range of essential molecular functions in cells, many of which rely on intricate folding properties of the molecule. Watson-Crick and Wobble base pairs form early, stack onto each other to create stems connected by loops, which are themselves stabilized by more sophisticated base interaction patterns. These networks are essential to shape RNA 3D structures but unfortunately still poorly understood. Here, we undertake the task to build a catalog of base interaction networks occurring in multiple structures. However, a pairwise comparison of all RNA structures is computationally heavy. Therefore, we devise an algorithm leveraging intrinsic properties of RNA base interaction networks that enables us to quickly mine full databases of 3D structures. Compared to previous methods, our techniques bring the total running time of the analysis from months to hours while performing more general searches. The data collected though this work will benefit molecular evolution studies and serve in structure prediction tools.
Collapse
Affiliation(s)
- Antoine Soulé
- School of Computer Science, McGill University, Montréal, Canada
- LiX, École Polytechnique, Paris, France
| | - Vladimir Reinharz
- Department of Computer Science, Université du Québec à Montréal, Montréal, Canada
| | | | - Alain Denise
- Laboratoire de recherche en informatique, Université Paris-Saclay - CNRS, Orsay, France
- Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay - CEA - CNRS, Gif-sur-Yvette, France
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, Montréal, Canada
- * E-mail:
| |
Collapse
|
3
|
Richardson KE, Adams MS, Kirkpatrick CC, Gohara DW, Znosko BM. Identification and Characterization of New RNA Tetraloop Sequence Families. Biochemistry 2019; 58:4809-4820. [PMID: 31714066 DOI: 10.1021/acs.biochem.9b00535] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
There is an abundance of RNA sequence information available due to the efforts of sequencing projects. However, current techniques implemented to solve the tertiary structures of RNA, such as NMR and X-ray crystallography, are difficult and time-consuming. Therefore, biophysical techniques are not able to keep pace with the abundance of sequence information available. Because of this, there is a need to develop quick and efficient ways to predict RNA tertiary structure from sequence. One promising approach is to identify structural patterns within previously solved 3D structures and apply these patterns to new sequences. RNA tetraloops are one of the most common naturally occurring secondary structure motifs. Here, we use RNA Characterization of Secondary Structure Motifs (CoSSMos), Dissecting the Spatial Structure of RNA (DSSR), and a bioinformatic approach to search for and characterize tertiary structure patterns among tetraloops. Not surprising, we identified the well-known GNRA and UNCG tetraloops, as well as the previously identified RNYA tetraloop. However, some previously identified characteristics of these families were not observed in this data set, and some new characteristics were identified. In addition, we also identified and characterized three new tetraloop sequence families: YGAR, UGGU, and RMSA. This new structural information sheds light on the tertiary structure of tetraloops and contributes to the efforts of RNA tertiary structure prediction from sequence.
Collapse
Affiliation(s)
- Katherine E Richardson
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Miranda S Adams
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Charles C Kirkpatrick
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - David W Gohara
- Department of Biochemistry and Molecular Biology , Saint Louis University , Saint Louis , Missouri 63103 , United States
| | - Brent M Znosko
- Department of Chemistry , Saint Louis University , Saint Louis , Missouri 63103 , United States
| |
Collapse
|
4
|
Reinharz V, Soulé A, Westhof E, Waldispühl J, Denise A. Mining for recurrent long-range interactions in RNA structures reveals embedded hierarchies in network families. Nucleic Acids Res 2019; 46:3841-3851. [PMID: 29608773 PMCID: PMC5934684 DOI: 10.1093/nar/gky197] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2017] [Accepted: 03/22/2018] [Indexed: 11/14/2022] Open
Abstract
The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.
Collapse
Affiliation(s)
- Vladimir Reinharz
- Department of Computer Science, Ben-Gurion University of the Negev, P.O.B. 653 Beer-Sheva, 84105, Israel.,School of Computer Science, McGill University, 3480 University, Montreal, Quebec H3A 0E9, Canada
| | - Antoine Soulé
- School of Computer Science, McGill University, 3480 University, Montreal, Quebec H3A 0E9, Canada.,LIX, École Polytechnique, CNRS, Inria, Palaiseau 91120, France
| | - Eric Westhof
- ARN, Université de Strasbourg, IBMC-CNRS, 15 rue René Descartes, Strasbourg Cedex 67084, France
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, 3480 University, Montreal, Quebec H3A 0E9, Canada
| | - Alain Denise
- LRI, Université Paris-Sud, CNRS, Université Paris-Saclay, Bâtiment 650, Orsay cedex 91405, France.,I2BC, Université Paris-Sud, CNRS, CEA, Université Paris-Saclay, Bâtiment 400, Orsay cedex 91405, France
| |
Collapse
|
5
|
Ge P, Islam S, Zhong C, Zhang S. De novo discovery of structural motifs in RNA 3D structures through clustering. Nucleic Acids Res 2018; 46:4783-4793. [PMID: 29534235 PMCID: PMC5961109 DOI: 10.1093/nar/gky139] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Revised: 02/09/2018] [Accepted: 02/16/2018] [Indexed: 11/16/2022] Open
Abstract
As functional components in three-dimensional (3D) conformation of an RNA, the RNA structural motifs provide an easy way to associate the molecular architectures with their biological mechanisms. In the past years, many computational tools have been developed to search motif instances by using the existing knowledge of well-studied families. Recently, with the rapidly increasing number of resolved RNA 3D structures, there is an urgent need to discover novel motifs with the newly presented information. In this work, we classify all the loops in non-redundant RNA 3D structures to detect plausible RNA structural motif families by using a clustering pipeline. Compared with other clustering approaches, our method has two benefits: first, the underlying alignment algorithm is tolerant to the variations in 3D structures. Second, sophisticated downstream analysis has been performed to ensure the clusters are valid and easily applied to further research. The final clustering results contain many interesting new variants of known motif families, such as GNAA tetraloop, kink-turn, sarcin-ricin and T-loop. We have also discovered potential novel functional motifs conserved in ribosomal RNA, sgRNA, SRP RNA, riboswitch and ribozyme.
Collapse
Affiliation(s)
- Ping Ge
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Shahidul Islam
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Cuncong Zhong
- Department of Electrical Engineering and Computer Science, University of Kansas, Lawrence, KS 66045, USA
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
6
|
Šponer J, Bussi G, Krepl M, Banáš P, Bottaro S, Cunha RA, Gil-Ley A, Pinamonti G, Poblete S, Jurečka P, Walter NG, Otyepka M. RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview. Chem Rev 2018; 118:4177-4338. [PMID: 29297679 PMCID: PMC5920944 DOI: 10.1021/acs.chemrev.7b00427] [Citation(s) in RCA: 336] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Indexed: 12/14/2022]
Abstract
With both catalytic and genetic functions, ribonucleic acid (RNA) is perhaps the most pluripotent chemical species in molecular biology, and its functions are intimately linked to its structure and dynamics. Computer simulations, and in particular atomistic molecular dynamics (MD), allow structural dynamics of biomolecular systems to be investigated with unprecedented temporal and spatial resolution. We here provide a comprehensive overview of the fast-developing field of MD simulations of RNA molecules. We begin with an in-depth, evaluatory coverage of the most fundamental methodological challenges that set the basis for the future development of the field, in particular, the current developments and inherent physical limitations of the atomistic force fields and the recent advances in a broad spectrum of enhanced sampling methods. We also survey the closely related field of coarse-grained modeling of RNA systems. After dealing with the methodological aspects, we provide an exhaustive overview of the available RNA simulation literature, ranging from studies of the smallest RNA oligonucleotides to investigations of the entire ribosome. Our review encompasses tetranucleotides, tetraloops, a number of small RNA motifs, A-helix RNA, kissing-loop complexes, the TAR RNA element, the decoding center and other important regions of the ribosome, as well as assorted others systems. Extended sections are devoted to RNA-ion interactions, ribozymes, riboswitches, and protein/RNA complexes. Our overview is written for as broad of an audience as possible, aiming to provide a much-needed interdisciplinary bridge between computation and experiment, together with a perspective on the future of the field.
Collapse
Affiliation(s)
- Jiří Šponer
- Institute of Biophysics of the Czech Academy of Sciences , Kralovopolska 135 , Brno 612 65 , Czech Republic
| | - Giovanni Bussi
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Miroslav Krepl
- Institute of Biophysics of the Czech Academy of Sciences , Kralovopolska 135 , Brno 612 65 , Czech Republic
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Pavel Banáš
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Sandro Bottaro
- Structural Biology and NMR Laboratory, Department of Biology , University of Copenhagen , Copenhagen 2200 , Denmark
| | - Richard A Cunha
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Alejandro Gil-Ley
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Giovanni Pinamonti
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Simón Poblete
- Scuola Internazionale Superiore di Studi Avanzati , Via Bonomea 265 , Trieste 34136 , Italy
| | - Petr Jurečka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| | - Nils G Walter
- Single Molecule Analysis Group and Center for RNA Biomedicine, Department of Chemistry , University of Michigan , Ann Arbor , Michigan 48109 , United States
| | - Michal Otyepka
- Regional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science , Palacky University Olomouc , 17. listopadu 12 , Olomouc 771 46 , Czech Republic
| |
Collapse
|
7
|
RNA structure prediction: from 2D to 3D. Emerg Top Life Sci 2017; 1:275-285. [DOI: 10.1042/etls20160027] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2017] [Revised: 07/27/2017] [Accepted: 08/10/2017] [Indexed: 11/17/2022]
Abstract
We summarize different levels of RNA structure prediction, from classical 2D structure to extended secondary structure and motif-based research toward 3D structure prediction of RNA. We outline the importance of classical secondary structure during all those levels of structure prediction.
Collapse
|
8
|
DasGupta S, Suslov NB, Piccirilli JA. Structural Basis for Substrate Helix Remodeling and Cleavage Loop Activation in the Varkud Satellite Ribozyme. J Am Chem Soc 2017; 139:9591-9597. [PMID: 28625058 PMCID: PMC5929484 DOI: 10.1021/jacs.7b03655] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The Varkud satellite (VS) ribozyme catalyzes site-specific RNA cleavage and ligation reactions. Recognition of the substrate involves a kissing loop interaction between the substrate and the catalytic domain of the ribozyme, resulting in a rearrangement of the substrate helix register into a so-called "shifted" conformation that is critical for substrate binding and activation. We report a 3.3 Å crystal structure of the complete ribozyme that reveals the active, shifted conformation of the substrate, docked into the catalytic domain of the ribozyme. Comparison to previous NMR structures of isolated, inactive substrates provides a physical description of substrate remodeling, and implicates roles for tertiary interactions in catalytic activation of the cleavage loop. Similarities to the hairpin ribozyme cleavage loop activation suggest general strategies to enhance fidelity in RNA folding and ribozyme cleavage.
Collapse
Affiliation(s)
- Saurja DasGupta
- Department of Chemistry, The University of Chicago, Chicago, IL, 60637, USA
| | - Nikolai B. Suslov
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| | - Joseph A. Piccirilli
- Department of Chemistry, The University of Chicago, Chicago, IL, 60637, USA
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, 60637, USA
| |
Collapse
|
9
|
Mapping the Universe of RNA Tetraloop Folds. Biophys J 2017; 113:257-267. [PMID: 28673616 DOI: 10.1016/j.bpj.2017.06.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2017] [Revised: 06/08/2017] [Accepted: 06/08/2017] [Indexed: 11/22/2022] Open
Abstract
We report a map of RNA tetraloop conformations constructed by calculating pairwise distances among all experimentally determined four-nucleotide hairpin loops. Tetraloops with similar structures are clustered together and, as expected, the two largest clusters are the canonical GNRA and UNCG folds. We identify clusters corresponding to known tetraloop folds such as GGUG, RNYA, AGNN, and CUUG. These clusters are represented in a simple two-dimensional projection that recapitulates the relationship among the different folds. The cluster analysis also identifies 20 novel tetraloop folds that are peculiar to specific positions in ribosomal RNAs and that are stabilized by tertiary interactions. In our RNA tetraloop database we find a significant number of non-GNRA and non-UNCG sequences adopting the canonical GNRA and UNCG folds. Conversely, we find a significant number of GNRA and UNCG sequences adopting non-GNRA and non-UNCG folds. Our analysis demonstrates that there is not a simple one-to-one, but rather a many-to-many mapping between tetraloop sequence and tetraloop fold.
Collapse
|
10
|
Abstract
Ligand-responsive RNA mechanical switches represent a new class of simple switching modules that adopt well-defined ligand-free and bound conformational states, distinguishing them from metabolite-sensing riboswitches. Initially discovered in the internal ribosome entry site (IRES) of hepatitis C virus (HCV), these RNA switch motifs were found in the genome of diverse other viruses. Although large variations are seen in sequence and local secondary structure of the switches, their function in viral translation initiation that requires selective ligand recognition is conserved. We recently determined the crystal structure of an RNA switch from Seneca Valley virus (SVV) which is able to functionally replace the switch of HCV. The switches from both viruses recognize identical cognate ligands despite their sequence dissimilarity. Here, we describe the discovery of 7 new switches in addition to the previously established 5 examples. We highlight structural and functional features unique to this class of ligand-responsive RNA mechanical switches and discuss implications for therapeutic development and the construction of RNA nanostructures.
Collapse
Key Words
- AEV, avian encephalomyelitis virus
- BDV, border disease virus
- BVDV, bovine viral diarrhea virus
- CSFV, classical swine fever virus
- DHV, Duck hepatitis virus
- DPV, duck picornavirus
- GBV, GB virus
- GPV, giraffe pestivirus
- HCV, hepatitis C virus
- IRES
- IRES, internal ribosome entry site
- IVT, in vitro translation
- NPHV, non-primate hepacivirus
- RNA switch
- SPV, simian picornavirus
- SVV, Seneca Valley virus
- conformational switch
- hepatitis C virus
- riboswitch
Collapse
Affiliation(s)
- Mark A Boerneke
- a Department of Chemistry and Biochemistry ; University of California, San Diego ; La Jolla , CA USA
| | | |
Collapse
|
11
|
Gelinas AD, Davies DR, Janjic N. Embracing proteins: structural themes in aptamer-protein complexes. Curr Opin Struct Biol 2016; 36:122-32. [PMID: 26919170 DOI: 10.1016/j.sbi.2016.01.009] [Citation(s) in RCA: 128] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2015] [Revised: 01/13/2016] [Accepted: 01/15/2016] [Indexed: 01/17/2023]
Abstract
Understanding the structural rules that govern specific, high-affinity binding characteristic of aptamer-protein interactions is important in view of the increasing use of aptamers across many applications. From the modest number of 16 aptamer-protein structures currently available, trends are emerging. The flexible phosphodiester backbone allows folding into precise three-dimensional structures using known nucleic acid motifs as scaffolds that orient specific functional groups for target recognition. Still, completely novel motifs essential for structure and function are found in modified aptamers with diversity-enhancing side chains. Aptamers and antibodies, two classes of macromolecules used as affinity reagents with entirely different backbones and composition, recognize protein epitopes of similar size and with comparably high shape complementarity.
Collapse
Affiliation(s)
- Amy D Gelinas
- SomaLogic, Inc., 2945 Wilderness Place, Boulder, CO 80301, United States
| | - Douglas R Davies
- Beryllium, 7869 NE Day Road West, Bainbridge Island, WA 98110, United States
| | - Nebojsa Janjic
- SomaLogic, Inc., 2945 Wilderness Place, Boulder, CO 80301, United States.
| |
Collapse
|
12
|
Schudoma C. It's a loop world - single strands in RNA as structural and functional elements. Biomol Concepts 2015; 2:171-81. [PMID: 25962027 DOI: 10.1515/bmc.2011.016] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2011] [Accepted: 03/25/2011] [Indexed: 01/31/2023] Open
Abstract
Unpaired regions in RNA molecules - loops - are centrally involved in defining the characteristic three-dimensional (3D) architecture of RNAs and are of high interest in RNA engineering and design. Loops adopt diverse, but specific conformations stabilised by complex tertiary structural interactions that provide structural flexibility to RNA structures that would otherwise not be possible if they only consisted of the rigid A-helical shapes usually formed by canonical base pairing. By participating in sequence-non-local contacts, they furthermore contribute to stabilising the overall fold of RNA molecules. Interactions between RNAs and other nucleic acids, proteins, or small molecules are also generally mediated by RNA loop structures. Therefore, the function of an RNA molecule is generally dependent on its loops. Examples include intermolecular interactions between RNAs as part of the microRNA processing pathways, ribozymatic activity, or riboswitch-ligand interactions. Bioinformatics approaches have been successfully applied to the identification of novel RNA structural motifs including loops, local and global RNA 3D structure prediction, and structural and conformational analysis of RNAs and have contributed to a better understanding of the sequence-structure-function relationships in RNA loops.
Collapse
|
13
|
Abstract
Varkud Satellite (VS) ribozyme mediates rolling circle replication of a plasmid found in the Neurospora mitochondria. We report crystal structures of this ribozyme at 3.1Å resolution, revealing an intertwined dimer formed by an exchange of substrate helices. Within each protomer, an arrangement of three-way helical junctions organizes seven helices into a global fold that creates a docking site for the substrate helix of the other protomer, resulting in the formation of two active sites in trans. This mode of RNA-RNA association resembles the process of domain swapping in proteins and has implications for RNA regulation and evolution. Within each active site, adenine and guanine nucleobases abut the scissile phosphate, poised to serve direct roles in catalysis. Similarities to the active sites of the hairpin and hammerhead ribozymes highlight the functional significance of active site features, underscore the ability of RNA to access functional architectures from distant regions of sequence space, and suggest convergent evolution.
Collapse
|
14
|
Zhong C, Zhang S. RNAMotifScanX: a graph alignment approach for RNA structural motif identification. RNA (NEW YORK, N.Y.) 2015; 21:333-346. [PMID: 25595715 PMCID: PMC4338331 DOI: 10.1261/rna.044891.114] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2014] [Accepted: 11/28/2014] [Indexed: 06/04/2023]
Abstract
RNA structural motifs are recurrent three-dimensional (3D) components found in the RNA architecture. These RNA structural motifs play important structural or functional roles and usually exhibit highly conserved 3D geometries and base-interaction patterns. Analysis of the RNA 3D structures and elucidation of their molecular functions heavily rely on efficient and accurate identification of these motifs. However, efficient RNA structural motif search tools are lacking due to the high complexity of these motifs. In this work, we present RNAMotifScanX, a motif search tool based on a base-interaction graph alignment algorithm. This novel algorithm enables automatic identification of both partially and fully matched motif instances. RNAMotifScanX considers noncanonical base-pairing interactions, base-stacking interactions, and sequence conservation of the motifs, which leads to significantly improved sensitivity and specificity as compared with other state-of-the-art search tools. RNAMotifScanX also adopts a carefully designed branch-and-bound technique, which enables ultra-fast search of large kink-turn motifs against a 23S rRNA. The software package RNAMotifScanX is implemented using GNU C++, and is freely available from http://genome.ucf.edu/RNAMotifScanX.
Collapse
Affiliation(s)
- Cuncong Zhong
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, Florida 32816, USA
| | - Shaojie Zhang
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, Florida 32816, USA
| |
Collapse
|
15
|
Structural insights into recognition of c-di-AMP by the ydaO riboswitch. Nat Chem Biol 2014; 10:787-92. [PMID: 25086507 DOI: 10.1038/nchembio.1607] [Citation(s) in RCA: 82] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Accepted: 06/19/2014] [Indexed: 01/28/2023]
Abstract
Bacterial second messenger cyclic di-AMP (c-di-AMP) is implicated in signaling DNA damage and cell wall stress through interactions with several protein receptors and a widespread ydaO-type riboswitch. We report the crystal structures of c-di-AMP riboswitches from Thermoanaerobacter pseudethanolicus and Thermovirga lienii determined at ∼3.0-Å resolution. In both species, the RNA adopts an unforeseen 'square'-shaped pseudosymmetrical architecture that features two three-way junctions, a turn and a pseudoknot, positioned in the square corners. Uncharacteristically for riboswitches, the structure is stapled by two ligand molecules that span the interior of the structure and employ similar noncanonical interactions for RNA recognition. Mutations in either ligand-binding site negatively affect c-di-AMP binding, suggesting that the riboswitch-triggered genetic response requires contribution of both ligands. Our data provide what are to our knowledge the first insights into specific sensing of c-di-AMP and a molecular mechanism underlying the common c-di-AMP-dependent control of essential cellular processes in bacteria.
Collapse
|
16
|
Huang H, Suslov NB, Li NS, Shelke SA, Evans ME, Koldobskaya Y, Rice PA, Piccirilli JA. A G-quadruplex-containing RNA activates fluorescence in a GFP-like fluorophore. Nat Chem Biol 2014; 10:686-91. [PMID: 24952597 PMCID: PMC4104137 DOI: 10.1038/nchembio.1561] [Citation(s) in RCA: 245] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2013] [Accepted: 05/21/2014] [Indexed: 01/30/2023]
Abstract
Spinach is an in vitro-selected RNA aptamer that binds a GFP-like ligand and activates its green fluorescence. Spinach is thus an RNA analog of GFP and has potentially widespread applications for in vivo labeling and imaging. We used antibody-assisted crystallography to determine the structures of Spinach both with and without bound fluorophore at 2.2-Å and 2.4-Å resolution, respectively. Spinach RNA has an elongated structure containing two helical domains separated by an internal bulge that folds into a G-quadruplex motif of unusual topology. The G-quadruplex motif and adjacent nucleotides comprise a partially preformed binding site for the fluorophore. The fluorophore binds in a planar conformation and makes extensive aromatic stacking and hydrogen bond interactions with the RNA. Our findings provide a foundation for structure-based engineering of new fluorophore-binding RNA aptamers.
Collapse
Affiliation(s)
- Hao Huang
- Department of Chemistry, University of Chicago, Chicago, IL, 60637, USA
| | - Nikolai B. Suslov
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, 60637, USA
| | - Nan-Sheng Li
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, 60637, USA
| | - Sandip A. Shelke
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, 60637, USA
| | - Molly E. Evans
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, 60637, USA
| | | | - Phoebe A. Rice
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, 60637, USA
| | - Joseph A. Piccirilli
- Department of Chemistry, University of Chicago, Chicago, IL, 60637, USA
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, 60637, USA
| |
Collapse
|
17
|
Panecka J, Havrila M, Réblová K, Šponer J, Trylska J. Role of S-turn2 in the structure, dynamics, and function of mitochondrial ribosomal A-site. A bioinformatics and molecular dynamics simulation study. J Phys Chem B 2014; 118:6687-701. [PMID: 24845793 DOI: 10.1021/jp5030685] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
The mRNA decoding site (A-site) in the small ribosomal subunit controls fidelity of the translation process. Here, using molecular dynamics simulations and bioinformatic analyses, we investigated the structural dynamics of the human mitochondrial A-site (native and A1490G mutant) and compared it with the dynamics of the bacterial A-site. We detected and characterized a specific RNA backbone configuration, S-turn2, which occurs in the human mitochondrial but not in the bacterial A-site. Mitochondrial and bacterial A-sites show different propensities to form S-turn2 that may be caused by different base-pairing patterns of the flanking nucleotides. Also, the S-turn2 structural stability observed in the simulations supports higher accuracy and lower speed of mRNA decoding in mitochondria in comparison with bacteria. In the mitochondrial A-site, we observed collective movement of stacked nucleotides A1408·C1409·C1410, which may explain the known differences in aminoglycoside antibiotic binding affinities toward the studied A-site variants.
Collapse
Affiliation(s)
- Joanna Panecka
- Department of Biophysics, Institute of Experimental Physics and ∥Centre of New Technologies, University of Warsaw , Żwirki i Wigury 93, 02-089 Warsaw, Poland
| | | | | | | | | |
Collapse
|
18
|
He G, Steppi A, Laborde J, Srivastava A, Zhao P, Zhang J. RASS: a web server for RNA alignment in the joint sequence-structure space. Nucleic Acids Res 2014; 42:W377-81. [PMID: 24831547 PMCID: PMC4086137 DOI: 10.1093/nar/gku429] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Comparison of ribonucleic acid (RNA) molecules is important for revealing their
evolutionary relationships, predicting their functions and predicting their
structures. Many methods have been developed for comparing RNAs using either
sequence or three-dimensional (3D) structure (backbone geometry) information.
Sequences and 3D structures contain non-overlapping sets of information that
both determine RNA functions. When comparing RNA 3D structures, both types of
information need to be taken into account. However, few methods compare RNA
structures using both sequence and 3D structure information. Recently, we have
developed a new method based on elastic shape analysis (ESA) that compares RNA
molecules by combining both sequence and 3D structure information. ESA treats
RNA structures as 3D curves with sequence information encoded on additional
coordinates so that the alignment can be performed in the joint
sequence-structure space. The similarity between two RNA molecules is quantified
by a formal distance, geodesic distance. In this study, we implement a web
server for the method, called RASS, to make it publicly available to research
community. The web server is located at http://cloud.stat.fsu.edu/RASS/.
Collapse
Affiliation(s)
- Gewen He
- Department of Computer Science, Florida State University, Tallahassee, FL 32306, USA
| | - Albert Steppi
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| | - Jose Laborde
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| | - Anuj Srivastava
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| | - Peixiang Zhao
- Department of Computer Science, Florida State University, Tallahassee, FL 32306, USA
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| |
Collapse
|
19
|
Chojnowski G, Walen T, Bujnicki JM. RNA Bricks--a database of RNA 3D motifs and their interactions. Nucleic Acids Res 2013; 42:D123-31. [PMID: 24220091 PMCID: PMC3965019 DOI: 10.1093/nar/gkt1084] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The RNA Bricks database (http://iimcb.genesilico.pl/rnabricks), stores information about recurrent RNA 3D motifs and their interactions, found in experimentally determined RNA structures and in RNA–protein complexes. In contrast to other similar tools (RNA 3D Motif Atlas, RNA Frabase, Rloom) RNA motifs, i.e. ‘RNA bricks’ are presented in the molecular environment, in which they were determined, including RNA, protein, metal ions, water molecules and ligands. All nucleotide residues in RNA bricks are annotated with structural quality scores that describe real-space correlation coefficients with the electron density data (if available), backbone geometry and possible steric conflicts, which can be used to identify poorly modeled residues. The database is also equipped with an algorithm for 3D motif search and comparison. The algorithm compares spatial positions of backbone atoms of the user-provided query structure and of stored RNA motifs, without relying on sequence or secondary structure information. This enables the identification of local structural similarities among evolutionarily related and unrelated RNA molecules. Besides, the search utility enables searching ‘RNA bricks’ according to sequence similarity, and makes it possible to identify motifs with modified ribonucleotide residues at specific positions.
Collapse
Affiliation(s)
- Grzegorz Chojnowski
- International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw, Poland, Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland and Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Umultowska 89, 61-614 Poznan, Poland
| | | | | |
Collapse
|
20
|
Abstract
RNA molecules are highly modular components that can be used in a variety of contexts for building new metabolic, regulatory and genetic circuits in cells. The majority of synthetic RNA systems to date predominately rely on two-dimensional modularity. However, a better understanding and integration of three-dimensional RNA modularity at structural and functional levels is critical to the development of more complex, functional bio-systems and molecular machines for synthetic biology applications.
Collapse
Affiliation(s)
- Wade Grabow
- Department of Chemistry and Biochemistry, Seattle Pacific University3307 Third Avenue West, Seattle, WA 98119USA
| | - Luc Jaeger
- Department of Chemistry and Biochemistry, Bio-Molecular Science and Engineering Program, University of CaliforniaSanta Barbara, CA 93106-9510USA
| |
Collapse
|
21
|
Sheth P, Cervantes-Cervantes M, Nagula A, Laing C, Wang JTL. Novel features for identifying A-minors in three-dimensional RNA molecules. Comput Biol Chem 2013; 47:240-5. [PMID: 24211672 DOI: 10.1016/j.compbiolchem.2013.10.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2013] [Revised: 10/15/2013] [Accepted: 10/16/2013] [Indexed: 01/08/2023]
Abstract
RNA tertiary interactions or tertiary motifs are conserved structural patterns formed by pairwise interactions between nucleotides. They include base-pairing, base-stacking, and base-phosphate interactions. A-minor motifs are the most common tertiary interactions in the large ribosomal subunit. The A-minor motif is a nucleotide triple in which minor groove edges of an adenine base are inserted into the minor groove of neighboring helices, leading to interaction with a stabilizing base pair. We propose here novel features for identifying and predicting A-minor motifs in a given three-dimensional RNA molecule. By utilizing the features together with machine learning algorithms including random forests and support vector machines, we show experimentally that our approach is capable of predicting A-minor motifs in the given RNA molecule effectively, demonstrating the usefulness of the proposed approach. The techniques developed from this work will be useful for molecular biologists and biochemists to analyze RNA tertiary motifs, specifically A-minor interactions.
Collapse
Affiliation(s)
- Palak Sheth
- Bioinformatics Program, New Jersey Institute of Technology, Newark, NJ 07102, USA
| | | | | | | | | |
Collapse
|
22
|
Petrov AI, Zirbel CL, Leontis NB. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas. RNA (NEW YORK, N.Y.) 2013; 19:1327-40. [PMID: 23970545 PMCID: PMC3854523 DOI: 10.1261/rna.039438.113] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Accepted: 06/12/2013] [Indexed: 05/21/2023]
Abstract
The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson-Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access.
Collapse
Affiliation(s)
- Anton I. Petrov
- Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, USA
| | - Craig L. Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, Ohio 43403, USA
| | - Neocles B. Leontis
- Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, USA
- Corresponding authorE-mail
| |
Collapse
|
23
|
Shen Y, Wong HS, Zhang S, Zhang L. RNA structural motif recognition based on least-squares distance. RNA (NEW YORK, N.Y.) 2013; 19:1183-1191. [PMID: 23887146 PMCID: PMC3753925 DOI: 10.1261/rna.037648.112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2012] [Accepted: 06/13/2013] [Indexed: 06/02/2023]
Abstract
RNA structural motifs are recurrent structural elements occurring in RNA molecules. RNA structural motif recognition aims to find RNA substructures that are similar to a query motif, and it is important for RNA structure analysis and RNA function prediction. In view of this, we propose a new method known as RNA Structural Motif Recognition based on Least-Squares distance (LS-RSMR) to effectively recognize RNA structural motifs. A test set consisting of five types of RNA structural motifs occurring in Escherichia coli ribosomal RNA is compiled by us. Experiments are conducted for recognizing these five types of motifs. The experimental results fully reveal the superiority of the proposed LS-RSMR compared with four other state-of-the-art methods.
Collapse
Affiliation(s)
- Ying Shen
- School of Software Engineering, Tongji University, Shanghai 200092, China
| | - Hau-San Wong
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Shaohong Zhang
- Department of Computer Science, Guangzhou University, Guangzhou 510006, China
| | - Lin Zhang
- School of Software Engineering, Tongji University, Shanghai 200092, China
| |
Collapse
|
24
|
Laborde J, Robinson D, Srivastava A, Klassen E, Zhang J. RNA global alignment in the joint sequence-structure space using elastic shape analysis. Nucleic Acids Res 2013; 41:e114. [PMID: 23585278 PMCID: PMC3675459 DOI: 10.1093/nar/gkt187] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2012] [Revised: 02/26/2013] [Accepted: 02/27/2013] [Indexed: 01/22/2023] Open
Abstract
The functions of RNAs, like proteins, are determined by their structures, which, in turn, are determined by their sequences. Comparison/alignment of RNA molecules provides an effective means to predict their functions and understand their evolutionary relationships. For RNA sequence alignment, most methods developed for protein and DNA sequence alignment can be directly applied. RNA 3-dimensional structure alignment, on the other hand, tends to be more difficult than protein structure alignment due to the lack of regular secondary structures as observed in proteins. Most of the existing RNA 3D structure alignment methods use only the backbone geometry and ignore the sequence information. Using both the sequence and backbone geometry information in RNA alignment may not only produce more accurate classification, but also deepen our understanding of the sequence-structure-function relationship of RNA molecules. In this study, we developed a new RNA alignment method based on elastic shape analysis (ESA). ESA treats RNA structures as three dimensional curves with sequence information encoded on additional dimensions so that the alignment can be performed in the joint sequence-structure space. The similarity between two RNA molecules is quantified by a formal distance, geodesic distance. Based on ESA, a rigorous mathematical framework can be built for RNA structure comparison. Means and covariances of full structures can be defined and computed, and probability distributions on spaces of such structures can be constructed for a group of RNAs. Our method was further applied to predict functions of RNA molecules and showed superior performance compared with previous methods when tested on benchmark datasets. The programs are available at http://stat.fsu.edu/ ∼jinfeng/ESA.html.
Collapse
Affiliation(s)
- Jose Laborde
- Department of Statistics, Florida State University, FL, USA and Department of Mathematics, Florida State University, FL, USA
| | - Daniel Robinson
- Department of Statistics, Florida State University, FL, USA and Department of Mathematics, Florida State University, FL, USA
| | - Anuj Srivastava
- Department of Statistics, Florida State University, FL, USA and Department of Mathematics, Florida State University, FL, USA
| | - Eric Klassen
- Department of Statistics, Florida State University, FL, USA and Department of Mathematics, Florida State University, FL, USA
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, FL, USA and Department of Mathematics, Florida State University, FL, USA
| |
Collapse
|
25
|
Grabow WW, Zhuang Z, Shea JE, Jaeger L. The GA-minor submotif as a case study of RNA modularity, prediction, and design. WILEY INTERDISCIPLINARY REVIEWS-RNA 2013; 4:181-203. [PMID: 23378290 DOI: 10.1002/wrna.1153] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Complex natural RNAs such as the ribosome, group I and group II introns, and RNase P exemplify the fact that three-dimensional (3D) RNA structures are highly modular and hierarchical in nature. Tertiary RNA folding typically takes advantage of a rather limited set of recurrent structural motifs that are responsible for controlling bends or stacks between adjacent helices. Herein, the GA minor and related structural motifs are presented as a case study to highlight several structural and folding principles, to gain further insight into the structural evolution of naturally occurring RNAs, as well as to assist the rational design of artificial RNAs.
Collapse
Affiliation(s)
- Wade W Grabow
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA, USA
| | | | | | | |
Collapse
|
26
|
Abstract
The recent discoveries of regulatory non-coding RNAs changed our view of RNA as a simple information transfer molecule. Understanding the architecture and function of active RNA molecules requires methods for comparing and analyzing their 3D structures. While structural alignment of short RNAs is achievable in a reasonable amount of time, large structures represent much bigger challenge. Here, we present the SETTER web server for the RNA structure pairwise comparison utilizing the SETTER (SEcondary sTructure-based TERtiary Structure Similarity Algorithm) algorithm. The SETTER method divides an RNA structure into the set of non-overlapping structural elements called generalized secondary structure units (GSSUs). The SETTER algorithm scales as O(n2) with the size of a GSSUs and as O(n) with the number of GSSUs in the structure. This scaling gives SETTER its high speed as the average size of the GSSU remains constant irrespective of the size of the structure. However, the favorable speed of the algorithm does not compromise its accuracy. The SETTER web server together with the stand-alone implementation of the SETTER algorithm are freely accessible at http://siret.cz/setter.
Collapse
Affiliation(s)
- Petr Cech
- Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Prague, Czech Republic
| | | | | |
Collapse
|
27
|
Humphris-Narayanan E, Pyle AM. Discrete RNA libraries from pseudo-torsional space. J Mol Biol 2012; 421:6-26. [PMID: 22425640 DOI: 10.1016/j.jmb.2012.03.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2011] [Revised: 02/28/2012] [Accepted: 03/06/2012] [Indexed: 11/27/2022]
Abstract
The discovery that RNA molecules can fold into complex structures and carry out diverse cellular roles has led to interest in developing tools for modeling RNA tertiary structure. While significant progress has been made in establishing that the RNA backbone is rotameric, few libraries of discrete conformations specifically for use in RNA modeling have been validated. Here, we present six libraries of discrete RNA conformations based on a simplified pseudo-torsional notation of the RNA backbone, comparable to phi and psi in the protein backbone. We evaluate the ability of each library to represent single nucleotide backbone conformations, and we show how individual library fragments can be assembled into dinucleotides that are consistent with established RNA backbone descriptors spanning from sugar to sugar. We then use each library to build all-atom models of 20 test folds, and we show how the composition of a fragment library can limit model quality. Despite the limitations inherent in using discretized libraries, we find that several hundred discrete fragments can rebuild RNA folds up to 174 nucleotides in length with atomic-level accuracy (<1.5 Å RMSD). We anticipate that the libraries presented here could easily be incorporated into RNA structural modeling, analysis, or refinement tools.
Collapse
|
28
|
Butcher SE, Pyle AM. The molecular interactions that stabilize RNA tertiary structure: RNA motifs, patterns, and networks. Acc Chem Res 2011; 44:1302-11. [PMID: 21899297 DOI: 10.1021/ar200098t] [Citation(s) in RCA: 232] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
RNA molecules adopt specific three-dimensional structures critical to their function. Many essential metabolic processes, including protein synthesis and RNA splicing, are carried out by RNA molecules with elaborate tertiary structures (e.g. 3QIQ, right). Indeed, the ribosome and self-splicing introns are complex RNA machines. But even the coding regions in messenger RNAs and viral RNAs are flanked by highly structured untranslated regions, which provide regulatory information necessary for gene expression. RNA tertiary structure is defined as the three-dimensional arrangement of RNA building blocks, which include helical duplexes, triple-stranded structures, and other components that are held together through connections collectively termed RNA tertiary interactions. The structural diversity of these interactions is now a subject of intense investigation, involving the techniques of NMR, X-ray crystallography, chemical genetics, and phylogenetic analysis. At the same time, many investigators are using biophysical techniques to elucidate the driving forces for tertiary structure formation and the mechanisms for its stabilization. RNA tertiary folding is promoted by maximization of base stacking, much like the hydrophobic effect that drives protein folding. RNA folding also requires electrostatic stabilization, both through charge screening and site binding of metals, and it is enhanced by desolvation of the phosphate backbone. In this Account, we provide an overview of the features that specify and stabilize RNA tertiary structure. A major determinant for overall tertiary RNA architecture is local conformation in secondary-structure junctions, which are regions from which two or more duplexes project. At junctions and other structures, such as pseudoknots and kissing loops, adjacent helices stack on one another, and these coaxial stacks play a major role in dictating the overall architectural form of an RNA molecule. In addition to RNA junction topology, a second determinant for RNA tertiary structure is the formation of sequence-specific interactions. Networks of triple helices, tetraloop-receptor interactions, and other sequence-specific contacts establish the framework for the overall tertiary fold. The third determinant of tertiary structure is the formation of stabilizing stacking and backbone interactions, and many are not sequence specific. For example, ribose zippers allow 2'-hydroxyl groups on different RNA strands to form networks of interdigitated hydrogen bonds, serving to seal strands together and thereby stabilize adjacent substructures. These motifs often require monovalent and divalent cations, which can interact diffusely or through chelation to specific RNA functional groups. As we learn more about the components of RNA tertiary structure, we will be able to predict the structures of RNA molecules from their sequences, thereby obtaining key information about biological function. Understanding and predicting RNA structure is particularly important given the recent discovery that although most of our genome is transcribed into RNA molecules, few of them have a known function. The prevalence of RNA viruses and pathogens with RNA genomes makes RNA drug discovery an active area of research. Finally, knowledge of RNA structure will facilitate the engineering of supramolecular RNA structures, which can be used as nanomechanical components for new materials. But all of this promise depends on a better understanding of the RNA parts list, and how the pieces fit together.
Collapse
Affiliation(s)
- Samuel E. Butcher
- Department of Biochemistry, University of Wisconsin—Madison, 433 Babcock
Drive, Madison, Wisconsin 53706-1544, United States
| | - Anna Marie Pyle
- Department of Molecular, Cellular
and Developmental Biology and Department of Chemistry, Yale University, New Haven, Connecticut, United States
- Howard Hughes Medical Institute
| |
Collapse
|
29
|
Zhong C, Zhang S. Clustering RNA structural motifs in ribosomal RNAs using secondary structural alignment. Nucleic Acids Res 2011; 40:1307-17. [PMID: 21976732 PMCID: PMC3273805 DOI: 10.1093/nar/gkr804] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
RNA structural motifs are the building blocks of the complex RNA architecture. Identification of non-coding RNA structural motifs is a critical step towards understanding of their structures and functionalities. In this article, we present a clustering approach for de novo RNA structural motif identification. We applied our approach on a data set containing 5S, 16S and 23S rRNAs and rediscovered many known motifs including GNRA tetraloop, kink-turn, C-loop, sarcin–ricin, reverse kink-turn, hook-turn, E-loop and tandem-sheared motifs, with higher accuracy than the state-of-the-art clustering method. We also identified a number of potential novel instances of GNRA tetraloop, kink-turn, sarcin–ricin and tandem-sheared motifs. More importantly, several novel structural motif families have been revealed by our clustering analysis. We identified a highly asymmetric bulge loop motif that resembles the rope sling. We also found an internal loop motif that can significantly increase the twist of the helix. Finally, we discovered a subfamily of hexaloop motif, which has significantly different geometry comparing to the currently known hexaloop motif. Our discoveries presented in this article have largely increased current knowledge of RNA structural motifs.
Collapse
Affiliation(s)
- Cuncong Zhong
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | | |
Collapse
|
30
|
Shen Y, Wong HS, Zhang S, Yu Z. Feature-based 3D motif filtering for ribosomal RNA. ACTA ACUST UNITED AC 2011; 27:2828-35. [PMID: 21873638 DOI: 10.1093/bioinformatics/btr495] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION RNA 3D motifs are recurrent substructures in an RNA subunit and are building blocks of the RNA architecture. They play an important role in binding proteins and consolidating RNA tertiary structures. RNA 3D motif searching consists of two steps: candidate generation and candidate filtering. We proposed a novel method, known as Feature-based RNA Motif Filtering (FRMF), for identifying motifs based on a set of moment invariants and the Earth Mover's Distance in the second step. RESULTS A positive set of RNA motifs belonging to six characteristic types, with eight subtypes occurring in HM 50S, is compiled by us. The proposed method is validated on this representative set. FRMF successfully finds most of the positive fragments. Besides the proposed new method and the compiled positive set, we also recognize some new motifs, in particular a π-turn and some non-standard A-minor motifs are found. These newly discovered motifs provide more information about RNA structure conformation. AVAILABILITY Matlab code can be downloaded from www.cs.cityu.edu.hk/~yingshen/FRMF.html CONTACT cshswong@cityu.edu.hk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ying Shen
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | | | | | | |
Collapse
|
31
|
Pietropaolo A, Parrinello M. A quantitative measure of chirality inside nucleic acid databank. Chirality 2011; 23:534-42. [PMID: 21618614 DOI: 10.1002/chir.20961] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2010] [Accepted: 02/24/2011] [Indexed: 11/11/2022]
Abstract
We show the capability of a chirality index (Pietropaolo et al., Proteins 2008;70:667-677) to investigate nucleic acid structures because of its high sensitivity to helical conformations. By analyzing selected structures of DNA and RNA, we have found that sequences rich in cytosine and guanine have a tendency to left-handed chirality, in contrast to regions rich in adenine or thymine which show strong negative, right-handed, chirality values. We also analyze RNA structures, where specific loops and hairpin motifs are characterized by a well-defined chirality value. We find that in nucleosome the chirality is exalted, whereas in ribosome it is reduced. Our results illustrate the sensitivity of this descriptor for nucleic acid conformations.
Collapse
Affiliation(s)
- Adriana Pietropaolo
- Computational Science, Department of Chemistry and Applied Biosciences, ETH Zürich, USI Campus, Lugano, Switzerland.
| | | |
Collapse
|
32
|
Abstract
Unlike proteins, the RNA backbone has numerous degrees of freedom (eight, if one counts the sugar pucker), making RNA modeling, structure building and prediction a multidimensional problem of exceptionally high complexity. And yet RNA tertiary structures are not infinite in their structural morphology; rather, they are built from a limited set of discrete units. In order to reduce the dimensionality of the RNA backbone in a physically reasonable way, a shorthand notation was created that reduced the RNA backbone torsion angles to two (η and θ, analogous to φ and ψ in proteins). When these torsion angles are calculated for nucleotides in a crystallographic database and plotted against one another, one obtains a plot analogous to a Ramachandran plot (the η/θ plot), with highly populated and unpopulated regions. Nucleotides that occupy proximal positions on the plot have identical structures and are found in the same units of tertiary structure. In this review, we describe the statistical validation of the η/θ formalism and the exploration of features within the η/θ plot. We also describe the application of the η/θ formalism in RNA motif discovery, structural comparison, RNA structure building and tertiary structure prediction. More than a tool, however, the η/θ formalism has provided new insights into RNA structure itself, revealing its fundamental components and the factors underlying RNA architectural form.
Collapse
|
33
|
Zhong C, Tang H, Zhang S. RNAMotifScan: automatic identification of RNA structural motifs using secondary structural alignment. Nucleic Acids Res 2010; 38:e176. [PMID: 20696653 PMCID: PMC2952876 DOI: 10.1093/nar/gkq672] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Recent studies have shown that RNA structural motifs play essential roles in RNA folding and interaction with other molecules. Computational identification and analysis of RNA structural motifs remains a challenging task. Existing motif identification methods based on 3D structure may not properly compare motifs with high structural variations. Other structural motif identification methods consider only nested canonical base-pairing structures and cannot be used to identify complex RNA structural motifs that often consist of various non-canonical base pairs due to uncommon hydrogen bond interactions. In this article, we present a novel RNA structural alignment method for RNA structural motif identification, RNAMotifScan, which takes into consideration the isosteric (both canonical and non-canonical) base pairs and multi-pairings in RNA structural motifs. The utility and accuracy of RNAMotifScan is demonstrated by searching for kink-turn, C-loop, sarcin-ricin, reverse kink-turn and E-loop motifs against a 23S rRNA (PDBid: 1S72), which is well characterized for the occurrences of these motifs. Finally, we search these motifs against the RNA structures in the entire Protein Data Bank and the abundances of them are estimated. RNAMotifScan is freely available at our supplementary website (http://genome.ucf.edu/RNAMotifScan).
Collapse
Affiliation(s)
- Cuncong Zhong
- School of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | | | | |
Collapse
|
34
|
Kirillova S, Tosatto SCE, Carugo O. FRASS: the web-server for RNA structural comparison. BMC Bioinformatics 2010; 11:327. [PMID: 20553602 PMCID: PMC2902451 DOI: 10.1186/1471-2105-11-327] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2010] [Accepted: 06/16/2010] [Indexed: 11/24/2022] Open
Abstract
Background The impressive increase of novel RNA structures, during the past few years, demands automated methods for structure comparison. While many algorithms handle only small motifs, few techniques, developed in recent years, (ARTS, DIAL, SARA, SARSA, and LaJolla) are available for the structural comparison of large and intact RNA molecules. Results The FRASS web-server represents a RNA chain with its Gauss integrals and allows one to compare structures of RNA chains and to find similar entries in a database derived from the Protein Data Bank. We observed that FRASS scores correlate well with the ARTS and LaJolla similarity scores. Moreover, the-web server can also reproduce satisfactorily the DARTS classification of RNA 3D structures and the classification of the SCOR functions that was obtained by the SARA method. Conclusions The FRASS web-server can be easily used to detect relationships among RNA molecules and to scan efficiently the rapidly enlarging structural databases.
Collapse
Affiliation(s)
- Svetlana Kirillova
- Department of Structural and Computational Biology, Max F Perutz Laboratories, Vienna University, Campus Vienna Biocenter 5, A-1030 Vienna, Austria.
| | | | | |
Collapse
|
35
|
Abstract
Structural 3D motifs in RNA play an important role in the RNA stability and function. Previous studies have focused on the characterization and discovery of 3D motifs in RNA secondary and tertiary structures. However, statistical analyses of the distribution of 3D motifs along the RNA appear to be lacking. Herein, we present a novel strategy for evaluating the distribution of 3D motifs along the RNA chain and those motifs whose distributions are significantly non-random are identified. By applying it to the X-ray structure of the large ribosomal subunit from Haloarcula marismortui, helical motifs were found to cluster together along the chain and in the 3D structure, whereas the known tetraloops tend to be sequentially and spatially dispersed. That the distribution of key structural motifs such as tetraloops differ significantly from a random one suggests that our method could also be used to detect novel 3D motifs of any size in sufficiently long/large RNA structures. The motif distribution type can help in the prediction and design of 3D structures of large RNA molecules.
Collapse
Affiliation(s)
- Karen Sargsyan
- Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
| | | |
Collapse
|
36
|
Apostolico A, Ciriello G, Guerra C, Heitsch CE, Hsiao C, Williams LD. Finding 3D motifs in ribosomal RNA structures. Nucleic Acids Res 2009; 37:e29. [PMID: 19158187 PMCID: PMC2651809 DOI: 10.1093/nar/gkn1044] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The identification of small structural motifs and their organization into larger subassemblies is of fundamental interest in the analysis, prediction and design of 3D structures of large RNAs. This problem has been studied only sparsely, as most of the existing work is limited to the characterization and discovery of motifs in RNA secondary structures. We present a novel geometric method for the characterization and identification of structural motifs in 3D rRNA molecules. This method enables the efficient recognition of known 3D motifs, such as tetraloops, E-loops, kink-turns and others. Furthermore, it provides a new way of characterizing complex 3D motifs, notably junctions, that have been defined and identified in the secondary structure but have not been analyzed and classified in three dimensions. We demonstrate the relevance and utility of our approach by applying it to the Haloarcula marismortui large ribosomal unit. Pending the implementation of a dedicated web server, the code accompanying this article, written in JAVA, is available upon request from the contact author.
Collapse
Affiliation(s)
- Alberto Apostolico
- College of Computing, Georgia Institute of Technology, Atlanta, GA 30332-0280, USA
| | | | | | | | | | | |
Collapse
|
37
|
Djelloul M, Denise A. Automated motif extraction and classification in RNA tertiary structures. RNA (NEW YORK, N.Y.) 2008; 14:2489-2497. [PMID: 18957493 PMCID: PMC2590963 DOI: 10.1261/rna.1061108] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2008] [Accepted: 08/15/2008] [Indexed: 05/27/2023]
Abstract
We used a novel graph-based approach to extract RNA tertiary motifs. We cataloged them all and clustered them using an innovative graph similarity measure. We applied our method to three widely studied structures: Haloarcula marismortui 50S (H.m 50S), Escherichia coli 50S (E. coli 50S), and Thermus thermophilus 16S (T.th 16S) RNAs. We identified 10 known motifs without any prior knowledge of their shapes or positions. We additionally identified four putative new motifs.
Collapse
Affiliation(s)
- Mahassine Djelloul
- Laboratoire de Recherche en Informatique, Université Paris-Sud 11 and CNRS, 91405 Orsay Cedex, France
| | | |
Collapse
|
38
|
Abraham M, Dror O, Nussinov R, Wolfson HJ. Analysis and classification of RNA tertiary structures. RNA (NEW YORK, N.Y.) 2008; 14:2274-89. [PMID: 18824509 PMCID: PMC2578864 DOI: 10.1261/rna.853208] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2007] [Accepted: 07/05/2008] [Indexed: 05/19/2023]
Abstract
There is a fast growing interest in noncoding RNA transcripts. These transcripts are not translated into proteins, but play essential roles in many cellular and pathological processes. Recent efforts toward comprehension of their function has led to a substantial increase in both the number and the size of solved RNA structures. With the aim of addressing questions relating to RNA structural diversity, we examined RNA conservation at three structural levels: primary, secondary, and tertiary structure. Additionally, we developed an automated method for classifying RNA structures based on spatial (three-dimensional [3D]) similarity. Applying the method to all solved RNA structures resulted in a classified database of RNA tertiary structures (DARTS). DARTS embodies 1333 solved RNA structures classified into 94 clusters. The classification is hierarchical, reflecting the structural relationship between and within clusters. We also developed an application for searching DARTS with a new structure. The search is fast and its performance was successfully tested on all solved RNA structures since the creation of DARTS. A user-friendly interface for both the database and the search application is available online. We show intracluster and intercluster similarities in DARTS and demonstrate the usefulness of the search application. The analysis reveals the current structural repertoire of RNA and exposes common global folds and local tertiary motifs. Further study of these conserved substructures may suggest possible RNA domains and building blocks. This should be beneficial for structure prediction and for gaining insights into structure-function relationships.
Collapse
Affiliation(s)
- Mira Abraham
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | | | | | | |
Collapse
|
39
|
Abstract
MOTIVATION The recent discovery of tiny RNA molecules such as microRNAs and small interfering RNA are transforming the view of RNA as a simple information transfer molecule. Similar to proteins, the native three-dimensional structure of RNA determines its biological activity. Therefore, classifying the current structural space is paramount for functionally annotating RNA molecules. The increasing numbers of RNA structures deposited in the PDB requires more accurate, automatic and benchmarked methods for RNA structure comparison. In this article, we introduce a new algorithm for RNA structure alignment based on a unit-vector approach. The algorithm has been implemented in the SARA program, which results in RNA structure pairwise alignments and their statistical significance. RESULTS The SARA program has been implemented to be of general applicability even when no secondary structure can be calculated from the RNA structures. A benchmark against the ARTS program using a set of 1275 non-redundant pairwise structure alignments results in inverted approximately 6% extra alignments with at least 50% structurally superposed nucleotides and base pairs. A first attempt to perform RNA automatic functional annotation based on structure alignments indicates that SARA can correctly assign the deepest SCOR classification to >60% of the query structures. AVAILABILITY The SARA program is freely available through a World Wide Web server http://sgu.bioinfo.cipf.es/services/SARA/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Emidio Capriotti
- Bioinformatics and Genomics Department, Structural Genomics Unit, Centro de Investigación Príncipe Felipe, Valencia, Spain
| | | |
Collapse
|
40
|
Richardson JS, Schneider B, Murray LW, Kapral GJ, Immormino RM, Headd JJ, Richardson DC, Ham D, Hershkovits E, Williams LD, Keating KS, Pyle AM, Micallef D, Westbrook J, Berman HM. RNA backbone: consensus all-angle conformers and modular string nomenclature (an RNA Ontology Consortium contribution). RNA (NEW YORK, N.Y.) 2008; 14:465-81. [PMID: 18192612 PMCID: PMC2248255 DOI: 10.1261/rna.657708] [Citation(s) in RCA: 190] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2007] [Accepted: 10/29/2007] [Indexed: 05/03/2023]
Abstract
A consensus classification and nomenclature are defined for RNA backbone structure using all of the backbone torsion angles. By a consensus of several independent analysis methods, 46 discrete conformers are identified as suitably clustered in a quality-filtered, multidimensional dihedral angle distribution. Most of these conformers represent identifiable features or roles within RNA structures. The conformers are given two-character names that reflect the seven-angle delta epsilon zeta alpha beta gamma delta combinations empirically found favorable for the sugar-to-sugar "suite" unit within which the angle correlations are strongest (e.g., 1a for A-form, 5z for the start of S-motifs). Since the half-nucleotides are specified by a number for delta epsilon zeta and a lowercase letter for alpha beta gamma delta, this modular system can also be parsed to describe traditional nucleotide units (e.g., a1) or the dinucleotides (e.g., a1a1) that are especially useful at the level of crystallographic map fitting. This nomenclature can also be written as a string with two-character suite names between the uppercase letters of the base sequence (N1aG1gN1aR1aA1cN1a for a GNRA tetraloop), facilitating bioinformatic comparisons. Cluster means, standard deviations, coordinates, and examples are made available, as well as the Suitename software that assigns suite conformer names and conformer match quality (suiteness) from atomic coordinates. The RNA Ontology Consortium will combine this new backbone system with others that define base pairs, base-stacking, and hydrogen-bond relationships to provide a full description of RNA structural motifs.
Collapse
Affiliation(s)
- Jane S Richardson
- Department of Biochemistry, Duke University Medical Center, Durham, North Carolina, 27710-3711, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Sarver M, Zirbel CL, Stombaugh J, Mokdad A, Leontis NB. FR3D: finding local and composite recurrent structural motifs in RNA 3D structures. J Math Biol 2007; 56:215-52. [PMID: 17694311 PMCID: PMC2837920 DOI: 10.1007/s00285-007-0110-x] [Citation(s) in RCA: 192] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2006] [Revised: 06/30/2006] [Indexed: 11/26/2022]
Abstract
New methods are described for finding recurrent three-dimensional (3D) motifs in RNA atomic-resolution structures. Recurrent RNA 3D motifs are sets of RNA nucleotides with similar spatial arrangements. They can be local or composite. Local motifs comprise nucleotides that occur in the same hairpin or internal loop. Composite motifs comprise nucleotides belonging to three or more different RNA strand segments or molecules. We use a base-centered approach to construct efficient, yet exhaustive search procedures using geometric, symbolic, or mixed representations of RNA structure that we implement in a suite of MATLAB programs, "Find RNA 3D" (FR3D). The first modules of FR3D preprocess structure files to classify base-pair and -stacking interactions. Each base is represented geometrically by the position of its glycosidic nitrogen in 3D space and by the rotation matrix that describes its orientation with respect to a common frame. Base-pairing and base-stacking interactions are calculated from the base geometries and are represented symbolically according to the Leontis/Westhof basepairing classification, extended to include base-stacking. These data are stored and used to organize motif searches. For geometric searches, the user supplies the 3D structure of a query motif which FR3D uses to find and score geometrically similar candidate motifs, without regard to the sequential position of their nucleotides in the RNA chain or the identity of their bases. To score and rank candidate motifs, FR3D calculates a geometric discrepancy by rigidly rotating candidates to align optimally with the query motif and then comparing the relative orientations of the corresponding bases in the query and candidate motifs. Given the growing size of the RNA structure database, it is impossible to explicitly compute the discrepancy for all conceivable candidate motifs, even for motifs with less than ten nucleotides. The screening algorithm that we describe finds all candidate motifs whose geometric discrepancy with respect to the query motif falls below a user-specified cutoff discrepancy. This technique can be applied to RMSD searches. Candidate motifs identified geometrically may be further screened symbolically to identify those that contain particular basepair types or base-stacking arrangements or that conform to sequence continuity or nucleotide identity constraints. Purely symbolic searches for motifs containing user-defined sequence, continuity and interaction constraints have also been implemented. We demonstrate that FR3D finds all occurrences, both local and composite and with nucleotide substitutions, of sarcin/ricin and kink-turn motifs in the 23S and 5S ribosomal RNA 3D structures of the H. marismortui 50S ribosomal subunit and assigns the lowest discrepancy scores to bona fide examples of these motifs. The search algorithms have been optimized for speed to allow users to search the non-redundant RNA 3D structure database on a personal computer in a matter of minutes.
Collapse
Affiliation(s)
- Michael Sarver
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Craig L. Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Jesse Stombaugh
- Department of Chemistry, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Ali Mokdad
- Department of Chemistry, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Neocles B. Leontis
- Department of Chemistry, Bowling Green State University, Bowling Green, OH 43403, USA
| |
Collapse
|
42
|
Abstract
The world of regulatory RNAs is fast expanding into mainstream molecular biology as both a subject of intense mechanistic study and as a tool for functional characterization. The RNA world is one of complex structures that carry out catalysis, sense metabolites and synthesize proteins. The dynamic and structural nature of RNAs presents a whole new set of informatics challenges to the computational community. The ability to relate structure and dynamics to function will be key to understanding this complex world. I review several important classes of structured RNAs that present our community with a series of biologically novel informatics challenges. I also review available informatics tools that have been recently developed in the field.
Collapse
Affiliation(s)
- Alain Laederach
- Department of Genetics, 300 Pasteur Drive, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
43
|
Wang X, Huan J, Snoeyink JS, Wang W. Mining RNA Tertiary Motifs with Structure Graphs. ACTA ACUST UNITED AC 2007. [DOI: 10.1109/ssdbm.2007.38] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
44
|
Wadley LM, Keating KS, Duarte CM, Pyle AM. Evaluating and learning from RNA pseudotorsional space: quantitative validation of a reduced representation for RNA structure. J Mol Biol 2007; 372:942-957. [PMID: 17707400 PMCID: PMC2720064 DOI: 10.1016/j.jmb.2007.06.058] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2007] [Revised: 06/18/2007] [Accepted: 06/18/2007] [Indexed: 10/23/2022]
Abstract
Quantitatively describing RNA structure and conformational elements remains a formidable problem. Seven standard torsion angles and the sugar pucker are necessary to characterize the conformation of an RNA nucleotide completely. Progress has been made toward understanding the discrete nature of RNA structure, but classifying simple and ubiquitous structural elements such as helices and motifs remains a difficult task. One approach for describing RNA structure in a simple, mathematically consistent, and computationally accessible manner involves the invocation of two pseudotorsions, eta (C4'(n-1), P(n), C4'(n), P(n+1)) and theta (P(n), C4'(n), P(n+1), C4'(n+1)), which can be used to describe RNA conformation in much the same way that varphi and psi are used to describe backbone configuration of proteins. Here, we conduct an exploration and statistical evaluation of pseudotorsional space and of the Ramachandran-like eta-theta plot. We show that, through the rigorous quantitative analysis of the eta-theta plot, the pseudotorsional descriptors eta and theta, together with sugar pucker, are sufficient to describe RNA backbone conformation fully in most cases. These descriptors are also shown to contain considerable information about nucleotide base conformation, revealing a previously uncharacterized interplay between backbone and base orientation. A window function analysis is used to discern statistically relevant regions of density in the eta-theta scatter plot and then nucleotides in colocalized clusters in the eta-theta plane are shown to have similar 3-D structures through RMSD analysis of the RNA structural constituents. We find that major clusters in the eta-theta plot are few, underscoring the discrete nature of RNA backbone conformation. Like the Ramachandran plot, the eta-theta plot is a valuable system for conceptualizing biomolecular conformation, it is a useful tool for analyzing RNA tertiary structures, and it is a vital component of new approaches for solving the 3-D structures of large RNA molecules and RNA assemblies.
Collapse
Affiliation(s)
- Leven M Wadley
- Department of Physics, Columbia University, New York, NY 10027, USA
| | - Kevin S Keating
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Carlos M Duarte
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Anna Marie Pyle
- Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520, USA; Howard Hughes Medical Institute, Yale University, 266 Whitney Avenue, New Haven, CT 06520, USA.
| |
Collapse
|
45
|
Ferrè F, Ponty Y, Lorenz WA, Clote P. DIAL: a web server for the pairwise alignment of two RNA three-dimensional structures using nucleotide, dihedral angle and base-pairing similarities. Nucleic Acids Res 2007; 35:W659-68. [PMID: 17567620 PMCID: PMC1933154 DOI: 10.1093/nar/gkm334] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
DIAL (dihedral alignment) is a web server that provides public access to a new dynamic programming algorithm for pairwise 3D structural alignment of RNA. DIAL achieves quadratic time by performing an alignment that accounts for (i) pseudo-dihedral and/or dihedral angle similarity, (ii) nucleotide sequence similarity and (iii) nucleotide base-pairing similarity. DIAL provides access to three alignment algorithms: global (Needleman–Wunsch), local (Smith–Waterman) and semiglobal (modified to yield motif search). Suboptimal alignments are optionally returned, and also Boltzmann pair probabilities Pr(ai,bj) for aligned positions ai , bj from the optimal alignment. If a non-zero suboptimal alignment score ratio is entered, then the semiglobal alignment algorithm may be used to detect structurally similar occurrences of a user-specified 3D motif. The query motif may be contiguous in the linear chain or fragmented in a number of noncontiguous regions. The DIAL web server provides graphical output which allows the user to view, rotate and enlarge the 3D superposition for the optimal (and suboptimal) alignment of query to target. Although graphical output is available for all three algorithms, the semiglobal motif search may be of most interest in attempts to identify RNA motifs. DIAL is available at http://bioinformatics.bc.edu/clotelab/DIAL.
Collapse
Affiliation(s)
- F. Ferrè
- Harvard Medical School, Children's Hospital, Hematology/Oncology Department, Boston, MA 02115 and Department of Biology, Boston College, Chestnut Hill, MA 02467, USA
| | - Y. Ponty
- Harvard Medical School, Children's Hospital, Hematology/Oncology Department, Boston, MA 02115 and Department of Biology, Boston College, Chestnut Hill, MA 02467, USA
| | - W. A. Lorenz
- Harvard Medical School, Children's Hospital, Hematology/Oncology Department, Boston, MA 02115 and Department of Biology, Boston College, Chestnut Hill, MA 02467, USA
| | - Peter Clote
- Harvard Medical School, Children's Hospital, Hematology/Oncology Department, Boston, MA 02115 and Department of Biology, Boston College, Chestnut Hill, MA 02467, USA
- *To whom correspondence should be addressed. +1 617 552 1332+1 617 552 2011
| |
Collapse
|
46
|
Corral E, Kooijman H, Spek AL, Reedijk J. Nucleic acids in two dimensions: layers of base pairs linked by carboxylate. NEW J CHEM 2007. [DOI: 10.1039/b613845d] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
47
|
Tang CL, Alexov E, Pyle AM, Honig B. Calculation of pKas in RNA: on the structural origins and functional roles of protonated nucleotides. J Mol Biol 2006; 366:1475-96. [PMID: 17223134 DOI: 10.1016/j.jmb.2006.12.001] [Citation(s) in RCA: 118] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2006] [Revised: 11/29/2006] [Accepted: 12/01/2006] [Indexed: 12/01/2022]
Abstract
pK(a) calculations based on the Poisson-Boltzmann equation have been widely used to study proteins and, more recently, DNA. However, much less attention has been paid to the calculation of pK(a) shifts in RNA. There is accumulating evidence that protonated nucleotides can stabilize RNA structure and participate in enzyme catalysis within ribozymes. Here, we calculate the pK(a) shifts of nucleotides in RNA structures using numerical solutions to the Poisson-Boltzmann equation. We find that significant shifts are predicted for several nucleotides in two catalytic RNAs, the hairpin ribozyme and the hepatitis delta virus ribozyme, and that the shifts are likely to be related to their functions. We explore how different structural environments shift the pK(a)s of nucleotides from their solution values. RNA structures appear to use two basic strategies to shift pK(a)s: (a) the formation of compact structural motifs with structurally-conserved, electrostatic interactions; and (b) the arrangement of the phosphodiester backbone to focus negative electrostatic potential in specific regions.
Collapse
Affiliation(s)
- Christopher L Tang
- Howard Hughes Medical Institute, Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | | | | | | |
Collapse
|
48
|
Dror O, Nussinov R, Wolfson HJ. The ARTS web server for aligning RNA tertiary structures. Nucleic Acids Res 2006; 34:W412-5. [PMID: 16845038 PMCID: PMC1538835 DOI: 10.1093/nar/gkl312] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2006] [Revised: 03/06/2006] [Accepted: 04/11/2006] [Indexed: 11/12/2022] Open
Abstract
RNA molecules with common structural features may share similar functional properties. Structural comparison of RNAs and detection of common substructures is, thus, a highly important task. Nevertheless, the current available tools in the RNA community provide only a partial solution, since they either work at the 2D level or are suitable for detecting predefined or local contiguous tertiary motifs only. Here, we describe a web server built around ARTS, a method for aligning tertiary structures of nucleic acids (both RNA and DNA). ARTS receives a pair of 3D nucleic acid structures and searches for a priori unknown common substructures. The search is truly 3D and irrespective of the order of the nucleotides on the chain. The identified common substructures can be large global folds with hundreds and even thousands of nucleotides as well as small local motifs with at least two successive base pairs. The method is highly efficient and has been used to conduct an all-against-all comparison of all the RNA structures in the Protein Data Bank. The web server together with a software package for download are freely accessible at http://bioinfo3d.cs.tau.ac.il/ARTS.
Collapse
Affiliation(s)
- Oranit Dror
- School of Computer Science, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel.
| | | | | |
Collapse
|
49
|
Leontis NB, Lescoute A, Westhof E. The building blocks and motifs of RNA architecture. Curr Opin Struct Biol 2006; 16:279-87. [PMID: 16713707 PMCID: PMC4857889 DOI: 10.1016/j.sbi.2006.05.009] [Citation(s) in RCA: 251] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2006] [Revised: 04/12/2006] [Accepted: 05/10/2006] [Indexed: 10/24/2022]
Abstract
RNA motifs can be defined broadly as recurrent structural elements containing multiple intramolecular RNA-RNA interactions, as observed in atomic-resolution RNA structures. They constitute the modular building blocks of RNA architecture, which is organized hierarchically. Recent work has focused on analyzing RNA backbone conformations to identify, define and search for new instances of recurrent motifs in X-ray structures. One current view asserts that recurrent RNA strand segments with characteristic backbone configurations qualify as independent motifs. Other considerations indicate that, to characterize modular motifs, one must take into account the larger structural context of such strand segments. This follows the biologically relevant motivation, which is to identify RNA structural characteristics that are subject to sequence constraints and that thus relate RNA architectures to sequences.
Collapse
Affiliation(s)
- Neocles B Leontis
- Department of Chemistry and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, OH 43402, USA
| | | | | |
Collapse
|
50
|
Leontis NB, Altman RB, Berman HM, Brenner SE, Brown JW, Engelke DR, Harvey SC, Holbrook SR, Jossinet F, Lewis SE, Major F, Mathews DH, Richardson JS, Williamson JR, Westhof E. The RNA Ontology Consortium: an open invitation to the RNA community. RNA (NEW YORK, N.Y.) 2006; 12:533-41. [PMID: 16484377 PMCID: PMC1421088 DOI: 10.1261/rna.2343206] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
The aim of the RNA Ontology Consortium (ROC) is to create an integrated conceptual framework-an RNA Ontology (RO)-with a common, dynamic, controlled, and structured vocabulary to describe and characterize RNA sequences, secondary structures, three-dimensional structures, and dynamics pertaining to RNA function. The RO should produce tools for clear communication about RNA structure and function for multiple uses, including the integration of RNA electronic resources into the Semantic Web. These tools should allow the accurate description in computer-interpretable form of the coupling between RNA architecture, function, and evolution. The purposes for creating the RO are, therefore, (1) to integrate sequence and structural databases; (2) to allow different computational tools to interoperate; (3) to create powerful software tools that bring advanced computational methods to the bench scientist; and (4) to facilitate precise searches for all relevant information pertaining to RNA. For example, one initial objective of the ROC is to define, identify, and classify RNA structural motifs described in the literature or appearing in databases and to agree on a computer-interpretable definition for each of these motifs. To achieve these aims, the ROC will foster communication and promote collaboration among RNA scientists by coordinating frequent face-to-face workshops to discuss, debate, and resolve difficult conceptual issues. These meeting opportunities will create new directions at various levels of RNA research. The ROC will work closely with the PDB/NDB structural databases and the Gene, Sequence, and Open Biomedical Ontology Consortia to integrate the RO with existing biological ontologies to extend existing content while maintaining interoperability.
Collapse
|