1
|
Bohdan DR, Voronina VV, Bujnicki JM, Baulin EF. A comprehensive survey of long-range tertiary interactions and motifs in non-coding RNA structures. Nucleic Acids Res 2023; 51:8367-8382. [PMID: 37471030 PMCID: PMC10484739 DOI: 10.1093/nar/gkad605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 07/07/2023] [Indexed: 07/21/2023] Open
Abstract
Understanding the 3D structure of RNA is key to understanding RNA function. RNA 3D structure is modular and can be seen as a composition of building blocks of various sizes called tertiary motifs. Currently, long-range motifs formed between distant loops and helical regions are largely less studied than the local motifs determined by the RNA secondary structure. We surveyed long-range tertiary interactions and motifs in a non-redundant set of non-coding RNA 3D structures. A new dataset of annotated LOng-RAnge RNA 3D modules (LORA) was built using an approach that does not rely on the automatic annotations of non-canonical interactions. An original algorithm, ARTEM, was developed for annotation-, sequence- and topology-independent superposition of two arbitrary RNA 3D modules. The proposed methods allowed us to identify and describe the most common long-range RNA tertiary motifs. Along with the prevalent canonical A-minor interactions, a large number of previously undescribed staple interactions were observed. The most frequent long-range motifs were found to belong to three main motif families: planar staples, tilted staples, and helical packing motifs.
Collapse
Affiliation(s)
- Davyd R Bohdan
- Department of Innovation and High Technology, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Valeria V Voronina
- Department of Information Systems, Ulyanovsk State Technical University, Ulyanovsk 432027, Russia
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw 02-109, Poland
| | - Eugene F Baulin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw 02-109, Poland
| |
Collapse
|
2
|
Abstract
RNA molecules are highly modular components that can be used in a variety of contexts for building new metabolic, regulatory and genetic circuits in cells. The majority of synthetic RNA systems to date predominately rely on two-dimensional modularity. However, a better understanding and integration of three-dimensional RNA modularity at structural and functional levels is critical to the development of more complex, functional bio-systems and molecular machines for synthetic biology applications.
Collapse
Affiliation(s)
- Wade Grabow
- Department of Chemistry and Biochemistry, Seattle Pacific University3307 Third Avenue West, Seattle, WA 98119USA
| | - Luc Jaeger
- Department of Chemistry and Biochemistry, Bio-Molecular Science and Engineering Program, University of CaliforniaSanta Barbara, CA 93106-9510USA
| |
Collapse
|
3
|
Shen Y, Wong HS, Zhang S, Zhang L. RNA structural motif recognition based on least-squares distance. RNA (NEW YORK, N.Y.) 2013; 19:1183-1191. [PMID: 23887146 PMCID: PMC3753925 DOI: 10.1261/rna.037648.112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2012] [Accepted: 06/13/2013] [Indexed: 06/02/2023]
Abstract
RNA structural motifs are recurrent structural elements occurring in RNA molecules. RNA structural motif recognition aims to find RNA substructures that are similar to a query motif, and it is important for RNA structure analysis and RNA function prediction. In view of this, we propose a new method known as RNA Structural Motif Recognition based on Least-Squares distance (LS-RSMR) to effectively recognize RNA structural motifs. A test set consisting of five types of RNA structural motifs occurring in Escherichia coli ribosomal RNA is compiled by us. Experiments are conducted for recognizing these five types of motifs. The experimental results fully reveal the superiority of the proposed LS-RSMR compared with four other state-of-the-art methods.
Collapse
Affiliation(s)
- Ying Shen
- School of Software Engineering, Tongji University, Shanghai 200092, China
| | - Hau-San Wong
- Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong
| | - Shaohong Zhang
- Department of Computer Science, Guangzhou University, Guangzhou 510006, China
| | - Lin Zhang
- School of Software Engineering, Tongji University, Shanghai 200092, China
| |
Collapse
|
4
|
Grabow WW, Zhuang Z, Shea JE, Jaeger L. The GA-minor submotif as a case study of RNA modularity, prediction, and design. WILEY INTERDISCIPLINARY REVIEWS-RNA 2013; 4:181-203. [PMID: 23378290 DOI: 10.1002/wrna.1153] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Complex natural RNAs such as the ribosome, group I and group II introns, and RNase P exemplify the fact that three-dimensional (3D) RNA structures are highly modular and hierarchical in nature. Tertiary RNA folding typically takes advantage of a rather limited set of recurrent structural motifs that are responsible for controlling bends or stacks between adjacent helices. Herein, the GA minor and related structural motifs are presented as a case study to highlight several structural and folding principles, to gain further insight into the structural evolution of naturally occurring RNAs, as well as to assist the rational design of artificial RNAs.
Collapse
Affiliation(s)
- Wade W Grabow
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA, USA
| | | | | | | |
Collapse
|
5
|
Boutorine YI, Steinberg SV. Twist-joints and double twist-joints in RNA structure. RNA (NEW YORK, N.Y.) 2012; 18:2287-98. [PMID: 23060425 PMCID: PMC3504679 DOI: 10.1261/rna.030940.111] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Analysis of available RNA crystal structures has allowed us to identify a new family of RNA arrangements that we call double twist-joints, or DTJs. Each DTJ is composed of a double helix that contains two bulges incorporated into different strands and separated from each other by 2 or 3 bp. At each bulge, the double helix is over-twisted, while the unpaired nucleotides of both bulges form a complex network of stacking and hydrogen-bonding with nucleotides of helical regions. In total, we identified 14 DTJ cases, which can be combined in three groups based on common structural characteristics. One DTJ is found in a functional center of the ribosome, another DTJ mediates binding of the pre-tRNA to the RNase P, and two more DTJs form the sensing domains in the glycine riboswitch.
Collapse
Affiliation(s)
- Yury I. Boutorine
- Département de Biochimie, Université de Montréal, Montréal, Quebec H3C 3J7, Canada
| | - Sergey V. Steinberg
- Département de Biochimie, Université de Montréal, Montréal, Quebec H3C 3J7, Canada
- Corresponding authorE-mail
| |
Collapse
|
6
|
Grabow WW, Zhuang Z, Swank ZN, Shea JE, Jaeger L. The right angle (RA) motif: a prevalent ribosomal RNA structural pattern found in group I introns. J Mol Biol 2012; 424:54-67. [PMID: 22999957 DOI: 10.1016/j.jmb.2012.09.012] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Revised: 09/11/2012] [Accepted: 09/12/2012] [Indexed: 12/16/2022]
Abstract
The right angle (RA) motif, previously identified in the ribosome and used as a structural module for nano-construction, is a recurrent structural motif of 13 nucleotides that establishes a 90° bend between two adjacent helices. Comparative sequence analysis was used to explore the sequence space of the RA motif within ribosomal RNAs in order to define its canonical sequence space signature. We investigated the sequence constraints associated with the RA signature using several artificial self-assembly systems. Thermodynamic and topological investigations of sequence variants associated with the RA motif in both minimal and expanded structural contexts reveal that the presence of a helix at the 3' end of the RA motif increases the thermodynamic stability and rigidity of the resulting three-helix junction domain. A search for the RA in naturally occurring RNAs as well as its experimental characterization led to the identification of the RA in groups IC1 and ID intron ribozymes, where it is suggested to play an integral role in stabilizing peripheral structural domains. The present study exemplifies the need of empirical analysis of RNA structural motifs for facilitating the rational design and structure prediction of RNAs.
Collapse
Affiliation(s)
- Wade W Grabow
- Department of Chemistry and Biochemistry, University of California, Santa Barbara, CA 93106-9510, USA
| | | | | | | | | |
Collapse
|
7
|
Proux F, Dreyfus M, Iost I. Identification of the sites of action of SrmB, a DEAD-box RNA helicase involved in Escherichia coli ribosome assembly. Mol Microbiol 2011; 82:300-11. [PMID: 21859437 DOI: 10.1111/j.1365-2958.2011.07779.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
DEAD-box RNA-dependent ATPases are ubiquitous enzymes that participate in nearly all processes involving RNA, but their detailed molecular functions remain generally unknown. SrmB, one of the five Escherichia coli DEAD-box proteins, participates in the assembly of the large ribosomal subunit notably by facilitating the incorporation of L13, one of the ribosomal proteins that bind 23S rRNA earliest. Previously, we showed that SrmB is tethered to nascent ribosome through interactions with L4, L24 and the region from domain I of 23S rRNA that binds them. To identify the sites of action of SrmB, we have characterized rRNA mutations that bypass SrmB requirement. Five of them affect the same position from two repeated heptanucleotides in domain II of 23S rRNA, whereas two others affect a complementary hexanucleotide in 5S rRNA. Thus the sites of action of SrmB differ from its tethering site. In the mature ribosome, one of the heptanucleotides participates in a highly compact structure that contacts L13, the '1024 G-ribo wrench'. In addition, we have observed that the assembly defect of ΔsrmB cells worsens as rRNA synthesis increases. Based on these results, we propose two non-exclusive scenarios for the role of SrmB in ribosome assembly.
Collapse
Affiliation(s)
- Florence Proux
- Institut de Biologie de l'Ecole Normale Supérieure, CNRS UMR 8197, Génomique Fonctionnelle, 46 Rue d'Ulm 75230 Paris Cedex 05, France
| | | | | |
Collapse
|
8
|
Ishikawa J, Fujita Y, Maeda Y, Furuta H, Ikawa Y. GNRA/receptor interacting modules: Versatile modular units for natural and artificial RNA architectures. Methods 2011; 54:226-38. [DOI: 10.1016/j.ymeth.2010.12.011] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2010] [Revised: 12/08/2010] [Accepted: 12/08/2010] [Indexed: 12/25/2022] Open
|
9
|
|
10
|
|
11
|
Masquida B, Beckert B, Jossinet F. Exploring RNA structure by integrative molecular modelling. N Biotechnol 2010; 27:170-83. [PMID: 20206310 DOI: 10.1016/j.nbt.2010.02.022] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
RNA molecular modelling is adequate to rapidly tackle the structure of RNA molecules. With new structured RNAs constituting a central class of cellular regulators discovered every year, the need for swift and reliable modelling methods is more crucial than ever. The pragmatic method based on interactive all-atom molecular modelling relies on the observation that specific structural motifs are recurrently found in RNA sequences. Once identified by a combination of comparative sequence analysis and biochemical data, the motifs composing the secondary structure of a given RNA can be extruded in three dimensions (3D) and used as building blocks assembled manually during a bioinformatic interactive process. Comparing the models to the corresponding crystal structures has validated the method as being powerful to predict the RNA topology and architecture while being less accurate regarding the prediction of base-base interactions. These aspects as well as the necessary steps towards automation will be discussed.
Collapse
Affiliation(s)
- Benoît Masquida
- Architecture et Réactivité de l'ARN, Université de Strasbourg, IBMC, CNRS, 15 rue René Descartes, Strasbourg, France.
| | | | | |
Collapse
|
12
|
Ulyanov NB, James TL. RNA structural motifs that entail hydrogen bonds involving sugar-phosphate backbone atoms of RNA. NEW J CHEM 2010; 34:910-917. [PMID: 20689681 DOI: 10.1039/b9nj00754g] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
The growing number of high-resolution crystal structures of large RNA molecules provides much information for understanding the principles of structural organization of these complex molecules. Several in-depth analyses of nucleobase-centered RNA structural motifs and backbone conformations have been published based on this information, including a systematic classification of base pairs by Leontis and Westhof. However, hydrogen bonds involving sugar-phosphate backbone atoms of RNA have not been analyzed systematically until recently, although such hydrogen bonds appear to be common both in local and tertiary interactions. Here we review some backbone structural motifs discussed in the literature and analyze a set of eight high-resolution multi-domain RNA structures. The analyzed RNAs are highly structured: among 5372 nucleotides in this set, 89% are involved in at least one "long-range" RNA-RNA hydrogen bond, i.e., hydrogen bonds between atoms in the same residue or sequential residues are ignored. These long-range hydrogen bonds frequently use backbone atoms as hydrogen bond acceptors, i.e., OP1, OP2, O2', O3', O4', or O5', or as a donor (2'OH). A surprisingly large number of such hydrogen bonds are found, considering that neither single-stranded nor double-stranded regions will contain such hydrogen bonds unless additional interactions with other residues exist. Among 8327 long-range hydrogen bonds found in this set of structures, 2811, or about one-third, are hydrogen bonds entailing RNA backbone atoms; they involve 39% of all nucleotides in the structures. The majority of them (2111) are hydrogen bonds entailing ribose hydroxyl groups, which can be used either as a donor or an acceptor; they constitute 25% of all hydrogen bonds and involve 31% of all nucleotides. The phosphate oxygens OP1 or OP2 are used as hydrogen bond acceptors in 12% of all nucleotides, and the ribose ring oxygen O4' and phosphodiester oxygens O3' and O5' are used in 4%, 4%, and 1% of all nucleotides, respectively. Distributions of geometric parameters and some examples of such hydrogen bonds are presented in this report. A novel motif involving backbone hydrogen bonds, the ribose-phosphate zipper, is also identified.
Collapse
Affiliation(s)
- Nikolai B Ulyanov
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158-2517, USA
| | | |
Collapse
|
13
|
Laing C, Jung S, Iqbal A, Schlick T. Tertiary motifs revealed in analyses of higher-order RNA junctions. J Mol Biol 2009; 393:67-82. [PMID: 19660472 DOI: 10.1016/j.jmb.2009.07.089] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2009] [Revised: 07/29/2009] [Accepted: 07/29/2009] [Indexed: 12/22/2022]
Abstract
RNA junctions are secondary-structure elements formed when three or more helices come together. They are present in diverse RNA molecules with various fundamental functions in the cell. To better understand the intricate architecture of three-dimensional (3D) RNAs, we analyze currently solved 3D RNA junctions in terms of base-pair interactions and 3D configurations. First, we study base-pair interaction diagrams for solved RNA junctions with 5 to 10 helices and discuss common features. Second, we compare these higher-order junctions to those containing 3 or 4 helices and identify global motif patterns such as coaxial stacking and parallel and perpendicular helical configurations. These analyses show that higher-order junctions organize their helical components in parallel and helical configurations similar to lower-order junctions. Their sub-junctions also resemble local helical configurations found in three- and four-way junctions and are stabilized by similar long-range interaction preferences such as A-minor interactions. Furthermore, loop regions within junctions are high in adenine but low in cytosine, and in agreement with previous studies, we suggest that coaxial stacking between helices likely forms when the common single-stranded loop is small in size; however, other factors such as stacking interactions involving noncanonical base pairs and proteins can greatly determine or disrupt coaxial stacking. Finally, we introduce the ribo-base interactions: when combined with the along-groove packing motif, these ribo-base interactions form novel motifs involved in perpendicular helix-helix interactions. Overall, these analyses suggest recurrent tertiary motifs that stabilize junction architecture, pack helices, and help form helical configurations that occur as sub-elements of larger junction networks. The frequent occurrence of similar helical motifs suggest nature's finite and perhaps limited repertoire of RNA helical conformation preferences. More generally, studies of RNA junctions and tertiary building blocks can ultimately help in the difficult task of RNA 3D structure prediction.
Collapse
Affiliation(s)
- Christian Laing
- Department of Chemistry, New York University, 251 Mercer Street, New York, NY 10012, USA
| | | | | | | |
Collapse
|
14
|
Jaeger L, Verzemnieks EJ, Geary C. The UA_handle: a versatile submotif in stable RNA architectures. Nucleic Acids Res 2008; 37:215-30. [PMID: 19036788 PMCID: PMC2615604 DOI: 10.1093/nar/gkn911] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Stable RNAs are modular and hierarchical 3D architectures taking advantage of recurrent structural motifs to form extensive non-covalent tertiary interactions. Sequence and atomic structure analysis has revealed a novel submotif involving a minimal set of five nucleotides, termed the UA_handle motif (5'XU/AN(n)X3'). It consists of a U:A Watson-Crick: Hoogsteen trans base pair stacked over a classic Watson-Crick base pair, and a bulge of one or more nucleotides that can act as a handle for making different types of long-range interactions. This motif is one of the most versatile building blocks identified in stable RNAs. It enters into the composition of numerous recurrent motifs of greater structural complexity such as the T-loop, the 11-nt receptor, the UAA/GAN and the G-ribo motifs. Several structural principles pertaining to RNA motifs are derived from our analysis. A limited set of basic submotifs can account for the formation of most structural motifs uncovered in ribosomal and stable RNAs. Structural motifs can act as structural scaffoldings and be functionally and topologically equivalent despite sequence and structural differences. The sequence network resulting from the structural relationships shared by these RNA motifs can be used as a proto-language for assisting prediction and rational design of RNA tertiary structures.
Collapse
Affiliation(s)
- Luc Jaeger
- Chemistry and Biochemistry Department, University of California, Santa Barbara, CA 93106-9510, USA.
| | | | | |
Collapse
|
15
|
Abstract
Since the year 2000 a number of large RNA three-dimensional structures have been determined by X-ray crystallography. Structures composed of more than 100 nucleotide residues include the signal recognition particle RNA, group I intron, the GlmS ribozyme, RNAseP RNA, and ribosomal RNAs from Haloarcula morismortui, Escherichia coli, Thermus thermophilus, and Deinococcus radiodurans. These large RNAs are constructed from the same secondary and tertiary structural motifs identified in smaller RNAs but appear to have a larger organizational architecture. They are dominated by long continuous interhelical base stacking, tend to segregate into domains, and are planar in overall shape as opposed to their globular protein counterparts. These findings have consequences in RNA folding, intermolecular interaction, and packing, in addition to studies of design and engineering and structure prediction.
Collapse
Affiliation(s)
- Stephen R Holbrook
- Structural Biology Department, Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA.
| |
Collapse
|
16
|
Geary C, Baudrey S, Jaeger L. Comprehensive features of natural and in vitro selected GNRA tetraloop-binding receptors. Nucleic Acids Res 2007; 36:1138-52. [PMID: 18158305 PMCID: PMC2275092 DOI: 10.1093/nar/gkm1048] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Specific recognitions of GNRA tetraloops by small helical receptors are among the most widespread long-range packing interactions in large ribozymes. However, in contrast to GYRA and GAAA tetraloops, very few GNRA/receptor interactions have yet been identified to involve GGAA tetraloops in nature. A novel in vitro selection scheme based on a rigid self-assembling tectoRNA scaffold designed for isolation of intermolecular interactions with A-minor motifs has yielded new GGAA tetraloop-binding receptors with affinity in the nanomolar range. One of the selected receptors is a novel 12 nt RNA motif, (CCUGUG … AUCUGG), that recognizes GGAA tetraloop hairpin with a remarkable specificity and affinity. Its physical and chemical characteristics are comparable to those of the well-studied ‘11nt’ GAAA tetraloop receptor motif. A second less specific motif (CCCAGCCC … GAUAGGG) binds GGRA tetraloops and appears to be related to group IC3 tetraloop receptors. Mutational, thermodynamic and comparative structural analysis suggests that natural and in vitro selected GNRA receptors can essentially be grouped in two major classes of GNRA binders. New insights about the evolution, recognition and structural modularity of GNRA and A-minor RNA–RNA interactions are proposed.
Collapse
Affiliation(s)
- Cody Geary
- Department of Chemistry and Biochemistry, Biomolecular Science and Engineering Program, University of California at Santa Barbara, Santa Barbara, CA 93106-9510, USA
| | | | | |
Collapse
|
17
|
Steinberg SV, Boutorine YI. G-ribo motif favors the formation of pseudoknots in ribosomal RNA. RNA (NEW YORK, N.Y.) 2007; 13:1036-42. [PMID: 17507660 PMCID: PMC1894920 DOI: 10.1261/rna.495207] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Analysis of the pseudoknots existing in the ribosomal RNA showed that four of them are formed with the help of G-ribo, a recently identified RNA recurrent motif. The analysis of these pseudoknots revealed two major aspects in the G-ribo motif structure, which together provide the structural context favoring the formation of two different types of pseudoknots. The first aspect pertains to a particular side-by-side juxtaposition of two double helices that facilitates switches of the polynucleotide chain between different strands. The second aspect deals with the presence of an adenosine at a specific place where it can stabilize a particular arrangement of two quasicoaxial helices required for the pseudoknot formation. Additional analysis shows that the latter aspect is also present in other pseudoknots not related to the G-ribo motif or the ribosome, and thus represents a general structural element favoring the formation of pseudoknots.
Collapse
|