1
|
Riveros II, Yildirim I. Prediction of 3D RNA Structures from Sequence Using Energy Landscapes of RNA Dimers: Application to RNA Tetraloops. J Chem Theory Comput 2024; 20:4363-4376. [PMID: 38728627 DOI: 10.1021/acs.jctc.4c00189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2024]
Abstract
Access to the three-dimensional structure of RNA enables an ability to gain a more profound understanding of its biological mechanisms, as well as the ability to design RNA-targeting drugs, which can take advantage of the unique chemical environment imposed by a folded RNA structure. Due to the dynamic and structurally complex properties of RNA, both experimental and traditional computational methods have difficulty in determining RNA's 3D structure. Herein, we introduce TAPERSS (Theoretical Analyses, Prediction, and Evaluation of RNA Structures from Sequence), a physics-based fragment assembly method for predicting 3D RNA structures from sequence. Using a fragment library created using discrete path sampling calculations of RNA dinucleoside monophosphates, TAPERSS can sample the physics-based energy landscapes of any RNA sequence with relatively low computational complexity. We have benchmarked TAPERSS on 21 RNA tetraloops, using a combinatorial algorithm as a proof-of-concept. We show that TAPERSS was successfully able to predict the apo-state structures of all 21 RNA hairpins, with 16 of those structures also having low predicted energies as well. We demonstrate that TAPERSS performs most accurately on GNRA-like tetraloops with mostly stacked loop-nucleotides, while having limited success with more dynamic UNCG and CUYG tetraloops, most likely due to the influence of the RNA force field used to create the fragment library. Moreover, we show that TAPERSS can successfully predict the majority of the experimental non-apo states, highlighting its potential in anticipating biologically significant yet unobserved states. This holds great promise for future applications in drug design and related studies. With discussed improvements and implementation of more efficient sampling algorithms, we believe TAPERSS may serve as a useful tool for a physics-based conformational sampling of large RNA structures.
Collapse
Affiliation(s)
- Ivan Isaac Riveros
- Department of Chemistry and Biochemistry, Florida Atlantic University, Jupiter, Florida 33458, United States
| | - Ilyas Yildirim
- Department of Chemistry and Biochemistry, Florida Atlantic University, Jupiter, Florida 33458, United States
| |
Collapse
|
2
|
Thapar R, Denmon AP, Nikonowicz EP. Recognition modes of RNA tetraloops and tetraloop-like motifs by RNA-binding proteins. WILEY INTERDISCIPLINARY REVIEWS-RNA 2013; 5:49-67. [PMID: 24124096 DOI: 10.1002/wrna.1196] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Revised: 08/13/2013] [Accepted: 08/18/2013] [Indexed: 12/19/2022]
Abstract
RNA hairpins are the most commonly occurring secondary structural elements in RNAs and serve as nucleation sites for RNA folding, RNA-RNA, and RNA-protein interactions. RNA hairpins are frequently capped by tetraloops, and based on sequence similarity, three broad classes of RNA tetraloops have been defined: GNRA, UNCG, and CUYG. Other classes such as the UYUN tetraloop in histone mRNAs, the UGAA in 16S rRNA, the AUUA tetraloop from the MS2 bacteriophage, and the AGNN tetraloop that binds RNase III have also been characterized. The tetraloop structure is compact and is usually characterized by a paired interaction between the first and fourth nucleotides. The two unpaired nucleotides in the loop are usually involved in base-stacking or base-phosphate hydrogen bonding interactions. Several structures of RNA tetraloops, free and complexed to other RNAs or proteins, are now available and these studies have increased our understanding of the diverse mechanisms by which this motif is recognized. RNA tetraloops can mediate RNA-RNA contacts via the tetraloop-receptor motif, kissing hairpin loops, A-minor interactions, and pseudoknots. While these RNA-RNA interactions are fairly well understood, how RNA-binding proteins recognize RNA tetraloops and tetraloop-like motifs remains unclear. In this review, we summarize the structures of RNA tetraloop-protein complexes and the general themes that have emerged on sequence- and structure-specific recognition of RNA tetraloops. We highlight how proteins achieve molecular recognition of this nucleic acid motif, the structural adaptations observed in the tetraloop to accommodate the protein-binding partner, and the role of dynamics in recognition.
Collapse
Affiliation(s)
- Roopa Thapar
- Department of Structural Biology, Hauptman-Woodward Medical Research Institute, Buffalo, NY, USA; Department of Structural Biology, SUNY at Buffalo, Buffalo, NY, USA; Department of Biochemistry and Cell Biology, Rice University, Houston, TX, USA
| | | | | |
Collapse
|
3
|
Barik A, C N, P M, Bahadur RP. A protein-RNA docking benchmark (I): nonredundant cases. Proteins 2012; 80:1866-71. [PMID: 22488669 DOI: 10.1002/prot.24083] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Accepted: 03/23/2012] [Indexed: 11/07/2022]
Abstract
We have developed a nonredundant protein-RNA docking benchmark dataset, which is derived from the available bound and unbound structures in the Protein Data Bank involving polypeptide and nucleic acid chains. It consists of nine unbound-unbound cases where both the protein and the RNA are available in the free form. The other 36 cases are of unbound-bound type where only the protein is available in the free form. The conformational change upon complex formation is calculated by a distance matrix alignment method, and based on that, complexes are classified into rigid, semi-flexible, and full flexible. Although in the rigid body category, no significant conformational change accompanies complex formation, the fully flexible test cases show large domain movements, RNA base flips, etc. The benchmark covers four major groups of RNA, namely, t-RNA, ribosomal RNA, duplex RNA, and single-stranded RNA. We find that RNA is generally more flexible than the protein in the complexes, and the interface region is as flexible as the molecule as a whole. The structural diversity of the complexes in the benchmark set should provide a common ground for the development and comparison of the protein-RNA docking methods. The benchmark can be freely downloaded from the internet.
Collapse
Affiliation(s)
- Amita Barik
- Department of Biotechnology, Indian Institute of Technology, Kharagpur 721302, India
| | | | | | | |
Collapse
|
4
|
Chung J, Ulyanov NB, Guilbert C, Mujeeb A, James TL. Binding characteristics of small molecules that mimic nucleocapsid protein-induced maturation of stem-loop 1 of HIV-1 RNA. Biochemistry 2010; 49:6341-51. [PMID: 20565056 PMCID: PMC2921804 DOI: 10.1021/bi100660r] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
As a retrovirus, the human immunodeficiency virus (HIV-1) packages two copies of the RNA genome as a dimer in the infectious virion. Dimerization is initiated at the dimer initiation site (DIS) which encompasses stem-loop 1 (SL1) in the 5'-UTR of the genome. Study of genomic dimerization has been facilitated by the discovery that short RNA fragments containing SL1 can dimerize spontaneously without any protein factors. On the basis of the palindromic nature of SL1, a kissing loop model has been proposed. First, a metastable kissing dimer is formed via standard Watson-Crick base pairs and then converted into a more stable extended dimer by the viral nucleocapsid protein (NCp7). This dimer maturation in vitro is believed to mimic initial steps in the RNA maturation in vivo, which is correlated with viral infectivity. We previously discovered a small molecule activator, Lys-Ala-7-amido-4-methylcoumarin (KA-AMC), which facilitates dimer maturation in vitro, and determined aspects of its structure-activity relationship. In this report, we present measurements of the binding affinity of the activators and characterization of their interactions with the SL1 RNA. Guanidinium groups and increasing positive charge on the side chain enhance affinity and activity, but features in the aromatic ring at least partially decouple affinity from activity. Although KA-AMC can bind to multiple structural motifs, the NMR study showed KA-AMC preferentially binds to unique structural motifs, such as the palindromic loop and the G-rich internal loop in the SL1 RNA. NCp7 binds to SL1 only 1 order of magnitude more tightly than the best small molecule ligand tested. This study provides guidelines for the design of superior small molecules that bind to the SL1 RNA that have the potential of being developed as an antiviral by interfering with SL1-NCp7 interaction at the packaging and/or maturation stages.
Collapse
Affiliation(s)
| | - Nikolai B. Ulyanov
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 600 16 Street, San Francisco, California 94158-2517
| | - Christophe Guilbert
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 600 16 Street, San Francisco, California 94158-2517
| | | | - Thomas L. James
- Department of Pharmaceutical Chemistry, University of California, San Francisco, 600 16 Street, San Francisco, California 94158-2517
| |
Collapse
|
5
|
Zhou J, Bean RL, Vogt VM, Summers MF. Solution structure of the Rous sarcoma virus nucleocapsid protein: muPsi RNA packaging signal complex. J Mol Biol 2007; 365:453-67. [PMID: 17070546 PMCID: PMC1764217 DOI: 10.1016/j.jmb.2006.10.013] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2006] [Revised: 10/02/2006] [Accepted: 10/05/2006] [Indexed: 11/30/2022]
Abstract
The 5'-untranslated region (5'-UTR) of retroviral genomes contains elements required for genome packaging during virus assembly. For many retroviruses, the packaging elements reside in non-contiguous segments that span most or all of the 5'-UTR. The Rous sarcoma virus (RSV) is an exception, in that its genome can be packaged efficiently by a relatively short, 82 nt segment of the 5'-UTR called muPsi. The RSV 5'-UTR also contains three translational start codons (AUG-1, AUG-2 and AUG-3) that have been controvertibly implicated in translation initiation and genome packaging, one of which (AUG-3) resides within the muPsi sequence. We demonstrated recently that muPsi is capable of binding to the cognate RSV nucleocapsid protein (NC) with high affinity (dissociation constant K(d) approximately 2 nM), and that residues of AUG-3 are essential for tight binding. We now report the solution structure of the NC:muPsi complex, determined using NMR data obtained for samples containing ((13)C,(15)N)-labeled NC and (2)H-enriched, nucleotide-specifically protonated RNAs. Upon NC binding, muPsi adopts a stable secondary structure that consists of three stem loops (SL-A, SL-B and SL-C) and an 8 bp stem (O3). Binding is mediated by the two zinc knuckle domains of NC. The N-terminal knuckle interacts with a conserved U(217)GCG tetraloop (a member of the UNCG family; N=A,U,G or C), and the C-terminal zinc knuckle binds to residues that flank SL-A, including residues of AUG-3. Mutations of critical nucleotides in these sequences compromise or abolish viral infectivity. Our studies reveal novel structural features important for NC:RNA binding, and support the hypothesis that AUG-3 is conserved for genome packaging rather than translational control.
Collapse
Key Words
- rous sarcoma virus
- ribonucleic acid (rna)
- psi-site (μψ)
- nucleocapsid (nc) protein
- uncg tetraloop
- nuclear magnetic resonance (nmr)
- a, adenosine
- c, cytidine
- g, guanosine
- gst, glutathione-s-transferase
- hiv-1, human immunodeficiency virus type-1
- hmqc, heteronuclear multiple quantum coherence
- hsqc, heteronuclear single quantumn coherence
- itc, isothermal titration calorimetry
- mlv, moloney murine leukaemia virus
- nc, nucleocapsid protein
- noe, nuclear overhauser effect
- noesy, noe spectroscopy
- orf, open reading frame
- pbs, primer binding site
- rmsd, root-mean-square deviation
- roesy, rotating frame overhauser effect spectroscopy
- rsv, rous sarcoma virus
- sd, splice-donor site
- u, uridine
- utr, unstranslated region
Collapse
Affiliation(s)
- Jing Zhou
- Howard Hughes Medical Institute and Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250
| | - Rebecca L. Bean
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853
| | - Volker M. Vogt
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853
| | - Michael F. Summers
- Howard Hughes Medical Institute and Department of Chemistry and Biochemistry, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250
| |
Collapse
|
6
|
Denisov AY, Hannoush RN, Gehring K, Damha MJ. A novel RNA motif based on the structure of unusually stable 2',5'-linked r(UUCG) loops. J Am Chem Soc 2003; 125:11525-31. [PMID: 13129354 DOI: 10.1021/ja036207k] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We have recently shown that hairpins containing 2',5'-linked RNA loops exhibit superior thermodynamic stability compared to native hairpins comprised of 3',5'-RNA loops [Hannoush, R. N.; Damha, M. J. J. Am. Chem. Soc. 2001, 123, 12368-12374]. A remarkable feature of the 2',5'-r(UUCG) tetraloop is that, unlike the corresponding 3',5'-linked tetraloop, its stability is virtually independent of the hairpin stem composition. Here, we determine the solution structure of unusually stable hairpins of the sequence 5'-G(1)G(2)A(3)C(4)-(U(5)U(6)C(7)G(8))-G(9)(U/T(10))C(11)C(12)-3' containing a 2',5'-linked RNA (UUCG) loop and either an RNA or a DNA stem. The 2',5'-linked RNA loop adopts a new fold that is completely different from that previously observed for the native 3',5'-linked RNA loop. The 2',5'-RNA loop is stabilized by (a). U5.G8 wobble base pairing, with both nucleotide residues in the anti-conformation, (b). extensive base stacking, and (c). sugar-base and sugar-sugar contacts, all of which contribute to the extra stability of this hairpin structure. The U5:G8 base pair stacks on top of the C4:G9 loop-closing base pair and thus appears as a continuation of the stem. The loop uracil U6 base stacks above U5 base, while the cytosine C7 base protrudes out into the solvent and does not participate in any of the stabilizing interactions. The different sugar pucker and intrinsic bonding interactions within the 2',5'-linked ribonucleotides help explain the unusual stability and conformational properties displayed by 2',5'-RNA tetraloops. These findings are relevant for the design of more effective RNA-based aptamers, ribozymes, and antisense agents and identify the 2',5'-RNA loop as a novel structural motif.
Collapse
Affiliation(s)
- Alexei Yu Denisov
- Department of Biochemistry and Montreal Joint Centre for Structural Biology, McGill University, Montreal, QC, H3G 1Y6, Canada
| | | | | | | |
Collapse
|
7
|
Giedroc DP, Cornish PV, Hennig M. Detection of scalar couplings involving 2'-hydroxyl protons across hydrogen bonds in a frameshifting mRNA pseudoknot. J Am Chem Soc 2003; 125:4676-7. [PMID: 12696863 DOI: 10.1021/ja029286t] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The -1 frameshift-stimulating mRNA pseudoknot from pea enation mosaic virus-1 (PEMV-1) is composed nearly entirely of RNA triple helix.(4) The 2'-OH hydroxyl protons of riboses C15 and C16 are hydrogen bond donors to the N1 atoms of adenosines A27 and A25, respectively, positioned in the minor groove of pseudoknot stem S1. In this paper, a nonrefocused (1)H,(15)N CPMG HSQC of uniformly (13)C,(15)N-labeled 33-mer PEMV-1 RNA has been tailored to reveal a correlation of the 2'-OH hydroxyl proton of C15 to the N1 nitrogen resonance of A27 mediated by a cross hydrogen bond scalar coupling. The (1h)J(2'OH,N) cross hydrogen bond scalar coupling constant determined from a quantitative 1D (15N) spin-echo difference experiment for the C15/A27 interaction is 1.7 +/- 0.1 Hz, while that for the C16/A25 interaction appears larger, 3.5 +/- 0.3 Hz, despite the fact that the corresponding direct correlation between the 2'-OH hydroxyl proton of C16 and the N1 of A25 is missing due to unfavorable solvent exchange properties. These findings reveal a detailed picture of critical noncanonical O-H.N hydrogen-bonding loop-stem interactions in an RNA triple helical structure.
Collapse
Affiliation(s)
- David P Giedroc
- Department of Biochemistry, Center for Advanced Biomolecular Research, Texas A&M University, College Station, Texas 77843-2128, USA.
| | | | | |
Collapse
|