1
|
Mukherjee S, Moafinejad SN, Badepally NG, Merdas K, Bujnicki JM. Advances in the field of RNA 3D structure prediction and modeling, with purely theoretical approaches, and with the use of experimental data. Structure 2024; 32:1860-1876. [PMID: 39321802 DOI: 10.1016/j.str.2024.08.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Revised: 08/08/2024] [Accepted: 08/22/2024] [Indexed: 09/27/2024]
Abstract
Recent advancements in RNA three-dimensional (3D) structure prediction have provided significant insights into RNA biology, highlighting the essential role of RNA in cellular functions and its therapeutic potential. This review summarizes the latest developments in computational methods, particularly the incorporation of artificial intelligence and machine learning, which have improved the efficiency and accuracy of RNA structure predictions. We also discuss the integration of new experimental data types, including cryoelectron microscopy (cryo-EM) techniques and high-throughput sequencing, which have transformed RNA structure modeling. The combination of experimental advances with computational methods represents a significant leap in RNA structure determination. We review the outcomes of RNA-Puzzles and critical assessment of structure prediction (CASP) challenges, which assess the state of the field and limitations of existing methods. Future perspectives are discussed, focusing on the impact of RNA 3D structure prediction on understanding RNA mechanisms and its implications for drug discovery and RNA-targeted therapies, opening new avenues in molecular biology.
Collapse
Affiliation(s)
- Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - S Naeim Moafinejad
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Nagendar Goud Badepally
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Katarzyna Merdas
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, PL-02-109 Warsaw, Poland.
| |
Collapse
|
2
|
Ye R, Zhao H, Wang X, Xue Y. Technological advancements in deciphering RNA-RNA interactions. Mol Cell 2024; 84:3722-3736. [PMID: 39047724 DOI: 10.1016/j.molcel.2024.06.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/11/2024] [Accepted: 06/28/2024] [Indexed: 07/27/2024]
Abstract
RNA-RNA interactions (RRIs) can dictate RNA molecules to form intricate higher-order structures and bind their RNA substrates in diverse biological processes. To elucidate the function, binding specificity, and regulatory mechanisms of various RNA molecules, especially the vast repertoire of non-coding RNAs, advanced technologies and methods that globally map RRIs are extremely valuable. In the past decades, many state-of-the-art technologies have been developed for this purpose. This review focuses on those high-throughput technologies for the global mapping of RRIs. We summarize the key concepts and the pros and cons of different technologies. In addition, we highlight the novel biological insights uncovered by these RRI mapping methods and discuss the future challenges for appreciating the crucial roles of RRIs in gene regulation across bacteria, viruses, archaea, and mammals.
Collapse
Affiliation(s)
- Rong Ye
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Hailian Zhao
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xi Wang
- State Key Laboratory of Female Fertility Promotion, Clinical Stem Cell Research Center, Peking University Third Hospital, Beijing 100191, China
| | - Yuanchao Xue
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
3
|
Hlaing ST, Srimanote P, Tongtawe P, Khantisitthiporn O, Glab-Ampai K, Chulanetra M, Thanongsaksrikul J. Isolation and Characterization of scFv Antibody against Internal Ribosomal Entry Site of Enterovirus A71. Int J Mol Sci 2023; 24:9865. [PMID: 37373012 DOI: 10.3390/ijms24129865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 06/02/2023] [Accepted: 06/06/2023] [Indexed: 06/29/2023] Open
Abstract
Enterovirus A71 (EV-A71) is one of the causative agents of hand-foot-mouth disease, which can be associated with neurocomplications of the central nervous system. A limited understanding of the virus's biology and pathogenesis has led to the unavailability of effective anti-viral treatments. The EV-A71 RNA genome carries type I internal ribosomal entry site (IRES) at 5' UTR that plays an essential role in the viral genomic translation. However, the detailed mechanism of IRES-mediated translation has not been elucidated. In this study, sequence analysis revealed that the domains IV, V, and VI of EV-A71 IRES contained the structurally conserved regions. The selected region was transcribed in vitro and labeled with biotin to use as an antigen for selecting the single-chain variable fragment (scFv) antibody from the naïve phage display library. The so-obtained scFv, namely, scFv #16-3, binds specifically to EV-A71 IRES. The molecular docking showed that the interaction between scFv #16-3 and EV-A71 IRES was mediated by the preferences of amino acid residues, including serine, tyrosine, glycine, lysine, and arginine on the antigen-binding sites contacted the nucleotides on the IRES domains IV and V. The so-produced scFv has the potential to develop as a structural biology tool to study the biology of the EV-A71 RNA genome.
Collapse
Affiliation(s)
- Su Thandar Hlaing
- Graduate Program in Biomedical Sciences, Faculty of Allied Health Sciences, Thammasat University, Pathumtani 12120, Thailand
| | - Potjanee Srimanote
- Graduate Program in Biomedical Sciences, Faculty of Allied Health Sciences, Thammasat University, Pathumtani 12120, Thailand
- Thammasat University Research Unit in Molecular Pathogenesis and Immunology of Infectious Diseases, Thammasat University, Pathumthani 12120, Thailand
| | - Pongsri Tongtawe
- Graduate Program in Biomedical Sciences, Faculty of Allied Health Sciences, Thammasat University, Pathumtani 12120, Thailand
| | - Onruedee Khantisitthiporn
- Thammasat University Research Unit in Molecular Pathogenesis and Immunology of Infectious Diseases, Thammasat University, Pathumthani 12120, Thailand
- Department of Medical Technology, Faculty of Allied Health Sciences, Thammasat University, Pathumthani 12120, Thailand
| | - Kittirat Glab-Ampai
- Center of Research Excellence in Therapeutic Proteins and Antibody Engineering, Department of Parasitology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
| | - Monrat Chulanetra
- Center of Research Excellence in Therapeutic Proteins and Antibody Engineering, Department of Parasitology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
| | - Jeeraphong Thanongsaksrikul
- Graduate Program in Biomedical Sciences, Faculty of Allied Health Sciences, Thammasat University, Pathumtani 12120, Thailand
- Thammasat University Research Unit in Molecular Pathogenesis and Immunology of Infectious Diseases, Thammasat University, Pathumthani 12120, Thailand
| |
Collapse
|
4
|
Bheemireddy S, Sandhya S, Srinivasan N, Sowdhamini R. Computational tools to study RNA-protein complexes. Front Mol Biosci 2022; 9:954926. [PMID: 36275618 PMCID: PMC9585174 DOI: 10.3389/fmolb.2022.954926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 09/20/2022] [Indexed: 11/19/2022] Open
Abstract
RNA is the key player in many cellular processes such as signal transduction, replication, transport, cell division, transcription, and translation. These diverse functions are accomplished through interactions of RNA with proteins. However, protein–RNA interactions are still poorly derstood in contrast to protein–protein and protein–DNA interactions. This knowledge gap can be attributed to the limited availability of protein-RNA structures along with the experimental difficulties in studying these complexes. Recent progress in computational resources has expanded the number of tools available for studying protein-RNA interactions at various molecular levels. These include tools for predicting interacting residues from primary sequences, modelling of protein-RNA complexes, predicting hotspots in these complexes and insights into derstanding in the dynamics of their interactions. Each of these tools has its strengths and limitations, which makes it significant to select an optimal approach for the question of interest. Here we present a mini review of computational tools to study different aspects of protein-RNA interactions, with focus on overall application, development of the field and the future perspectives.
Collapse
Affiliation(s)
- Sneha Bheemireddy
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Sankaran Sandhya
- Department of Biotechnology, Faculty of Life and Allied Health Sciences, M.S. Ramaiah University of Applied Sciences, Bengaluru, India
- *Correspondence: Sankaran Sandhya, ; Ramanathan Sowdhamini,
| | | | - Ramanathan Sowdhamini
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- National Centre for Biological Sciences, TIFR, GKVK Campus, Bangalore, India
- Institute of Bioinformatics and Applied Biotechnology, Bangalore, India
- *Correspondence: Sankaran Sandhya, ; Ramanathan Sowdhamini,
| |
Collapse
|
5
|
Developing an Updated Strategy for Estimating the Free-Energy Parameters in RNA Duplexes. Int J Mol Sci 2021; 22:ijms22189708. [PMID: 34575896 PMCID: PMC8467000 DOI: 10.3390/ijms22189708] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Revised: 09/01/2021] [Accepted: 09/02/2021] [Indexed: 12/12/2022] Open
Abstract
For the last 20 years, it has been common lore that the free energy of RNA duplexes formed from canonical Watson-Crick base pairs (bps) can be largely approximated with dinucleotide bp parameters and a few simple corrective constants that are duplex independent. Additionally, the standard benchmark set of duplexes used to generate the parameters were GC-rich in the shorter duplexes and AU-rich in the longer duplexes, and the length of the majority of the duplexes ranged between 6 and 8 bps. We were curious if other models would generate similar results and whether adding longer duplexes of 17 bps would affect the conclusions. We developed a gradient-descent fitting program for obtaining free-energy parameters-the changes in Gibbs free energy (ΔG), enthalpy (ΔH), and entropy (ΔS), and the melting temperature (Tm)-directly from the experimental melting curves. Using gradient descent and a genetic algorithm, the duplex melting results were combined with the standard benchmark data to obtain bp parameters. Both the standard (Turner) model and a new model that includes length-dependent terms were tested. Both models could fit the standard benchmark data; however, the new model could handle longer sequences better. We developed an updated strategy for fitting the duplex melting data.
Collapse
|
6
|
Predicting RNA Scaffolds with a Hybrid Method of Vfold3D and VfoldLA. Methods Mol Biol 2021. [PMID: 34086269 DOI: 10.1007/978-1-0716-1499-0_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2023]
Abstract
The ever-increasing discoveries of noncoding RNA functions draw a strong demand for RNA structure determination from the sequence. In recently years, computational studies for RNA structures, at both the two-dimensional and the three-dimensional levels, led to several highly promising new developments. In this chapter, we describe a hybrid method, which combines the motif template-based Vfold3D model and the loop template-based VfoldLA model, to predict RNA 3D structures. The main emphasis is placed on the definition of motifs and loops, the treatment of no-template motifs, and the 3D structure assembly from templates of motifs and loops. For illustration, we use the ZIKV xrRNA1 as an example to show the template-based prediction of RNA 3D structures from the 2D structure. The web server for the hybrid model is freely accessible at http://rna.physics.missouri.edu/vfold3D2 .
Collapse
|
7
|
Li B, Cao Y, Westhof E, Miao Z. Advances in RNA 3D Structure Modeling Using Experimental Data. Front Genet 2020; 11:574485. [PMID: 33193680 PMCID: PMC7649352 DOI: 10.3389/fgene.2020.574485] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 09/02/2020] [Indexed: 12/26/2022] Open
Abstract
RNA is a unique bio-macromolecule that can both record genetic information and perform biological functions in a variety of molecular processes, including transcription, splicing, translation, and even regulating protein function. RNAs adopt specific three-dimensional conformations to enable their functions. Experimental determination of high-resolution RNA structures using x-ray crystallography is both laborious and demands expertise, thus, hindering our comprehension of RNA structural biology. The computational modeling of RNA structure was a milestone in the birth of bioinformatics. Although computational modeling has been greatly improved over the last decade showing many successful cases, the accuracy of such computational modeling is not only length-dependent but also varies according to the complexity of the structure. To increase credibility, various experimental data were integrated into computational modeling. In this review, we summarize the experiments that can be integrated into RNA structure modeling as well as the computational methods based on these experimental data. We also demonstrate how computational modeling can help the experimental determination of RNA structure. We highlight the recent advances in computational modeling which can offer reliable structure models using high-throughput experimental data.
Collapse
Affiliation(s)
- Bing Li
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, Strasbourg, France
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence, Department of Anesthesiology, Shanghai Fourth People’s Hospital Affiliated to Tongji University School of Medicine, Shanghai, China
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| |
Collapse
|
8
|
Zhang Z, Xiong P, Zhang T, Wang J, Zhan J, Zhou Y. Accurate inference of the full base-pairing structure of RNA by deep mutational scanning and covariation-induced deviation of activity. Nucleic Acids Res 2020; 48:1451-1465. [PMID: 31872260 PMCID: PMC7026644 DOI: 10.1093/nar/gkz1192] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 12/10/2019] [Accepted: 12/11/2019] [Indexed: 11/12/2022] Open
Abstract
Despite the large number of noncoding RNAs in human genome and their roles in many diseases include cancer, we know very little about them due to lack of structural clues. The centerpiece of the structural clues is the full RNA base-pairing structure of secondary and tertiary contacts that can be precisely obtained only from costly and time-consuming 3D structure determination. Here, we performed deep mutational scanning of self-cleaving CPEB3 ribozyme by error-prone PCR and showed that a library of <5 × 104 single-to-triple mutants is sufficient to infer 25 of 26 base pairs including non-nested, nonhelical, and noncanonical base pairs with both sensitivity and precision at 96%. Such accurate inference was further confirmed by a twister ribozyme at 100% precision with only noncanonical base pairs as false negatives. The performance was resulted from analyzing covariation-induced deviation of activity by utilizing both functional and nonfunctional variants for unsupervised classification, followed by Monte Carlo (MC) simulated annealing with mutation-derived scores. Highly accurate inference can also be obtained by combining MC with evolution/direct coupling analysis, R-scape or epistasis analysis. The results highlight the usefulness of deep mutational scanning for high-accuracy structural inference of self-cleaving ribozymes with implications for other structured RNAs that permit high-throughput functional selections.
Collapse
Affiliation(s)
- Zhe Zhang
- High Magnetic Field Laboratory, Key Laboratory of High Magnetic Field and Ion Beam Physical Biology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, Anhui, P. R. China
- University of Chinese Academy of Sciences, Beijing 101408, P. R. China
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Peng Xiong
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Tongchuan Zhang
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Junfeng Wang
- High Magnetic Field Laboratory, Key Laboratory of High Magnetic Field and Ion Beam Physical Biology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, Anhui, P. R. China
- Institute of Physical Science and Information Technology, Anhui University, Hefei 230031, Anhui, P. R. China
| | - Jian Zhan
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| | - Yaoqi Zhou
- Institute for Glycomics, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
- School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD 4222, Australia
| |
Collapse
|
9
|
Rollins NJ, Brock KP, Poelwijk FJ, Stiffler MA, Gauthier NP, Sander C, Marks DS. Inferring protein 3D structure from deep mutation scans. Nat Genet 2019; 51:1170-1176. [PMID: 31209393 PMCID: PMC7295002 DOI: 10.1038/s41588-019-0432-9] [Citation(s) in RCA: 90] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Accepted: 04/29/2019] [Indexed: 11/09/2022]
Abstract
We describe an experimental method of three-dimensional (3D) structure determination that exploits the increasing ease of high-throughput mutational scans. Inspired by the success of using natural, evolutionary sequence covariation to compute protein and RNA folds, we explored whether 'laboratory', synthetic sequence variation might also yield 3D structures. We analyzed five large-scale mutational scans and discovered that the pairs of residues with the largest positive epistasis in the experiments are sufficient to determine the 3D fold. We show that the strongest epistatic pairings from genetic screens of three proteins, a ribozyme and a protein interaction reveal 3D contacts within and between macromolecules. Using these experimental epistatic pairs, we compute ab initio folds for a GB1 domain (within 1.8 Å of the crystal structure) and a WW domain (2.1 Å). We propose strategies that reduce the number of mutants needed for contact prediction, suggesting that genomics-based techniques can efficiently predict 3D structure.
Collapse
Affiliation(s)
- Nathan J Rollins
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Kelly P Brock
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Frank J Poelwijk
- cBio Center, Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Michael A Stiffler
- cBio Center, Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Nicholas P Gauthier
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
- cBio Center, Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Chris Sander
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
- cBio Center, Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
- Broad Institute of Harvard and MIT, Cambridge, MA, USA.
| |
Collapse
|
10
|
Ponce-Salvatierra A, Astha, Merdas K, Nithin C, Ghosh P, Mukherjee S, Bujnicki JM. Computational modeling of RNA 3D structure based on experimental data. Biosci Rep 2019; 39:BSR20180430. [PMID: 30670629 PMCID: PMC6367127 DOI: 10.1042/bsr20180430] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Revised: 01/19/2019] [Accepted: 01/21/2019] [Indexed: 01/02/2023] Open
Abstract
RNA molecules are master regulators of cells. They are involved in a variety of molecular processes: they transmit genetic information, sense cellular signals and communicate responses, and even catalyze chemical reactions. As in the case of proteins, RNA function is dictated by its structure and by its ability to adopt different conformations, which in turn is encoded in the sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore the majority of known RNAs remain structurally uncharacterized. To address this problem, predictive computational methods were developed based on the accumulated knowledge of RNA structures determined so far, the physical basis of the RNA folding, and taking into account evolutionary considerations, such as conservation of functionally important motifs. However, all theoretical methods suffer from various limitations, and they are generally unable to accurately predict structures for RNA sequences longer than 100-nt residues unless aided by additional experimental data. In this article, we review experimental methods that can generate data usable by computational methods, as well as computational approaches for RNA structure prediction that can utilize data from experimental analyses. We outline methods and data types that can be potentially useful for RNA 3D structure modeling but are not commonly used by the existing software, suggesting directions for future development.
Collapse
Affiliation(s)
- Almudena Ponce-Salvatierra
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Astha
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Katarzyna Merdas
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Chandran Nithin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Pritha Ghosh
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Sunandan Mukherjee
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, ul. Ks. Trojdena 4, Warsaw PL-02-109, Poland
- Bioinformatics Laboratory, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, Poznan PL-61-614, Poland
| |
Collapse
|
11
|
Mailler E, Paillart JC, Marquet R, Smyth RP, Vivet-Boudou V. The evolution of RNA structural probing methods: From gels to next-generation sequencing. WILEY INTERDISCIPLINARY REVIEWS-RNA 2018; 10:e1518. [PMID: 30485688 DOI: 10.1002/wrna.1518] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 09/13/2018] [Accepted: 10/17/2018] [Indexed: 01/09/2023]
Abstract
RNA molecules are important players in all domains of life and the study of the relationship between their multiple flexible states and the associated biological roles has increased in recent years. For several decades, chemical and enzymatic structural probing experiments have been used to determine RNA structure. During this time, there has been a steady improvement in probing reagents and experimental methods, and today the structural biologist community has a large range of tools at its disposal to probe the secondary structure of RNAs in vitro and in cells. Early experiments used radioactive labeling and polyacrylamide gel electrophoresis as read-out methods. This was superseded by capillary electrophoresis, and more recently by next-generation sequencing. Today, powerful structural probing methods can characterize RNA structure on a genome-wide scale. In this review, we will provide an overview of RNA structural probing methodologies from a historical and technical perspective. This article is categorized under: RNA Structure and Dynamics > RNA Structure, Dynamics, and Chemistry RNA Methods > RNA Analyses in vitro and In Silico RNA Methods > RNA Analyses in Cells.
Collapse
Affiliation(s)
- Elodie Mailler
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| | | | - Roland Marquet
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| | - Redmond P Smyth
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| | - Valerie Vivet-Boudou
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, Strasbourg, France
| |
Collapse
|
12
|
Woods CT, Laederach A. Classification of RNA structure change by 'gazing' at experimental data. Bioinformatics 2018; 33:1647-1655. [PMID: 28130241 PMCID: PMC5447233 DOI: 10.1093/bioinformatics/btx041] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Accepted: 01/20/2017] [Indexed: 11/12/2022] Open
Abstract
Motivation Mutations (or Single Nucleotide Variants) in folded RiboNucleic Acid structures that cause local or global conformational change are riboSNitches. Predicting riboSNitches is challenging, as it requires making two, albeit related, structure predictions. The data most often used to experimentally validate riboSNitch predictions is Selective 2' Hydroxyl Acylation by Primer Extension, or SHAPE. Experimentally establishing a riboSNitch requires the quantitative comparison of two SHAPE traces: wild-type (WT) and mutant. Historically, SHAPE data was collected on electropherograms and change in structure was evaluated by 'gel gazing.' SHAPE data is now routinely collected with next generation sequencing and/or capillary sequencers. We aim to establish a classifier capable of simulating human 'gazing' by identifying features of the SHAPE profile that human experts agree 'looks' like a riboSNitch. Results We find strong quantitative agreement between experts when RNA scientists 'gaze' at SHAPE data and identify riboSNitches. We identify dynamic time warping and seven other features predictive of the human consensus. The classSNitch classifier reported here accurately reproduces human consensus for 167 mutant/WT comparisons with an Area Under the Curve (AUC) above 0.8. When we analyze 2019 mutant traces for 17 different RNAs, we find that features of the WT SHAPE reactivity allow us to improve thermodynamic structure predictions of riboSNitches. This is significant, as accurate RNA structural analysis and prediction is likely to become an important aspect of precision medicine. Availability and Implementation The classSNitch R package is freely available at http://classsnitch.r-forge.r-project.org . Contact alain@email.unc.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chanin Tolson Woods
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.,Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
13
|
Xu X, Chen SJ. Hierarchical Assembly of RNA Three-Dimensional Structures Based on Loop Templates. J Phys Chem B 2018; 122:5327-5335. [PMID: 29258305 DOI: 10.1021/acs.jpcb.7b10102] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The current RNA structure prediction methods cannot keep up the pace of the rapidly increasing number of sequences and the emerging new functions of RNAs. Template-based RNA three-dimensional structure prediction methods are restricted by the limited number of known RNA structures, and traditional motif-based search for the templates does not always lead to successful results. Here we report a new template search and assembly algorithm, the hierarchical loop template-assembly method (VfoldLA). The method searches for templates for single strand loop/junctions instead of the whole motifs, which often renders no available templates, or short fragments (several nucleotides), which requires a long computational time to assemble and refine. The VfoldLA method has the advantage of accounting for local and nonlocal interloop interactions. Benchmark tests indicate that this new method can provide low-resolution predictions for RNA conformations at different levels of structural complexities. Furthermore, the VfoldLA-predicted conformations may also serve as reliable putative models for further structure prediction and refinements. VfoldLA is accessible at http://rna.physics.missouri.edu/vfoldLA .
Collapse
Affiliation(s)
- Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering , Jiangsu University of Technology , Changzhou , Jiangsu 213001 , China.,Department of Physics, Department of Biochemistry, and Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| |
Collapse
|
14
|
Abstract
In addition to continuous rapid progress in RNA structure determination, probing, and biophysical studies, the past decade has seen remarkable advances in the development of a new generation of RNA folding theories and models. In this article, we review RNA structure prediction models and models for ion-RNA and ligand-RNA interactions. These new models are becoming increasingly important for a mechanistic understanding of RNA function and quantitative design of RNA nanotechnology. We focus on new methods for physics-based, knowledge-based, and experimental data-directed modeling for RNA structures and explore the new theories for the predictions of metal ion and ligand binding sites and metal ion-dependent RNA stabilities. The integration of these new methods with theories about the cellular environment effects in RNA folding, such as molecular crowding and cotranscriptional kinetic effects, may ultimately lead to an all-encompassing RNA folding model.
Collapse
Affiliation(s)
- Li-Zhen Sun
- Department of Physics, Department of Biochemistry, and MU Informatics Institute, University of Missouri, Columbia, Missouri 65211;
| | - Dong Zhang
- Department of Physics, Department of Biochemistry, and MU Informatics Institute, University of Missouri, Columbia, Missouri 65211;
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and MU Informatics Institute, University of Missouri, Columbia, Missouri 65211;
| |
Collapse
|
15
|
Abstract
The discoveries of myriad non-coding RNA molecules, each transiting through multiple flexible states in cells or virions, present major challenges for structure determination. Advances in high-throughput chemical mapping give new routes for characterizing entire transcriptomes in vivo, but the resulting one-dimensional data generally remain too information-poor to allow accurate de novo structure determination. Multidimensional chemical mapping (MCM) methods seek to address this challenge. Mutate-and-map (M2), RNA interaction groups by mutational profiling (RING-MaP and MaP-2D analysis) and multiplexed •OH cleavage analysis (MOHCA) measure how the chemical reactivities of every nucleotide in an RNA molecule change in response to modifications at every other nucleotide. A growing body of in vitro blind tests and compensatory mutation/rescue experiments indicate that MCM methods give consistently accurate secondary structures and global tertiary structures for ribozymes, ribosomal domains and ligand-bound riboswitch aptamers up to 200 nucleotides in length. Importantly, MCM analyses provide detailed information on structurally heterogeneous RNA states, such as ligand-free riboswitches that are functionally important but difficult to resolve with other approaches. The sequencing requirements of currently available MCM protocols scale at least quadratically with RNA length, precluding general application to transcriptomes or viral genomes at present. We propose a modify-cross-link-map (MXM) expansion to overcome this and other current limitations to resolving the in vivo 'RNA structurome'.
Collapse
|
16
|
Kinetic and thermodynamic framework for P4-P6 RNA reveals tertiary motif modularity and modulation of the folding preferred pathway. Proc Natl Acad Sci U S A 2016; 113:E4956-65. [PMID: 27493222 DOI: 10.1073/pnas.1525082113] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The past decade has seen a wealth of 3D structural information about complex structured RNAs and identification of functional intermediates. Nevertheless, developing a complete and predictive understanding of the folding and function of these RNAs in biology will require connection of individual rate and equilibrium constants to structural changes that occur in individual folding steps and further relating these steps to the properties and behavior of isolated, simplified systems. To accomplish these goals we used the considerable structural knowledge of the folded, unfolded, and intermediate states of P4-P6 RNA. We enumerated structural states and possible folding transitions and determined rate and equilibrium constants for the transitions between these states using single-molecule FRET with a series of mutant P4-P6 variants. Comparisons with simplified constructs containing an isolated tertiary contact suggest that a given tertiary interaction has a stereotyped rate for breaking that may help identify structural transitions within complex RNAs and simplify the prediction of folding kinetics and thermodynamics for structured RNAs from their parts. The preferred folding pathway involves initial formation of the proximal tertiary contact. However, this preference was only ∼10 fold and could be reversed by a single point mutation, indicating that a model akin to a protein-folding contact order model will not suffice to describe RNA folding. Instead, our results suggest a strong analogy with a modified RNA diffusion-collision model in which tertiary elements within preformed secondary structures collide, with the success of these collisions dependent on whether the tertiary elements are in their rare binding-competent conformations.
Collapse
|
17
|
Coarse-grained modeling of RNA 3D structure. Methods 2016; 103:138-56. [PMID: 27125734 DOI: 10.1016/j.ymeth.2016.04.026] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2015] [Revised: 04/21/2016] [Accepted: 04/22/2016] [Indexed: 12/21/2022] Open
Abstract
Functional RNA molecules depend on three-dimensional (3D) structures to carry out their tasks within the cell. Understanding how these molecules interact to carry out their biological roles requires a detailed knowledge of RNA 3D structure and dynamics as well as thermodynamics, which strongly governs the folding of RNA and RNA-RNA interactions as well as a host of other interactions within the cellular environment. Experimental determination of these properties is difficult, and various computational methods have been developed to model the folding of RNA 3D structures and their interactions with other molecules. However, computational methods also have their limitations, especially when the biological effects demand computation of the dynamics beyond a few hundred nanoseconds. For the researcher confronted with such challenges, a more amenable approach is to resort to coarse-grained modeling to reduce the number of data points and computational demand to a more tractable size, while sacrificing as little critical information as possible. This review presents an introduction to the topic of coarse-grained modeling of RNA 3D structures and dynamics, covering both high- and low-resolution strategies. We discuss how physics-based approaches compare with knowledge based methods that rely on databases of information. In the course of this review, we discuss important aspects in the reasoning process behind building different models and the goals and pitfalls that can result.
Collapse
|
18
|
Yan K, Arfat Y, Li D, Zhao F, Chen Z, Yin C, Sun Y, Hu L, Yang T, Qian A. Structure Prediction: New Insights into Decrypting Long Noncoding RNAs. Int J Mol Sci 2016; 17:ijms17010132. [PMID: 26805815 PMCID: PMC4730372 DOI: 10.3390/ijms17010132] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2015] [Revised: 12/18/2015] [Accepted: 01/12/2016] [Indexed: 12/31/2022] Open
Abstract
Long noncoding RNAs (lncRNAs), which form a diverse class of RNAs, remain the least understood type of noncoding RNAs in terms of their nature and identification. Emerging evidence has revealed that a small number of newly discovered lncRNAs perform important and complex biological functions such as dosage compensation, chromatin regulation, genomic imprinting, and nuclear organization. However, understanding the wide range of functions of lncRNAs related to various processes of cellular networks remains a great experimental challenge. Structural versatility is critical for RNAs to perform various functions and provides new insights into probing the functions of lncRNAs. In recent years, the computational method of RNA structure prediction has been developed to analyze the structure of lncRNAs. This novel methodology has provided basic but indispensable information for the rapid, large-scale and in-depth research of lncRNAs. This review focuses on mainstream RNA structure prediction methods at the secondary and tertiary levels to offer an additional approach to investigating the functions of lncRNAs.
Collapse
Affiliation(s)
- Kun Yan
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Yasir Arfat
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Dijie Li
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Fan Zhao
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Zhihao Chen
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Chong Yin
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Yulong Sun
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Lifang Hu
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| | - Tuanmin Yang
- Department of Bone Disease Oncology, Hong-Hui Hospital, Xi'an Jiaotong University College of Medicine, South Door slightly Friendship Road 555, Xi'an 710054, China.
| | - Airong Qian
- Key Laboratory for Space Bioscience & Biotechnology, Institute of Special Environmental Biophysics, School of Life Sciences, Northwestern Polytechnical University, 127 Youyi Xilu, Xi'an 710072, China.
| |
Collapse
|
19
|
Somarowthu S. Progress and Current Challenges in Modeling Large RNAs. J Mol Biol 2015; 428:736-747. [PMID: 26585404 DOI: 10.1016/j.jmb.2015.11.011] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Revised: 11/03/2015] [Accepted: 11/08/2015] [Indexed: 12/21/2022]
Abstract
Recent breakthroughs in next-generation sequencing technologies have led to the discovery of several classes of non-coding RNAs (ncRNAs). It is now apparent that RNA molecules are not only just carriers of genetic information but also key players in many cellular processes. While there has been a rapid increase in the number of ncRNA sequences deposited in various databases over the past decade, the biological functions of these ncRNAs are largely not well understood. Similar to proteins, RNA molecules carry out a function by forming specific three-dimensional structures. Understanding the function of a particular RNA therefore requires a detailed knowledge of its structure. However, determining experimental structures of RNA is extremely challenging. In fact, RNA-only structures represent just 1% of the total structures deposited in the PDB. Thus, computational methods that predict three-dimensional RNA structures are in high demand. Computational models can provide valuable insights into structure-function relationships in ncRNAs and can aid in the development of functional hypotheses and experimental designs. In recent years, a set of diverse RNA structure prediction tools have become available, which differ in computational time, input data and accuracy. This review discusses the recent progress and challenges in RNA structure prediction methods.
Collapse
Affiliation(s)
- Srinivas Somarowthu
- Department of Molecular, Cellular and Developmental Biology, Yale University, 219 Prospect Street, Kline Biology Tower, New Haven, CT 06511, USA.
| |
Collapse
|
20
|
Boudard M, Bernauer J, Barth D, Cohen J, Denise A. GARN: Sampling RNA 3D Structure Space with Game Theory and Knowledge-Based Scoring Strategies. PLoS One 2015; 10:e0136444. [PMID: 26313379 PMCID: PMC4551674 DOI: 10.1371/journal.pone.0136444] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Accepted: 08/03/2015] [Indexed: 11/19/2022] Open
Abstract
Cellular processes involve large numbers of RNA molecules. The functions of these RNA molecules and their binding to molecular machines are highly dependent on their 3D structures. One of the key challenges in RNA structure prediction and modeling is predicting the spatial arrangement of the various structural elements of RNA. As RNA folding is generally hierarchical, methods involving coarse-grained models hold great promise for this purpose. We present here a novel coarse-grained method for sampling, based on game theory and knowledge-based potentials. This strategy, GARN (Game Algorithm for RNa sampling), is often much faster than previously described techniques and generates large sets of solutions closely resembling the native structure. GARN is thus a suitable starting point for the molecular modeling of large RNAs, particularly those with experimental constraints. GARN is available from: http://garn.lri.fr/.
Collapse
Affiliation(s)
- Mélanie Boudard
- PRiSM, CNRS UMR 8144, Université de Versailles-St-Quentin-en-Yvelines, 78000 Versailles, France
- LRI, CNRS UMR 8623, Université Paris-Sud, 91405 Orsay, France
- * E-mail: (MB); (JC)
| | - Julie Bernauer
- AMIB, Inria Saclay-Ile de France, 91120 Palaiseau, France
- LIX, CNRS UMR 7161, Ecole Polytechnique, 91120 Palaiseau, France
| | - Dominique Barth
- PRiSM, CNRS UMR 8144, Université de Versailles-St-Quentin-en-Yvelines, 78000 Versailles, France
| | - Johanne Cohen
- LRI, CNRS UMR 8623, Université Paris-Sud, 91405 Orsay, France
- * E-mail: (MB); (JC)
| | - Alain Denise
- LRI, CNRS UMR 8623, Université Paris-Sud, 91405 Orsay, France
- AMIB, Inria Saclay-Ile de France, 91120 Palaiseau, France
- I2BC, CNRS, Université Paris-Sud, 91405 Orsay, France
| |
Collapse
|
21
|
Abstract
Despite the success of RNA secondary structure prediction for simple, short RNAs, the problem of predicting RNAs with long-range tertiary folds remains. Furthermore, RNA 3D structure prediction is hampered by the lack of the knowledge about the tertiary contacts and their thermodynamic parameters. Low-resolution structural modeling enables us to estimate the conformational entropies for a number of tertiary folds through rigorous statistical mechanical calculations. The models lead to 3D tertiary folds at coarse-grained level. The coarse-grained structures serve as the initial structures for all-atom molecular dynamics refinement to build the final all-atom 3D structures. In this paper, we present an overview of RNA computational models for secondary and tertiary structures’ predictions and then focus on a recently developed RNA statistical mechanical model—the Vfold model. The main emphasis is placed on the physics behind the models, including the treatment of the non-canonical interactions in secondary and tertiary structure modelings, and the correlations to RNA functions.
Collapse
Affiliation(s)
- Xiaojun Xu
- />Department of Physics, University of Missouri, Columbia, MO 65211 USA
- />Department of Biochemistry, University of Missouri, Columbia, MO 65211 USA
- />Informatics Institute, University of Missouri, Columbia, MO 65211 USA
| | - Shi-Jie Chen
- />Department of Physics, University of Missouri, Columbia, MO 65211 USA
- />Department of Biochemistry, University of Missouri, Columbia, MO 65211 USA
- />Informatics Institute, University of Missouri, Columbia, MO 65211 USA
| |
Collapse
|
22
|
Cheng CY, Chou FC, Kladwang W, Tian S, Cordero P, Das R. Consistent global structures of complex RNA states through multidimensional chemical mapping. eLife 2015; 4:e07600. [PMID: 26035425 PMCID: PMC4495719 DOI: 10.7554/elife.07600] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 06/02/2015] [Indexed: 11/13/2022] Open
Abstract
Accelerating discoveries of non-coding RNA (ncRNA) in myriad biological processes pose major challenges to structural and functional analysis. Despite progress in secondary structure modeling, high-throughput methods have generally failed to determine ncRNA tertiary structures, even at the 1-nm resolution that enables visualization of how helices and functional motifs are positioned in three dimensions. We report that integrating a new method called MOHCA-seq (Multiplexed •OHCleavage Analysis with paired-end sequencing) with mutate-and-map secondary structure inference guides Rosetta 3D modeling to consistent 1-nm accuracy for intricately folded ncRNAs with lengths up to 188 nucleotides, including a blind RNA-puzzle challenge, the lariat-capping ribozyme. This multidimensional chemical mapping (MCM) pipeline resolves unexpected tertiary proximities for cyclic-di-GMP, glycine, and adenosylcobalamin riboswitch aptamers without their ligands and a loose structure for the recently discovered human HoxA9D internal ribosome entry site regulon. MCM offers a sequencing-based route to uncovering ncRNA 3D structure, applicable to functionally important but potentially heterogeneous states. DOI:http://dx.doi.org/10.7554/eLife.07600.001 Our genetic material, in the form of molecules of DNA, provides instructions for many different processes in our cells. To issue these instructions, particular sections of DNA are copied to make a type of molecule called ribonucleic acid (RNA). Some of these RNA molecules contain instructions to make proteins, but others—known as non-coding RNAs—regulate the activity of genes in cells. The genetic information within RNA is encoded by the sequence of four different chemical parts called ‘nucleotides’. RNA can exist as a single strand of nucleotides, but the nucleotides can also pair up in specific combinations to form sections of double-stranded RNA. Therefore, a single strand of non-coding RNA can fold into a complex three-dimensional shape that contains loops, twists, and bulges. The three-dimensional structures of non-coding RNAs are crucial for their roles in cells, but the variety and complexity of shapes that they can form makes it technically difficult to study them. In 2008, researchers developed a new method called MOHCA that can map the positions of nucleotides that are close together in the three-dimensional structure. Highly reactive chemicals are attached to the nucleotides and these can react with, and damage, other nearby nucleotides. By detecting which nucleotides have been damaged, it is possible to map the positions of these nucleotides and decipher the structure of the RNA molecule using computer algorithms. MOHCA is a promising approach, but the initial methods to find the damaged nucleotides were tedious and required specialized equipment. Now, Cheng, Das et al.—including some of the researchers involved in the 2008 work—have developed an improved version of MOHCA that uses readily available RNA sequencing techniques to find the damaged nucleotides. The RNA sequencing data are then analyzed by a new algorithm in the Rosetta computer modeling software. Cheng, Das et al. used this newly developed ‘MOHCA-seq’ and Rosetta to reveal the structures of a human non-coding RNA and several other non-coding RNA molecules to a much higher level of detail than before. Together, MOHCA-seq and Rosetta provide a rapid method for researchers to decipher the three-dimensional structure of non-coding RNAs. This method is likely to speed up the analysis of the complex structures of non-coding RNAs. It will be useful in future efforts to work out what roles these RNAs play in cells, including their activity in cancer, neurodegeneration, and other diseases. DOI:http://dx.doi.org/10.7554/eLife.07600.002
Collapse
Affiliation(s)
- Clarence Yu Cheng
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Siqi Tian
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Pablo Cordero
- Biomedical Informatics Program, Stanford University, Stanford, United States
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, United States
| |
Collapse
|
23
|
Kerpedjiev P, Höner Zu Siederdissen C, Hofacker IL. Predicting RNA 3D structure using a coarse-grain helix-centered model. RNA (NEW YORK, N.Y.) 2015; 21:1110-1121. [PMID: 25904133 PMCID: PMC4436664 DOI: 10.1261/rna.047522.114] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Accepted: 02/13/2015] [Indexed: 06/04/2023]
Abstract
A 3D model of RNA structure can provide information about its function and regulation that is not possible with just the sequence or secondary structure. Current models suffer from low accuracy and long running times and either neglect or presume knowledge of the long-range interactions which stabilize the tertiary structure. Our coarse-grained, helix-based, tertiary structure model operates with only a few degrees of freedom compared with all-atom models while preserving the ability to sample tertiary structures given a secondary structure. It strikes a balance between the precision of an all-atom tertiary structure model and the simplicity and effectiveness of a secondary structure representation. It provides a simplified tool for exploring global arrangements of helices and loops within RNA structures. We provide an example of a novel energy function relying only on the positions of stems and loops. We show that coupling our model to this energy function produces predictions as good as or better than the current state of the art tools. We propose that given the wide range of conformational space that needs to be explored, a coarse-grain approach can explore more conformations in less iterations than an all-atom model coupled to a fine-grain energy function. Finally, we emphasize the overarching theme of providing an ensemble of predicted structures, something which our tool excels at, rather than providing a handful of the lowest energy structures.
Collapse
Affiliation(s)
| | - Christian Höner Zu Siederdissen
- Institute for Theoretical Chemistry, A-1090 Vienna, Austria Bioinformatics Group, Department of Computer Science, Universität Leipzig, D-04107 Leipzig, Germany Interdisciplinary Center for Bioinformatics, Universität Leipzig, D-04107 Leipzig, Germany
| | - Ivo L Hofacker
- Institute for Theoretical Chemistry, A-1090 Vienna, Austria Research Group Bioinformatics and Computational Biology, University of Vienna, A-1090 Vienna, Austria Center for non-coding RNA in Technology and Health, Department of Veterinary Clinical and Animal Science, University of Copenhagen, DK-1870 Frederiksberg, Denmark
| |
Collapse
|
24
|
Ge P, Zhang S. Computational analysis of RNA structures with chemical probing data. Methods 2015; 79-80:60-6. [PMID: 25687190 DOI: 10.1016/j.ymeth.2015.02.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2014] [Revised: 01/16/2015] [Accepted: 02/09/2015] [Indexed: 11/28/2022] Open
Abstract
RNAs play various roles, not only as the genetic codes to synthesize proteins, but also as the direct participants of biological functions determined by their underlying high-order structures. Although many computational methods have been proposed for analyzing RNA structures, their accuracy and efficiency are limited, especially when applied to the large RNAs and the genome-wide data sets. Recently, advances in parallel sequencing and high-throughput chemical probing technologies have prompted the development of numerous new algorithms, which can incorporate the auxiliary structural information obtained from those experiments. Their potential has been revealed by the secondary structure prediction of ribosomal RNAs and the genome-wide ncRNA function annotation. In this review, the existing probing-directed computational methods for RNA secondary and tertiary structure analysis are discussed.
Collapse
Affiliation(s)
- Ping Ge
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816-2362, USA
| | - Shaojie Zhang
- Department of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816-2362, USA.
| |
Collapse
|
25
|
Abstract
Reliable modeling of RNA tertiary structures is key to both understanding these structures' roles in complex biological machines and to eventually facilitating their design for molecular computing and robotics. In recent years, a concerted effort to improve computational prediction of RNA structure through the RNA-Puzzles blind prediction trials has accelerated advances in the field. Among other approaches, the versatile and expanding Rosetta molecular modeling software now permits modeling of RNAs in the 100-300 nucleotide size range at consistent subhelical (~1 nm) resolution. Our laboratory's current state-of-the-art methods for RNAs in this size range involve Fragment Assembly of RNA with Full-Atom Refinement (FARFAR), which optimizes RNA conformations in the context of a physically realistic energy function, as well as hybrid techniques that leverage experimental data to inform computational modeling. In this chapter, we give a practical guide to our current workflow for modeling RNA three-dimensional structures using FARFAR, including strategies for using data from multidimensional chemical mapping experiments to focus sampling and select accurate conformations.
Collapse
Affiliation(s)
- Clarence Yu Cheng
- Department of Biochemistry, Stanford University, Stanford, California, USA
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University, Stanford, California, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, California, USA; Department of Physics, Stanford University, Stanford, California, USA.
| |
Collapse
|
26
|
Dufour D, Marti-Renom MA. Software for predicting the 3D structure of RNA molecules. WILEY INTERDISCIPLINARY REVIEWS: COMPUTATIONAL MOLECULAR SCIENCE 2015. [DOI: 10.1002/wcms.1198] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- David Dufour
- Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG) and Gene Regulacion, Stem Cells and Cancer Program; Centre de Regulació Genòmica (CRG); Barcelona Spain
| | - Marc A. Marti-Renom
- Institució Catalana de Recerca i Estudis Avançats (ICREA); Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG) and Gene Regulacion, Stem Cells and Cancer Program, Centre de Regulació Genòmica (CRG); Barcelona Spain
| |
Collapse
|
27
|
Tian S, Cordero P, Kladwang W, Das R. High-throughput mutate-map-rescue evaluates SHAPE-directed RNA structure and uncovers excited states. RNA (NEW YORK, N.Y.) 2014; 20:1815-26. [PMID: 25183835 PMCID: PMC4201832 DOI: 10.1261/rna.044321.114] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The three-dimensional conformations of noncoding RNAs underpin their biochemical functions but have largely eluded experimental characterization. Here, we report that integrating a classic mutation/rescue strategy with high-throughput chemical mapping enables rapid RNA structure inference with unusually strong validation. We revisit a 16S rRNA domain for which SHAPE (selective 2'-hydroxyl acylation with primer extension) and limited mutational analysis suggested a conformational change between apo- and holo-ribosome conformations. Computational support estimates, data from alternative chemical probes, and mutate-and-map (M(2)) experiments highlight issues of prior methodology and instead give a near-crystallographic secondary structure. Systematic interrogation of single base pairs via a high-throughput mutation/rescue approach then permits incisive validation and refinement of the M(2)-based secondary structure. The data further uncover the functional conformation as an excited state (20 ± 10% population) accessible via a single-nucleotide register shift. These results correct an erroneous SHAPE inference of a ribosomal conformational change, expose critical limitations of conventional structure mapping methods, and illustrate practical steps for more incisively dissecting RNA dynamic structure landscapes.
Collapse
Affiliation(s)
- Siqi Tian
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
| | - Pablo Cordero
- Biomedical Informatics Program, Stanford University, Stanford, California 94305, USA
| | - Wipapat Kladwang
- Department of Physics, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA Biomedical Informatics Program, Stanford University, Stanford, California 94305, USA Department of Physics, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
28
|
Shi YZ, Wang FH, Wu YY, Tan ZJ. A coarse-grained model with implicit salt for RNAs: Predicting 3D structure, stability and salt effect. J Chem Phys 2014; 141:105102. [DOI: 10.1063/1.4894752] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Affiliation(s)
- Ya-Zhou Shi
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Feng-Hua Wang
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Yuan-Yan Wu
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
29
|
Abstract
A comprehensive understanding of RNA structure will provide fundamental insights into the cellular function of both coding and non-coding RNAs. Although many RNA structures have been analysed by traditional biophysical and biochemical methods, the low-throughput nature of these approaches has prevented investigation of the vast majority of cellular transcripts. Triggered by advances in sequencing technology, genome-wide approaches for probing the transcriptome are beginning to reveal how RNA structure affects each step of protein expression and RNA stability. In this Review, we discuss the emerging relationships between RNA structure and the regulation of gene expression.
Collapse
|
30
|
Magnus M, Matelska D, Łach G, Chojnowski G, Boniecki MJ, Purta E, Dawson W, Dunin-Horkawicz S, Bujnicki JM. Computational modeling of RNA 3D structures, with the aid of experimental restraints. RNA Biol 2014; 11:522-36. [PMID: 24785264 PMCID: PMC4152360 DOI: 10.4161/rna.28826] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/01/2014] [Accepted: 04/08/2014] [Indexed: 11/19/2022] Open
Abstract
In addition to mRNAs whose primary function is transmission of genetic information from DNA to proteins, numerous other classes of RNA molecules exist, which are involved in a variety of functions, such as catalyzing biochemical reactions or performing regulatory roles. In analogy to proteins, the function of RNAs depends on their structure and dynamics, which are largely determined by the ribonucleotide sequence. Experimental determination of high-resolution RNA structures is both laborious and difficult, and therefore, the majority of known RNAs remain structurally uncharacterized. To address this problem, computational structure prediction methods were developed that simulate either the physical process of RNA structure formation ("Greek science" approach) or utilize information derived from known structures of other RNA molecules ("Babylonian science" approach). All computational methods suffer from various limitations that make them generally unreliable for structure prediction of long RNA sequences. However, in many cases, the limitations of computational and experimental methods can be overcome by combining these two complementary approaches with each other. In this work, we review computational approaches for RNA structure prediction, with emphasis on implementations (particular programs) that can utilize restraints derived from experimental analyses. We also list experimental approaches, whose results can be relatively easily used by computational methods. Finally, we describe case studies where computational and experimental analyses were successfully combined to determine RNA structures that would remain out of reach for each of these approaches applied separately.
Collapse
Affiliation(s)
- Marcin Magnus
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Dorota Matelska
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Grzegorz Łach
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Grzegorz Chojnowski
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Michal J Boniecki
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Elzbieta Purta
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Wayne Dawson
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Stanislaw Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering; International Institute of Molecular and Cell Biology; Warsaw, Poland
- Laboratory of Structural Bioinformatics; Institute of Molecular Biology and Biotechnology; Faculty of Biology; Adam Mickiewicz University; Poznan, Poland
| |
Collapse
|
31
|
|
32
|
Abstract
Self-assembling RNA molecules present compelling substrates for the rational interrogation and control of living systems. However, imperfect in silico models--even at the secondary structure level--hinder the design of new RNAs that function properly when synthesized. Here, we present a unique and potentially general approach to such empirical problems: the Massive Open Laboratory. The EteRNA project connects 37,000 enthusiasts to RNA design puzzles through an online interface. Uniquely, EteRNA participants not only manipulate simulated molecules but also control a remote experimental pipeline for high-throughput RNA synthesis and structure mapping. We show herein that the EteRNA community leveraged dozens of cycles of continuous wet laboratory feedback to learn strategies for solving in vitro RNA design problems on which automated methods fail. The top strategies--including several previously unrecognized negative design rules--were distilled by machine learning into an algorithm, EteRNABot. Over a rigorous 1-y testing phase, both the EteRNA community and EteRNABot significantly outperformed prior algorithms in a dozen RNA secondary structure design tests, including the creation of dendrimer-like structures and scaffolds for small molecule sensors. These results show that an online community can carry out large-scale experiments, hypothesis generation, and algorithm design to create practical advances in empirical science.
Collapse
|
33
|
Novikova IV, Hennelly SP, Sanbonmatsu KY. Tackling structures of long noncoding RNAs. Int J Mol Sci 2013; 14:23672-84. [PMID: 24304541 PMCID: PMC3876070 DOI: 10.3390/ijms141223672] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2013] [Revised: 11/15/2013] [Accepted: 11/25/2013] [Indexed: 11/16/2022] Open
Abstract
RNAs are important catalytic machines and regulators at every level of gene expression. A new class of RNAs has emerged called long non-coding RNAs, providing new insights into evolution, development and disease. Long non-coding RNAs (lncRNAs) predominantly found in higher eukaryotes, have been implicated in the regulation of transcription factors, chromatin-remodeling, hormone receptors and many other processes. The structural versatility of RNA allows it to perform various functions, ranging from precise protein recognition to catalysis and metabolite sensing. While major housekeeping RNA molecules have long been the focus of structural studies, lncRNAs remain the least characterized class, both structurally and functionally. Here, we review common methodologies used to tackle RNA structure, emphasizing their potential application to lncRNAs. When considering the complexity of lncRNAs and lack of knowledge of their structure, chemical probing appears to be an indispensable tool, with few restrictions in terms of size, quantity and heterogeneity of the RNA molecule. Probing is not constrained to in vitro analysis and can be adapted to high-throughput sequencing platforms. Significant efforts have been applied to develop new in vivo chemical probing reagents, new library construction protocols for sequencing platforms and improved RNA prediction software based on the experimental evidence.
Collapse
|
34
|
Frank AT, Bae SH, Stelzer AC. Prediction of RNA 1H and 13C chemical shifts: a structure based approach. J Phys Chem B 2013; 117:13497-506. [PMID: 24033307 DOI: 10.1021/jp407254m] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The use of NMR-derived chemical shifts in protein structure determination and prediction has received much attention, and, as such, many methods have been developed to predict protein chemical shifts from three-dimensional (3D) coordinates. In contrast, little attention has been paid to predicting chemical shifts from RNA coordinates. Using the random forest machine learning approach, we developed RAMSEY, which is capable of predicting both (1)H and protonated (13)C chemical shifts from RNA coordinates. In this report, we introduce RAMSEY, assess its accuracy, and demonstrate the sensitivity of RAMSEY-predicted chemical shifts to RNA 3D structure.
Collapse
Affiliation(s)
- Aaron T Frank
- Nymirum , 3510 West Liberty Road, Ann Arbor, Michigan 48103, United States
| | | | | |
Collapse
|
35
|
Large-scale study of long non-coding RNA functions based on structure and expression features. SCIENCE CHINA-LIFE SCIENCES 2013; 56:953-9. [PMID: 24091687 DOI: 10.1007/s11427-013-4556-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2013] [Accepted: 09/02/2013] [Indexed: 02/01/2023]
Abstract
Mammals and other complex organisms can transcribe an abundance of long non-coding RNAs (lncRNAs) that fulfill a wide variety of regulatory roles in many biological processes. These roles, including as scaffolds and as guides for protein-coding genes, mainly depend on the structure and expression level of lncRNAs. In this review, we focus on the current methods for analyzing lncRNA structure and expression, which is basic but necessary information for in-depth, large-scale analysis of lncRNA functions.
Collapse
|
36
|
GPU-based acceleration of an RNA tertiary structure prediction algorithm. Comput Biol Med 2013; 43:1011-22. [DOI: 10.1016/j.compbiomed.2013.05.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2012] [Revised: 05/08/2013] [Accepted: 05/14/2013] [Indexed: 11/22/2022]
|
37
|
Kim H, Cordero P, Das R, Yoon S. HiTRACE-Web: an online tool for robust analysis of high-throughput capillary electrophoresis. Nucleic Acids Res 2013; 41:W492-8. [PMID: 23761448 PMCID: PMC3692083 DOI: 10.1093/nar/gkt501] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2013] [Revised: 05/08/2013] [Accepted: 05/15/2013] [Indexed: 01/14/2023] Open
Abstract
To facilitate the analysis of large-scale high-throughput capillary electrophoresis data, we previously proposed a suite of efficient analysis software named HiTRACE (High Throughput Robust Analysis of Capillary Electrophoresis). HiTRACE has been used extensively for quantitating data from RNA and DNA structure mapping experiments, including mutate-and-map contact inference, chromatin footprinting, the Eterna RNA design project and other high-throughput applications. However, HiTRACE is based on a suite of command-line MATLAB scripts that requires nontrivial efforts to learn, use and extend. Here, we present HiTRACE-Web, an online version of HiTRACE that includes standard features previously available in the command-line version and additional features such as automated band annotation and flexible adjustment of annotations, all via a user-friendly environment. By making use of parallelization, the on-line workflow is also faster than software implementations available to most users on their local computers. Free access: http://hitrace.org.
Collapse
Affiliation(s)
- Hanjoo Kim
- Department of Electrical and Computer Engineering, Seoul National University, Seoul 151-744, Korea, Bioinformatics Institute, Seoul National University, Seoul 151-747, Korea, Program in Biomedical Informatics, School of Medicine, Stanford University, Stanford CA 94305, USA, Department of Biochemistry, Stanford University, Stanford, CA 94305, USA and Department of Physics, Stanford University, Stanford, CA 94305, USA
| | - Pablo Cordero
- Department of Electrical and Computer Engineering, Seoul National University, Seoul 151-744, Korea, Bioinformatics Institute, Seoul National University, Seoul 151-747, Korea, Program in Biomedical Informatics, School of Medicine, Stanford University, Stanford CA 94305, USA, Department of Biochemistry, Stanford University, Stanford, CA 94305, USA and Department of Physics, Stanford University, Stanford, CA 94305, USA
| | - Rhiju Das
- Department of Electrical and Computer Engineering, Seoul National University, Seoul 151-744, Korea, Bioinformatics Institute, Seoul National University, Seoul 151-747, Korea, Program in Biomedical Informatics, School of Medicine, Stanford University, Stanford CA 94305, USA, Department of Biochemistry, Stanford University, Stanford, CA 94305, USA and Department of Physics, Stanford University, Stanford, CA 94305, USA
| | - Sungroh Yoon
- Department of Electrical and Computer Engineering, Seoul National University, Seoul 151-744, Korea, Bioinformatics Institute, Seoul National University, Seoul 151-747, Korea, Program in Biomedical Informatics, School of Medicine, Stanford University, Stanford CA 94305, USA, Department of Biochemistry, Stanford University, Stanford, CA 94305, USA and Department of Physics, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
38
|
Ouyang Z, Snyder MP, Chang HY. SeqFold: genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data. Genome Res 2012; 23:377-87. [PMID: 23064747 PMCID: PMC3561878 DOI: 10.1101/gr.138545.112] [Citation(s) in RCA: 85] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
We present an integrative approach, SeqFold, that combines high-throughput RNA structure profiling data with computational prediction for genome-scale reconstruction of RNA secondary structures. SeqFold transforms experimental RNA structure information into a structure preference profile (SPP) and uses it to select stable RNA structure candidates representing the structure ensemble. Under a high-dimensional classification framework, SeqFold efficiently matches a given SPP to the most likely cluster of structures sampled from the Boltzmann-weighted ensemble. SeqFold is able to incorporate diverse types of RNA structure profiling data, including parallel analysis of RNA structure (PARS), selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), fragmentation sequencing (FragSeq) data generated by deep sequencing, and conventional SHAPE data. Using the known structures of a wide range of mRNAs and noncoding RNAs as benchmarks, we demonstrate that SeqFold outperforms or matches existing approaches in accuracy and is more robust to noise in experimental data. Application of SeqFold to reconstruct the secondary structures of the yeast transcriptome reveals the diverse impact of RNA secondary structure on gene regulation, including translation efficiency, transcription initiation, and protein-RNA interactions. SeqFold can be easily adapted to incorporate any new types of high-throughput RNA structure profiling data and is widely applicable to analyze RNA structures in any transcriptome.
Collapse
Affiliation(s)
- Zhengqing Ouyang
- Howard Hughes Medical Institute and Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California 94305, USA.
| | | | | |
Collapse
|
39
|
Cirillo D, Agostini F, Tartaglia GG. Predictions of protein-RNA interactions. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2012. [DOI: 10.1002/wcms.1119] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
|
40
|
Ohmayer U, Perez-Fernandez J, Hierlmeier T, Pöll G, Williams L, Griesenbeck J, Tschochner H, Milkereit P. Local tertiary structure probing of ribonucleoprotein particles by nuclease fusion proteins. PLoS One 2012; 7:e42449. [PMID: 22876323 PMCID: PMC3411627 DOI: 10.1371/journal.pone.0042449] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2012] [Accepted: 07/06/2012] [Indexed: 11/19/2022] Open
Abstract
Analyses of the conformational dynamics of the numerous cellular ribonucleoprotein particles (RNP) significantly contribute to the understanding of their modes of action. Here, we tested whether ribonuclease fusion proteins incorporated into RNPs can be used as molecular probes to characterize the local RNA environment of these proteins. Fusion proteins of micrococcal nuclease (MNase) with ribosomal proteins were expressed in S. cerevisae to produce in vivo recombinant ribosomes which have a ribonuclease tethered to specific sites. Activation of the MNase activity by addition of calcium led to specific rRNA cleavage events in proximity to the ribosomal binding sites of the fusion proteins. The dimensions of the RNP environment which could be probed by this approach varied with the size of the linker sequence between MNase and the fused protein. Advantages and disadvantages of the use of MNase fusion proteins for local tertiary structure probing of RNPs as well as alternative applications for this type of approach in RNP research are discussed.
Collapse
Affiliation(s)
- Uli Ohmayer
- Lehrstuhl für Biochemie III, Universität Regensburg, Regensburg, Germany
| | | | - Thomas Hierlmeier
- Lehrstuhl für Biochemie III, Universität Regensburg, Regensburg, Germany
| | - Gisela Pöll
- Lehrstuhl für Biochemie III, Universität Regensburg, Regensburg, Germany
| | - Lydia Williams
- Lehrstuhl für Biochemie III, Universität Regensburg, Regensburg, Germany
| | | | - Herbert Tschochner
- Lehrstuhl für Biochemie III, Universität Regensburg, Regensburg, Germany
| | - Philipp Milkereit
- Lehrstuhl für Biochemie III, Universität Regensburg, Regensburg, Germany
| |
Collapse
|
41
|
Ritz J, Martin JS, Laederach A. Evaluating our ability to predict the structural disruption of RNA by SNPs. BMC Genomics 2012; 13 Suppl 4:S6. [PMID: 22759654 PMCID: PMC3303743 DOI: 10.1186/1471-2164-13-s4-s6] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The structure of RiboNucleic Acid (RNA) has the potential to be altered by a Single Nucleotide Polymorphism (SNP). Disease-associated SNPs mapping to non-coding regions of the genome that are transcribed into RiboNucleic Acid (RNA) can potentially affect cellular regulation (and cause disease) by altering the structure of the transcript. We performed a large-scale meta-analysis of Selective 2'-Hydroxyl Acylation analyzed by Primer Extension (SHAPE) data, which probes the structure of RNA. We found that several single point mutations exist that significantly disrupt RNA secondary structure in the five transcripts we analyzed. Thus, every RNA that is transcribed has the potential to be a “RiboSNitch;” where a SNP causes a large conformational change that alters regulatory function. Predicting the SNPs that will have the largest effect on RNA structure remains a contemporary computational challenge. We therefore benchmarked the most popular RNA structure prediction algorithms for their ability to identify mutations that maximally affect structure. We also evaluated metrics for rank ordering the extent of the structural change. Although no single algorithm/metric combination dramatically outperformed the others, small differences in AUC (Area Under the Curve) values reveal that certain approaches do provide better agreement with experiment. The experimental data we analyzed nonetheless show that multiple single point mutations exist in all RNA transcripts that significantly disrupt structure in agreement with the predictions.
Collapse
Affiliation(s)
- Justin Ritz
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | | | | |
Collapse
|
42
|
Sim AYL, Minary P, Levitt M. Modeling nucleic acids. Curr Opin Struct Biol 2012; 22:273-8. [PMID: 22538125 DOI: 10.1016/j.sbi.2012.03.012] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2012] [Accepted: 03/25/2012] [Indexed: 01/24/2023]
Abstract
Nucleic acids are an important class of biological macromolecules that carry out a variety of cellular roles. For many functions, naturally occurring DNA and RNA molecules need to fold into precise three-dimensional structures. Due to their self-assembling characteristics, nucleic acids have also been widely studied in the field of nanotechnology, and a diverse range of intricate three-dimensional nanostructures have been designed and synthesized. Different physical terms such as base-pairing and stacking interactions, tertiary contacts, electrostatic interactions and entropy all affect nucleic acid folding and structure. Here we review general computational approaches developed to model nucleic acid systems. We focus on four key areas of nucleic acid modeling: molecular representation, potential energy function, degrees of freedom and sampling algorithm. Appropriate choices in each of these key areas in nucleic acid modeling can effectively combine to aid interpretation of experimental data and facilitate prediction of nucleic acid structure.
Collapse
Affiliation(s)
- Adelene Y L Sim
- Department of Applied Physics, Stanford University, Stanford, CA 94305, USA
| | | | | |
Collapse
|
43
|
Ding F, Lavender CA, Weeks KM, Dokholyan NV. Three-dimensional RNA structure refinement by hydroxyl radical probing. Nat Methods 2012; 9:603-8. [PMID: 22504587 PMCID: PMC3422565 DOI: 10.1038/nmeth.1976] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2012] [Accepted: 03/20/2012] [Indexed: 01/08/2023]
Abstract
Molecular modeling guided by experimentally-derived structural information is an attractive approach for three-dimensional structure determination of complex RNAs that are not amenable to study by high-resolution methods. Hydroxyl radical probing (HRP), performed routinely in many laboratories, provides a measure of solvent accessibility at individual nucleotides. HRP measurements have, to date, only been used to evaluate RNA models qualitatively. Here, we report development of a quantitative structure refinement approach using HRP measurements to drive discrete molecular dynamics simulations for RNAs ranging in size from 80 to 230 nucleotides. HRP reactivities were first used to identify RNAs that form extensive helical packing interactions. For these RNAs, we achieved highly significant structure predictions, given inputs of RNA sequence and base pairing. This HRP-directed tertiary structure refinement approach generates robust structural hypotheses useful for guiding explorations of structure-function interrelationships in RNA.
Collapse
Affiliation(s)
- Feng Ding
- Department of Biochemistry and Biophysics, University of North Carolina, USA
| | | | | | | |
Collapse
|
44
|
Cao S, Chen SJ. Statistical mechanical modeling of RNA folding: from free energy landscape to tertiary structural prediction. NUCLEIC ACIDS AND MOLECULAR BIOLOGY 2012; 27:185-212. [PMID: 27293312 DOI: 10.1007/978-3-642-25740-7_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
In spite of the success of computational methods for predicting RNA secondary structure, the problem of predicting RNA tertiary structure folding remains. Low-resolution structural models show promise as they allow for rigorous statistical mechanical computation for the conformational entropies, free energies, and the coarse-grained structures of tertiary folds. Molecular dynamics refinement of coarse-grained structures leads to all-atom 3D structures. Modeling based on statistical mechanics principles also has the unique advantage of predicting the full free energy landscape, including local minima and the global free energy minimum. The energy landscapes combined with the 3D structures form the basis for quantitative predictions of RNA functions. In this chapter, we present an overview of statistical mechanical models for RNA folding and then focus on a recently developed RNA statistical mechanical model -- the Vfold model. The main emphasis is placed on the physics underpinning the models, the computational strategies, and the connections to RNA biology.
Collapse
Affiliation(s)
- Song Cao
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, MO 65211
| | - Shi-Jie Chen
- Department of Physics and Department of Biochemistry, University of Missouri, Columbia, MO 65211
| |
Collapse
|
45
|
Parisien M, Major F. Determining RNA three-dimensional structures using low-resolution data. J Struct Biol 2012; 179:252-60. [PMID: 22387042 DOI: 10.1016/j.jsb.2011.12.024] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Revised: 11/29/2011] [Accepted: 12/06/2011] [Indexed: 11/25/2022]
Abstract
Knowing the 3-D structure of an RNA is fundamental to understand its biological function. Nowadays X-ray crystallography and NMR spectroscopy are systematically applied to newly discovered RNAs. However, the application of these high-resolution techniques is not always possible, and thus scientists must turn to lower resolution alternatives. Here, we introduce a pipeline to systematically generate atomic resolution 3-D structures that are consistent with low-resolution data sets. We compare and evaluate the discriminative power of a number of low-resolution experimental techniques to reproduce the structure of the Escherichia coli tRNA(VAL) and P4-P6 domain of the Tetrahymena thermophila group I intron. We test single and combinations of the most accessible low-resolution techniques, i.e. hydroxyl radical footprinting (OH), methidiumpropyl-EDTA (MPE), multiplexed hydroxyl radical cleavage (MOHCA), and small-angle X-ray scattering (SAXS). We show that OH-derived constraints are accurate to discriminate structures at the atomic level, whereas EDTA-based constraints apply to global shape determination. We provide a guide for choosing which experimental techniques or combination of thereof is best in which context. The pipeline represents an important step towards high-throughput low-resolution RNA structure determination.
Collapse
Affiliation(s)
- Marc Parisien
- Biochemistry Department, The University of Chicago, 929 E. 57th Street, Chicago, IL 60637, USA
| | | |
Collapse
|
46
|
Quarta G, Sin K, Schlick T. Dynamic energy landscapes of riboswitches help interpret conformational rearrangements and function. PLoS Comput Biol 2012; 8:e1002368. [PMID: 22359488 PMCID: PMC3280964 DOI: 10.1371/journal.pcbi.1002368] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Accepted: 12/19/2011] [Indexed: 11/23/2022] Open
Abstract
Riboswitches are RNAs that modulate gene expression by ligand-induced conformational changes. However, the way in which sequence dictates alternative folding pathways of gene regulation remains unclear. In this study, we compute energy landscapes, which describe the accessible secondary structures for a range of sequence lengths, to analyze the transcriptional process as a given sequence elongates to full length. In line with experimental evidence, we find that most riboswitch landscapes can be characterized by three broad classes as a function of sequence length in terms of the distribution and barrier type of the conformational clusters: low-barrier landscape with an ensemble of different conformations in equilibrium before encountering a substrate; barrier-free landscape in which a direct, dominant “downhill” pathway to the minimum free energy structure is apparent; and a barrier-dominated landscape with two isolated conformational states, each associated with a different biological function. Sharing concepts with the “new view” of protein folding energy landscapes, we term the three sequence ranges above as the sensing, downhill folding, and functional windows, respectively. We find that these energy landscape patterns are conserved in various riboswitch classes, though the order of the windows may vary. In fact, the order of the three windows suggests either kinetic or thermodynamic control of ligand binding. These findings help understand riboswitch structure/function relationships and open new avenues to riboswitch design. Riboswitches are RNAs that modulate gene expression by ligand-induced conformational changes. However, the way that sequence dictates alternative folding pathways of gene regulation remains unclear. In this study, we mimic transcription by computing energy landscapes which describe accessible secondary structures for a range of sequence lengths. Consistent with experimental evidence, we find that most riboswitch landscapes can be characterized by three broad classes as a function of sequence length in terms of the distribution and barrier type of the conformational clusters: Low-barrier landscape with an ensemble of conformations in equilibrium before encountering a substrate; barrier-free landscape with a dominant “downhill” pathway to the minimum free energy structure; and barrier-dominated landscape with two isolated conformational states with different functions. Sharing concepts with the “new view” of protein folding energy landscapes, we term the three sequence ranges above as the sensing, downhill folding, and functional windows, respectively. We find that these energy landscape patterns are conserved between riboswitch classes, though the order of the windows may vary. In fact, the order of the three windows suggests either kinetic or thermodynamic control of ligand binding. These findings help understand riboswitch structure/function relationships and open new avenues to riboswitch design.
Collapse
Affiliation(s)
- Giulio Quarta
- Department of Chemistry, New York University, New York, New York, United States of America
- Howard Hughes Medical Institute - Medical Research Fellows Program, Chevy Chase, Maryland, United States of America
| | - Ken Sin
- Department of Chemistry, New York University, New York, New York, United States of America
| | - Tamar Schlick
- Department of Chemistry, New York University, New York, New York, United States of America
- Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
- * E-mail:
| |
Collapse
|
47
|
Abstract
The function of RNA depends on its ability to adopt complex and dynamic structures, and the incorporation of site-specific cross-linking probes is a powerful method for providing distance constraints that are valuable in RNA structural biology. Here we describe a new RNA-RNA cross-linking strategy based on Pt(II) targeting of specific phosphorothioate substitutions. In this strategy cis-diammine Pt(II) complexes are kinetically recruited and anchored to a phosphorothioate substitution embedded within a structured RNA. Substitution of the remaining exchangeable Pt(II) ligand with a nucleophile supplied by a nearby RNA nucleobase results in metal-mediated cross-links that are stable during isolation. This type of cross-linking strategy was explored within the catalytic core of the Hammerhead ribozyme (HHRz). When a phosphorothioate substitution is installed at the scissile bond normally cleaved by the HHRz, Pt(II) cross-linking takes place to nucleotides G8 and G10 in the ribozyme active site. Both of these positions are predicted to be within ~8 Å of a phosphorothioate-bound Pt(II) metal center. Cross-linking depends on Mg(2+) ion concentration, reaching yields as high as 30%, with rates that indicate cation competition within the RNA three-helix junction. Cross-linking efficiency depends on accurate formation of the HHRz tertiary structure, and cross-links are not observed for RNA helices. Combined, these results show promise for using kinetically inert Pt(II) complexes as new site-specific cross-linking tools for exploring RNA structure and dynamics.
Collapse
Affiliation(s)
- Erich G. Chapman
- Department of Chemistry and Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403
| | - Victoria J. DeRose
- Department of Chemistry and Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403
| |
Collapse
|
48
|
Flores SC, Bernauer J, Shin S, Zhou R, Huang X. Multiscale modeling of macromolecular biosystems. Brief Bioinform 2012; 13:395-405. [DOI: 10.1093/bib/bbr077] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
49
|
Kladwang W, VanLang CC, Cordero P, Das R. A two-dimensional mutate-and-map strategy for non-coding RNA structure. Nat Chem 2011; 3:954-62. [PMID: 22109276 PMCID: PMC3725140 DOI: 10.1038/nchem.1176] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2011] [Accepted: 09/15/2011] [Indexed: 12/24/2022]
Abstract
Non-coding RNAs fold into precise base-pairing patterns to carry out critical roles in genetic regulation and protein synthesis, but determining RNA structure remains difficult. Here, we show that coupling systematic mutagenesis with high-throughput chemical mapping enables accurate base-pair inference of domains from ribosomal RNA, ribozymes and riboswitches. For a six-RNA benchmark that has challenged previous chemical/computational methods, this 'mutate-and-map' strategy gives secondary structures that are in agreement with crystallography (helix error rates, 2%), including a blind test on a double-glycine riboswitch. Through modelling of partially ordered states, the method enables the first test of an interdomain helix-swap hypothesis for ligand-binding cooperativity in a glycine riboswitch. Finally, the data report on tertiary contacts within non-coding RNAs, and coupling to the Rosetta/FARFAR algorithm gives nucleotide-resolution three-dimensional models (helix root-mean-squared deviation, 5.7 Å) of an adenine riboswitch. These results establish a promising two-dimensional chemical strategy for inferring the secondary and tertiary structures that underlie non-coding RNA behaviour.
Collapse
Affiliation(s)
- Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
| | - Christopher C. VanLang
- Department of Chemical Engineering, Stanford University, Stanford, California 94305, USA
| | - Pablo Cordero
- Program in Biomedical Informatics, Stanford University, Stanford, California 94305, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, California 94305, USA
- Program in Biomedical Informatics, Stanford University, Stanford, California 94305, USA
- Department of Physics, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
50
|
Computational methods for prediction of protein-RNA interactions. J Struct Biol 2011; 179:261-8. [PMID: 22019768 DOI: 10.1016/j.jsb.2011.10.001] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2011] [Revised: 09/28/2011] [Accepted: 10/04/2011] [Indexed: 12/21/2022]
Abstract
Understanding the molecular mechanism of protein-RNA recognition and complex formation is a major challenge in structural biology. Unfortunately, the experimental determination of protein-RNA complexes by X-ray crystallography and nuclear magnetic resonance spectroscopy (NMR) is tedious and difficult. Alternatively, protein-RNA interactions can be predicted by computational methods. Although less accurate than experimental observations, computational predictions can be sufficiently accurate to prompt functional hypotheses and guide experiments, e.g. to identify individual amino acid or nucleotide residues. In this article we review 10 methods for predicting protein-RNA interactions, seven of which predict RNA-binding sites from protein sequences and three from structures. We also developed a meta-predictor that uses the output of top three sequence-based primary predictors to calculate a consensus prediction, which outperforms all the primary predictors. In order to fully cover the software for predicting protein-RNA interactions, we also describe five methods for protein-RNA docking. The article highlights the strengths and shortcomings of existing methods for the prediction of protein-RNA interactions and provides suggestions for their further development.
Collapse
|