1
|
Su JJ, Xu XL, Sun TT, Shen Y, Wang Y. Cotranscriptional folding of RNA pseudoknots with different rates. Chem Phys Lett 2021. [DOI: 10.1016/j.cplett.2021.138946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
2
|
Zhao Q, Zhao Z, Fan X, Yuan Z, Mao Q, Yao Y. Review of machine learning methods for RNA secondary structure prediction. PLoS Comput Biol 2021; 17:e1009291. [PMID: 34437528 PMCID: PMC8389396 DOI: 10.1371/journal.pcbi.1009291] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
Abstract
Secondary structure plays an important role in determining the function of noncoding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagnated in the last decade. Recently, with the increasing availability of RNA structure data, new methods based on machine learning (ML) technologies, especially deep learning, have alleviated the issue. In this review, we provide a comprehensive overview of RNA secondary structure prediction methods based on ML technologies and a tabularized summary of the most important methods in this field. The current pending challenges in the field of RNA secondary structure prediction and future trends are also discussed.
Collapse
Affiliation(s)
- Qi Zhao
- College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, Liaoning, China
| | - Zheng Zhao
- School of Information Science and Technology, Dalian Maritime University, Dalian, Liaoning, China
| | - Xiaoya Fan
- School of Software, Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, Dalian University of Technology, Dalian, Liaoning, China
| | - Zhengwei Yuan
- Key Laboratory of Health Ministry for Congenital Malformation, Shengjing Hospital of China Medical University, Shenyang, Liaoning, China
| | - Qian Mao
- College of Light Industry, Liaoning University, Shenyang, Liaoning, China
- Key Laboratory of Agroproducts Processing Technology, Changchun University, Changchun, Jilin, China
| | - Yudong Yao
- Department of Electrical and Computer Engineering, Stevens Institute of Technology, Hoboken, New Jersey, United States of America
| |
Collapse
|
3
|
Perret G, Boschetti E. Aptamer-Based Affinity Chromatography for Protein Extraction and Purification. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2020; 174:93-139. [PMID: 31485702 DOI: 10.1007/10_2019_106] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Aptamers are oligonucleotide molecules able to recognize very specifically proteins. Among the possible applications, aptamers have been used for affinity chromatography with effective results and advantages over most advanced protein separation technologies. This chapter first discusses the context of the affinity chromatography with aptamer ligands. With the adaptation of SELEX, the chemical modifications of aptamers to comply with the covalent coupling and the separation process are then extensively presented. A focus is then made about the most important applications for protein separation with real-life examples and the comparison with immunoaffinity chromatography. In spite of well-advanced demonstrations and the extraordinary potential developments, a significant optimization work is still due to deserve large-scale applications with all necessary validations. Graphical Abstract Aptamer-protein complexes by X-ray crystallography.
Collapse
|
4
|
Maximova T, Plaku E, Shehu A. Structure-Guided Protein Transition Modeling with a Probabilistic Roadmap Algorithm. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1783-1796. [PMID: 27411226 DOI: 10.1109/tcbb.2016.2586044] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Proteins are macromolecules in perpetual motion, switching between structural states to modulate their function. A detailed characterization of the precise yet complex relationship between protein structure, dynamics, and function requires elucidating transitions between functionally-relevant states. Doing so challenges both wet and dry laboratories, as protein dynamics involves disparate temporal scales. In this paper, we present a novel, sampling-based algorithm to compute transition paths. The algorithm exploits two main ideas. First, it leverages known structures to initialize its search and define a reduced conformation space for rapid sampling. This is key to address the insufficient sampling issue suffered by sampling-based algorithms. Second, the algorithm embeds samples in a nearest-neighbor graph where transition paths can be efficiently computed via queries. The algorithm adapts the probabilistic roadmap framework that is popular in robot motion planning. In addition to efficiently computing lowest-cost paths between any given structures, the algorithm allows investigating hypotheses regarding the order of experimentally-known structures in a transition event. This novel contribution is likely to open up new venues of research. Detailed analysis is presented on multiple-basin proteins of relevance to human disease. Multiscaling and the AMBER ff14SB force field are used to obtain energetically-credible paths at atomistic detail.
Collapse
|
5
|
Wang Y, Wang Z, Liu T, Gong S, Zhang W. Effects of flanking regions on HDV cotranscriptional folding kinetics. RNA (NEW YORK, N.Y.) 2018; 24:1229-1240. [PMID: 29954950 PMCID: PMC6097654 DOI: 10.1261/rna.065961.118] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Accepted: 06/25/2018] [Indexed: 05/20/2023]
Abstract
Hepatitis delta virus (HDV) ribozyme performs the self-cleavage activity through folding to a double pseudoknot structure. The folding of functional RNA structures is often coupled with the transcription process. In this work, we developed a new approach for predicting the cotranscriptional folding kinetics of RNA secondary structures with pseudoknots. We theoretically studied the cotranscriptional folding behavior of the 99-nucleotide (nt) HDV sequence, two upstream flanking sequences, and one downstream flanking sequence. During transcription, the 99-nt HDV can effectively avoid the trap intermediates and quickly fold to the cleavage-active state. It is different from its refolding kinetics, which folds into an intermediate trap state. For all the sequences, the ribozyme regions (from 1 to 73) all fold to the same structure during transcription. However, the existence of the 30-nt upstream flanking sequence can inhibit the ribozyme region folding into the active native state through forming an alternative helix Alt1 with the segments 70-90. The longer upstream flanking sequence of 54 nt itself forms a stable hairpin structure, which sequesters the formation of the Alt1 helix and leads to rapid formation of the cleavage-active structure. Although the 55-nt downstream flanking sequence could invade the already folded active structure during transcription by forming a more stable helix with the ribozyme region, the slow transition rate could keep the structure in the cleavage-active structure to perform the activity.
Collapse
Affiliation(s)
- Yanli Wang
- Department of Physics, Wuhan University, Wuhan, Hubei 430072, P.R. China
| | - Zhen Wang
- Department of Physics, Wuhan University, Wuhan, Hubei 430072, P.R. China
| | - Taigang Liu
- Department of Physics, Wuhan University, Wuhan, Hubei 430072, P.R. China
| | - Sha Gong
- Department of Physics, Wuhan University, Wuhan, Hubei 430072, P.R. China
| | - Wenbing Zhang
- Department of Physics, Wuhan University, Wuhan, Hubei 430072, P.R. China
| |
Collapse
|
6
|
Sun TT, Zhao C, Chen SJ. Predicting Cotranscriptional Folding Kinetics For Riboswitch. J Phys Chem B 2018; 122:7484-7496. [PMID: 29985608 DOI: 10.1021/acs.jpcb.8b04249] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
On the basis of a helix-based transition rate model, we developed a new method for sampling cotranscriptional RNA conformational ensemble and the prediction of cotranscriptional folding kinetics. Applications to E. coli. SRP RNA and pbuE riboswitch indicate that the model may provide reliable predictions for the cotranscriptional folding pathways and population kinetics. For E. coli. SRP RNA, the predicted population kinetics and the folding pathway are consistent with the SHAPE profiles in the recent cotranscriptional SHAPE-seq experiments. For the pbuE riboswitch, the model predicts the transcriptional termination efficiency as a function of the force. The theoretical results show (a) a force-induced transition from the aptamer (antiterminator) to the terminator structure and (b) the different folding pathways for the riboswitch with and without the ligand (adenine). More specifically, without adenine, the aptamer structure emerges as a short-lived kinetic transient state instead of a thermodynamically stable intermediate state. Furthermore, from the predicted extension-time curves, the model identifies a series of conformational switches in the pulling process, where the predicted relative residence times for the different structures are in accordance with the experimental data. The model may provide a new tool for quantitative predictions of cotranscriptional folding kinetics, and results can offer useful insights into cotranscriptional folding-related RNA functions such as regulation of gene expression with riboswitches.
Collapse
Affiliation(s)
- Ting-Ting Sun
- Department of Physics , Zhejiang University of Science and Technology , Hangzhou 310023 , P. R. China.,Department of Physics, Department of Biochemistry, and University of Missouri Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Chenhan Zhao
- Department of Physics, Department of Biochemistry, and University of Missouri Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| | - Shi-Jie Chen
- Department of Physics, Department of Biochemistry, and University of Missouri Informatics Institute , University of Missouri , Columbia , Missouri 65211 , United States
| |
Collapse
|
7
|
Fukunaga T, Hamada M. Computational approaches for alternative and transient secondary structures of ribonucleic acids. Brief Funct Genomics 2018; 18:182-191. [PMID: 30689706 DOI: 10.1093/bfgp/ely042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Transient and alternative structures of ribonucleic acids (RNAs) play essential roles in various regulatory processes, such as translation regulation in living cells. Because experimental analyses for RNA structures are difficult and time-consuming, computational approaches based on RNA secondary structures are promising. In this article, we review computational methods for detecting and analyzing transient/alternative secondary structures of RNAs, including static approaches based on probabilistic distributions of RNA secondary structures and dynamic approaches such as kinetic folding and folding pathway predictions.
Collapse
|
8
|
Efficient computation of co-transcriptional RNA-ligand interaction dynamics. Methods 2018; 143:70-76. [PMID: 29730250 DOI: 10.1016/j.ymeth.2018.04.036] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2017] [Revised: 04/26/2018] [Accepted: 04/29/2018] [Indexed: 11/23/2022] Open
Abstract
Riboswitches form an abundant class of cis-regulatory RNA elements that mediate gene expression by binding a small metabolite. For synthetic biology applications, they are becoming cheap and accessible systems for selectively triggering transcription or translation of downstream genes. Many riboswitches are kinetically controlled, hence knowledge of their co-transcriptional mechanisms is essential. We present here an efficient implementation for analyzing co-transcriptional RNA-ligand interaction dynamics. This approach allows for the first time to model concentration-dependent metabolite binding/unbinding kinetics. We exemplify this novel approach by means of the recently studied I-A 2'-deoxyguanosine (2'dG)-sensing riboswitch from Mesoplasma florum.
Collapse
|
9
|
Maximova T, Zhang Z, Carr DB, Plaku E, Shehu A. Sample-Based Models of Protein Energy Landscapes and Slow Structural Rearrangements. J Comput Biol 2018; 25:33-50. [DOI: 10.1089/cmb.2017.0158] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Affiliation(s)
- Tatiana Maximova
- Department of Computer Science, George Mason University, Fairfax, Virginia
| | - Zijing Zhang
- Department of Statistics, George Mason University, Fairfax, Virginia
| | - Daniel B. Carr
- Department of Statistics, George Mason University, Fairfax, Virginia
| | - Erion Plaku
- Department of Electrical Engineering and Computer Science, The Catholic University of America, Washington, D.C
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, Virginia
- Department of Bioengineering, George Mason University, Fairfax, Virginia
- School of Systems Biology, George Mason University, Manassas, Virginia
| |
Collapse
|
10
|
Clote P, Bayegan AH. RNA folding kinetics using Monte Carlo and Gillespie algorithms. J Math Biol 2017; 76:1195-1227. [PMID: 28780735 DOI: 10.1007/s00285-017-1169-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Revised: 07/09/2017] [Indexed: 11/26/2022]
Abstract
RNA secondary structure folding kinetics is known to be important for the biological function of certain processes, such as the hok/sok system in E. coli. Although linear algebra provides an exact computational solution of secondary structure folding kinetics with respect to the Turner energy model for tiny ([Formula: see text]20 nt) RNA sequences, the folding kinetics for larger sequences can only be approximated by binning structures into macrostates in a coarse-grained model, or by repeatedly simulating secondary structure folding with either the Monte Carlo algorithm or the Gillespie algorithm. Here we investigate the relation between the Monte Carlo algorithm and the Gillespie algorithm. We prove that asymptotically, the expected time for a K-step trajectory of the Monte Carlo algorithm is equal to [Formula: see text] times that of the Gillespie algorithm, where [Formula: see text] denotes the Boltzmann expected network degree. If the network is regular (i.e. every node has the same degree), then the mean first passage time (MFPT) computed by the Monte Carlo algorithm is equal to MFPT computed by the Gillespie algorithm multiplied by [Formula: see text]; however, this is not true for non-regular networks. In particular, RNA secondary structure folding kinetics, as computed by the Monte Carlo algorithm, is not equal to the folding kinetics, as computed by the Gillespie algorithm, although the mean first passage times are roughly correlated. Simulation software for RNA secondary structure folding according to the Monte Carlo and Gillespie algorithms is publicly available, as is our software to compute the expected degree of the network of secondary structures of a given RNA sequence-see http://bioinformatics.bc.edu/clote/RNAexpNumNbors .
Collapse
Affiliation(s)
- Peter Clote
- Department of Biology, Boston College, Chestnut Hill, MA, 02467, USA.
| | - Amir H Bayegan
- Department of Biology, Boston College, Chestnut Hill, MA, 02467, USA
| |
Collapse
|
11
|
Lorenz R, Wolfinger MT, Tanzer A, Hofacker IL. Predicting RNA secondary structures from sequence and probing data. Methods 2016; 103:86-98. [PMID: 27064083 DOI: 10.1016/j.ymeth.2016.04.004] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 03/29/2016] [Accepted: 04/04/2016] [Indexed: 01/08/2023] Open
Abstract
RNA secondary structures have proven essential for understanding the regulatory functions performed by RNA such as microRNAs, bacterial small RNAs, or riboswitches. This success is in part due to the availability of efficient computational methods for predicting RNA secondary structures. Recent advances focus on dealing with the inherent uncertainty of prediction by considering the ensemble of possible structures rather than the single most stable one. Moreover, the advent of high-throughput structural probing has spurred the development of computational methods that incorporate such experimental data as auxiliary information.
Collapse
Affiliation(s)
- Ronny Lorenz
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria.
| | - Michael T Wolfinger
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria; Medical University of Vienna, Center for Anatomy and Cell Biology, Währingerstraße 13, 1090 Vienna, Austria.
| | - Andrea Tanzer
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria.
| | - Ivo L Hofacker
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria; University of Vienna, Faculty of Computer Science, Research Group Bioinformatics and Computational Biology, Währingerstr. 29, 1090 Vienna, Austria.
| |
Collapse
|
12
|
Abstract
SUMMARYEvidence is emerging that the role of protein structure in disease needs to be rethought. Sequence mutations in proteins are often found to affect the rate at which a protein switches between structures. Modeling structural transitions in wildtype and variant proteins is central to understanding the molecular basis of disease. This paper investigates an efficient algorithmic realization of the stochastic roadmap simulation framework to model structural transitions in wildtype and variants of proteins implicated in human disorders. Our results indicate that the algorithm is able to extract useful information on the impact of mutations on protein structure and function.
Collapse
|
13
|
Dykeman EC. An implementation of the Gillespie algorithm for RNA kinetics with logarithmic time update. Nucleic Acids Res 2015; 43:5708-15. [PMID: 25990741 PMCID: PMC4499123 DOI: 10.1093/nar/gkv480] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 05/01/2015] [Indexed: 12/17/2022] Open
Abstract
In this paper I outline a fast method called KFOLD for implementing the Gillepie algorithm to stochastically sample the folding kinetics of an RNA molecule at single base-pair resolution. In the same fashion as the KINFOLD algorithm, which also uses the Gillespie algorithm to predict folding kinetics, KFOLD stochastically chooses a new RNA secondary structure state that is accessible from the current state by a single base-pair addition/deletion following the Gillespie procedure. However, unlike KINFOLD, the KFOLD algorithm utilizes the fact that many of the base-pair addition/deletion reactions and their corresponding rates do not change between each step in the algorithm. This allows KFOLD to achieve a substantial speed-up in the time required to compute a prediction of the folding pathway and, for a fixed number of base-pair moves, performs logarithmically with sequence size. This increase in speed opens up the possibility of studying the kinetics of much longer RNA sequences at single base-pair resolution while also allowing for the RNA folding statistics of smaller RNA sequences to be computed much more quickly.
Collapse
Affiliation(s)
- Eric C Dykeman
- York Centre for Complex Systems Analysis, Department of Mathematics and Biology University of York, Deramore Lane, York, YO10 5GE, UK
| |
Collapse
|
14
|
Górska A, Jasiński M, Trylska J. MINT: software to identify motifs and short-range interactions in trajectories of nucleic acids. Nucleic Acids Res 2015; 43:e114. [PMID: 26024667 PMCID: PMC4787793 DOI: 10.1093/nar/gkv559] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Accepted: 05/15/2015] [Indexed: 12/18/2022] Open
Abstract
Structural biology experiments and structure prediction tools have provided many
high-resolution three-dimensional structures of nucleic acids. Also, molecular
dynamics force field parameters have been adapted to simulating charged and flexible
nucleic acid structures on microsecond time scales. Therefore, we can generate the
dynamics of DNA or RNA molecules, but we still lack adequate tools for the analysis
of the resulting huge amounts of data. We present MINT (Motif
Identifier for Nucleic acids Trajectory) — an automatic tool for analyzing
three-dimensional structures of RNA and DNA, and their full-atom molecular dynamics
trajectories or other conformation sets (e.g. X-ray or nuclear magnetic
resonance-derived structures). For each RNA or DNA conformation
MINT determines the hydrogen bonding network resolving the
base pairing patterns, identifies secondary structure motifs (helices, junctions,
loops, etc.) and pseudoknots. MINT also estimates the energy
of stacking and phosphate anion-base interactions. For many conformations, as in a
molecular dynamics trajectory, MINT provides averages of the
above structural and energetic features and their evolution. We show
MINT functionality based on all-atom explicit solvent
molecular dynamics trajectory of the 30S ribosomal subunit.
Collapse
Affiliation(s)
- Anna Górska
- Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland Master studies at the Faculty of Mathematics, Informatics, and Mechanics, University of Warsaw, Banacha 2, Warsaw, Poland
| | - Maciej Jasiński
- Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland College of Inter-Faculty Individual Studies in Mathematics and Natural Sciences, University of Warsaw, Al. Żwirki i Wigury 93, 02-089 Warsaw, Poland
| | - Joanna Trylska
- Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland
| |
Collapse
|
15
|
Abstract
In this article, we introduce the software suite Hermes, which provides fast, novel algorithms for RNA secondary structure kinetics. Using the fast Fourier transform to efficiently compute the Boltzmann probability that a secondary structure S of a given RNA sequence has base pair distance x (resp. y) from reference structure A (resp. B), Hermes computes the exact kinetics of folding from A to B in this coarse-grained model. In particular, Hermes computes the mean first passage time from the transition probability matrix by using matrix inversion, and also computes the equilibrium time from the rate matrix by using spectral decomposition. Due to the model granularity and the speed of Hermes, it is capable of determining secondary structure refolding kinetics for large RNA sequences, beyond the range of other methods. Comparative benchmarking of Hermes with other methods indicates that Hermes provides refolding kinetics of accuracy suitable for use in the computational design of RNA, an important area of synthetic biology. Source code and documentation for Hermes are available.
Collapse
Affiliation(s)
- Evan Senter
- Department of Biology, Boston College , Chestnut Hill, Massachusetts
| | | |
Collapse
|
16
|
RNA folding: structure prediction, folding kinetics and ion electrostatics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2015; 827:143-83. [PMID: 25387965 DOI: 10.1007/978-94-017-9245-5_11] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Beyond the "traditional" functions such as gene storage, transport and protein synthesis, recent discoveries reveal that RNAs have important "new" biological functions including the RNA silence and gene regulation of riboswitch. Such functions of noncoding RNAs are strongly coupled to the RNA structures and proper structure change, which naturally leads to the RNA folding problem including structure prediction and folding kinetics. Due to the polyanionic nature of RNAs, RNA folding structure, stability and kinetics are strongly coupled to the ion condition of solution. The main focus of this chapter is to review the recent progress in the three major aspects in RNA folding problem: structure prediction, folding kinetics and ion electrostatics. This chapter will introduce both the recent experimental and theoretical progress, while emphasize the theoretical modelling on the three aspects in RNA folding.
Collapse
|
17
|
Mann M, Kucharík M, Flamm C, Wolfinger MT. Memory-efficient RNA energy landscape exploration. Bioinformatics 2014; 30:2584-91. [PMID: 24833804 PMCID: PMC4155248 DOI: 10.1093/bioinformatics/btu337] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Revised: 04/25/2014] [Accepted: 05/08/2014] [Indexed: 02/01/2023] Open
Abstract
MOTIVATION Energy landscapes provide a valuable means for studying the folding dynamics of short RNA molecules in detail by modeling all possible structures and their transitions. Higher abstraction levels based on a macro-state decomposition of the landscape enable the study of larger systems; however, they are still restricted by huge memory requirements of exact approaches. RESULTS We present a highly parallelizable local enumeration scheme that enables the computation of exact macro-state transition models with highly reduced memory requirements. The approach is evaluated on RNA secondary structure landscapes using a gradient basin definition for macro-states. Furthermore, we demonstrate the need for exact transition models by comparing two barrier-based approaches, and perform a detailed investigation of gradient basins in RNA energy landscapes. AVAILABILITY AND IMPLEMENTATION Source code is part of the C++ Energy Landscape Library available at http://www.bioinf.uni-freiburg.de/Software/.
Collapse
Affiliation(s)
- Martin Mann
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany, Institute for Theoretical Chemistry, University of Vienna, 1090 Vienna, Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, and Department of Biochemistry and Molecular Cell Biology, Max F. Perutz Laboratories, University of Vienna, A-1030 Vienna, Austria
| | - Marcel Kucharík
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany, Institute for Theoretical Chemistry, University of Vienna, 1090 Vienna, Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, and Department of Biochemistry and Molecular Cell Biology, Max F. Perutz Laboratories, University of Vienna, A-1030 Vienna, Austria
| | - Christoph Flamm
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany, Institute for Theoretical Chemistry, University of Vienna, 1090 Vienna, Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, and Department of Biochemistry and Molecular Cell Biology, Max F. Perutz Laboratories, University of Vienna, A-1030 Vienna, Austria
| | - Michael T Wolfinger
- Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany, Institute for Theoretical Chemistry, University of Vienna, 1090 Vienna, Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, and Department of Biochemistry and Molecular Cell Biology, Max F. Perutz Laboratories, University of Vienna, A-1030 Vienna, Austria Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany, Institute for Theoretical Chemistry, University of Vienna, 1090 Vienna, Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, and Department of Biochemistry and Molecular Cell Biology, Max F. Perutz Laboratories, University of Vienna, A-1030 Vienna, Austria Bioinformatics Group, Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany, Institute for Theoretical Chemistry, University of Vienna, 1090 Vienna, Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, and Department of Biochemistry and Molecular Cell Biology, Max F. Perutz Laboratories, University of Vienna, A-1030 Vienna, Austria
| |
Collapse
|
18
|
Kucharík M, Hofacker IL, Stadler PF, Qin J. Basin Hopping Graph: a computational framework to characterize RNA folding landscapes. ACTA ACUST UNITED AC 2014; 30:2009-17. [PMID: 24648041 DOI: 10.1093/bioinformatics/btu156] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
MOTIVATION RNA folding is a complicated kinetic process. The minimum free energy structure provides only a static view of the most stable conformational state of the system. It is insufficient to give detailed insights into the dynamic behavior of RNAs. A sufficiently sophisticated analysis of the folding free energy landscape, however, can provide the relevant information. RESULTS We introduce the Basin Hopping Graph (BHG) as a novel coarse-grained model of folding landscapes. Each vertex of the BHG is a local minimum, which represents the corresponding basin in the landscape. Its edges connect basins when the direct transitions between them are 'energetically favorable'. Edge weights endcode the corresponding saddle heights and thus measure the difficulties of these favorable transitions. BHGs can be approximated accurately and efficiently for RNA molecules well beyond the length range accessible to enumerative algorithms. AVAILABILITY AND IMPLEMENTATION The algorithms described here are implemented in C++ as standalone programs. Its source code and supplemental material can be freely downloaded from http://www.tbi.univie.ac.at/bhg.html.
Collapse
Affiliation(s)
- Marcel Kucharík
- Institute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, Denmark
| | - Ivo L Hofacker
- Institute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, DenmarkInstitute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, DenmarkInstitute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, Denmark
| | - Peter F Stadler
- Institute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, DenmarkInstitute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, DenmarkInstitute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, DenmarkInstitute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, Univer
| | - Jing Qin
- Institute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, DenmarkInstitute for Theoretical Chemistry and Research group BCB, Faculty of Computer Science, University of Vienna, Währinger Straße 17, 1090 Vienna, Austria, Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark, Department of Computer Science & IZBI & iDiv & LIFE, Härtelstraße 16-18, D-04107 University of Leipzig, Max Planck Institute for Mathematics in the Sciences and Fraunhofer Institute IZI, Leipzig, Germany, Santa Fe Institute, Santa Fe, NM 87501, USA and Department of Mathematics and Computer Science, University Of Southern Denmark, Odense, Denmark
| |
Collapse
|
19
|
Huang J, Voß B. Analysing RNA-kinetics based on folding space abstraction. BMC Bioinformatics 2014; 15:60. [PMID: 24575751 PMCID: PMC3974018 DOI: 10.1186/1471-2105-15-60] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2013] [Accepted: 02/24/2014] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND RNA molecules, especially non-coding RNAs, play vital roles in the cell and their biological functions are mostly determined by structural properties. Often, these properties are related to dynamic changes in the structure, as in the case of riboswitches, and thus the analysis of RNA folding kinetics is crucial for their study. Exact approaches to kinetic folding are computationally expensive and, thus, limited to short sequences. In a previous study, we introduced a position-specific abstraction based on helices which we termed helix index shapes (hishapes) and a hishape-based algorithm for near-optimal folding pathway computation, called HiPath. The combination of these approaches provides an abstract view of the folding space that offers information about the global features. RESULTS In this paper we present HiKinetics, an algorithm that can predict RNA folding kinetics for sequences up to several hundred nucleotides long. This algorithm is based on RNAHeliCes, which decomposes the folding space into abstract classes, namely hishapes, and an improved version of HiPath, namely HiPath2, which estimates plausible folding pathways that connect these classes. Furthermore, we analyse the relationship of hishapes to locally optimal structures, the results of which strengthen the use of the hishape abstraction for studying folding kinetics. Finally, we show the application of HiKinetics to the folding kinetics of two well-studied RNAs. CONCLUSIONS HiKinetics can calculate kinetic folding based on a novel hishape decomposition. HiKinetics, together with HiPath2 and RNAHeliCes, is available for download at http://www.cyanolab.de/software/RNAHeliCes.htm.
Collapse
Affiliation(s)
- Jiabin Huang
- Genetics & Experimental Bioinformatics, Faculty of Biology, University of Freiburg, Schänzlestr. 1, 79104, Freiburg, Germany
| | - Björn Voß
- Genetics & Experimental Bioinformatics, Faculty of Biology, University of Freiburg, Schänzlestr. 1, 79104, Freiburg, Germany
| |
Collapse
|
20
|
Senter E, Dotu I, Clote P. RNA folding pathways and kinetics using 2D energy landscapes. J Math Biol 2014; 70:173-96. [PMID: 24515409 DOI: 10.1007/s00285-014-0760-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Revised: 09/26/2013] [Indexed: 11/27/2022]
Abstract
RNA folding pathways play an important role in various biological processes, such as (i) the hok/sok (host-killing/suppression of killing) system in E. coli to check for sufficient plasmid copy number, (ii) the conformational switch in spliced leader (SL) RNA from Leptomonas collosoma, which controls trans splicing of a portion of the '5 exon, and (iii) riboswitches--portions of the 5' untranslated region of messenger RNA that regulate genes by allostery. Since RNA folding pathways are determined by the energy landscape, we describe a novel algorithm, FFTbor2D, which computes the 2D projection of the energy landscape for a given RNA sequence. Given two metastable secondary structures A, B for a given RNA sequence, FFTbor2D computes the Boltzmann probability p(x, y) = Z(x,y)/Z that a secondary structure has base pair distance x from A and distance y from B. Using polynomial interpolationwith the fast Fourier transform,we compute p(x, y) in O(n(5)) time and O(n(2)) space, which is an improvement over an earlier method, which runs in O(n(7)) time and O(n(4)) space. FFTbor2D has potential applications in synthetic biology, where one might wish to design bistable switches having target metastable structures A, B with favorable pathway kinetics. By inverting the transition probability matrix determined from FFTbor2D output, we show that L. collosoma spliced leader RNA has larger mean first passage time from A to B on the 2D energy landscape, than 97.145% of 20,000 sequences, each having metastable structures A, B. Source code and binaries are freely available for download at http://bioinformatics.bc.edu/clotelab/FFTbor2D. The program FFTbor2D is implemented in C++, with optional OpenMP parallelization primitives.
Collapse
Affiliation(s)
- Evan Senter
- Department of Biology, Boston College, Chestnut Hill, MA, 02467, USA
| | | | | |
Collapse
|
21
|
Chen J, Gong S, Wang Y, Zhang W. Kinetic partitioning mechanism of HDV ribozyme folding. J Chem Phys 2014; 140:025102. [DOI: 10.1063/1.4861037] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
|
22
|
Abstract
Proteins are at the root of many biological functions, often performing complex tasks as the result of large changes in their structure. Describing the exact details of these conformational changes, however, remains a central challenge for computational biology due the enormous computational requirements of the problem. This has engendered the development of a rich variety of useful methods designed to answer specific questions at different levels of spatial, temporal, and energetic resolution. These methods fall largely into two classes: physically accurate, but computationally demanding methods and fast, approximate methods. We introduce here a new hybrid modeling tool, the Structured Intuitive Move Selector (sims), designed to bridge the divide between these two classes, while allowing the benefits of both to be seamlessly integrated into a single framework. This is achieved by applying a modern motion planning algorithm, borrowed from the field of robotics, in tandem with a well-established protein modeling library. sims can combine precise energy calculations with approximate or specialized conformational sampling routines to produce rapid, yet accurate, analysis of the large-scale conformational variability of protein systems. Several key advancements are shown, including the abstract use of generically defined moves (conformational sampling methods) and an expansive probabilistic conformational exploration. We present three example problems that sims is applied to and demonstrate a rapid solution for each. These include the automatic determination of “active” residues for the hinge-based system Cyanovirin-N, exploring conformational changes involving long-range coordinated motion between non-sequential residues in Ribose-Binding Protein, and the rapid discovery of a transient conformational state of Maltose-Binding Protein, previously only determined by Molecular Dynamics. For all cases we provide energetic validations using well-established energy fields, demonstrating this framework as a fast and accurate tool for the analysis of a wide range of protein flexibility problems.
Collapse
|
23
|
Kirkpatrick B, Hajiaghayi M, Condon A. A new model for approximating RNA folding trajectories and population kinetics. ACTA ACUST UNITED AC 2013. [DOI: 10.1088/1749-4699/6/1/014003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
24
|
Chen J, Zhang W. Kinetic analysis of the effects of target structure on siRNA efficiency. J Chem Phys 2012; 137:225102. [DOI: 10.1063/1.4769821] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
25
|
Washietl S, Will S, Hendrix DA, Goff LA, Rinn JL, Berger B, Kellis M. Computational analysis of noncoding RNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 3:759-78. [PMID: 22991327 DOI: 10.1002/wrna.1134] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Noncoding RNAs have emerged as important key players in the cell. Understanding their surprisingly diverse range of functions is challenging for experimental and computational biology. Here, we review computational methods to analyze noncoding RNAs. The topics covered include basic and advanced techniques to predict RNA structures, annotation of noncoding RNAs in genomic data, mining RNA-seq data for novel transcripts and prediction of transcript structures, computational aspects of microRNAs, and database resources.
Collapse
Affiliation(s)
- Stefan Washietl
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | | | | | | | | | | | | |
Collapse
|
26
|
Efficient procedures for the numerical simulation of mid-size RNA kinetics. Algorithms Mol Biol 2012; 7:24. [PMID: 22958879 PMCID: PMC3463434 DOI: 10.1186/1748-7188-7-24] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2009] [Accepted: 08/22/2012] [Indexed: 01/02/2023] Open
Abstract
Motivation Methods for simulating the kinetic folding of RNAs by numerically solving the chemical master equation have been developed since the late 90's, notably the programs Kinfold and Treekin with Barriers that are available in the Vienna RNA package. Our goal is to formulate extensions to the algorithms used, starting from the Gillespie algorithm, that will allow numerical simulations of mid-size (~ 60–150 nt) RNA kinetics in some practical cases where numerous distributions of folding times are desired. These extensions can contribute to analyses and predictions of RNA folding in biologically significant problems. Results By describing in a particular way the reduction of numerical simulations of RNA folding kinetics into the Gillespie stochastic simulation algorithm for chemical reactions, it is possible to formulate extensions to the basic algorithm that will exploit memoization and parallelism for efficient computations. These can be used to advance forward from the small examples demonstrated to larger examples of biological interest. Software The implementation that is described and used for the Gillespie algorithm is freely available by contacting the authors, noting that the efficient procedures suggested may also be applicable along with Vienna's Kinfold.
Collapse
|
27
|
Zhao P, Zhang W, Chen SJ. Cotranscriptional folding kinetics of ribonucleic acid secondary structures. J Chem Phys 2012; 135:245101. [PMID: 22225186 DOI: 10.1063/1.3671644] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
We develop a systematic helix-based computational method to predict RNA folding kinetics during transcription. In our method, the transcription is modeled as stepwise process, where each step is the transcription of a nucleotide. For each step, the kinetics algorithm predicts the population kinetics, transition pathways, folding intermediates, and the transcriptional folding products. The folding pathways, rate constants, and the conformational populations for cotranscription folding show contrastingly different features than the refolding kinetics for a fully transcribed chain. The competition between the transcription speed and rate constants for the transitions between the different nascent structures determines the RNA folding pathway and the end product of folding. For example, fast transcription favors the formation of branch-like structures than rod-like structures and chain elongation in the folding process may reduce the probability of the formation of misfolded structures. Furthermore, good theory-experiment agreements suggest that our method may provide a reliable tool for quantitative prediction for cotranscriptional RNA folding, including the kinetics for the population distribution for the whole conformational ensemble.
Collapse
Affiliation(s)
- Peinan Zhao
- Department of Physics, Wuhan University, Wuhan, People's Republic of China
| | | | | |
Collapse
|
28
|
Sahoo S, Albrecht AA. Approximating the set of local minima in partial RNA folding landscapes. ACTA ACUST UNITED AC 2011; 28:523-30. [PMID: 22210870 DOI: 10.1093/bioinformatics/btr715] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION We study a stochastic method for approximating the set of local minima in partial RNA folding landscapes associated with a bounded-distance neighbourhood of folding conformations. The conformations are limited to RNA secondary structures without pseudoknots. The method aims at exploring partial energy landscapes pL induced by folding simulations and their underlying neighbourhood relations. It combines an approximation of the number of local optima devised by Garnier and Kallel (2002) with a run-time estimation for identifying sets of local optima established by Reeves and Eremeev (2004). RESULTS The method is tested on nine sequences of length between 50 nt and 400 nt, which allows us to compare the results with data generated by RNAsubopt and subsequent barrier tree calculations. On the nine sequences, the method captures on average 92% of local minima with settings designed for a target of 95%. The run-time of the heuristic can be estimated by O(n(2)Dνlnν), where n is the sequence length, ν is the number of local minima in the partial landscape pL under consideration and D is the maximum number of steepest descent steps in attraction basins associated with pL.
Collapse
Affiliation(s)
- S Sahoo
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast BT9 7BL, UK
| | | |
Collapse
|
29
|
Efficient Traversal of Beta-Sheet Protein Folding Pathways Using Ensemble Models. J Comput Biol 2011; 18:1635-47. [DOI: 10.1089/cmb.2011.0176] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
30
|
Lorenz WA, Clote P. Computing the partition function for kinetically trapped RNA secondary structures. PLoS One 2011; 6:e16178. [PMID: 21297972 PMCID: PMC3030561 DOI: 10.1371/journal.pone.0016178] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2010] [Accepted: 12/15/2010] [Indexed: 12/17/2022] Open
Abstract
An RNA secondary structure is locally optimal if there is no lower energy structure that can be obtained by the addition or removal of a single base pair, where energy is defined according to the widely accepted Turner nearest neighbor model. Locally optimal structures form kinetic traps, since any evolution away from a locally optimal structure must involve energetically unfavorable folding steps. Here, we present a novel, efficient algorithm to compute the partition function over all locally optimal secondary structures of a given RNA sequence. Our software, RNAlocopt runs in time and space. Additionally, RNAlocopt samples a user-specified number of structures from the Boltzmann subensemble of all locally optimal structures. We apply RNAlocopt to show that (1) the number of locally optimal structures is far fewer than the total number of structures – indeed, the number of locally optimal structures approximately equal to the square root of the number of all structures, (2) the structural diversity of this subensemble may be either similar to or quite different from the structural diversity of the entire Boltzmann ensemble, a situation that depends on the type of input RNA, (3) the (modified) maximum expected accuracy structure, computed by taking into account base pairing frequencies of locally optimal structures, is a more accurate prediction of the native structure than other current thermodynamics-based methods. The software RNAlocopt constitutes a technical breakthrough in our study of the folding landscape for RNA secondary structures. For the first time, locally optimal structures (kinetic traps in the Turner energy model) can be rapidly generated for long RNA sequences, previously impossible with methods that involved exhaustive enumeration. Use of locally optimal structure leads to state-of-the-art secondary structure prediction, as benchmarked against methods involving the computation of minimum free energy and of maximum expected accuracy. Web server and source code available at http://bioinformatics.bc.edu/clotelab/RNAlocopt/.
Collapse
Affiliation(s)
- William A. Lorenz
- Department of Mathematics and Computer Science, Denison University, Granville, Ohio, United States of America
| | - Peter Clote
- Biology Department, Boston College, Chestnut Hill, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
31
|
Andronescu M, Condon A, Hoos HH, Mathews DH, Murphy KP. Computational approaches for RNA energy parameter estimation. RNA (NEW YORK, N.Y.) 2010; 16:2304-18. [PMID: 20940338 PMCID: PMC2995392 DOI: 10.1261/rna.1950510] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Methods for efficient and accurate prediction of RNA structure are increasingly valuable, given the current rapid advances in understanding the diverse functions of RNA molecules in the cell. To enhance the accuracy of secondary structure predictions, we developed and refined optimization techniques for the estimation of energy parameters. We build on two previous approaches to RNA free-energy parameter estimation: (1) the Constraint Generation (CG) method, which iteratively generates constraints that enforce known structures to have energies lower than other structures for the same molecule; and (2) the Boltzmann Likelihood (BL) method, which infers a set of RNA free-energy parameters that maximize the conditional likelihood of a set of reference RNA structures. Here, we extend these approaches in two main ways: We propose (1) a max-margin extension of CG, and (2) a novel linear Gaussian Bayesian network that models feature relationships, which effectively makes use of sparse data by sharing statistical strength between parameters. We obtain significant improvements in the accuracy of RNA minimum free-energy pseudoknot-free secondary structure prediction when measured on a comprehensive set of 2518 RNA molecules with reference structures. Our parameters can be used in conjunction with software that predicts RNA secondary structures, RNA hybridization, or ensembles of structures. Our data, software, results, and parameter sets in various formats are freely available at http://www.cs.ubc.ca/labs/beta/Projects/RNA-Params.
Collapse
Affiliation(s)
- Mirela Andronescu
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA.
| | | | | | | | | |
Collapse
|
32
|
Portella G, Orozco M. Multiple Routes to Characterize the Folding of a Small DNA Hairpin. Angew Chem Int Ed Engl 2010. [DOI: 10.1002/ange.201003816] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
33
|
Portella G, Orozco M. Multiple Routes to Characterize the Folding of a Small DNA Hairpin. Angew Chem Int Ed Engl 2010; 49:7673-6. [DOI: 10.1002/anie.201003816] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
34
|
Hofacker IL, Flamm C, Heine C, Wolfinger MT, Scheuermann G, Stadler PF. BarMap: RNA folding on dynamic energy landscapes. RNA (NEW YORK, N.Y.) 2010; 16:1308-1316. [PMID: 20504954 PMCID: PMC2885680 DOI: 10.1261/rna.2093310] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2010] [Accepted: 03/24/2010] [Indexed: 05/29/2023]
Abstract
Dynamical changes of RNA secondary structures play an important role in the function of many regulatory RNAs. Such kinetic effects, especially in time-variable and externally triggered systems, are usually investigated by means of extensive and expensive simulations of large sets of individual folding trajectories. Here we describe the theoretical foundations of a generic approach that not only allows the direct computation of approximate population densities but also reduces the efforts required to analyze the folding energy landscapes to a one-time preprocessing step. The basic idea is to consider the kinetics on individual landscapes and to model external triggers and environmental changes as small but discrete changes in the landscapes. A "barmap" links macrostates of temporally adjacent landscapes and defines the transfer of population densities from one "snapshot" to the next. Implemented in the BarMap software, this approach makes it feasible to study folding processes at the level of basins, saddle points, and barriers for many nonstationary scenarios, including temperature changes, cotranscriptional folding, refolding in consequence to degradation, and mechanically constrained kinetics, as in the case of the translocation of a polymer through a pore.
Collapse
Affiliation(s)
- Ivo L Hofacker
- Institute for Theoretical Chemistry, University of Vienna, 1090 Wien, Austria.
| | | | | | | | | | | |
Collapse
|
35
|
Ditzler MA, Otyepka M, Šponer J, Walter NG. Molecular dynamics and quantum mechanics of RNA: conformational and chemical change we can believe in. Acc Chem Res 2010; 43:40-7. [PMID: 19754142 PMCID: PMC2808146 DOI: 10.1021/ar900093g] [Citation(s) in RCA: 142] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Structure and dynamics are both critical to RNA’s vital functions in biology. Numerous techniques can elucidate the structural dynamics of RNA, but computational approaches based on experimental data arguably hold the promise of providing the most detail. In this Account, we highlight areas wherein molecular dynamics (MD) and quantum mechanical (QM) techniques are applied to RNA, particularly in relation to complementary experimental studies.
We have expanded on atomic-resolution crystal structures of RNAs in functionally relevant states by applying explicit solvent MD simulations to explore their dynamics and conformational changes on the submicrosecond time scale. MD relies on simplified atomistic, pairwise additive interaction potentials (force fields). Because of limited sampling, due to the finite accessible simulation time scale and the approximated force field, high-quality starting structures are required. Despite their imperfection, we find that currently available force fields empower MD to provide meaningful and predictive information on RNA dynamics around a crystallographically defined energy minimum. The performance of force fields can be estimated by precise QM calculations on small model systems. Such calculations agree reasonably well with the Cornell et al. AMBER force field, particularly for stacking and hydrogen-bonding interactions. A final verification of any force field is accomplished by simulations of complex nucleic acid structures. The performance of the Cornell et al. AMBER force field generally corresponds well with and augments experimental data, but one notable exception could be the capping loops of double-helical stems. In addition, the performance of pairwise additive force fields is obviously unsatisfactory for inclusion of divalent cations, because their interactions lead to major polarization and charge-transfer effects neglected by the force field. Neglect of polarization also limits, albeit to a lesser extent, the description accuracy of other contributions, such as interactions with monovalent ions, conformational flexibility of the anionic sugar−phosphate backbone, hydrogen bonding, and solute polarization by solvent. Still, despite limitations, MD simulations are a valid tool for analyzing the structural dynamics of existing experimental structures. Careful analysis of MD simulations can identify problematic aspects of an experimental RNA structure, unveil structural characteristics masked by experimental constraints, reveal functionally significant stochastic fluctuations, evaluate the structural role of base ionization, and predict structurally and potentially functionally important details of the solvent behavior, including the presence of tightly bound water molecules. Moreover, combining classical MD simulations with QM calculations in hybrid QM/MM approaches helps in the assessment of the plausibility of chemical mechanisms of catalytic RNAs (ribozymes). In contrast, the reliable prediction of structure from sequence information is beyond the applicability of MD tools. The ultimate utility of computational studies in understanding RNA function thus requires that the results are neither blindly accepted nor flatly rejected, but rather considered in the context of all available experimental data, with great care given to assessing limitations through the available starting structures, force field approximations, and sampling limitations. The examples given in this Account showcase how the judicious use of basic MD simulations has already served as a powerful tool to help evaluate the role of structural dynamics in biological function of RNA.
Collapse
Affiliation(s)
- Mark A. Ditzler
- Biophysics, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109-1055
- Department of Chemistry, Single Molecule Analysis Group, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109-1055
| | - Michal Otyepka
- Department of Physical Chemistry, Faculty of Science, Palacky University Olomouc, tr. Svobody 26, 771 46 Olomouc, Czech Republic
- Institute of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska 135, 612 65 Brno, Czech Republic
| | - Jiřì Šponer
- Institute of Biophysics, Academy of Sciences of the Czech Republic, Kralovopolska 135, 612 65 Brno, Czech Republic
| | - Nils G. Walter
- Department of Chemistry, Single Molecule Analysis Group, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109-1055
| |
Collapse
|
36
|
Gillespie J, Mayne M, Jiang M. RNA folding on the 3D triangular lattice. BMC Bioinformatics 2009; 10:369. [PMID: 19891777 PMCID: PMC2780420 DOI: 10.1186/1471-2105-10-369] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2009] [Accepted: 11/05/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Difficult problems in structural bioinformatics are often studied in simple exact models to gain insights and to derive general principles. Protein folding, for example, has long been studied in the lattice model. Recently, researchers have also begun to apply the lattice model to the study of RNA folding. RESULTS We present a novel method for predicting RNA secondary structures with pseudoknots: first simulate the folding dynamics of the RNA sequence on the 3D triangular lattice, next extract and select a set of disjoint base pairs from the best lattice conformation found by the folding simulation. Experiments on sequences from PseudoBase show that our prediction method outperforms the HotKnot algorithm of Ren, Rastegari, Condon and Hoos, a leading method for RNA pseudoknot prediction. Our method for RNA secondary structure prediction can be adapted into an efficient reconstruction method that, given an RNA sequence and an associated secondary structure, finds a conformation of the sequence on the 3D triangular lattice that realizes the base pairs in the secondary structure. We implemented a suite of computer programs for the simulation and visualization of RNA folding on the 3D triangular lattice. These programs come with detailed documentation and are accessible from the companion website of this paper at http://www.cs.usu.edu/~mjiang/rna/DeltaIS/. CONCLUSION Folding simulation on the 3D triangular lattice is effective method for RNA secondary structure prediction and lattice conformation reconstruction. The visualization software for the lattice conformations of RNA structures is a valuable tool for the study of RNA folding and is a great pedagogic device.
Collapse
Affiliation(s)
- Joel Gillespie
- Department of Computer Science, Utah State University, Logan, Utah 84322-4205, USA.
| | | | | |
Collapse
|
37
|
Analysis of riboswitch structure and function by an energy landscape framework. J Mol Biol 2009; 393:993-1003. [PMID: 19733179 DOI: 10.1016/j.jmb.2009.08.062] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2009] [Revised: 08/15/2009] [Accepted: 08/27/2009] [Indexed: 01/15/2023]
Abstract
The thiamine pyrophosphate (TPP) riboswitch employs modular domains for binding TPP to form a platform for gene expression regulation. Specifically, TPP binding triggers a conformational switch in the RNA from a transcriptionally active "on" state to an inactive "off" state that concomitantly causes the formation of a terminator hairpin and halting of transcription. Here, clustering analysis of energy landscapes at different nucleotide lengths suggests a novel computational tool for analysis of the mechanics of transcription elongation in the presence or absence of the ligand. Namely, we suggest that the riboswitch's kinetics are tightly governed by a length-dependent switch, whereby the energy landscape has two clusters available during transcription elongation and where TPP's binding shifts the preference to one form. Significantly, the biologically active and inactive structures determined experimentally matched well the structures predominant in each computational set. These clustering/structural analyses combined with modular computational design suggest design principles that exploit the above features to analyze as well as create new functions and structures of RNA systems.
Collapse
|
38
|
Cao S, Chen SJ. A new computational approach for mechanical folding kinetics of RNA hairpins. Biophys J 2009; 96:4024-34. [PMID: 19450474 DOI: 10.1016/j.bpj.2009.02.044] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2008] [Revised: 01/30/2009] [Accepted: 02/20/2009] [Indexed: 01/03/2023] Open
Abstract
Based on an ensemble of kinetically accessible conformations, we propose a new analytical model for RNA folding kinetics. The model gives populational kinetics, kinetic rates, transition states, and pathways from the rate matrix. Applications of the new kinetic model to mechanical folding of RNA hairpins such as trans-activation-responsive RNA reveal distinct kinetic behaviors in different force regimes, from zero force to forces much stronger than the critical force for the folding-unfolding transition. In the absence of force or a low force, folding can be initiated (nucleated) at any position by forming the first base stack and there exist many pathways for the folding process. In contrast, for a higher force, the folding/unfolding would predominantly proceed along a single zipping/unzipping pathway. Studies for different hairpin-forming sequences indicate that depending on the nucleotide sequence, a kinetic intermediate can emerge in the low force regime but disappear in high force regime, and a new kinetic intermediate, which is absent in the low and high force regimes, can emerge in the medium force range. Variations of the force lead to changes in folding cooperativity and rate-limiting steps. The predicted network of pathways for trans-activation-responsive RNA suggests two parallel dominant pathways. The rate-limiting folding steps (at f = 8 pN) are the formation of specific basepairs that are 2-4 basepairs away from the loop. At a higher force (f = 11 pN), the folding rate is controlled by the formation of the bulge loop. The predicted rates and transition states are in good agreement with the experimental data for a broad force regime.
Collapse
Affiliation(s)
- Song Cao
- Department of Physics and Astronomy, University of Missouri, Columbia, Missouri, USA
| | | |
Collapse
|