1
|
Bahai A, Kwoh CK, Mu Y, Li Y. Systematic benchmarking of deep-learning methods for tertiary RNA structure prediction. PLoS Comput Biol 2024; 20:e1012715. [PMID: 39775239 PMCID: PMC11723642 DOI: 10.1371/journal.pcbi.1012715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 01/10/2025] [Accepted: 12/10/2024] [Indexed: 01/11/2025] Open
Abstract
The 3D structure of RNA critically influences its functionality, and understanding this structure is vital for deciphering RNA biology. Experimental methods for determining RNA structures are labour-intensive, expensive, and time-consuming. Computational approaches have emerged as valuable tools, leveraging physics-based-principles and machine learning to predict RNA structures rapidly. Despite advancements, the accuracy of computational methods remains modest, especially when compared to protein structure prediction. Deep learning methods, while successful in protein structure prediction, have shown some promise for RNA structure prediction as well, but face unique challenges. This study systematically benchmarks state-of-the-art deep learning methods for RNA structure prediction across diverse datasets. Our aim is to identify factors influencing performance variation, such as RNA family diversity, sequence length, RNA type, multiple sequence alignment (MSA) quality, and deep learning model architecture. We show that generally ML-based methods perform much better than non-ML methods on most RNA targets, although the performance difference isn't substantial when working with unseen novel or synthetic RNAs. The quality of the MSA and secondary structure prediction both play an important role and most methods aren't able to predict non-Watson-Crick pairs in the RNAs. Overall among the automated 3D RNA structure prediction methods, DeepFoldRNA has the best prediction results followed by DRFold as the second best method. Finally, we also suggest possible mitigations to improve the quality of the prediction for future method development.
Collapse
Affiliation(s)
- Akash Bahai
- School of Biological Sciences (SBS), Nanyang Technological University, Singapore, Singapore
| | - Chee Keong Kwoh
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
| | - Yuguang Mu
- School of Biological Sciences (SBS), Nanyang Technological University, Singapore, Singapore
| | - Yinghui Li
- School of Biological Sciences (SBS), Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
2
|
Bukina V, Božič A. Context-dependent structure formation of hairpin motifs in bacteriophage MS2 genomic RNA. Biophys J 2024; 123:3397-3407. [PMID: 39118324 PMCID: PMC11480767 DOI: 10.1016/j.bpj.2024.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 07/17/2024] [Accepted: 08/05/2024] [Indexed: 08/10/2024] Open
Abstract
Many functions of ribonucleic acid (RNA) rely on its ability to assume specific sequence-structure motifs. Packaging signals found in certain RNA viruses are one such prominent example of functional RNA motifs. These signals are short hairpin loops that interact with coat proteins and drive viral self-assembly. As they are found in different positions along the much longer genomic RNA, the formation of their correct structure occurs as a part of a larger context. Any changes to this context can consequently lead to changes in the structure of the motifs themselves. In fact, previous studies have shown that structure and function of RNA motifs can be highly context sensitive to the flanking sequence surrounding them. However, in what ways different flanking sequences influence the structure of an RNA motif they surround has yet to be studied in detail. We focus on a hairpin-rich region of the RNA genome of bacteriophage MS2-a well-studied RNA virus with a wide potential for use in biotechnology-and systematically examine context-dependent structural stability of 14 previously identified hairpin motifs, which include putative and confirmed packaging signals. Combining secondary and tertiary RNA structure prediction of the hairpin motifs placed in different contexts, ranging from the native genomic sequence to random RNA sequences and unstructured poly-U sequences, we determine different measures of motif structural stability. In this way, we show that while some motif structures can be stable in any context, others require specific context provided by the genome. Our results demonstrate the importance of context in RNA structure formation and how changes in the flanking sequence of an RNA motif sometimes lead to drastic changes in its structure. Structural stability of a motif in different contexts could provide additional insights into its functionality as well as assist in determining whether it remains functional when intentionally placed in other contexts.
Collapse
Affiliation(s)
- Veronika Bukina
- Department of Theoretical Physics, Jožef Stefan Institute, Ljubljana, Slovenia; Department of Physics, Faculty of Mathematics and Physics, University of Ljubljana, Ljubljana, Slovenia
| | - Anže Božič
- Department of Theoretical Physics, Jožef Stefan Institute, Ljubljana, Slovenia.
| |
Collapse
|
3
|
Zhang S, Li J, Chen SJ. Machine learning in RNA structure prediction: Advances and challenges. Biophys J 2024; 123:2647-2657. [PMID: 38297836 PMCID: PMC11393687 DOI: 10.1016/j.bpj.2024.01.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/08/2024] [Accepted: 01/24/2024] [Indexed: 02/02/2024] Open
Abstract
RNA molecules play a crucial role in various biological processes, with their functionality closely tied to their structures. The remarkable advancements in machine learning techniques for protein structure prediction have shown promise in the field of RNA structure prediction. In this perspective, we discuss the advances and challenges encountered in constructing machine learning-based models for RNA structure prediction. We explore topics including model building strategies, specific challenges involved in predicting RNA secondary (2D) and tertiary (3D) structures, and approaches to these challenges. In addition, we highlight the advantages and challenges of constructing RNA language models. Given the rapid advances of machine learning techniques, we anticipate that machine learning-based models will serve as important tools for predicting RNA structures, thereby enriching our understanding of RNA structures and their corresponding functions.
Collapse
Affiliation(s)
- Sicheng Zhang
- Department of Physics and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri
| | - Jun Li
- Department of Physics and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri
| | - Shi-Jie Chen
- Department of Physics and Institute of Data Science and Informatics, University of Missouri, Columbia, Missouri; Department of Biochemistry, University of Missouri, Columbia, Missouri.
| |
Collapse
|
4
|
Bosch A, Guzman HV, Pérez R. Adsorption-Driven Deformation and Footprints of the RBD Proteins in SARS-CoV-2 Variants on Biological and Inanimate Surfaces. J Chem Inf Model 2024; 64:5977-5990. [PMID: 39083670 PMCID: PMC11323246 DOI: 10.1021/acs.jcim.4c00460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 06/03/2024] [Accepted: 06/04/2024] [Indexed: 08/02/2024]
Abstract
Respiratory viruses, carried through airborne microdroplets, frequently adhere to surfaces, including plastics and metals. However, our understanding of the interactions between viruses and materials remains limited, particularly in scenarios involving polarizable surfaces. Here, we investigate the role of the receptor-binding domain (RBD) of the spike protein mutations on the adsorption of SARS-CoV-2 to hydrophobic and hydrophilic surfaces employing molecular simulations. To contextualize our findings, we contrast the interactions on inanimate surfaces with those on native biological interfaces, specifically the angiotensin-converting enzyme 2. Notably, we identify a 2-fold increase in structural deformations for the protein's receptor binding motif (RBM) onto inanimate surfaces, indicative of enhanced shock-absorbing mechanisms. Furthermore, the distribution of adsorbed amino acids (landing footprints) on the inanimate surface reveals a distinct regional asymmetry relative to the biological interface, with roughly half of the adsorbed amino acids arranged in opposite sites. In spite of the H-bonds formed at the hydrophilic substrate, the simulations consistently show a higher number of contacts and interfacial area with the hydrophobic surface, where the wild-type RBD adsorbs more strongly than the Delta or Omicron RBDs. In contrast, the adsorption of Delta and Omicron to hydrophilic surfaces was characterized by a distinctive hopping-pattern. The novel shock-absorbing mechanisms identified in the virus adsorption on inanimate surfaces show the embedded high-deformation capacity of the RBD without losing its secondary structure, which could lead to current experimental strategies in the design of virucidal surfaces.
Collapse
Affiliation(s)
- Antonio
M. Bosch
- Departamento
de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
| | - Horacio V. Guzman
- Departamento
de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
- Department
of Theoretical Physics, Jožef Stefan
Institute, SI-1000 Ljubljana, Slovenia
| | - Rubén Pérez
- Departamento
de Física Teórica de la Materia Condensada, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
- Condensed
Matter Physics Center (IFIMAC), Universidad
Autónoma de Madrid, E-28049 Madrid, Spain
| |
Collapse
|
5
|
Wang X, Terashi G, Kihara D. CryoREAD: de novo structure modeling for nucleic acids in cryo-EM maps using deep learning. Nat Methods 2023; 20:1739-1747. [PMID: 37783885 PMCID: PMC10841814 DOI: 10.1038/s41592-023-02032-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 08/24/2023] [Indexed: 10/04/2023]
Abstract
DNA and RNA play fundamental roles in various cellular processes, where their three-dimensional structures provide information critical to understanding the molecular mechanisms of their functions. Although an increasing number of nucleic acid structures and their complexes with proteins are determined by cryogenic electron microscopy (cryo-EM), structure modeling for DNA and RNA remains challenging particularly when the map is determined at a resolution coarser than atomic level. Moreover, computational methods for nucleic acid structure modeling are relatively scarce. Here, we present CryoREAD, a fully automated de novo DNA/RNA atomic structure modeling method using deep learning. CryoREAD identifies phosphate, sugar and base positions in a cryo-EM map using deep learning, which are traced and modeled into a three-dimensional structure. When tested on cryo-EM maps determined at 2.0 to 5.0 Å resolution, CryoREAD built substantially more accurate models than existing methods. We also applied the method to cryo-EM maps of biomolecular complexes in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).
Collapse
Affiliation(s)
- Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, USA.
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
6
|
Wang X, Yu S, Lou E, Tan YL, Tan ZJ. RNA 3D Structure Prediction: Progress and Perspective. Molecules 2023; 28:5532. [PMID: 37513407 PMCID: PMC10386116 DOI: 10.3390/molecules28145532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/05/2023] [Accepted: 07/13/2023] [Indexed: 07/30/2023] Open
Abstract
Ribonucleic acid (RNA) molecules play vital roles in numerous important biological functions such as catalysis and gene regulation. The functions of RNAs are strongly coupled to their structures or proper structure changes, and RNA structure prediction has been paid much attention in the last two decades. Some computational models have been developed to predict RNA three-dimensional (3D) structures in silico, and these models are generally composed of predicting RNA 3D structure ensemble, evaluating near-native RNAs from the structure ensemble, and refining the identified RNAs. In this review, we will make a comprehensive overview of the recent advances in RNA 3D structure modeling, including structure ensemble prediction, evaluation, and refinement. Finally, we will emphasize some insights and perspectives in modeling RNA 3D structures.
Collapse
Affiliation(s)
- Xunxun Wang
- Department of Physics, Key Laboratory of Artificial Micro & Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Shixiong Yu
- Department of Physics, Key Laboratory of Artificial Micro & Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - En Lou
- Department of Physics, Key Laboratory of Artificial Micro & Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Lan Tan
- School of Bioengineering and Health, Wuhan Textile University, Wuhan 430200, China
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan 430200, China
| | - Zhi-Jie Tan
- Department of Physics, Key Laboratory of Artificial Micro & Nano-Structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
7
|
Wang X, Tan YL, Yu S, Shi YZ, Tan ZJ. Predicting 3D structures and stabilities for complex RNA pseudoknots in ion solutions. Biophys J 2023; 122:1503-1516. [PMID: 36924021 PMCID: PMC10147842 DOI: 10.1016/j.bpj.2023.03.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 03/05/2023] [Accepted: 03/10/2023] [Indexed: 03/17/2023] Open
Abstract
RNA pseudoknots are a kind of important tertiary motif, and the structures and stabilities of pseudoknots are generally critical to the biological functions of RNAs with the motifs. In this work, we have carefully refined our previously developed coarse-grained model with salt effect through involving a new coarse-grained force field and a replica-exchange Monte Carlo algorithm, and employed the model to predict structures and stabilities of complex RNA pseudoknots in ion solutions beyond minimal H-type pseudoknots. Compared with available experimental data, the newly refined model can successfully predict 3D structures from sequences for the complex RNA pseudoknots including SARS-CoV-2 programming-1 ribosomal frameshifting element and Zika virus xrRNA, and can reliably predict the thermal stabilities of RNA pseudoknots with various sequences and lengths over broad ranges of monovalent/divalent salts. In addition, for complex pseudoknots including SARS-CoV-2 frameshifting element, our analyses show that their thermally unfolding pathways are mainly dependent on the relative stabilities of unfolded intermediate states, in analogy to those of minimal H-type pseudoknots.
Collapse
Affiliation(s)
- Xunxun Wang
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science and School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Shixiong Yu
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science and School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China.
| |
Collapse
|
8
|
Ma H, Pham P, Luo B, Rangan R, Kappel K, Su Z, Das R. Auto-DRRAFTER: Automated RNA Modeling Based on Cryo-EM Density. Methods Mol Biol 2023; 2568:193-211. [PMID: 36227570 DOI: 10.1007/978-1-0716-2687-0_13] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
RNA three-dimensional structures provide rich and vital information for understanding their functions. Recent advances in cryogenic electron microscopy (cryo-EM) allow structure determination of RNAs and ribonucleoprotein (RNP) complexes. However, limited global and local resolutions of RNA cryo-EM maps pose great challenges in tracing RNA coordinates. The Rosetta-based "auto-DRRAFTER" method builds RNA models into moderate-resolution RNA cryo-EM density as part of the Ribosolve pipeline. Here, we describe a step-by-step protocol for auto-DRRAFTER using a glycine riboswitch from Fusobacterium nucleatum as an example. Successful implementation of this protocol allows automated RNA modeling into RNA cryo-EM density, accelerating our understanding of RNA structure-function relationships. Input and output files are being made available at https://github.com/auto-DRRAFTER/springer-chapter .
Collapse
Affiliation(s)
- Haiyun Ma
- The State Key Laboratory of Biotherapy, Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu, Sichuan, China
| | - Phillip Pham
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Bingnan Luo
- The State Key Laboratory of Biotherapy, Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu, Sichuan, China
| | - Ramya Rangan
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Zhaoming Su
- The State Key Laboratory of Biotherapy, Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, Chengdu, Sichuan, China.
| | - Rhiju Das
- Biophysics Program, Stanford University, Stanford, CA, USA.
| |
Collapse
|
9
|
Zhou L, Wang X, Yu S, Tan YL, Tan ZJ. FebRNA: An automated fragment-ensemble-based model for building RNA 3D structures. Biophys J 2022; 121:3381-3392. [PMID: 35978551 PMCID: PMC9515226 DOI: 10.1016/j.bpj.2022.08.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 07/19/2022] [Accepted: 08/15/2022] [Indexed: 11/23/2022] Open
Abstract
Knowledge of RNA three-dimensional (3D) structures is critical to understanding the important biological functions of RNAs. Although various structure prediction models have been developed, the high-accuracy predictions of RNA 3D structures are still limited to the RNAs with short lengths or with simple topology. In this work, we proposed a new model, namely FebRNA, for building RNA 3D structures through fragment assembly based on coarse-grained (CG) fragment ensembles. Specifically, FebRNA is composed of four processes: establishing the library of different types of non-redundant CG fragment ensembles regardless of the sequences, building CG 3D structure ensemble through fragment assembly, identifying top-scored CG structures through a specific CG scoring function, and rebuilding the all-atom structures from the top-scored CG ones. Extensive examination against different types of RNA structures indicates that FebRNA consistently gives the reliable predictions on RNA 3D structures, including pseudoknots, three-way junctions, four-way and five-way junctions, and RNAs in the RNA-Puzzles. FebRNA is available on the Web site: https://github.com/Tan-group/FebRNA.
Collapse
Affiliation(s)
- Li Zhou
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Xunxun Wang
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Shixiong Yu
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan 430073, China.
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China.
| |
Collapse
|
10
|
Mukherjee D, Maiti S, Gouda PK, Sharma R, Roy P, Bhattacharyya D. RNABPDB: Molecular Modeling of RNA Structure-From Base Pair Analysis in Crystals to Structure Prediction. Interdiscip Sci 2022; 14:759-774. [PMID: 35705797 DOI: 10.1007/s12539-022-00528-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 05/05/2022] [Accepted: 05/17/2022] [Indexed: 06/15/2023]
Abstract
The stable three-dimensional structure of RNA is known to play several important biochemical roles, from post-transcriptional gene regulation to enzymatic action. These structures contain double-helical regions, which often have different types of non-canonical base pairs in addition to Watson-Crick base pairs. Hence, it is important to study their structures from experimentally obtained or even predicted ones, to understand their role, or to develop a drug against the potential targets. Molecular Modeling of RNA double helices containing non-canonical base pairs is a difficult process, particularly due to the unavailability of structural features of non-Watson-Crick base pairs. Here we show a composite web-server with an associated database that allows one to generate the structure of RNA double helix containing non-canonical base pairs using consensus parameters obtained from the database. The database classification is followed by an evaluation of the central tendency of the structural parameters as well as a quantitative estimation of interaction strengths. These parameters are used to construct three-dimensional structures of double helices composed of Watson-Crick and/or non-canonical base pairs. Our benchmark study to regenerate double-helical fragments of many experimentally derived RNA structures indicate very high accuracy. This composite server is expected to be highly useful in understanding functions of various pre-miRNA by modeling structures of the molecules and estimating binding efficiency. The database can be accessed from http://hdrnas.saha.ac.in/rnabpdb .
Collapse
Affiliation(s)
- Debasish Mukherjee
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India.
- Institute of Molecular Biology gGmbH (IMB), Ackermannweg 4, 55128, Mainz, Germany.
| | - Satyabrata Maiti
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
- Homi Bhaba National Institute, Anushaktinagar, Mumbai, 400094, India
| | - Prasanta Kumar Gouda
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
| | - Richa Sharma
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
| | - Parthajit Roy
- Department of Computer Science, The University of Burdwan, Golapbag, Burdwan, 713104, India
| | - Dhananjay Bhattacharyya
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
- Homi Bhaba National Institute, Anushaktinagar, Mumbai, 400094, India
| |
Collapse
|
11
|
Evtugyn G, Porfireva A, Tsekenis G, Oravczova V, Hianik T. Electrochemical Aptasensors for Antibiotics Detection: Recent Achievements and Applications for Monitoring Food Safety. SENSORS (BASEL, SWITZERLAND) 2022; 22:3684. [PMID: 35632093 PMCID: PMC9143886 DOI: 10.3390/s22103684] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 05/05/2022] [Accepted: 05/07/2022] [Indexed: 06/15/2023]
Abstract
Antibiotics are often used in human and veterinary medicine for the treatment of bacterial diseases. However, extensive use of antibiotics in agriculture can result in the contamination of common food staples such as milk. Consumption of contaminated products can cause serious illness and a rise in antibiotic resistance. Conventional methods of antibiotics detection such are microbiological assays chromatographic and mass spectroscopy methods are sensitive; however, they require qualified personnel, expensive instruments, and sample pretreatment. Biosensor technology can overcome these drawbacks. This review is focused on the recent achievements in the electrochemical biosensors based on nucleic acid aptamers for antibiotic detection. A brief explanation of conventional methods of antibiotic detection is also provided. The methods of the aptamer selection are explained, together with the approach used for the improvement of aptamer affinity by post-SELEX modification and computer modeling. The substantial focus of this review is on the explanation of the principles of the electrochemical detection of antibiotics by aptasensors and on recent achievements in the development of electrochemical aptasensors. The current trends and problems in practical applications of aptasensors are also discussed.
Collapse
Affiliation(s)
- Gennady Evtugyn
- A.M. Butlerov’ Chemistry Institute, Kazan Federal University, 18 Kremlevskaya Street, 420008 Kazan, Russia; (G.E.); (A.P.)
- Analytical Chemistry Department, Chemical Technology Institute, Ural Federal University, 19 Mira Street, 620002 Ekaterinburg, Russia
| | - Anna Porfireva
- A.M. Butlerov’ Chemistry Institute, Kazan Federal University, 18 Kremlevskaya Street, 420008 Kazan, Russia; (G.E.); (A.P.)
| | - George Tsekenis
- Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou Street, 115 27 Athens, Greece;
| | - Veronika Oravczova
- Department of Nuclear Physics and Biophysics, Comenius University, Mlynska Dolina F1, 842 48 Bratislava, Slovakia;
| | - Tibor Hianik
- Department of Nuclear Physics and Biophysics, Comenius University, Mlynska Dolina F1, 842 48 Bratislava, Slovakia;
| |
Collapse
|
12
|
Li Y, Baptista RP, Mei X, Kissinger JC. Small and intermediate size structural RNAs in the unicellular parasite Cryptosporidium parvum as revealed by sRNA-seq and comparative genomics. Microb Genom 2022; 8. [PMID: 35536609 PMCID: PMC9465071 DOI: 10.1099/mgen.0.000821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Small and intermediate-size noncoding RNAs (sRNAs and is-ncRNAs) have been shown to play important regulatory roles in the development of several eukaryotic organisms. However, they have not been thoroughly explored in Cryptosporidium parvum, an obligate zoonotic protist parasite responsible for the diarrhoeal disease cryptosporidiosis. Using Illumina sequencing of a small RNA library, a systematic identification of novel small and is-ncRNAs was performed in C. parvum excysted sporozoites. A total of 79 novel is-ncRNA candidates, including antisense, intergenic and intronic is-ncRNAs, were identified, including 7 new small nucleolar RNAs (snoRNAs). Expression of select novel is-ncRNAs was confirmed by RT-PCR. Phylogenetic conservation was analysed using covariance models (CMs) in related Cryptosporidium and apicomplexan parasite genome sequences. A potential new type of small ncRNA derived from tRNA fragments was observed. Overall, a deep profiling analysis of novel is-ncRNAs in C. parvum and related species revealed structural features and conservation of these novel is-ncRNAs. Covariance models can be used to detect is-ncRNA genes in other closely related parasites. These findings provide important new sequences for additional functional characterization of novel is-ncRNAs in the protist pathogen C. parvum.
Collapse
Affiliation(s)
- Yiran Li
- Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| | - Rodrigo P Baptista
- Institute of Bioinformatics, University of Georgia, Athens, GA, USA.,Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, GA, USA.,Present address: Houston Methodist Research Institute, Houston, TX, USA
| | - Xiaohan Mei
- Department of Physiology and Pharmacology, University of Georgia, Athens, GA, USA
| | - Jessica C Kissinger
- Institute of Bioinformatics, University of Georgia, Athens, GA, USA.,Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, GA, USA.,Department of Genetics, University of Georgia, Athens, GA, USA
| |
Collapse
|
13
|
Dykstra PB, Kaplan M, Smolke CD. Engineering synthetic RNA devices for cell control. Nat Rev Genet 2022; 23:215-228. [PMID: 34983970 PMCID: PMC9554294 DOI: 10.1038/s41576-021-00436-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/15/2021] [Indexed: 12/16/2022]
Abstract
The versatility of RNA in sensing and interacting with small molecules, proteins and other nucleic acids while encoding genetic instructions for protein translation makes it a powerful substrate for engineering biological systems. RNA devices integrate cellular information sensing, processing and actuation of specific signals into defined functions and have yielded programmable biological systems and novel therapeutics of increasing sophistication. However, challenges centred on expanding the range of analytes that can be sensed and adding new mechanisms of action have hindered the full realization of the field's promise. Here, we describe recent advances that address these limitations and point to a significant maturation of synthetic RNA-based devices.
Collapse
Affiliation(s)
- Peter B. Dykstra
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Matias Kaplan
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Christina D. Smolke
- Department of Bioengineering, Stanford University, Stanford, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA.,
| |
Collapse
|
14
|
Mardia KV, Wiechers H, Eltzner B, Huckemann SF. Principal component analysis and clustering on manifolds. J MULTIVARIATE ANAL 2022. [DOI: 10.1016/j.jmva.2021.104862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
15
|
Gianti E, Percec S. Machine Learning at the Interface of Polymer Science and Biology: How Far Can We Go? Biomacromolecules 2022; 23:576-591. [PMID: 35133143 DOI: 10.1021/acs.biomac.1c01436] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
This Perspective outlines recent progress and future directions for using machine learning (ML), a data-driven method, to address critical questions in the design, synthesis, processing, and characterization of biomacromolecules. The achievement of these tasks requires the navigation of vast and complex chemical and biological spaces, difficult to accomplish with reasonable speed. Using modern algorithms and supercomputers, quantum physics methods are able to examine systems containing a few hundred interacting species and determine the probability of finding them in a particular region of phase space, thereby anticipating their properties. Likewise, modern approaches in chemistry and biomolecular simulation, supported by high performance computing, have culminated in producing data sets of escalating size and intrinsically high complexity. Hence, using ML to extract relevant information from these fields is of paramount importance to advance our understanding of chemical and biomolecular systems. At the heart of ML approaches lie statistical algorithms, which by evaluating a portion of a given data set, identify, learn, and manipulate the underlying rules that govern the whole data set. The assembly of a quality model to represent the data followed by the predictions and elimination of error sources are the key steps in ML. In addition to a growing infrastructure of ML tools to address complex problems, an increasing number of aspects related to our understanding of the fundamental properties of biomacromolecules are exposed to ML. These fields, including those residing at the interface of polymer science and biology (i.e., structure determination, de novo design, folding, and dynamics), strive to adopt and take advantage of the transformative power offered by approaches in the ML domain, which clearly has the potential of accelerating research in the field of biomacromolecules.
Collapse
Affiliation(s)
- Eleonora Gianti
- Institute for Computational Molecular Science (ICMS), Temple University, Philadelphia, Pennsylvania 19122, United States.,Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| | - Simona Percec
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
16
|
Guo ZH, Yuan L, Tan YL, Zhang BG, Shi YZ. RNAStat: An Integrated Tool for Statistical Analysis of RNA 3D Structures. FRONTIERS IN BIOINFORMATICS 2022; 1:809082. [PMID: 36303785 PMCID: PMC9580920 DOI: 10.3389/fbinf.2021.809082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Accepted: 12/17/2021] [Indexed: 11/13/2022] Open
Abstract
The 3D architectures of RNAs are essential for understanding their cellular functions. While an accurate scoring function based on the statistics of known RNA structures is a key component for successful RNA structure prediction or evaluation, there are few tools or web servers that can be directly used to make comprehensive statistical analysis for RNA 3D structures. In this work, we developed RNAStat, an integrated tool for making statistics on RNA 3D structures. For given RNA structures, RNAStat automatically calculates RNA structural properties such as size and shape, and shows their distributions. Based on the RNA structure annotation from DSSR, RNAStat provides statistical information of RNA secondary structure motifs including canonical/non-canonical base pairs, stems, and various loops. In particular, the geometry of base-pairing/stacking can be calculated in RNAStat by constructing a local coordinate system for each base. In addition, RNAStat also supplies the distribution of distance between any atoms to the users to help build distance-based RNA statistical potentials. To test the usability of the tool, we established a non-redundant RNA 3D structure dataset, and based on the dataset, we made a comprehensive statistical analysis on RNA structures, which could have the guiding significance for RNA structure modeling. The python code of RNAStat, the dataset used in this work, and corresponding statistical data files are freely available at GitHub (https://github.com/RNA-folding-lab/RNAStat).
Collapse
Affiliation(s)
- Zhi-Hao Guo
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
- School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan, China
| | - Li Yuan
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
- School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Ben-Gong Zhang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
- *Correspondence: Ya-Zhou Shi,
| |
Collapse
|
17
|
Dingle K, Ghaddar F, Šulc P, Louis AA. Phenotype Bias Determines How Natural RNA Structures Occupy the Morphospace of All Possible Shapes. Mol Biol Evol 2022; 39:msab280. [PMID: 34542628 PMCID: PMC8763027 DOI: 10.1093/molbev/msab280] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Morphospaces-representations of phenotypic characteristics-are often populated unevenly, leaving large parts unoccupied. Such patterns are typically ascribed to contingency, or else to natural selection disfavoring certain parts of the morphospace. The extent to which developmental bias, the tendency of certain phenotypes to preferentially appear as potential variation, also explains these patterns is hotly debated. Here we demonstrate quantitatively that developmental bias is the primary explanation for the occupation of the morphospace of RNA secondary structure (SS) shapes. Upon random mutations, some RNA SS shapes (the frequent ones) are much more likely to appear than others. By using the RNAshapes method to define coarse-grained SS classes, we can directly compare the frequencies that noncoding RNA SS shapes appear in the RNAcentral database to frequencies obtained upon a random sampling of sequences. We show that: 1) only the most frequent structures appear in nature; the vast majority of possible structures in the morphospace have not yet been explored; 2) remarkably small numbers of random sequences are needed to produce all the RNA SS shapes found in nature so far; and 3) perhaps most surprisingly, the natural frequencies are accurately predicted, over several orders of magnitude in variation, by the likelihood that structures appear upon a uniform random sampling of sequences. The ultimate cause of these patterns is not natural selection, but rather a strong phenotype bias in the RNA genotype-phenotype map, a type of developmental bias or "findability constraint," which limits evolutionary dynamics to a hugely reduced subset of structures that are easy to "find."
Collapse
Affiliation(s)
- Kamaludin Dingle
- Centre for Applied Mathematics and Bioinformatics, Department of Mathematics and Natural Sciences, Gulf University for Science and Technology, Hawally, Kuwait
| | - Fatme Ghaddar
- Centre for Applied Mathematics and Bioinformatics, Department of Mathematics and Natural Sciences, Gulf University for Science and Technology, Hawally, Kuwait
| | - Petr Šulc
- School of Molecular Sciences and Center for Molecular Design and Biomimetics at the Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Ard A Louis
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
18
|
Poblete S, Božič A, Kanduč M, Podgornik R, Guzman HV. RNA Secondary Structures Regulate Adsorption of Fragments onto Flat Substrates. ACS OMEGA 2021; 6:32823-32831. [PMID: 34901632 PMCID: PMC8655909 DOI: 10.1021/acsomega.1c04774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 11/04/2021] [Indexed: 06/14/2023]
Abstract
RNA is a functionally rich molecule with multilevel, hierarchical structures whose role in the adsorption to molecular substrates is only beginning to be elucidated. Here, we introduce a multiscale simulation approach that combines a tractable coarse-grained RNA structural model with an interaction potential of a structureless flat adsorbing substrate. Within this approach, we study the specific role of stem-hairpin and multibranch RNA secondary structure motifs on its adsorption phenomenology. Our findings identify a dual regime of adsorption for short RNA fragments with and without the secondary structure and underline the adsorption efficiency in both cases as a function of the surface interaction strength. The observed behavior results from an interplay between the number of contacts formed at the surface and the conformational entropy of the RNA molecule. The adsorption phenomenology of RNA seems to persist also for much longer RNAs as qualitatively observed by comparing the trends of our simulations with a theoretical approach based on an ideal semiflexible polymer chain.
Collapse
Affiliation(s)
- Simón Poblete
- Instituto
de Ciencias Físicas y Matemáticas, Universidad Austral de Chile, Valdivia 5091000, Chile
- Computational
Biology Lab, Fundación Ciencia &
Vida, Santiago 7780272, Chile
| | - Anže Božič
- Department
of Theoretical Physics, Jožef Stefan
Institute, SI-1000 Ljubljana, Slovenia
| | - Matej Kanduč
- Department
of Theoretical Physics, Jožef Stefan
Institute, SI-1000 Ljubljana, Slovenia
| | - Rudolf Podgornik
- School
of Physical Sciences and Kavli Institute for Theoretical Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- Institute
of Physics, Chinese Academy of Sciences, Beijing 100190, China
- Wenzhou
Institute of the University of Chinese Academy of Sciences, Wenzhou, Zhejiang 325000, China
- Department
of Physics, Faculty of Mathematics and Physics, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Horacio V. Guzman
- Department
of Theoretical Physics, Jožef Stefan
Institute, SI-1000 Ljubljana, Slovenia
| |
Collapse
|
19
|
Levintov L, Vashisth H. Role of conformational heterogeneity in ligand recognition by viral RNA molecules. Phys Chem Chem Phys 2021; 23:11211-11223. [PMID: 34010381 DOI: 10.1039/d1cp00679g] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Ribonucleic acid (RNA) molecules are known to undergo conformational changes in response to various environmental stimuli including temperature, pH, and ligands. In particular, viral RNA molecules are a key example of conformationally adapting molecules that have evolved to switch between many functional conformations. The transactivation response element (TAR) RNA from the type-1 human immunodeficiency virus (HIV-1) is a viral RNA molecule that is being increasingly explored as a potential therapeutic target due to its role in the viral replication process. In this work, we have studied the dynamics in TAR RNA in apo and liganded states by performing explicit-solvent molecular dynamics (MD) simulations initiated with 27 distinct structures. We determined that the TAR RNA structure is significantly stabilized on ligand binding with especially decreased fluctuations in its two helices. This rigidity is further coupled with the decreased flipping of bulge nucleotides, which were observed to flip more frequently in the absence of ligands. We found that initially-distinct structures of TAR RNA converged to similar conformations on removing ligands. We also report that conformational dynamics in unliganded TAR structures leads to the formation of binding pockets capable of accommodating ligands of various sizes.
Collapse
Affiliation(s)
- Lev Levintov
- Department of Chemical Engineering, University of New Hampshire, Durham 03824, New Hampshire, USA.
| | - Harish Vashisth
- Department of Chemical Engineering, University of New Hampshire, Durham 03824, New Hampshire, USA.
| |
Collapse
|
20
|
Schlick T, Portillo-Ledesma S, Myers CG, Beljak L, Chen J, Dakhel S, Darling D, Ghosh S, Hall J, Jan M, Liang E, Saju S, Vohr M, Wu C, Xu Y, Xue E. Biomolecular Modeling and Simulation: A Prospering Multidisciplinary Field. Annu Rev Biophys 2021; 50:267-301. [PMID: 33606945 PMCID: PMC8105287 DOI: 10.1146/annurev-biophys-091720-102019] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We reassess progress in the field of biomolecular modeling and simulation, following up on our perspective published in 2011. By reviewing metrics for the field's productivity and providing examples of success, we underscore the productive phase of the field, whose short-term expectations were overestimated and long-term effects underestimated. Such successes include prediction of structures and mechanisms; generation of new insights into biomolecular activity; and thriving collaborations between modeling and experimentation, including experiments driven by modeling. We also discuss the impact of field exercises and web games on the field's progress. Overall, we note tremendous success by the biomolecular modeling community in utilization of computer power; improvement in force fields; and development and application of new algorithms, notably machine learning and artificial intelligence. The combined advances are enhancing the accuracy andscope of modeling and simulation, establishing an exemplary discipline where experiment and theory or simulations are full partners.
Collapse
Affiliation(s)
- Tamar Schlick
- Department of Chemistry, New York University, New York, New York 10003, USA;
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
- New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 200122, China
| | | | - Christopher G Myers
- Department of Chemistry, New York University, New York, New York 10003, USA;
| | - Lauren Beljak
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Justin Chen
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sami Dakhel
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Daniel Darling
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sayak Ghosh
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Joseph Hall
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Mikaeel Jan
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Emily Liang
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sera Saju
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Mackenzie Vohr
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Chris Wu
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Yifan Xu
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Eva Xue
- College of Arts and Science, New York University, New York, New York 10003, USA
| |
Collapse
|
21
|
Feng C, Tan YL, Cheng YX, Shi YZ, Tan ZJ. Salt-Dependent RNA Pseudoknot Stability: Effect of Spatial Confinement. Front Mol Biosci 2021; 8:666369. [PMID: 33928126 PMCID: PMC8078894 DOI: 10.3389/fmolb.2021.666369] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 03/17/2021] [Indexed: 12/27/2022] Open
Abstract
Macromolecules, such as RNAs, reside in crowded cell environments, which could strongly affect the folded structures and stability of RNAs. The emergence of RNA-driven phase separation in biology further stresses the potential functional roles of molecular crowding. In this work, we employed the coarse-grained model that was previously developed by us to predict 3D structures and stability of the mouse mammary tumor virus (MMTV) pseudoknot under different spatial confinements over a wide range of salt concentrations. The results show that spatial confinements can not only enhance the compactness and stability of MMTV pseudoknot structures but also weaken the dependence of the RNA structure compactness and stability on salt concentration. Based on our microscopic analyses, we found that the effect of spatial confinement on the salt-dependent RNA pseudoknot stability mainly comes through the spatial suppression of extended conformations, which are prevalent in the partially/fully unfolded states, especially at low ion concentrations. Furthermore, our comprehensive analyses revealed that the thermally unfolding pathway of the pseudoknot can be significantly modulated by spatial confinements, since the intermediate states with more extended conformations would loss favor when spatial confinements are introduced.
Collapse
Affiliation(s)
- Chenjie Feng
- Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, Center for Theoretical Physics, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, Center for Theoretical Physics, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Yu-Xuan Cheng
- Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, Center for Theoretical Physics, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| | - Zhi-Jie Tan
- Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, Center for Theoretical Physics, School of Physics and Technology, Wuhan University, Wuhan, China
| |
Collapse
|
22
|
Abstract
Novel RNA motif design is of great practical importance for technology and medicine. Increasingly, computational design plays an important role in such efforts. Our coarse-grained RAG (RNA-As-Graphs) framework offers strategies for enumerating the universe of RNA 2D folds, selecting "RNA-like" candidates for design, and determining sequences that fold onto these candidates. In RAG, RNA secondary structures are represented as tree or dual graphs. Graphs with known RNA structures are called "existing", and the others are labeled "hypothetical". By using simplified features for RNA graphs, we have clustered the hypothetical graphs into "RNA-like" and "non-RNA-like" groups and proposed RNA-like graphs as candidates for design. Here, we propose a new way of designing graph features by using Fiedler vectors. The new features reflect graph shapes better, and they lead to a more clustered organization of existing graphs. We show significant increases in K-means clustering accuracy by using the new features (e.g., up to 95% and 98% accuracy for tree and dual graphs, respectively). In addition, we propose a scoring model for top graph candidate selection. This scoring model allows users to set a threshold for candidates, and it incorporates weighing of existing graphs based on their corresponding number of known RNAs. We include a list of top scored RNA-like candidates, which we hope will stimulate future novel RNA design.
Collapse
Affiliation(s)
- Qiyao Zhu
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
| | - Tamar Schlick
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
- Department of Chemistry, New York University, New York, New York 10003, United States
- NYU-ECNU Center for Computational Chemistry, NYU Shanghai, Shanghai 200062, P. R. China
| |
Collapse
|
23
|
Huber RG, Marzinek JK, Boon PLS, Yue W, Bond PJ. Computational modelling of flavivirus dynamics: The ins and outs. Methods 2021; 185:28-38. [PMID: 32526282 PMCID: PMC7278654 DOI: 10.1016/j.ymeth.2020.06.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 04/24/2020] [Accepted: 06/04/2020] [Indexed: 02/06/2023] Open
Abstract
Enveloped viruses such as the flaviviruses represent a significant burden to human health around the world, with hundreds of millions of people each year affected by dengue alone. In an effort to improve our understanding of the molecular basis for the infective mechanisms of these viruses, extensive computational modelling approaches have been applied to elucidate their conformational dynamics. Multiscale protocols have been developed to simulate flavivirus envelopes in close accordance with biophysical data, in particular derived from cryo-electron microscopy, enabling high-resolution refinement of their structures and elucidation of the conformational changes associated with adaptation both to host environments and to immunological factors such as antibodies. Likewise, integrative modelling efforts combining data from biophysical experiments and from genome sequencing with chemical modification are providing unparalleled insights into the architecture of the previously unresolved nucleocapsid complex. Collectively, this work provides the basis for the future rational design of new antiviral therapeutics and vaccine development strategies targeting enveloped viruses.
Collapse
Affiliation(s)
- Roland G Huber
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, Matrix #07-01, 138671, Singapore
| | - Jan K Marzinek
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, Matrix #07-01, 138671, Singapore
| | - Priscilla L S Boon
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, Matrix #07-01, 138671, Singapore; NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore (NUS), University Hall, Tan Chin Tuan Wing #04-02, 119077, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 16 Science Drive 4, Building S3, Singapore
| | - Wan Yue
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome #02-01, 138672, Singapore
| | - Peter J Bond
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, Matrix #07-01, 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 16 Science Drive 4, Building S3, Singapore.
| |
Collapse
|
24
|
Graf J, Kretz M. From structure to function: Route to understanding lncRNA mechanism. Bioessays 2020; 42:e2000027. [PMID: 33164244 DOI: 10.1002/bies.202000027] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 09/03/2020] [Indexed: 12/13/2022]
Abstract
RNAs have emerged as a major target for diagnostics and therapeutics approaches. Regulatory nonprotein-coding RNAs (ncRNAs) in particular display remarkable versatility. They can fold into complex structures and interact with proteins, DNA, and other RNAs, thus modulating activity, localization, or interactome of multi-protein complexes. Thus, ncRNAs confer regulatory plasticity and represent a new layer of regulatory control. Interestingly, long noncoding RNAs (lncRNAs) tend to acquire complex secondary and tertiary structures and their function-in many cases-is dependent on structural conservation rather than primary sequence conservation. Whereas for many proteins, structure and its associated function are closely connected, for lncRNAs, the structural domains that determine functionality and its interactome are still not well understood. Numerous approaches for analyzing the structural configuration of lncRNAs have been developed recently. Here, will provide an overview of major experimental approaches used in the field, and discuss the potential benefit of using combinatorial strategies to analyze lncRNA modes of action based on structural information.
Collapse
Affiliation(s)
- Johannes Graf
- Institute of Biochemistry, Genetics and Microbiology, University of Regensburg, Regensburg, Germany
| | - Markus Kretz
- Institute of Biochemistry, Genetics and Microbiology, University of Regensburg, Regensburg, Germany
| |
Collapse
|
25
|
Li B, Cao Y, Westhof E, Miao Z. Advances in RNA 3D Structure Modeling Using Experimental Data. Front Genet 2020; 11:574485. [PMID: 33193680 PMCID: PMC7649352 DOI: 10.3389/fgene.2020.574485] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 09/02/2020] [Indexed: 12/26/2022] Open
Abstract
RNA is a unique bio-macromolecule that can both record genetic information and perform biological functions in a variety of molecular processes, including transcription, splicing, translation, and even regulating protein function. RNAs adopt specific three-dimensional conformations to enable their functions. Experimental determination of high-resolution RNA structures using x-ray crystallography is both laborious and demands expertise, thus, hindering our comprehension of RNA structural biology. The computational modeling of RNA structure was a milestone in the birth of bioinformatics. Although computational modeling has been greatly improved over the last decade showing many successful cases, the accuracy of such computational modeling is not only length-dependent but also varies according to the complexity of the structure. To increase credibility, various experimental data were integrated into computational modeling. In this review, we summarize the experiments that can be integrated into RNA structure modeling as well as the computational methods based on these experimental data. We also demonstrate how computational modeling can help the experimental determination of RNA structure. We highlight the recent advances in computational modeling which can offer reliable structure models using high-throughput experimental data.
Collapse
Affiliation(s)
- Bing Li
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, Strasbourg, France
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence, Department of Anesthesiology, Shanghai Fourth People’s Hospital Affiliated to Tongji University School of Medicine, Shanghai, China
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| |
Collapse
|
26
|
Schlick T. RNA Back to the Spotlight. J Struct Biol 2020; 212:107555. [PMID: 32592854 PMCID: PMC7311913 DOI: 10.1016/j.jsb.2020.107555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
27
|
Jain S, Zhu Q, Paz ASP, Schlick T. Identification of novel RNA design candidates by clustering the extended RNA-As-Graphs library. Biochim Biophys Acta Gen Subj 2020; 1864:129534. [PMID: 31954797 DOI: 10.1016/j.bbagen.2020.129534] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 01/10/2020] [Accepted: 01/14/2020] [Indexed: 12/31/2022]
Abstract
BACKGROUND We re-evaluate our RNA-As-Graphs clustering approach, using our expanded graph library and new RNA structures, to identify potential RNA-like topologies for design. Our coarse-grained approach represents RNA secondary structures as tree and dual graphs, with vertices and edges corresponding to RNA helices and loops. The graph theoretical framework facilitates graph enumeration, partitioning, and clustering approaches to study RNA structure and its applications. METHODS Clustering graph topologies based on features derived from graph Laplacian matrices and known RNA structures allows us to classify topologies into 'existing' or hypothetical, and the latter into, 'RNA-like' or 'non RNA-like' topologies. Here we update our list of existing tree graph topologies and RAG-3D database of atomic fragments to include newly determined RNA structures. We then use linear and quadratic regression, optionally with dimensionality reduction, to derive graph features and apply several clustering algorithms on our tree-graph library and recently expanded dual-graph library to classify them into the three groups. RESULTS The unsupervised PAM and K-means clustering approaches correctly classify 72-77% of all existing graph topologies and 75-82% of newly added ones as RNA-like. For supervised k-NN clustering, the cross-validation accuracy ranges from 57 to 81%. CONCLUSIONS Using linear regression with unsupervised clustering, or quadratic regression with supervised clustering, provides better accuracies than supervised/linear clustering. All accuracies are better than random, especially for newly added existing topologies, thus lending credibility to our approach. GENERAL SIGNIFICANCE Our updated RAG-3D database and motif classification by clustering present new RNA substructures and RNA-like motifs as novel design candidates.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Qiyao Zhu
- Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| | - Amiel S P Paz
- NYU Shanghai, 1555 Century Avenue, Shanghai 200135, China; NYU-ECNU Center for Computational Chemistry, NYU Shanghai, 3663 Zhongshang Road North, Shanghai 200062, China
| | - Tamar Schlick
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; NYU-ECNU Center for Computational Chemistry, NYU Shanghai, 3663 Zhongshang Road North, Shanghai 200062, China.
| |
Collapse
|
28
|
Inverse folding with RNA-As-Graphs produces a large pool of candidate sequences with target topologies. J Struct Biol 2019; 209:107438. [PMID: 31874236 DOI: 10.1016/j.jsb.2019.107438] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 12/18/2019] [Accepted: 12/19/2019] [Indexed: 02/07/2023]
Abstract
We present an RNA-As-Graphs (RAG) based inverse folding algorithm, RAG-IF, to design novel RNA sequences that fold onto target tree graph topologies. The algorithm can be used to enhance our recently reported computational design pipeline (Jain et al., NAR 2018). The RAG approach represents RNA secondary structures as tree and dual graphs, where RNA loops and helices are coarse-grained as vertices and edges, opening the usage of graph theory methods to study, predict, and design RNA structures. Our recently developed computational pipeline for design utilizes graph partitioning (RAG-3D) and atomic fragment assembly (F-RAG) to design sequences to fold onto RNA-like tree graph topologies; the atomic fragments are taken from existing RNA structures that correspond to tree subgraphs. Because F-RAG may not produce the target folds for all designs, automated mutations by RAG-IF algorithm enhance the candidate pool markedly. The crucial residues for mutation are identified by differences between the predicted and the target topology. A genetic algorithm then mutates the selected residues, and the successful sequences are optimized to retain only the minimal or essential mutations. Here we evaluate RAG-IF for 6 RNA-like topologies and generate a large pool of successful candidate sequences with a variety of minimal mutations. We find that RAG-IF adds robustness and efficiency to our RNA design pipeline, making inverse folding motivated by graph topology rather than secondary structure more productive.
Collapse
|
29
|
Peran I, Mittag T. Molecular structure in biomolecular condensates. Curr Opin Struct Biol 2019; 60:17-26. [PMID: 31790873 DOI: 10.1016/j.sbi.2019.09.007] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 09/16/2019] [Indexed: 12/20/2022]
Abstract
Evidence accumulated over the past decade provides support for liquid-liquid phase separation as the mechanism underlying the formation of biomolecular condensates, which include not only 'membraneless' organelles such as nucleoli and RNA granules, but additional assemblies involved in transcription, translation and signaling. Understanding the molecular mechanisms of condensate function requires knowledge of the structures of their constituents. Current knowledge suggests that structures formed via multivalent domain-motif interactions remain largely unchanged within condensates. Two different viewpoints exist regarding structures of disordered low-complexity domains within condensates; one argues that low-complexity domains remain largely disordered in condensates and their multivalency is encoded in short motifs called 'stickers', while the other argues that the sequences form cross-β structures resembling amyloid fibrils. We review these viewpoints and highlight outstanding questions that will inform structure-function relationships for biomolecular condensates.
Collapse
Affiliation(s)
- Ivan Peran
- Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Tanja Mittag
- Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, TN, USA.
| |
Collapse
|
30
|
Singh J, Hanson J, Paliwal K, Zhou Y. RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning. Nat Commun 2019; 10:5407. [PMID: 31776342 PMCID: PMC6881452 DOI: 10.1038/s41467-019-13395-9] [Citation(s) in RCA: 165] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 11/01/2019] [Indexed: 01/03/2023] Open
Abstract
The majority of our human genome transcribes into noncoding RNAs with unknown structures and functions. Obtaining functional clues for noncoding RNAs requires accurate base-pairing or secondary-structure prediction. However, the performance of such predictions by current folding-based algorithms has been stagnated for more than a decade. Here, we propose the use of deep contextual learning for base-pair prediction including those noncanonical and non-nested (pseudoknot) base pairs stabilized by tertiary interactions. Since only [Formula: see text]250 nonredundant, high-resolution RNA structures are available for model training, we utilize transfer learning from a model initially trained with a recent high-quality bpRNA dataset of [Formula: see text]10,000 nonredundant RNAs made available through comparative analysis. The resulting method achieves large, statistically significant improvement in predicting all base pairs, noncanonical and non-nested base pairs in particular. The proposed method (SPOT-RNA), with a freely available server and standalone software, should be useful for improving RNA structure modeling, sequence alignment, and functional annotations.
Collapse
Affiliation(s)
- Jaswinder Singh
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia
| | - Jack Hanson
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia
| | - Kuldip Paliwal
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD, 4111, Australia.
| | - Yaoqi Zhou
- Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Dr., Southport, QLD, 4222, Australia.
| |
Collapse
|
31
|
Refining RNA solution structures with the integrative use of label-free paramagnetic relaxation enhancement NMR. BIOPHYSICS REPORTS 2019. [DOI: 10.1007/s41048-019-00099-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
Abstract
AbstractNMR structure calculation is inherently integrative, and can incorporate new experimental data as restraints. As RNAs have lower proton densities and are more conformational heterogenous than proteins, the refinement of RNA structures can benefit from additional types of restraints. Paramagnetic relaxation enhancement (PRE) provides distance information between a paramagnetic probe and protein or RNA nuclei. However, covalent conjugation of a paramagnetic probe is difficult for RNAs, thus limiting the use of PRE NMR for RNA structure characterization. Here, we show that the solvent PRE can be accurately measured for RNA labile imino protons, simply with the addition of an inert paramagnetic cosolute. Demonstrated on three RNAs that have increasingly complex topologies, we show that the incorporation of the solvent PRE restraints can significantly improve the precision and accuracy of RNA structures. Importantly, the solvent PRE data can be collected for RNAs without isotope enrichment. Thus, the solvent PRE method can work integratively with other biophysical techniques for better characterization of RNA structures.
Collapse
|
32
|
Jin L, Tan YL, Wu Y, Wang X, Shi YZ, Tan ZJ. Structure folding of RNA kissing complexes in salt solutions: predicting 3D structure, stability, and folding pathway. RNA (NEW YORK, N.Y.) 2019; 25:1532-1548. [PMID: 31391217 PMCID: PMC6795135 DOI: 10.1261/rna.071662.119] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 08/02/2019] [Indexed: 05/08/2023]
Abstract
RNA kissing complexes are essential for genomic RNA dimerization and regulation of gene expression, and their structures and stability are critical to their biological functions. In this work, we used our previously developed coarse-grained model with an implicit structure-based electrostatic potential to predict three-dimensional (3D) structures and stability of RNA kissing complexes in salt solutions. For extensive RNA kissing complexes, our model shows great reliability in predicting 3D structures from their sequences, and our additional predictions indicate that the model can capture the dependence of 3D structures of RNA kissing complexes on monovalent/divalent ion concentrations. Moreover, the comparisons with extensive experimental data show that the model can make reliable predictions on the stability for various RNA kissing complexes over wide ranges of monovalent/divalent ion concentrations. Notably, for RNA kissing complexes, our further analyses show the important contribution of coaxial stacking to the 3D structures and stronger stability than the corresponding kissing-interface duplexes at high salts. Furthermore, our comprehensive analyses for RNA kissing complexes reveal that the thermally folding pathway for a complex sequence is mainly determined by the relative stability of two possible folded states of kissing complex and extended duplex, which can be significantly modulated by its sequence.
Collapse
Affiliation(s)
- Lei Jin
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Lan Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Yao Wu
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Xunxun Wang
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan 430073, China
| | - Zhi-Jie Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
33
|
Zhang BG, Qiu HH, Jiang J, Liu J, Shi YZ. 3D structure stability of the HIV-1 TAR RNA in ion solutions: A coarse-grained model study. J Chem Phys 2019; 151:165101. [PMID: 31675878 DOI: 10.1063/1.5126128] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
As an extremely common structural motif, RNA hairpins with bulge loops [e.g., the human immunodeficiency virus type 1 (HIV-1) transactivation response (TAR) RNA] can play essential roles in normal cellular processes by binding to proteins and small ligands, which could be very dependent on their three-dimensional (3D) structures and stability. Although the structures and conformational dynamics of the HIV-1 TAR RNA have been extensively studied, there are few investigations on the thermodynamic stability of the TAR RNA, especially in ion solutions, and the existing studies also have some divergence on the unfolding process of the RNA. Here, we employed our previously developed coarse-grained model with implicit salt to predict the 3D structure, stability, and unfolding pathway for the HIV-1 TAR RNA over a wide range of ion concentrations. As compared with the extensive experimental/theoretical results, the present model can give reliable predictions on the 3D structure stability of the TAR RNA from the sequence. Based on the predictions, our further comprehensive analyses on the stability of the TAR RNA as well as its variants revealed that the unfolding pathway of an RNA hairpin with a bulge loop is mainly determined by the relative stability between different states (folded state, intermediate state, and unfolded state) and the strength of the coaxial stacking between two stems in folded structures, both of which can be apparently modulated by the ion concentrations as well as the sequences.
Collapse
Affiliation(s)
- Ben-Gong Zhang
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| | - Hua-Hai Qiu
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| | - Jian Jiang
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| | - Jie Liu
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| |
Collapse
|
34
|
Tan YL, Feng CJ, Jin L, Shi YZ, Zhang W, Tan ZJ. What is the best reference state for building statistical potentials in RNA 3D structure evaluation? RNA (NEW YORK, N.Y.) 2019; 25:793-812. [PMID: 30996105 PMCID: PMC6573789 DOI: 10.1261/rna.069872.118] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2018] [Accepted: 04/06/2019] [Indexed: 05/14/2023]
Abstract
Knowledge-based statistical potentials have been shown to be efficient in protein structure evaluation/prediction, and the core difference between various statistical potentials is attributed to the choice of reference states. However, for RNA 3D structure evaluation, a comprehensive examination on reference states is still lacking. In this work, we built six statistical potentials based on six reference states widely used in protein structure evaluation, including averaging, quasi-chemical approximation, atom-shuffled, finite-ideal-gas, spherical-noninteracting, and random-walk-chain reference states, and we examined the six reference states against three RNA test sets including six subsets. Our extensive examinations show that, overall, for identifying native structures and ranking decoy structures, the finite-ideal-gas and random-walk-chain reference states are slightly superior to others, while for identifying near-native structures, there is only a slight difference between these reference states. Our further analyses show that the performance of a statistical potential is apparently dependent on the quality of the training set. Furthermore, we found that the performance of a statistical potential is closely related to the origin of test sets, and for the three realistic test subsets, the six statistical potentials have overall unsatisfactory performance. This work presents a comprehensive examination on the existing reference states and statistical potentials for RNA 3D structure evaluation.
Collapse
Affiliation(s)
- Ya-Lan Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Chen-Jie Feng
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Lei Jin
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan 430073, China
| | - Wenbing Zhang
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Zhi-Jie Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| |
Collapse
|
35
|
Morgan BS, Forte JE, Hargrove AE. Insights into the development of chemical probes for RNA. Nucleic Acids Res 2019; 46:8025-8037. [PMID: 30102391 PMCID: PMC6144806 DOI: 10.1093/nar/gky718] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 07/27/2018] [Indexed: 12/18/2022] Open
Abstract
Over the past decade, the RNA revolution has revealed thousands of non-coding RNAs that are essential for cellular regulation and are misregulated in disease. While the development of methods and tools to study these RNAs has been challenging, the power and promise of small molecule chemical probes is increasingly recognized. To harness existing knowledge, we compiled a list of 116 ligands with reported activity against RNA targets in biological systems (R-BIND). In this survey, we examine the RNA targets, design and discovery strategies, and chemical probe characterization techniques of these ligands. We discuss the applicability of current tools to identify and evaluate RNA-targeted chemical probes, suggest criteria to assess the quality of RNA chemical probes and targets, and propose areas where new tools are particularly needed. We anticipate that this knowledge will expedite the discovery of RNA-targeted ligands and the next phase of the RNA revolution.
Collapse
Affiliation(s)
| | - Jordan E Forte
- Department of Chemistry, Duke University, Durham, NC 27708, USA
| | - Amanda E Hargrove
- Department of Chemistry, Duke University, Durham, NC 27708, USA.,Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| |
Collapse
|
36
|
Investigating targets for neuropharmacological intervention by molecular dynamics simulations. Biochem Soc Trans 2019; 47:909-918. [PMID: 31085614 DOI: 10.1042/bst20190048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 04/11/2019] [Accepted: 04/23/2019] [Indexed: 01/09/2023]
Abstract
Medical research has identified over 500 brain disorders. Among these, there are still only very few neuropathologies whose causes are fully understood and, consequently, very few drugs whose mechanism of action is known. No FDA drug has been identified for major neurodegenerative diseases, such as Alzheimer's and Parkinson's. We still lack effective treatments and strategies for modulating progression or even early neurodegenerative disease onset diagnostic tools. A great support toward the highly needed identification of neuroactive drugs comes from computer simulation methods and, in particular, from molecular dynamics (MD). This provides insight into structure-function relationship of a target and predicts structure, dynamics and energetics of ligand/target complexes under biologically relevant conditions like temperature and physiological saline concentration. Here, we present examples of the predictive power of MD for neuroactive ligands/target complexes. This brief survey from our own research shows the usefulness of partnerships between academia and industry, and from joint efforts between experimental and theoretical groups.
Collapse
|
37
|
Cheng MSQ, Su MXX, Wang MXN, Sun MZY, Ou TM. Probes and drugs that interfere with protein translation via targeting to the RNAs or RNA-protein interactions. Methods 2019; 167:124-133. [PMID: 31185274 DOI: 10.1016/j.ymeth.2019.06.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 04/08/2019] [Accepted: 06/05/2019] [Indexed: 11/18/2022] Open
Abstract
Protein synthesis is critical to cell survival and translation regulation is essential to post-transcriptional gene expression regulation. Disorders of this process, particularly through RNA-binding proteins, is associated with the development and progression of a number of diseases, including cancers. However, the molecular mechanisms underlying the initiation of protein synthesis are intricate, making it difficult to find a drug that interferes with this process. Chemical probes are useful in elucidating the structures of RNA-protein complex and molecular mechanism of biological events. Moreover, some of these chemical probes show certain therapeutic benefits and can be further developed as leading compounds. Here, we will briefly review the general process and mechanism of protein synthesis, and emphasis on chemical probes in examples of probing the RNA structural changes and RNA-protein interactions. Moreover, the therapeutic potential of these probes is also discussed to give a comprehensive understanding.
Collapse
Affiliation(s)
- Miss Sui-Qi Cheng
- Sun Yat-Sen University, School of Pharmaceutical Sciences, Guangzhou, Guangdong 510006, China
| | - Miss Xiao-Xuan Su
- Sun Yat-Sen University, School of Pharmaceutical Sciences, Guangzhou, Guangdong 510006, China.
| | - Miss Xiao-Na Wang
- Sun Yat-Sen University, School of Pharmaceutical Sciences, Guangzhou, Guangdong 510006, China
| | - Miss Zhi-Yin Sun
- Sun Yat-Sen University, School of Pharmaceutical Sciences, Guangzhou, Guangdong 510006, China
| | - Tian-Miao Ou
- Sun Yat-Sen University, School of Pharmaceutical Sciences, 132 Waihuan East Road, Guangzhou University City, Guangzhou, Guangdong, China.
| |
Collapse
|
38
|
Umuhire Juru A, Patwardhan NN, Hargrove AE. Understanding the Contributions of Conformational Changes, Thermodynamics, and Kinetics of RNA-Small Molecule Interactions. ACS Chem Biol 2019; 14:824-838. [PMID: 31042354 DOI: 10.1021/acschembio.8b00945] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
The implication of RNA in multiple cellular processes beyond protein coding has revitalized interest in the development of small molecules for therapeutically targeting RNA and for further probing its cellular biology. However, the process of rationally designing such small molecule probes is hampered by the paucity of information about fundamental molecular recognition principles of RNA. In this Review, we summarize two important and often underappreciated aspects of RNA-small molecule recognition: RNA conformational dynamics and the biophysical properties of interactions of small molecules with RNA, specifically thermodynamics and kinetics. While conformational flexibility is often said to impede RNA ligand development, the ability of small molecules to influence the RNA conformational landscape can have a significant effect on the cellular functions of RNA. An analysis of the conformational landscape of RNA and the interactions of individual conformations with ligands can thus guide the development of new small molecule probes, which needs to be investigated further. Additionally, while it is common practice to quantify the binding affinities ( Ka or Kd) of small molecules for biomacromolecules as a measure of their activity, further biophysical characterization of their interaction can provide a deeper understanding. Studies that focus on the thermodynamic and kinetic parameters for interaction between RNA and ligands are next discussed. Finally, this Review provides the reader with a perspective on how such in-depth analysis of biophysical characteristics of the interaction of RNA and small molecules can impact our understanding of these interactions and how they will benefit the future design of small molecule probes.
Collapse
Affiliation(s)
- Aline Umuhire Juru
- Department of Chemistry, Duke University, Durham, North Carolina 27708, United States
| | - Neeraj N. Patwardhan
- Department of Chemistry, Duke University, Durham, North Carolina 27708, United States
| | - Amanda E. Hargrove
- Department of Chemistry, Duke University, Durham, North Carolina 27708, United States
| |
Collapse
|
39
|
Jain S, Saju S, Petingi L, Schlick T. An extended dual graph library and partitioning algorithm applicable to pseudoknotted RNA structures. Methods 2019; 162-163:74-84. [PMID: 30928508 DOI: 10.1016/j.ymeth.2019.03.022] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Revised: 02/28/2019] [Accepted: 03/22/2019] [Indexed: 12/18/2022] Open
Abstract
Exploring novel RNA topologies is imperative for understanding RNA structure and pursuing its design. Our RNA-As-Graphs (RAG) approach exploits graph theory tools and uses coarse-grained tree and dual graphs to represent RNA helices and loops by vertices and edges. Only dual graphs represent pseudoknotted RNAs fully. Here we develop a dual graph enumeration algorithm to generate an expanded library of dual graph topologies for 2-9 vertices, and extend our dual graph partitioning algorithm to identify all possible RNA subgraphs. Our enumeration algorithm connects smaller-vertex graphs, using all possible edge combinations, to build larger-vertex graphs and retain all non-isomorphic graph topologies, thereby more than doubling the size of our prior library to a total of 110,667 dual graph topologies. We apply our dual graph partitioning algorithm, which keeps pseudoknots and junctions intact, to all existing RNA structures to identify all possible substructures up to 9 vertices. In addition, our expanded dual graph library assigns graph topologies to all RNA graphs and subgraphs, rectifying prior inconsistencies. We update our RAG-3Dual database of RNA atomic fragments with all newly identified substructures and their graph IDs, increasing its size by more than 50 times. The enlarged dual graph library and RAG-3Dual database provide a comprehensive repertoire of graph topologies and atomic fragments to study yet undiscovered RNA molecules and design RNA sequences with novel topologies, including a variety of pseudoknotted RNAs.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Sera Saju
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Louis Petingi
- Computer Science Department, College of Staten Island, City University of New York, Staten Island, New York, NY 10314, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1021 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; NYU-East China Normal University Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China.
| |
Collapse
|
40
|
Andrews RJ, Roche J, Moss WN. ScanFold: an approach for genome-wide discovery of local RNA structural elements-applications to Zika virus and HIV. PeerJ 2018; 6:e6136. [PMID: 30627482 PMCID: PMC6317755 DOI: 10.7717/peerj.6136] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 11/15/2018] [Indexed: 12/24/2022] Open
Abstract
In addition to encoding RNA primary structures, genomes also encode RNA secondary and tertiary structures that play roles in gene regulation and, in the case of RNA viruses, genome replication. Methods for the identification of functional RNA structures in genomes typically rely on scanning analysis windows, where multiple partially-overlapping windows are used to predict RNA structures and folding metrics to deduce regions likely to form functional structure. Separate structural models are produced for each window, where the step size can greatly affect the returned model. This makes deducing unique local structures challenging, as the same nucleotides in each window can be alternatively base paired. We are presenting here a new approach where all base pairs from analysis windows are considered and weighted by favorable folding. This results in unique base pairing throughout the genome and the generation of local regions/structures that can be ranked by their propensity to form unusually thermodynamically stable folds. We applied this approach to the Zika virus (ZIKV) and HIV-1 genomes. ZIKV is linked to a variety of neurological ailments including microcephaly and Guillain-Barré syndrome and its (+)-sense RNA genome encodes two, previously described, functionally essential structured RNA regions. HIV, the cause of AIDS, contains multiple functional RNA motifs in its genome, which have been extensively studied. Our approach is able to successfully identify and model the structures of known functional motifs in both viruses, while also finding additional regions likely to form functional structures. All data have been archived at the RNAStructuromeDB (www.structurome.bb.iastate.edu), a repository of RNA folding data for humans and their pathogens.
Collapse
Affiliation(s)
- Ryan J. Andrews
- Roy J. Carver Department of Biophysics, Biochemistry and Molecular Biology, Iowa State University, Ames, IA, USA
| | - Julien Roche
- Roy J. Carver Department of Biophysics, Biochemistry and Molecular Biology, Iowa State University, Ames, IA, USA
| | - Walter N. Moss
- Roy J. Carver Department of Biophysics, Biochemistry and Molecular Biology, Iowa State University, Ames, IA, USA
| |
Collapse
|
41
|
Jin L, Shi YZ, Feng CJ, Tan YL, Tan ZJ. Modeling Structure, Stability, and Flexibility of Double-Stranded RNAs in Salt Solutions. Biophys J 2018; 115:1403-1416. [PMID: 30236782 DOI: 10.1016/j.bpj.2018.08.030] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 08/10/2018] [Accepted: 08/24/2018] [Indexed: 11/16/2022] Open
Abstract
Double-stranded (ds) RNAs play essential roles in many processes of cell metabolism. The knowledge of three-dimensional (3D) structure, stability, and flexibility of dsRNAs in salt solutions is important for understanding their biological functions. In this work, we further developed our previously proposed coarse-grained model to predict 3D structure, stability, and flexibility for dsRNAs in monovalent and divalent ion solutions through involving an implicit structure-based electrostatic potential. The model can make reliable predictions for 3D structures of extensive dsRNAs with/without bulge/internal loops from their sequences, and the involvement of the structure-based electrostatic potential and corresponding ion condition can improve the predictions for 3D structures of dsRNAs in ion solutions. Furthermore, the model can make good predictions for thermal stability for extensive dsRNAs over the wide range of monovalent/divalent ion concentrations, and our analyses show that the thermally unfolding pathway of dsRNA is generally dependent on its length as well as its sequence. In addition, the model was employed to examine the salt-dependent flexibility of a dsRNA helix, and the calculated salt-dependent persistence lengths are in good accordance with experiments.
Collapse
Affiliation(s)
- Lei Jin
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
| | - Chen-Jie Feng
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Zhi-Jie Tan
- Center for Theoretical Physics and Key Laboratory of Artificial Micro- & Nanostructures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China.
| |
Collapse
|
42
|
Jain S, Laederach A, Ramos SBV, Schlick T. A pipeline for computational design of novel RNA-like topologies. Nucleic Acids Res 2018; 46:7040-7051. [PMID: 30137633 PMCID: PMC6101589 DOI: 10.1093/nar/gky524] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Revised: 05/22/2018] [Accepted: 05/24/2018] [Indexed: 12/11/2022] Open
Abstract
Designing novel RNA topologies is a challenge, with important therapeutic and industrial applications. We describe a computational pipeline for design of novel RNA topologies based on our coarse-grained RNA-As-Graphs (RAG) framework. RAG represents RNA structures as tree graphs and describes RNA secondary (2D) structure topologies (currently up to 13 vertices, ≈260 nucleotides). We have previously identified novel graph topologies that are RNA-like among these. Here we describe a systematic design pipeline and illustrate design for six broad design problems using recently developed tools for graph-partitioning and fragment assembly (F-RAG). Following partitioning of the target graph, corresponding atomic fragments from our RAG-3D database are combined using F-RAG, and the candidate atomic models are scored using a knowledge-based potential developed for 3D structure prediction. The sequences of the top scoring models are screened further using available tools for 2D structure prediction. The results indicate that our modular approach based on RNA-like topologies rather than specific 2D structures allows for greater flexibility in the design process, and generates a large number of candidate sequences quickly. Experimental structure probing using SHAPE-MaP for two sequences agree with our predictions and suggest that our combined tools yield excellent candidates for further sequence and experimental screening.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Alain Laederach
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Silvia B V Ramos
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
- Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
- NYU-ECNU Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China
| |
Collapse
|
43
|
Jain S, Bayrak CS, Petingi L, Schlick T. Dual Graph Partitioning Highlights a Small Group of Pseudoknot-Containing RNA Submotifs. Genes (Basel) 2018; 9:E371. [PMID: 30044451 PMCID: PMC6115904 DOI: 10.3390/genes9080371] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Revised: 06/26/2018] [Accepted: 06/26/2018] [Indexed: 12/31/2022] Open
Abstract
RNA molecules are composed of modular architectural units that define their unique structural and functional properties. Characterization of these building blocks can help interpret RNA structure/function relationships. We present an RNA secondary structure motif and submotif library using dual graph representation and partitioning. Dual graphs represent RNA helices as vertices and loops as edges. Unlike tree graphs, dual graphs can represent RNA pseudoknots (intertwined base pairs). For a representative set of RNA structures, we construct dual graphs from their secondary structures, and apply our partitioning algorithm to identify non-separable subgraphs (or blocks) without breaking pseudoknots. We report 56 subgraph blocks up to nine vertices; among them, 22 are frequently occurring, 15 of which contain pseudoknots. We then catalog atomic fragments corresponding to the subgraph blocks to define a library of building blocks that can be used for RNA design, which we call RAG-3Dual, as we have done for tree graphs. As an application, we analyze the distribution of these subgraph blocks within ribosomal RNAs of various prokaryotic and eukaryotic species to identify common subgraphs and possible ancestry relationships. Other applications of dual graph partitioning and motif library can be envisioned for RNA structure analysis and design.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, New York, NY 10003, USA.
| | - Cigdem S Bayrak
- Department of Chemistry, New York University, New York, NY 10003, USA.
| | - Louis Petingi
- Computer Science Department, College of Staten Island, City University of New York, Staten Island, New York, NY 10314, USA.
| | - Tamar Schlick
- Department of Chemistry, New York University, New York, NY 10003, USA.
- Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, USA.
- NYU-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 3663, China.
| |
Collapse
|
44
|
Abstract
The structure of RNA has been a natural subject for mathematical modeling, inviting many innovative computational frameworks. This single-stranded polynucleotide chain can fold upon itself in numerous ways to form hydrogen-bonded segments, imperfect with single-stranded loops. Illustrating these paired and non-paired interaction networks, known as RNA's secondary (2D) structure, using mathematical graph objects has been illuminating for RNA structure analysis. Building upon such seminal work from the 1970s and 1980s, graph models are now used to study not only RNA structure but also describe RNA's recurring modular units, sample the conformational space accessible to RNAs, predict RNA's three-dimensional folds, and apply the combined aspects to novel RNA design. In this article, we outline the development of the RNA-As-Graphs (or RAG) approach and highlight current applications to RNA structure prediction and design.
Collapse
Affiliation(s)
- Tamar Schlick
- Department of Chemistry, 100 Washington Square East, Silver Building, New York University, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer St., New York, NY 10012, USA; New York University ECNU - Center for Computational Chemistry at NYU Shanghai, 3663 North Zhongshan Road, Shanghai, 200062, China.
| |
Collapse
|
45
|
Shi YZ, Jin L, Feng CJ, Tan YL, Tan ZJ. Predicting 3D structure and stability of RNA pseudoknots in monovalent and divalent ion solutions. PLoS Comput Biol 2018; 14:e1006222. [PMID: 29879103 PMCID: PMC6007934 DOI: 10.1371/journal.pcbi.1006222] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Revised: 06/19/2018] [Accepted: 05/22/2018] [Indexed: 01/30/2023] Open
Abstract
RNA pseudoknots are a kind of minimal RNA tertiary structural motifs, and their three-dimensional (3D) structures and stability play essential roles in a variety of biological functions. Therefore, to predict 3D structures and stability of RNA pseudoknots is essential for understanding their functions. In the work, we employed our previously developed coarse-grained model with implicit salt to make extensive predictions and comprehensive analyses on the 3D structures and stability for RNA pseudoknots in monovalent/divalent ion solutions. The comparisons with available experimental data show that our model can successfully predict the 3D structures of RNA pseudoknots from their sequences, and can also make reliable predictions for the stability of RNA pseudoknots with different lengths and sequences over a wide range of monovalent/divalent ion concentrations. Furthermore, we made comprehensive analyses on the unfolding pathway for various RNA pseudoknots in ion solutions. Our analyses for extensive pseudokonts and the wide range of monovalent/divalent ion concentrations verify that the unfolding pathway of RNA pseudoknots is mainly dependent on the relative stability of unfolded intermediate states, and show that the unfolding pathway of RNA pseudoknots can be significantly modulated by their sequences and solution ion conditions.
Collapse
Affiliation(s)
- Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematics and Computer Science, Wuhan Textile University, Wuhan, China
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Lei Jin
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Chen-Jie Feng
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Ya-Lan Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro- and Nano-structures of Ministry of Education, School of Physics and Technology, Wuhan University, Wuhan, China
| |
Collapse
|
46
|
Jain S, Schlick T. F-RAG: Generating Atomic Coordinates from RNA Graphs by Fragment Assembly. J Mol Biol 2017; 429:3587-3605. [PMID: 28988954 PMCID: PMC5693719 DOI: 10.1016/j.jmb.2017.09.017] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 09/12/2017] [Accepted: 09/22/2017] [Indexed: 10/18/2022]
Abstract
Coarse-grained models represent attractive approaches to analyze and simulate ribonucleic acid (RNA) molecules, for example, for structure prediction and design, as they simplify the RNA structure to reduce the conformational search space. Our structure prediction protocol RAGTOP (RNA-As-Graphs Topology Prediction) represents RNA structures as tree graphs and samples graph topologies to produce candidate graphs. However, for a more detailed study and analysis, construction of atomic from coarse-grained models is required. Here we present our graph-based fragment assembly algorithm (F-RAG) to convert candidate three-dimensional (3D) tree graph models, produced by RAGTOP into atomic structures. We use our related RAG-3D utilities to partition graphs into subgraphs and search for structurally similar atomic fragments in a data set of RNA 3D structures. The fragments are edited and superimposed using common residues, full atomic models are scored using RAGTOP's knowledge-based potential, and geometries of top scoring models is optimized. To evaluate our models, we assess all-atom RMSDs and Interaction Network Fidelity (a measure of residue interactions) with respect to experimentally solved structures and compare our results to other fragment assembly programs. For a set of 50 RNA structures, we obtain atomic models with reasonable geometries and interactions, particularly good for RNAs containing junctions. Additional improvements to our protocol and databases are outlined. These results provide a good foundation for further work on RNA structure prediction and design applications.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; New York University-East China Normal University Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China.
| |
Collapse
|
47
|
Biomolecular coevolution and its applications: Going from structure prediction toward signaling, epistasis, and function. Biochem Soc Trans 2017; 45:1253-1261. [DOI: 10.1042/bst20170063] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Revised: 08/30/2017] [Accepted: 09/04/2017] [Indexed: 01/01/2023]
Abstract
Evolution leads to considerable changes in the sequence of biomolecules, while their overall structure and function remain quite conserved. The wealth of genomic sequences, the ‘Biological Big Data’, modern sequencing techniques provide allows us to investigate biomolecular evolution with unprecedented detail. Sophisticated statistical models can infer residue pair mutations resulting from spatial proximity. The introduction of predicted spatial adjacencies as constraints in biomolecular structure prediction workflows has transformed the field of protein and RNA structure prediction toward accuracies approaching the experimental resolution limit. Going beyond structure prediction, the same mathematical framework allows mimicking evolutionary fitness landscapes to infer signaling interactions, epistasis, or mutational landscapes.
Collapse
|
48
|
Morgan BS, Forte JE, Culver RN, Zhang Y, Hargrove AE. Discovery of Key Physicochemical, Structural, and Spatial Properties of RNA-Targeted Bioactive Ligands. Angew Chem Int Ed Engl 2017; 56:13498-13502. [PMID: 28810078 PMCID: PMC5752130 DOI: 10.1002/anie.201707641] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Indexed: 01/20/2023]
Abstract
While a myriad non-coding RNAs are known to be essential in cellular processes and misregulated in diseases, the development of RNA-targeted small molecule probes has met with limited success. To elucidate the guiding principles for selective small molecule/RNA recognition, we analyzed cheminformatic and shape-based descriptors for 104 RNA-targeted ligands with demonstrated biological activity (RNA-targeted BIoactive ligaNd Database, R-BIND). We then compared R-BIND to both FDA-approved small molecule drugs and RNA ligands without reported bioactivity. Several striking trends emerged for bioactive RNA ligands, including: 1) Compliance to medicinal chemistry rules, 2) distinctive structural features, and 3) enrichment in rod-like shapes over others. This work provides unique insights that directly facilitate the selection and synthesis of RNA-targeted libraries with the goal of efficiently identifying selective small molecule ligands for therapeutically relevant RNAs.
Collapse
Affiliation(s)
- Brittany S Morgan
- Department of Chemistry, Duke University, Durham, NC, 27708-0346, USA
| | - Jordan E Forte
- Department of Chemistry, Duke University, Durham, NC, 27708-0346, USA
| | - Rebecca N Culver
- Department of Chemistry, Duke University, Durham, NC, 27708-0346, USA
| | - Yuqi Zhang
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| | - Amanda E Hargrove
- Department of Chemistry, Duke University, Durham, NC, 27708-0346, USA
| |
Collapse
|
49
|
Bayrak CS, Kim N, Schlick T. Using sequence signatures and kink-turn motifs in knowledge-based statistical potentials for RNA structure prediction. Nucleic Acids Res 2017; 45:5414-5422. [PMID: 28158755 PMCID: PMC5435971 DOI: 10.1093/nar/gkx045] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 01/22/2017] [Indexed: 12/15/2022] Open
Abstract
Kink turns are widely occurring motifs in RNA, located in internal loops and associated with many biological functions including translation, regulation and splicing. The associated sequence pattern, a 3-nt bulge and G-A, A-G base-pairs, generates an angle of ∼50° along the helical axis due to A-minor interactions. The conserved sequence and distinct secondary structures of kink-turns (k-turn) suggest computational folding rules to predict k-turn-like topologies from sequence. Here, we annotate observed k-turn motifs within a non-redundant RNA dataset based on sequence signatures and geometrical features, analyze bending and torsion angles, and determine distinct knowledge-based potentials with and without k-turn motifs. We apply these scoring potentials to our RAGTOP (RNA-As-Graph-Topologies) graph sampling protocol to construct and sample coarse-grained graph representations of RNAs from a given secondary structure. We present graph-sampling results for 35 RNAs, including 12 k-turn and 23 non k-turn internal loops, and compare the results to solved structures and to RAGTOP results without special k-turn potentials. Significant improvements are observed with the updated scoring potentials compared to the k-turn-free potentials. Because k-turns represent a classic example of sequence/structure motif, our study suggests that other such motifs with sequence signatures and unique geometrical features can similarly be utilized for RNA structure prediction and design.
Collapse
Affiliation(s)
- Cigdem Sevim Bayrak
- Department of Chemistry and Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| | - Namhee Kim
- Department of Chemistry and Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| | - Tamar Schlick
- Department of Chemistry and Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA
| |
Collapse
|
50
|
Morgan BS, Forte JE, Culver RN, Zhang Y, Hargrove AE. Discovery of Key Physicochemical, Structural, and Spatial Properties of RNA-Targeted Bioactive Ligands. Angew Chem Int Ed Engl 2017. [DOI: 10.1002/ange.201707641] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
| | - Jordan E. Forte
- Department of Chemistry; Duke University; Durham NC 27708-0346 USA
| | | | - Yuqi Zhang
- Department of Integrative Structural and Computational Biology; The Scripps Research Institute; La Jolla CA 92037 USA
| | | |
Collapse
|