1
|
Sabei A, Hognon C, Martin J, Frezza E. Dynamics of Protein-RNA Interfaces Using All-Atom Molecular Dynamics Simulations. J Phys Chem B 2024; 128:4865-4886. [PMID: 38740056 DOI: 10.1021/acs.jpcb.3c07698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Facing the current challenges posed by human health diseases requires the understanding of cell machinery at a molecular level. The interplay between proteins and RNA is key for any physiological phenomenon, as well protein-RNA interactions. To understand these interactions, many experimental techniques have been developed, spanning a very wide range of spatial and temporal resolutions. In particular, the knowledge of tridimensional structures of protein-RNA complexes provides structural, mechanical, and dynamical pieces of information essential to understand their functions. To get insights into the dynamics of protein-RNA complexes, we carried out all-atom molecular dynamics simulations in explicit solvent on nine different protein-RNA complexes with different functions and interface size by taking into account the bound and unbound forms. First, we characterized structural changes upon binding and, for the RNA part, the change in the puckering. Second, we extensively analyzed the interfaces, their dynamics and structural properties, and the structural waters involved in the binding, as well as the contacts mediated by them. Based on our analysis, the interfaces rearranged during the simulation time showing alternative and stable residue-residue contacts with respect to the experimental structure.
Collapse
Affiliation(s)
- Afra Sabei
- Université Paris Cité, CiTCoM, CNRS, Paris F-75006, France
| | - Cécilia Hognon
- Université Paris Cité, CiTCoM, CNRS, Paris F-75006, France
| | - Juliette Martin
- Univ Lyon, Université Claude Bernard Lyon 1, CNRS, UMR 5086 MMSB, Lyon 69367, France
- Laboratory of Biology and Modeling of the Cell, Université de Lyon, ENS de Lyon, Université Claude Bernard, CNRS UMR 5239, Inserm U1293, Lyon 69367, France
| | - Elisa Frezza
- Université Paris Cité, CiTCoM, CNRS, Paris F-75006, France
| |
Collapse
|
2
|
Rey Redondo E, Xu Y, Yung CCM. Genomic characterisation and ecological distribution of Mantoniella tinhauana: a novel Mamiellophycean green alga from the Western Pacific. Front Microbiol 2024; 15:1358574. [PMID: 38774501 PMCID: PMC11106453 DOI: 10.3389/fmicb.2024.1358574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 04/12/2024] [Indexed: 05/24/2024] Open
Abstract
Mamiellophyceae are dominant marine algae in much of the ocean, the most prevalent genera belonging to the order Mamiellales: Micromonas, Ostreococcus and Bathycoccus, whose genetics and global distributions have been extensively studied. Conversely, the genus Mantoniella, despite its potential ecological importance, remains relatively under-characterised. In this study, we isolated and characterised a novel species of Mamiellophyceae, Mantoniella tinhauana, from subtropical coastal waters in the South China Sea. Morphologically, it resembles other Mantoniella species; however, a comparative analysis of the 18S and ITS2 marker genes revealed its genetic distinctiveness. Furthermore, we sequenced and assembled the first genome of Mantoniella tinhauana, uncovering significant differences from previously studied Mamiellophyceae species. Notably, the genome lacked any detectable outlier chromosomes and exhibited numerous unique orthogroups. We explored gene groups associated with meiosis, scale and flagella formation, shedding light on species divergence, yet further investigation is warranted. To elucidate the biogeography of Mantoniella tinhauana, we conducted a comprehensive analysis using global metagenomic read mapping to the newly sequenced genome. Our findings indicate this species exhibits a cosmopolitan distribution with a low-level prevalence worldwide. Understanding the intricate dynamics between Mamiellophyceae and the environment is crucial for comprehending their impact on the ocean ecosystem and accurately predicting their response to forthcoming environmental changes.
Collapse
Affiliation(s)
| | | | - Charmaine Cheuk Man Yung
- Department of Ocean Science, The Hong Kong University of Science and Technology, Kowloon, Hong Kong SAR, China
| |
Collapse
|
3
|
Bu F, Lin X, Liao W, Lu Z, He Y, Luo Y, Peng X, Li M, Huang Y, Chen X, Xiao B, Jiang J, Deng J, Huang J, Lin T, Miao Z, Huang L. Ribocentre-switch: a database of riboswitches. Nucleic Acids Res 2024; 52:D265-D272. [PMID: 37855663 PMCID: PMC10767811 DOI: 10.1093/nar/gkad891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 09/19/2023] [Accepted: 10/04/2023] [Indexed: 10/20/2023] Open
Abstract
Riboswitches are regulatory elements found in the untranslated regions (UTRs) of certain mRNA molecules. They typically comprise two distinct domains: an aptamer domain that can bind to specific small molecules, and an expression platform that controls gene expression. Riboswitches work by undergoing a conformational change upon binding to their specific ligand, thus activating or repressing the genes downstream. This mechanism allows gene expression regulation in response to metabolites or small molecules. To systematically summarise riboswitch structures and their related ligand binding functions, we present Ribocentre-switch, a comprehensive database of riboswitches, including the information as follows: sequences, structures, functions, ligand binding pockets and biological applications. It encompasses 56 riboswitches and 26 orphan riboswitches from over 430 references, with a total of 89 591 sequences. It serves as a good resource for comparing different riboswitches and facilitating the identification of potential riboswitch candidates. Therefore, it may facilitate the understanding of RNA structural conformational changes in response to ligand signaling. The database is publicly available at https://riboswitch.ribocentre.org.
Collapse
Affiliation(s)
- Fan Bu
- Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230027, China
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases,Guangzhou National Laboratory, Medical University, Guangzhou 510180, China
| | - Xiaowei Lin
- Department of Urology, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Wenjian Liao
- Department of Urology, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Zhizhong Lu
- School of Life Sciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou 510006, China
| | - Yuanlin He
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Yuhang Luo
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Xuemei Peng
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Mengxiao Li
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Yuanyin Huang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Xiaoxue Chen
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Bowen Xiao
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases,Guangzhou National Laboratory, Medical University, Guangzhou 510180, China
| | - Jiuhong Jiang
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases,Guangzhou National Laboratory, Medical University, Guangzhou 510180, China
| | - Jie Deng
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Jian Huang
- Department of Urology, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Tianxin Lin
- Department of Urology, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| | - Zhichao Miao
- Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, Shanghai 200434, China
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases,Guangzhou National Laboratory, Medical University, Guangzhou 510180, China
| | - Lin Huang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangdong-Hong Kong Joint Laboratory for RNA Medicine, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou 510120, China
| |
Collapse
|
4
|
Thiel BC, Poblete S, Hofacker IL. The Multiscale Ernwin/SPQR RNA Structure Prediction Pipeline. Methods Mol Biol 2024; 2726:377-399. [PMID: 38780739 DOI: 10.1007/978-1-0716-3519-3_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
Aside from the well-known role in protein synthesis, RNA can perform catalytic, regulatory, and other essential biological functions which are determined by its three-dimensional structure. In this regard, a great effort has been made during the past decade to develop computational tools for the prediction of the structure of RNAs from the knowledge of their sequence, incorporating experimental data to refine or guide the modeling process. Nevertheless, this task can become exceptionally challenging when dealing with long noncoding RNAs, constituted by more than 200 nucleotides, due to their large size and the specific interactions involved. In this chapter, we describe a multiscale approach to predict such structures, incorporating SAXS experimental data into a hierarchical procedure which couples two coarse-grained representations: Ernwin, a helix-based approach, which deals with the global arrangement of secondary structure elements, and SPQR, a nucleotide-centered coarse-grained model, which corrects and refines the structures predicted at the coarser level.We describe the methodology through its application on the Braveheart long noncoding RNA, starting from the SAXS and secondary structure data to propose a refined, all-atom structure.
Collapse
Affiliation(s)
- Bernhard C Thiel
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, Austria
| | - Simón Poblete
- Instituto de Ciencias Físicas y Matemáticas, Universidad Austral de Chile, Valdivia, Chile
- Computational Biology Lab, Fundación Ciencia & Vida, Santiago, Chile
- Facultad de Ingeniería, Arquitectura y Diseño, Universidad SanSebastián, Santiago, Chile
| | - Ivo L Hofacker
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, Austria.
- Research Group Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, Austria.
| |
Collapse
|
5
|
Sarrazin-Gendron R, Waldispühl J, Reinharz V. Classification and Identification of Non-canonical Base Pairs and Structural Motifs. Methods Mol Biol 2024; 2726:143-168. [PMID: 38780731 DOI: 10.1007/978-1-0716-3519-3_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2024]
Abstract
The 3D structures of many ribonucleic acid (RNA) loops are characterized by highly organized networks of non-canonical interactions. Multiple computational methods have been developed to annotate structures with those interactions or automatically identify recurrent interaction networks. By contrast, the reverse problem that aims to retrieve the geometry of a look from its sequence or ensemble of interactions remains much less explored. In this chapter, we will describe how to retrieve and build families of conserved structural motifs using their underlying network of non-canonical interactions. Then, we will show how to assign sequence alignments to those families and use the software BayesPairing to build statistical models of structural motifs with their associated sequence alignments. From this model, we will apply BayesPairing to identify in new sequences regions where those loop geometries can occur.
Collapse
Affiliation(s)
| | | | - Vladimir Reinharz
- Department of Computer Science, Université du Québec à Montréal, Montreal, QC, Canada.
| |
Collapse
|
6
|
Das R, Kretsch RC, Simpkin AJ, Mulvaney T, Pham P, Rangan R, Bu F, Keegan RM, Topf M, Rigden DJ, Miao Z, Westhof E. Assessment of three-dimensional RNA structure prediction in CASP15. Proteins 2023; 91:1747-1770. [PMID: 37876231 PMCID: PMC10841292 DOI: 10.1002/prot.26602] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 08/21/2023] [Accepted: 09/07/2023] [Indexed: 10/26/2023]
Abstract
The prediction of RNA three-dimensional structures remains an unsolved problem. Here, we report assessments of RNA structure predictions in CASP15, the first CASP exercise that involved RNA structure modeling. Forty-two predictor groups submitted models for at least one of twelve RNA-containing targets. These models were evaluated by the RNA-Puzzles organizers and, separately, by a CASP-recruited team using metrics (GDT, lDDT) and approaches (Z-score rankings) initially developed for assessment of proteins and generalized here for RNA assessment. The two assessments independently ranked the same predictor groups as first (AIchemy_RNA2), second (Chen), and third (RNAPolis and GeneSilico, tied); predictions from deep learning approaches were significantly worse than these top ranked groups, which did not use deep learning. Further analyses based on direct comparison of predicted models to cryogenic electron microscopy (cryo-EM) maps and x-ray diffraction data support these rankings. With the exception of two RNA-protein complexes, models submitted by CASP15 groups correctly predicted the global fold of the RNA targets. Comparisons of CASP15 submissions to designed RNA nanostructures as well as molecular replacement trials highlight the potential utility of current RNA modeling approaches for RNA nanotechnology and structural biology, respectively. Nevertheless, challenges remain in modeling fine details such as noncanonical pairs, in ranking among submitted models, and in prediction of multiple structures resolved by cryo-EM or crystallography.
Collapse
Affiliation(s)
- Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, CA USA
- Biophysics Program, Stanford University School of Medicine, CA USA
- Howard Hughes Medical Institute, Stanford University, CA USA
| | | | - Adam J. Simpkin
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Phillip Pham
- Department of Biochemistry, Stanford University School of Medicine, CA USA
| | - Ramya Rangan
- Biophysics Program, Stanford University School of Medicine, CA USA
| | - Fan Bu
- Guangzhou Laboratory, Guangzhou International Bio Island, Guangzhou 510005, China
- Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230036, Anhui, China
| | - Ronan M. Keegan
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
- Life Science, Diamond Light Source, Harwell Science, UK
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV), Hamburg, Germany
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Daniel J. Rigden
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Zhichao Miao
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou National Laboratory, Guangzhou Medical University
- Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, Shanghai 200434, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, F-67084, Strasbourg, France
| |
Collapse
|
7
|
Kryshtafovych A, Antczak M, Szachniuk M, Zok T, Kretsch RC, Rangan R, Pham P, Das R, Robin X, Studer G, Durairaj J, Eberhardt J, Sweeney A, Topf M, Schwede T, Fidelis K, Moult J. New prediction categories in CASP15. Proteins 2023; 91:1550-1557. [PMID: 37306011 PMCID: PMC10713864 DOI: 10.1002/prot.26515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 05/10/2023] [Indexed: 06/13/2023]
Abstract
Prediction categories in the Critical Assessment of Structure Prediction (CASP) experiments change with the need to address specific problems in structure modeling. In CASP15, four new prediction categories were introduced: RNA structure, ligand-protein complexes, accuracy of oligomeric structures and their interfaces, and ensembles of alternative conformations. This paper lists technical specifications for these categories and describes their integration in the CASP data management system.
Collapse
Affiliation(s)
| | - Maciej Antczak
- Institute of Computing Science, Poznan University of TechnologyPoznanPoland
- Institute of Bioorganic Chemistry, Polish Academy of SciencesPoznanPoland
| | - Marta Szachniuk
- Institute of Computing Science, Poznan University of TechnologyPoznanPoland
- Institute of Bioorganic Chemistry, Polish Academy of SciencesPoznanPoland
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of TechnologyPoznanPoland
- Institute of Bioorganic Chemistry, Polish Academy of SciencesPoznanPoland
| | - Rachael C. Kretsch
- Biophysics Program, Stanford University School of MedicineStanfordCaliforniaUSA
| | - Ramya Rangan
- Biophysics Program, Stanford University School of MedicineStanfordCaliforniaUSA
| | - Phillip Pham
- Biochemistry DepartmentStanford University School of MedicineStanfordCaliforniaUSA
| | - Rhiju Das
- Biochemistry DepartmentStanford University School of MedicineStanfordCaliforniaUSA
- Howard Hughes Medical Institute, Stanford UniversityStanfordCaliforniaUSA
| | - Xavier Robin
- Biozentrum, University of BaselBaselSwitzerland
- SIB Swiss Institute of BioinformaticsBaselSwitzerland
| | - Gabriel Studer
- Biozentrum, University of BaselBaselSwitzerland
- SIB Swiss Institute of BioinformaticsBaselSwitzerland
| | - Janani Durairaj
- Biozentrum, University of BaselBaselSwitzerland
- SIB Swiss Institute of BioinformaticsBaselSwitzerland
| | - Jerome Eberhardt
- Biozentrum, University of BaselBaselSwitzerland
- SIB Swiss Institute of BioinformaticsBaselSwitzerland
| | - Aaron Sweeney
- Centre for Structural Systems Biology (CSSB), Leibniz‐Institut für Virologie (LIV)HamburgGermany
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB), Leibniz‐Institut für Virologie (LIV)HamburgGermany
- Universitätsklinikum Hamburg Eppendorf (UKE)HamburgGermany
| | - Torsten Schwede
- Biozentrum, University of BaselBaselSwitzerland
- SIB Swiss Institute of BioinformaticsBaselSwitzerland
| | | | - John Moult
- Institute for Bioscience and Biotechnology Research, Department of Cell Biology and Molecular genetics, University of MarylandRockvilleMarylandUSA
| |
Collapse
|
8
|
Singh M, Kumar S. Effect of single nucleotide polymorphisms on the structure of long noncoding RNAs and their interaction with RNA binding proteins. Biosystems 2023; 233:105021. [PMID: 37703988 DOI: 10.1016/j.biosystems.2023.105021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 07/25/2023] [Accepted: 09/06/2023] [Indexed: 09/15/2023]
Abstract
Long non-coding RNAs (lncRNA) are emerging as a new class of regulatory RNAs with remarkable potential to be utilized as therapeutic targets against many human diseases. Several genome-wide association studies (GWAS) have catalogued Single Nucleotide Polymorphisms (SNPs) present in the noncoding regions of the genome from where lncRNAs originate. In this study, we have selected 67 lncRNAs with GWAS-tagged SNPs and have also investigated their role in affecting the local secondary structures. Majority of the SNPs lead to changes in the secondary structure of lncRNAs to a different extent by altering the base pairing patterns. These structural changes in lncRNA are also manifested in form of alteration in the binding site for RNA binding proteins (RBPs) along with affecting their binding efficacies. Ultimately, these structural modifications may influence the transcriptional and post-transcriptional pathways of these RNAs, leading to the causation of diseases. Hence, it is important to understand the possible underlying mechanism of RBPs in association with GWAS-tagged SNPs in human diseases.
Collapse
Affiliation(s)
- Mandakini Singh
- Department of Life Science, National Institute of Technology, Rourkela, Odisha, 769008, India
| | - Santosh Kumar
- Department of Life Science, National Institute of Technology, Rourkela, Odisha, 769008, India.
| |
Collapse
|
9
|
Schneider B, Sweeney BA, Bateman A, Cerny J, Zok T, Szachniuk M. When will RNA get its AlphaFold moment? Nucleic Acids Res 2023; 51:9522-9532. [PMID: 37702120 PMCID: PMC10570031 DOI: 10.1093/nar/gkad726] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 08/13/2023] [Accepted: 08/22/2023] [Indexed: 09/14/2023] Open
Abstract
The protein structure prediction problem has been solved for many types of proteins by AlphaFold. Recently, there has been considerable excitement to build off the success of AlphaFold and predict the 3D structures of RNAs. RNA prediction methods use a variety of techniques, from physics-based to machine learning approaches. We believe that there are challenges preventing the successful development of deep learning-based methods like AlphaFold for RNA in the short term. Broadly speaking, the challenges are the limited number of structures and alignments making data-hungry deep learning methods unlikely to succeed. Additionally, there are several issues with the existing structure and sequence data, as they are often of insufficient quality, highly biased and missing key information. Here, we discuss these challenges in detail and suggest some steps to remedy the situation. We believe that it is possible to create an accurate RNA structure prediction method, but it will require solving several data quality and volume issues, usage of data beyond simple sequence alignments, or the development of new less data-hungry machine learning methods.
Collapse
Affiliation(s)
- Bohdan Schneider
- Institute of Biotechnology of the Czech Academy of Sciences, Prumyslova 595, CZ-252 50 Vestec, Czech Republic
| | - Blake Alexander Sweeney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK
| | - Jiri Cerny
- Institute of Biotechnology of the Czech Academy of Sciences, Prumyslova 595, CZ-252 50 Vestec, Czech Republic
| | - Tomasz Zok
- Institute of Computing Science and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| |
Collapse
|
10
|
Das R, Kretsch RC, Simpkin AJ, Mulvaney T, Pham P, Rangan R, Bu F, Keegan RM, Topf M, Rigden DJ, Miao Z, Westhof E. Assessment of three-dimensional RNA structure prediction in CASP15. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.25.538330. [PMID: 37162955 PMCID: PMC10168427 DOI: 10.1101/2023.04.25.538330] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The prediction of RNA three-dimensional structures remains an unsolved problem. Here, we report assessments of RNA structure predictions in CASP15, the first CASP exercise that involved RNA structure modeling. Forty two predictor groups submitted models for at least one of twelve RNA-containing targets. These models were evaluated by the RNA-Puzzles organizers and, separately, by a CASP-recruited team using metrics (GDT, lDDT) and approaches (Z-score rankings) initially developed for assessment of proteins and generalized here for RNA assessment. The two assessments independently ranked the same predictor groups as first (AIchemy_RNA2), second (Chen), and third (RNAPolis and GeneSilico, tied); predictions from deep learning approaches were significantly worse than these top ranked groups, which did not use deep learning. Further analyses based on direct comparison of predicted models to cryogenic electron microscopy (cryo-EM) maps and X-ray diffraction data support these rankings. With the exception of two RNA-protein complexes, models submitted by CASP15 groups correctly predicted the global fold of the RNA targets. Comparisons of CASP15 submissions to designed RNA nanostructures as well as molecular replacement trials highlight the potential utility of current RNA modeling approaches for RNA nanotechnology and structural biology, respectively. Nevertheless, challenges remain in modeling fine details such as non-canonical pairs, in ranking among submitted models, and in prediction of multiple structures resolved by cryo-EM or crystallography.
Collapse
Affiliation(s)
- Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, CA USA
- Biophysics Program, Stanford University School of Medicine, CA USA
- Howard Hughes Medical Institute, Stanford University, CA USA
| | | | - Adam J. Simpkin
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Thomas Mulvaney
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV)
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Phillip Pham
- Department of Biochemistry, Stanford University School of Medicine, CA USA
| | - Ramya Rangan
- Biophysics Program, Stanford University School of Medicine, CA USA
| | - Fan Bu
- Guangzhou Laboratory, Guangzhou International Bio Island, Guangzhou 510005, China
- Division of Life Sciences and Medicine,University of Science and Technology of China, Hefei 230036, Anhui, China
| | - Ronan M. Keegan
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
- Life Science, Diamond Light Source, Harwell Science, UK
| | - Maya Topf
- Centre for Structural Systems Biology (CSSB), Leibniz-Institut für Virologie (LIV)
- University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Daniel J. Rigden
- Institute of Systems, Molecular & Integrative Biology, The University of Liverpool, UK
| | - Zhichao Miao
- GMU-GIBH Joint School of Life Sciences, The Guangdong-Hong Kong-Macau Joint Laboratory for Cell Fate Regulation and Diseases, Guangzhou National Laboratory, Guangzhou Medical University
- Shanghai Key Laboratory of Anesthesiology and Brain Functional Modulation, Clinical Research Center for Anesthesiology and Perioperative Medicine, Translational Research Institute of Brain and Brain-Like Intelligence, Shanghai Fourth People’s Hospital, School of Medicine, Tongji University, Shanghai 200434, China
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Institut de Biologie Moléculaire et Cellulaire du CNRS, Université de Strasbourg, F-67084, Strasbourg, France
| |
Collapse
|
11
|
Gao X, Bai Y, Jiang X, Long X, Wei D, He Z, Zeng X, Yu Y. Complete Mitochondrial Genome Characterization of Schrankia costaestrigalis (Insecta: Erebidae: Hypenodinae) and Its Phylogenetic Implication. Genes (Basel) 2023; 14:1867. [PMID: 37895216 PMCID: PMC10606299 DOI: 10.3390/genes14101867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 09/22/2023] [Accepted: 09/25/2023] [Indexed: 10/29/2023] Open
Abstract
The pinion-streaked snout Schrankia costaestrigalis is a new potato pest that has recently been recorded in China. In this study, we analyzed the complete mitochondrial genome of S. costaestrigalis. The results revealed the mitogenome (GenBank: OQ181231) to occur as a circular DNA molecule of 16,376 bp with 51.001% AT content, including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes, and 1 control region. Notably, the PCGs exhibited typical ATN (Met) start codons, including cox1, which deviated from the usual CGA start codon observed in other lepidopteran mitogenomes, and followed the conventional TAN stop codons. The 22 tRNA genes demonstrated the ability to form a cloverleaf structure, with the exception of trnS1-NCU, which lacked the DHU arm present in other Erebidae mitogenomes. Additionally, conserved motifs like "ATAGA + poly-T (19 bp) stretch" and five microsatellite-like elements (TA) were identified in the AT-rich region. The phylogenetic trees revealed that the Hypenodinae subfamily forms an independent lineage closely related to Erebinae and Catocalinae. The comprehensive mitogenome of S. costaestrigalis will greatly enhance future studies focused on the molecular classification and phylogenetic understanding of the Hypenodinae subfamily within the larger family Erebidae.
Collapse
Affiliation(s)
- Xuyuan Gao
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs/Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests/Plant Protection Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China; (X.G.); (X.J.); (X.L.); (D.W.); (Z.H.)
| | - Yu Bai
- College of Mathematics & Information Science, Guiyang University, Guiyang 550005, China;
| | - Xiaodong Jiang
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs/Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests/Plant Protection Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China; (X.G.); (X.J.); (X.L.); (D.W.); (Z.H.)
| | - Xiuzhen Long
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs/Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests/Plant Protection Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China; (X.G.); (X.J.); (X.L.); (D.W.); (Z.H.)
| | - Dewei Wei
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs/Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests/Plant Protection Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China; (X.G.); (X.J.); (X.L.); (D.W.); (Z.H.)
| | - Zhan He
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs/Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests/Plant Protection Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China; (X.G.); (X.J.); (X.L.); (D.W.); (Z.H.)
| | - Xianru Zeng
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs/Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests/Plant Protection Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China; (X.G.); (X.J.); (X.L.); (D.W.); (Z.H.)
| | - Yonghao Yu
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs/Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests/Plant Protection Research Institute, Guangxi Academy of Agricultural Sciences, Nanning 530007, China; (X.G.); (X.J.); (X.L.); (D.W.); (Z.H.)
| |
Collapse
|
12
|
Li Y, Zhang C, Feng C, Pearce R, Lydia Freddolino P, Zhang Y. Integrating end-to-end learning with deep geometrical potentials for ab initio RNA structure prediction. Nat Commun 2023; 14:5745. [PMID: 37717036 PMCID: PMC10505173 DOI: 10.1038/s41467-023-41303-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 08/22/2023] [Indexed: 09/18/2023] Open
Abstract
RNAs are fundamental in living cells and perform critical functions determined by their tertiary architectures. However, accurate modeling of 3D RNA structure remains a challenging problem. We present a novel method, DRfold, to predict RNA tertiary structures by simultaneous learning of local frame rotations and geometric restraints from experimentally solved RNA structures, where the learned knowledge is converted into a hybrid energy potential to guide RNA structure assembly. The method significantly outperforms previous approaches by >73.3% in TM-score on a sequence-nonredundant dataset containing recently released structures. Detailed analyses showed that the major contribution to the improvements arise from the deep end-to-end learning supervised with the atom coordinates and the composite energy function integrating complementary information from geometry restraints and end-to-end learning models. The open-source DRfold program with fast training protocol allows large-scale application of high-resolution RNA structure modeling and can be further improved with future expansion of RNA structure databases.
Collapse
Affiliation(s)
- Yang Li
- Cancer Science Institute of Singapore, National University of Singapore, 117599, Singapore, Singapore
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Chengxin Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT, 06511, USA
| | - Chenjie Feng
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- School of Science, Ningxia Medical University, Yinchuan, 750004, China
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- Department of Computer Science, School of Computing, National University of Singapore, 117417, Singapore, Singapore
| | - P Lydia Freddolino
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
| | - Yang Zhang
- Cancer Science Institute of Singapore, National University of Singapore, 117599, Singapore, Singapore.
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
- Department of Computer Science, School of Computing, National University of Singapore, 117417, Singapore, Singapore.
- Department of Biological Chemistry, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 117596, Singapore, Singapore.
| |
Collapse
|
13
|
Bohdan DR, Voronina VV, Bujnicki JM, Baulin EF. A comprehensive survey of long-range tertiary interactions and motifs in non-coding RNA structures. Nucleic Acids Res 2023; 51:8367-8382. [PMID: 37471030 PMCID: PMC10484739 DOI: 10.1093/nar/gkad605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 07/07/2023] [Indexed: 07/21/2023] Open
Abstract
Understanding the 3D structure of RNA is key to understanding RNA function. RNA 3D structure is modular and can be seen as a composition of building blocks of various sizes called tertiary motifs. Currently, long-range motifs formed between distant loops and helical regions are largely less studied than the local motifs determined by the RNA secondary structure. We surveyed long-range tertiary interactions and motifs in a non-redundant set of non-coding RNA 3D structures. A new dataset of annotated LOng-RAnge RNA 3D modules (LORA) was built using an approach that does not rely on the automatic annotations of non-canonical interactions. An original algorithm, ARTEM, was developed for annotation-, sequence- and topology-independent superposition of two arbitrary RNA 3D modules. The proposed methods allowed us to identify and describe the most common long-range RNA tertiary motifs. Along with the prevalent canonical A-minor interactions, a large number of previously undescribed staple interactions were observed. The most frequent long-range motifs were found to belong to three main motif families: planar staples, tilted staples, and helical packing motifs.
Collapse
Affiliation(s)
- Davyd R Bohdan
- Department of Innovation and High Technology, Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Valeria V Voronina
- Department of Information Systems, Ulyanovsk State Technical University, Ulyanovsk 432027, Russia
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw 02-109, Poland
| | - Eugene F Baulin
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Warsaw 02-109, Poland
| |
Collapse
|
14
|
Shekhovtsov SV, Vasiliev GV, Latif R, Poluboyarova TV, Peltek SE, Rapoport IB. The mitochondrial genome of Dendrobaena tellermanica Perel, 1966 (Annelida: Lumbricidae) and its phylogenetic position. Vavilovskii Zhurnal Genet Selektsii 2023; 27:146-152. [PMID: 37063518 PMCID: PMC10090101 DOI: 10.18699/vjgb-23-20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 09/30/2022] [Accepted: 09/30/2022] [Indexed: 04/18/2023] Open
Abstract
Earthworms are an important ecological group that has a significant impact on soil fauna as well as plant communities. Despite their importance, genetic diversity and phylogeny of earthworms are still insufficiently studied. Most studies on earthworm genetic diversity are currently based on a few mitochondrial and nuclear genes. Mitochondrial genomes are becoming a promising target for phylogeny reconstruction in earthworms. However, most studies on earthworm mitochondrial genomes were made on West European and East Asian species, with much less sampling from other regions. In this study, we performed sequencing, assembly, and analysis of the mitochondrial genome of Dendrobaena tellermanica Perel, 1966 from the Northern Caucasus. This species was earlier included into D. schmidti (Michaelsen, 1907), a polytypic species with many subspecies. The genome was assembled as a single contig 15,298 bp long which contained a typical gene set: 13 protein-coding genes (three subunits of cytochrome c oxidase, seven subunits of NADH dehydrogenase, two subunits of ATP synthetase, and cytochrome b), 12S and 16S ribosomal RNA genes, and 22 tRNA genes. All genes were located on one DNA strand. The assembled part of the control region, located between the tRNA-Arg and tRNA-His genes, was 727 bp long. The control region contained multiple hairpins, as well as tandem repeats of the AACGCTT monomer. Phylogenetic analysis based on the complete mitochondrial genomes indicated that the genus Dendrobaena occupied the basal position within Lumbricidae. D. tellermanica was a rather distant relative of the cosmopolitan D. octaedra, suggesting high genetic diversity in this genus. D. schmidti turned out to be paraphyletic with respect to D. tellermanica. Since D. schmidti is known to contain very high genetic diversity, these results may indicate that it may be split into several species.
Collapse
Affiliation(s)
- S V Shekhovtsov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Institute of Biological Problems of the North of the Far Eastern Branch of the Russian Academy of Sciences, Magadan, Russia
| | - G V Vasiliev
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - R Latif
- Semnan University, Semnan, Iran
| | - T V Poluboyarova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - S E Peltek
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - I B Rapoport
- Tembotov Institute of Ecology of Mountain Territories of Russian Academy of Sciences, Nalchik, Russia
| |
Collapse
|
15
|
Rivarez MPS, Pecman A, Bačnik K, Maksimović O, Vučurović A, Seljak G, Mehle N, Gutiérrez-Aguirre I, Ravnikar M, Kutnjak D. In-depth study of tomato and weed viromes reveals undiscovered plant virus diversity in an agroecosystem. MICROBIOME 2023; 11:60. [PMID: 36973750 PMCID: PMC10042675 DOI: 10.1186/s40168-023-01500-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 02/20/2023] [Indexed: 06/17/2023]
Abstract
BACKGROUND In agroecosystems, viruses are well known to influence crop health and some cause phytosanitary and economic problems, but their diversity in non-crop plants and role outside the disease perspective is less known. Extensive virome explorations that include both crop and diverse weed plants are therefore needed to better understand roles of viruses in agroecosystems. Such unbiased exploration is available through viromics, which could generate biological and ecological insights from immense high-throughput sequencing (HTS) data. RESULTS Here, we implemented HTS-based viromics to explore viral diversity in tomatoes and weeds in farming areas at a nation-wide scale. We detected 125 viruses, including 79 novel species, wherein 65 were found exclusively in weeds. This spanned 21 higher-level plant virus taxa dominated by Potyviridae, Rhabdoviridae, and Tombusviridae, and four non-plant virus families. We detected viruses of non-plant hosts and viroid-like sequences and demonstrated infectivity of a novel tobamovirus in plants of Solanaceae family. Diversities of predominant tomato viruses were variable, in some cases, comparable to that of global isolates of the same species. We phylogenetically classified novel viruses and showed links between a subgroup of phylogenetically related rhabdoviruses to their taxonomically related host plants. Ten classified viruses detected in tomatoes were also detected in weeds, which might indicate possible role of weeds as their reservoirs and that these viruses could be exchanged between the two compartments. CONCLUSIONS We showed that even in relatively well studied agroecosystems, such as tomato farms, a large part of very diverse plant viromes can still be unknown and is mostly present in understudied non-crop plants. The overlapping presence of viruses in tomatoes and weeds implicate possible presence of virus reservoir and possible exchange between the weed and crop compartments, which may influence weed management decisions. The observed variability and widespread presence of predominant tomato viruses and the infectivity of a novel tobamovirus in solanaceous plants, provided foundation for further investigation of virus disease dynamics and their effect on tomato health. The extensive insights we generated from such in-depth agroecosystem virome exploration will be valuable in anticipating possible emergences of plant virus diseases and would serve as baseline for further post-discovery characterization studies. Video Abstract.
Collapse
Affiliation(s)
- Mark Paul Selda Rivarez
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
- Jožef Stefan International Postgraduate School, Jamova cesta 39, Ljubljana, 1000 Slovenia
- Present Address: College of Agriculture and Agri-Industries, Caraga State University, Ampayon, Butuan City, 8600 Philippines
| | - Anja Pecman
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
- Jožef Stefan International Postgraduate School, Jamova cesta 39, Ljubljana, 1000 Slovenia
| | - Katarina Bačnik
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
| | - Olivera Maksimović
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
- Jožef Stefan International Postgraduate School, Jamova cesta 39, Ljubljana, 1000 Slovenia
| | - Ana Vučurović
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
| | - Gabrijel Seljak
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
| | - Nataša Mehle
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
- School for Viticulture and Enology, University of Nova Gorica, Dvorec Lanthieri Glavni trg 8, Vipava, 5271 Slovenia
| | - Ion Gutiérrez-Aguirre
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
| | - Maja Ravnikar
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
| | - Denis Kutnjak
- Department of Biotechnology and Systems Biology, National Institute of Biology, Večna pot 111, Ljubljana, 1000 Slovenia
| |
Collapse
|
16
|
Ali Z, Goyal A, Jhunjhunwala A, Mitra A, Trant JF, Sharma P. Structural and Energetic Features of Base-Base Stacking Contacts in RNA. J Chem Inf Model 2023; 63:655-669. [PMID: 36635230 DOI: 10.1021/acs.jcim.2c01116] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Nucleobase π-π stacking is one of the crucial organizing interactions within three-dimensional (3D) RNA architectures. Characterizing the structural variability of these contacts in RNA crystal structures will help delineate their subtleties and their role in determining function. This analysis of different stacking geometries found in RNA X-ray crystal structures is the largest such survey to date; coupled with quantum-mechanical calculations on typical representatives of each possible stacking arrangement, we determined the distribution of stacking interaction energies. A total of 1,735,481 stacking contacts, spanning 359 of the 384 theoretically possible distinct stacking geometries, were identified. Our analysis reveals preferential occurrences of specific consecutive stacking arrangements in certain regions of RNA architectures. Quantum chemical calculations suggest that 88 of the 359 contacts possess intrinsically stable stacking geometries, whereas the remaining stacks require the RNA backbone or surrounding macromolecular environment to force their formation and maintain their stability. Our systematic analysis of π-π stacks in RNA highlights trends in the occurrence and localization of these noncovalent interactions and may help better understand the structural intricacies of functional RNA-based molecular architectures.
Collapse
Affiliation(s)
- Zakir Ali
- Computational Biochemistry Laboratory, Department of Chemistry and Centre for Advanced Studies in Chemistry, Panjab University, Chandigarh160014, India
| | - Ambika Goyal
- Computational Biochemistry Laboratory, Department of Chemistry and Centre for Advanced Studies in Chemistry, Panjab University, Chandigarh160014, India
| | - Ayush Jhunjhunwala
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, Gachibowli, Hyderabad, Telangana500032, India
| | - Abhijit Mitra
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, Gachibowli, Hyderabad, Telangana500032, India
| | - John F Trant
- Department of Chemistry and Biochemistry, University of Windsor, 401 Sunset Avenue, Windsor, OntarioN9B 3P4, Canada
- Binary Star Research Services, LaSalle, OntarioN9J 3X8, Canada
| | - Purshotam Sharma
- Computational Biochemistry Laboratory, Department of Chemistry and Centre for Advanced Studies in Chemistry, Panjab University, Chandigarh160014, India
- Department of Chemistry and Biochemistry, University of Windsor, 401 Sunset Avenue, Windsor, OntarioN9B 3P4, Canada
| |
Collapse
|
17
|
Hoffmann PC, Kreysing JP, Khusainov I, Tuijtel MW, Welsch S, Beck M. Structures of the eukaryotic ribosome and its translational states in situ. Nat Commun 2022; 13:7435. [PMID: 36460643 PMCID: PMC9718845 DOI: 10.1038/s41467-022-34997-w] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 11/15/2022] [Indexed: 12/03/2022] Open
Abstract
Ribosomes translate genetic information into primary structure. During translation, various cofactors transiently bind to the ribosome that undergoes prominent conformational and structural changes. Different translational states of ribosomes have been well characterized in vitro. However, to which extent the known translational states are representative of the native situation inside cells has thus far only been addressed in prokaryotes. Here, we apply cryo-electron tomography to cryo-FIB milled Dictyostelium discoideum cells combined with subtomogram averaging and classification. We obtain an in situ structure that is locally resolved up to 3 Angstrom, the distribution of eukaryotic ribosome translational states, and unique arrangement of rRNA expansion segments. Our work demonstrates the use of in situ structural biology techniques for identifying distinct ribosome states within the cellular environment.
Collapse
Affiliation(s)
- Patrick C. Hoffmann
- grid.419494.50000 0001 1018 9466Department of Molecular Sociology, Max Planck Institute of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Jan Philipp Kreysing
- grid.419494.50000 0001 1018 9466Department of Molecular Sociology, Max Planck Institute of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany ,Department of Molecular Sociology, IMPRS on Cellular Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Iskander Khusainov
- grid.419494.50000 0001 1018 9466Department of Molecular Sociology, Max Planck Institute of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Maarten W. Tuijtel
- grid.419494.50000 0001 1018 9466Department of Molecular Sociology, Max Planck Institute of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Sonja Welsch
- grid.419494.50000 0001 1018 9466Central Electron Microscopy Facility, Max Planck Institute of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Martin Beck
- grid.419494.50000 0001 1018 9466Department of Molecular Sociology, Max Planck Institute of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| |
Collapse
|
18
|
Bai Y, Yang K, Ye L, Gao X. Complete Mitogenome and Phylogenetic Analyses of Galerita orientalis Schmidt-Goebel, 1846 (Insecta: Coleoptera: Carabidae: Galeritini). Genes (Basel) 2022; 13:genes13122199. [PMID: 36553466 PMCID: PMC9777712 DOI: 10.3390/genes13122199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 11/15/2022] [Accepted: 11/22/2022] [Indexed: 11/25/2022] Open
Abstract
The genus Galerita Fabricius, 1801 belongs to the tribe Galeritini of the family Carabidae. In this study, the complete mitochondrial genome (GenBank: ON920164.1) of G. orientalis is newly sequenced, annotated, characterized, and composed of 37 typical genes, and one control region. Mitogenome is a circular DNA molecule of 16,137 bp with a 78.79% AT content. All 13 protein-coding genes are initiated using a typical ATN (Met) as the start codon, except for nad1, which has a TTG as the start codon, and are terminated using a typical TAN stop codon. Twenty-two tRNAs could fold into a typical cloverleaf structure, including trnS1-GCU, which lacks the DHU stem observed in other mitogenomes of the subfamily Harpalinae. Both rrnS and rrnL contain many helices. A conserved poly-T stretch (19 bp) and seven tandem repeats are observed in the control region, and a phylogenetic analysis indicated that the genus Galerita is an independent lineage. The complete mitogenome of G. orientalis will contribute to further studies on the molecular basis of the classification and phylogeny of Harpalinae, and even Carabidae.
Collapse
Affiliation(s)
- Yu Bai
- College of Mathematics & Information Science, Guiyang University, Guiyang 550005, China
| | - Kang Yang
- College of Biology and Environmental Engineering, Guiyang University, Guiyang 550005, China
| | - Lin Ye
- College of Biology and Environmental Engineering, Guiyang University, Guiyang 550005, China
| | - Xuyuan Gao
- Guangxi Key Laboratory of Biology for Crop Diseases and Insect Pests, Institute of Plant Protection, Guangxi Academy of Agricultural Sciences, Nanning 530007, China
- Key Laboratory of Green Prevention and Control on Fruits and Vegetables in South China Ministry of Agriculture and Rural Affairs, Institute of Plant Protection, Guangxi Academy of Agricultural Sciences, Nanning 530007, China
- Correspondence:
| |
Collapse
|
19
|
Zhou L, Wang X, Yu S, Tan YL, Tan ZJ. FebRNA: An automated fragment-ensemble-based model for building RNA 3D structures. Biophys J 2022; 121:3381-3392. [PMID: 35978551 PMCID: PMC9515226 DOI: 10.1016/j.bpj.2022.08.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 07/19/2022] [Accepted: 08/15/2022] [Indexed: 11/23/2022] Open
Abstract
Knowledge of RNA three-dimensional (3D) structures is critical to understanding the important biological functions of RNAs. Although various structure prediction models have been developed, the high-accuracy predictions of RNA 3D structures are still limited to the RNAs with short lengths or with simple topology. In this work, we proposed a new model, namely FebRNA, for building RNA 3D structures through fragment assembly based on coarse-grained (CG) fragment ensembles. Specifically, FebRNA is composed of four processes: establishing the library of different types of non-redundant CG fragment ensembles regardless of the sequences, building CG 3D structure ensemble through fragment assembly, identifying top-scored CG structures through a specific CG scoring function, and rebuilding the all-atom structures from the top-scored CG ones. Extensive examination against different types of RNA structures indicates that FebRNA consistently gives the reliable predictions on RNA 3D structures, including pseudoknots, three-way junctions, four-way and five-way junctions, and RNAs in the RNA-Puzzles. FebRNA is available on the Web site: https://github.com/Tan-group/FebRNA.
Collapse
Affiliation(s)
- Li Zhou
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Xunxun Wang
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Shixiong Yu
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan 430073, China.
| | - Zhi-Jie Tan
- Department of Physics and Key Laboratory of Artificial Micro & Nano-structures of Education, School of Physics and Technology, Wuhan University, Wuhan 430072, China.
| |
Collapse
|
20
|
Matarrese MAG, Loppini A, Nicoletti M, Filippi S, Chiodo L. Assessment of tools for RNA secondary structure prediction and extraction: a final-user perspective. J Biomol Struct Dyn 2022:1-20. [DOI: 10.1080/07391102.2022.2116110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
Affiliation(s)
- Margherita A. G. Matarrese
- Engineering Department, Campus Bio-Medico University of Rome, Rome, Italy
- Jane and John Justin Neurosciences Center, Cook Children’s Health Care System, TX, USA
- Department of Bioengineering, The University of Texas at Arlington, Arlington, TX, USA
| | - Alessandro Loppini
- Engineering Department, Campus Bio-Medico University of Rome, Rome, Italy
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Martina Nicoletti
- Engineering Department, Campus Bio-Medico University of Rome, Rome, Italy
- Center for Life Nano & Neuroscience, Italian Institute of Technology, Rome, Italy
| | - Simonetta Filippi
- Engineering Department, Campus Bio-Medico University of Rome, Rome, Italy
| | - Letizia Chiodo
- Engineering Department, Campus Bio-Medico University of Rome, Rome, Italy
| |
Collapse
|
21
|
Mukherjee D, Maiti S, Gouda PK, Sharma R, Roy P, Bhattacharyya D. RNABPDB: Molecular Modeling of RNA Structure-From Base Pair Analysis in Crystals to Structure Prediction. Interdiscip Sci 2022; 14:759-774. [PMID: 35705797 DOI: 10.1007/s12539-022-00528-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 05/05/2022] [Accepted: 05/17/2022] [Indexed: 06/15/2023]
Abstract
The stable three-dimensional structure of RNA is known to play several important biochemical roles, from post-transcriptional gene regulation to enzymatic action. These structures contain double-helical regions, which often have different types of non-canonical base pairs in addition to Watson-Crick base pairs. Hence, it is important to study their structures from experimentally obtained or even predicted ones, to understand their role, or to develop a drug against the potential targets. Molecular Modeling of RNA double helices containing non-canonical base pairs is a difficult process, particularly due to the unavailability of structural features of non-Watson-Crick base pairs. Here we show a composite web-server with an associated database that allows one to generate the structure of RNA double helix containing non-canonical base pairs using consensus parameters obtained from the database. The database classification is followed by an evaluation of the central tendency of the structural parameters as well as a quantitative estimation of interaction strengths. These parameters are used to construct three-dimensional structures of double helices composed of Watson-Crick and/or non-canonical base pairs. Our benchmark study to regenerate double-helical fragments of many experimentally derived RNA structures indicate very high accuracy. This composite server is expected to be highly useful in understanding functions of various pre-miRNA by modeling structures of the molecules and estimating binding efficiency. The database can be accessed from http://hdrnas.saha.ac.in/rnabpdb .
Collapse
Affiliation(s)
- Debasish Mukherjee
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India.
- Institute of Molecular Biology gGmbH (IMB), Ackermannweg 4, 55128, Mainz, Germany.
| | - Satyabrata Maiti
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
- Homi Bhaba National Institute, Anushaktinagar, Mumbai, 400094, India
| | - Prasanta Kumar Gouda
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
| | - Richa Sharma
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
| | - Parthajit Roy
- Department of Computer Science, The University of Burdwan, Golapbag, Burdwan, 713104, India
| | - Dhananjay Bhattacharyya
- Computational Science Division, Saha Institute of Nuclear Physics, 1/AF Bidhannagar, Kolkata, 700064, India
- Homi Bhaba National Institute, Anushaktinagar, Mumbai, 400094, India
| |
Collapse
|
22
|
Ward JC, Lasecka-Dykes L, Neil C, Adeyemi OO, Gold S, McLean-Pell N, Wright C, Herod MR, Kealy D, Warner E, Jackson T, King DP, Tuthill TJ, Rowlands DJ, Stonehouse NJ. The RNA pseudoknots in foot-and-mouth disease virus are dispensable for genome replication, but essential for the production of infectious virus. PLoS Pathog 2022; 18:e1010589. [PMID: 35666744 PMCID: PMC9203018 DOI: 10.1371/journal.ppat.1010589] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 06/16/2022] [Accepted: 05/12/2022] [Indexed: 11/22/2022] Open
Abstract
Non-coding regions of viral RNA (vRNA) genomes are critically important in the regulation of gene expression. In particular, pseudoknot (PK) structures, which are present in a wide range of RNA molecules, have a variety of roles. The 5' untranslated region (5' UTR) of foot-and-mouth disease virus (FMDV) vRNA is considerably longer than in other viruses from the picornavirus family and consists of a number of distinctive structural motifs that includes multiple (2, 3 or 4 depending on the virus strain) putative PKs linked in tandem. The role(s) of the PKs in the FMDV infection are not fully understood. Here, using bioinformatics, sub-genomic replicons and recombinant viruses we have investigated the structural conservation and importance of the PKs in the FMDV lifecycle. Our results show that despite the conservation of two or more PKs across all FMDVs, a replicon lacking PKs was replication competent, albeit at reduced levels. Furthermore, in competition experiments, GFP FMDV replicons with less than two (0 or 1) PK structures were outcompeted by a mCherry FMDV wt replicon that had 4 PKs, whereas GFP replicons with 2 or 4 PKs were not. This apparent replicative advantage offered by the additional PKs correlates with the maintenance of at least two PKs in the genomes of FMDV field isolates. Despite a replicon lacking any PKs retaining the ability to replicate, viruses completely lacking PK were not viable and at least one PK was essential for recovery of infections virus, suggesting a role for the PKs in virion assembly. Thus, our study points to roles for the PKs in both vRNA replication and virion assembly, thereby improving understanding the molecular biology of FMDV replication and the wider roles of PK in RNA functions.
Collapse
Affiliation(s)
- Joseph C. Ward
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
| | | | - Chris Neil
- Pirbright Institute, Ash Road, Pirbright, Surrey, United Kingdom
| | - Oluwapelumi O. Adeyemi
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
| | - Sarah Gold
- Pirbright Institute, Ash Road, Pirbright, Surrey, United Kingdom
| | - Niall McLean-Pell
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
| | - Caroline Wright
- Pirbright Institute, Ash Road, Pirbright, Surrey, United Kingdom
| | - Morgan R. Herod
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
| | - David Kealy
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
| | - Emma Warner
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
| | - Terry Jackson
- Pirbright Institute, Ash Road, Pirbright, Surrey, United Kingdom
| | - Donald P. King
- Pirbright Institute, Ash Road, Pirbright, Surrey, United Kingdom
| | | | - David J. Rowlands
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
- * E-mail: (DJR); (NJS)
| | - Nicola J. Stonehouse
- School of Molecular and Cellular Biology, Faculty of Biological Sciences and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom
- * E-mail: (DJR); (NJS)
| |
Collapse
|
23
|
Tumor Suppressive Effects of GAS5 in Cancer Cells. Noncoding RNA 2022; 8:ncrna8030039. [PMID: 35736636 PMCID: PMC9228804 DOI: 10.3390/ncrna8030039] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 03/28/2022] [Accepted: 03/30/2022] [Indexed: 12/19/2022] Open
Abstract
In recent years, long non-coding RNAs (lncRNAs) have been shown to play important regulatory roles in cellular processes. Growth arrests specific transcript 5 (GAS5) is a lncRNA that is highly expressed during the cell cycle arrest phase but is downregulated in actively growing cells. Growth arrests specific transcript 5 was discovered to be downregulated in several cancers, primarily solid tumors, and it is known as a tumor suppressor gene that regulates cell proliferation, invasion, migration, and apoptosis via multiple molecular mechanisms. Furthermore, GAS5 polymorphism was found to affect GAS5 expression and functionality in a cell-specific manner. This review article focuses on GAS5’s tumor-suppressive effects in regulating oncogenic signaling pathways, cell cycle, apoptosis, tumor-associated genes, and treatment-resistant cells. We also discussed genetic polymorphisms of GAS5 and their association with cancer susceptibility.
Collapse
|
24
|
Ghani NSA, Emrizal R, Moffit SM, Hamdani HY, Ramlan EI, Firdaus-Raih M. GrAfSS: a webserver for substructure similarity searching and comparisons in the structures of proteins and RNA. Nucleic Acids Res 2022; 50:W375-W383. [PMID: 35639505 PMCID: PMC9252811 DOI: 10.1093/nar/gkac402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 04/28/2022] [Accepted: 05/08/2022] [Indexed: 12/03/2022] Open
Abstract
The GrAfSS (Graph theoretical Applications for Substructure Searching) webserver is a platform to search for three-dimensional substructures of: (i) amino acid side chains in protein structures; and (ii) base arrangements in RNA structures. The webserver interfaces the functions of five different graph theoretical algorithms – ASSAM, SPRITE, IMAAAGINE, NASSAM and COGNAC – into a single substructure searching suite. Users will be able to identify whether a three-dimensional (3D) arrangement of interest, such as a ligand binding site or 3D motif, observed in a protein or RNA structure can be found in other structures available in the Protein Data Bank (PDB). The webserver also allows users to determine whether a protein or RNA structure of interest contains substructural arrangements that are similar to known motifs or 3D arrangements. These capabilities allow for the functional annotation of new structures that were either experimentally determined or computationally generated (such as the coordinates generated by AlphaFold2) and can provide further insights into the diversity or conservation of functional mechanisms of structures in the PDB. The computed substructural superpositions are visualized using integrated NGL viewers. The GrAfSS server is available at http://mfrlab.org/grafss/.
Collapse
Affiliation(s)
- Nur Syatila Ab Ghani
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Reeki Emrizal
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Sabrina Mohamed Moffit
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Hazrina Yusof Hamdani
- Advanced Medical and Dental Institute, Universiti Sains Malaysia, Bertam, Kepala Batas 13200, Pulau Pinang, Malaysia
| | | | - Mohd Firdaus-Raih
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia.,Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| |
Collapse
|
25
|
Tran AN, Chandler M, Halman J, Beasock D, Fessler A, McKeough RQ, Lam PA, Furr DP, Wang J, Cedrone E, Dobrovolskaia MA, Dokholyan NV, Trammell SR, Afonin KA. Anhydrous Nucleic Acid Nanoparticles for Storage and Handling at Broad Range of Temperatures. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2022; 18:e2104814. [PMID: 35128787 PMCID: PMC8976831 DOI: 10.1002/smll.202104814] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 12/17/2021] [Indexed: 05/13/2023]
Abstract
Recent advances in nanotechnology now allow for the methodical implementation of therapeutic nucleic acids (TNAs) into modular nucleic acid nanoparticles (NANPs) with tunable physicochemical properties which can match the desired biological effects, provide uniformity, and regulate the delivery of multiple TNAs for combinatorial therapy. Despite the potential of novel NANPs, the maintenance of their structural integrity during storage and shipping remains a vital issue that impedes their broader applications. Cold chain storage is required to maintain the potency of NANPs in the liquid phase, which greatly increases transportation costs. To promote long-term storage and retention of biological activities at higher temperatures (e.g., +50 °C), a panel of representative NANPs is first exposed to three different drying mechanisms-vacuum concentration (SpeedVac), lyophilization (Lyo), and light-assisted drying (LAD)-and then rehydrated and analyzed. While SpeedVac primarily operates using heat, Lyo avoids temperature increases by taking advantage of pressure reduction and LAD involves a near-infrared laser for uniform drying in the presence of trehalose. This work compares and defines refinements crucial in formulating an optimal strategy for producing stable, fully functional NANPs and presents a forward advancement in their development for clinical applications.
Collapse
Affiliation(s)
- Allison N Tran
- Nanoscale Science Program, Department of Chemistry, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Morgan Chandler
- Nanoscale Science Program, Department of Chemistry, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Justin Halman
- Nanoscale Science Program, Department of Chemistry, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Damian Beasock
- Nanoscale Science Program, Department of Chemistry, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Adam Fessler
- Nanoscale Science Program, Department of Chemistry, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Riley Q McKeough
- Department of Physics and Optical Science, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Phuong Anh Lam
- Department of Physics and Optical Science, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Daniel P Furr
- Department of Physics and Optical Science, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Jian Wang
- Department of Pharmacology, Department of Biochemistry & Molecular Biology, Penn State College of Medicine, Hershey, PA, 17033, USA
| | - Edward Cedrone
- Nanotechnology Characterization Lab, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD, 21702, USA
| | - Marina A Dobrovolskaia
- Nanotechnology Characterization Lab, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD, 21702, USA
| | - Nikolay V Dokholyan
- Department of Pharmacology, Department of Biochemistry & Molecular Biology, Penn State College of Medicine, Hershey, PA, 17033, USA
| | - Susan R Trammell
- Department of Physics and Optical Science, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Kirill A Afonin
- Nanoscale Science Program, Department of Chemistry, The University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| |
Collapse
|
26
|
Zhang C, Pyle AM. CSSR: assignment of secondary structure to coarse-grained RNA tertiary structures. ACTA CRYSTALLOGRAPHICA SECTION D STRUCTURAL BIOLOGY 2022; 78:466-471. [PMID: 35362469 PMCID: PMC8972804 DOI: 10.1107/s2059798322001292] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 02/02/2022] [Indexed: 11/16/2022]
Abstract
CSSR, an algorithm for assigning secondary structures to RNA 3D structures with missing atoms, has been developed. The base-pair assignment accuracy is close to 90% for 3D structures in which only one atom per nucleotide can be empirically identified. RNA secondary-structure (rSS) assignment is one of the most routine forms of analysis of RNA 3D structures. However, traditional rSS assignment programs require full-atomic structures of the individual RNA nucleotides. This prevents their application to the modeling of RNA structures in which base atoms are missing. To address this issue, Coarse-grained Secondary Structure of RNA (CSSR), an algorithm for the assignment of rSS for structures in which nucleobase atomic positions are incomplete, has been developed. Using CSSR, an rSS assignment accuracy of ∼90% is achieved even for RNA structures in which only one backbone atom per nucleotide is known. Thus, CSSR will be useful for the analysis of experimentally determined and computationally predicted RNA 3D structures alike. The source code of CSSR is available at https://github.com/pylelab/CSSR.
Collapse
|
27
|
Guo ZH, Yuan L, Tan YL, Zhang BG, Shi YZ. RNAStat: An Integrated Tool for Statistical Analysis of RNA 3D Structures. FRONTIERS IN BIOINFORMATICS 2022; 1:809082. [PMID: 36303785 PMCID: PMC9580920 DOI: 10.3389/fbinf.2021.809082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Accepted: 12/17/2021] [Indexed: 11/13/2022] Open
Abstract
The 3D architectures of RNAs are essential for understanding their cellular functions. While an accurate scoring function based on the statistics of known RNA structures is a key component for successful RNA structure prediction or evaluation, there are few tools or web servers that can be directly used to make comprehensive statistical analysis for RNA 3D structures. In this work, we developed RNAStat, an integrated tool for making statistics on RNA 3D structures. For given RNA structures, RNAStat automatically calculates RNA structural properties such as size and shape, and shows their distributions. Based on the RNA structure annotation from DSSR, RNAStat provides statistical information of RNA secondary structure motifs including canonical/non-canonical base pairs, stems, and various loops. In particular, the geometry of base-pairing/stacking can be calculated in RNAStat by constructing a local coordinate system for each base. In addition, RNAStat also supplies the distribution of distance between any atoms to the users to help build distance-based RNA statistical potentials. To test the usability of the tool, we established a non-redundant RNA 3D structure dataset, and based on the dataset, we made a comprehensive statistical analysis on RNA structures, which could have the guiding significance for RNA structure modeling. The python code of RNAStat, the dataset used in this work, and corresponding statistical data files are freely available at GitHub (https://github.com/RNA-folding-lab/RNAStat).
Collapse
Affiliation(s)
- Zhi-Hao Guo
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
- School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan, China
| | - Li Yuan
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
- School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Ben-Gong Zhang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, China
- *Correspondence: Ya-Zhou Shi,
| |
Collapse
|
28
|
Identifying proximal RNA interactions from cDNA-encoded crosslinks with ShapeJumper. PLoS Comput Biol 2021; 17:e1009632. [PMID: 34905538 PMCID: PMC8670686 DOI: 10.1371/journal.pcbi.1009632] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 11/11/2021] [Indexed: 01/07/2023] Open
Abstract
SHAPE-JuMP is a concise strategy for identifying close-in-space interactions in RNA molecules. Nucleotides in close three-dimensional proximity are crosslinked with a bi-reactive reagent that covalently links the 2'-hydroxyl groups of the ribose moieties. The identities of crosslinked nucleotides are determined using an engineered reverse transcriptase that jumps across crosslinked sites, resulting in a deletion in the cDNA that is detected using massively parallel sequencing. Here we introduce ShapeJumper, a bioinformatics pipeline to process SHAPE-JuMP sequencing data and to accurately identify through-space interactions, as observed in complex JuMP datasets. ShapeJumper identifies proximal interactions with near-nucleotide resolution using an alignment strategy that is optimized to tolerate the unique non-templated reverse-transcription profile of the engineered crosslink-traversing reverse-transcriptase. JuMP-inspired strategies are now poised to replace adapter-ligation for detecting RNA-RNA interactions in most crosslinking experiments.
Collapse
|
29
|
Progress toward SHAPE Constrained Computational Prediction of Tertiary Interactions in RNA Structure. Noncoding RNA 2021; 7:ncrna7040071. [PMID: 34842779 PMCID: PMC8628965 DOI: 10.3390/ncrna7040071] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 10/29/2021] [Accepted: 11/02/2021] [Indexed: 01/04/2023] Open
Abstract
As more sequencing data accumulate and novel puzzling genetic regulations are discovered, the need for accurate automated modeling of RNA structure increases. RNA structure modeling from chemical probing experiments has made tremendous progress, however accurately predicting large RNA structures is still challenging for several reasons: RNA are inherently flexible and often adopt many energetically similar structures, which are not reliably distinguished by the available, incomplete thermodynamic model. Moreover, computationally, the problem is aggravated by the relevance of pseudoknots and non-canonical base pairs, which are hardly predicted efficiently. To identify nucleotides involved in pseudoknots and non-canonical interactions, we scrutinized the SHAPE reactivity of each nucleotide of the 188 nt long lariat-capping ribozyme under multiple conditions. Reactivities analyzed in the light of the X-ray structure were shown to report accurately the nucleotide status. Those that seemed paradoxical were rationalized by the nucleotide behavior along molecular dynamic simulations. We show that valuable information on intricate interactions can be deduced from probing with different reagents, and in the presence or absence of Mg2+. Furthermore, probing at increasing temperature was remarkably efficient at pointing to non-canonical interactions and pseudoknot pairings. The possibilities of following such strategies to inform structure modeling software are discussed.
Collapse
|
30
|
Tulsiyan K, Jena S, González-Viegas M, Kar RK, Biswal HS. Structural Dynamics of RNA in the Presence of Choline Amino Acid Based Ionic Liquid: A Spectroscopic and Computational Outlook. ACS CENTRAL SCIENCE 2021; 7:1688-1697. [PMID: 34729412 PMCID: PMC8554839 DOI: 10.1021/acscentsci.1c00768] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2021] [Indexed: 05/03/2023]
Abstract
Ribonucleic acid (RNA) is exceedingly sensitive to degradation compared to DNA. The current protocol for storage of purified RNA requires freezing conditions below -20 °C. Recent advancements in biological chemistry have identified amino acid-based ionic liquids as suitable preservation media for RNA, even in the presence of degrading enzymes. However, the mechanistic insight into the interaction between ILs and RNA is unclear. To the best of our knowledge, no attempts are made so far to provide a molecular view. This work aims to establish a detailed understanding of how ILs enable structural stability to RNA sourced from Torula yeast. Herein, we manifest the hypothesis of multimodal binding of IL and its minimal perturbation to the macromolecular structure, with several spectroscopic techniques such as time-resolved fluorescence and fluorescence correlation spectroscopy (FCS) aided with molecular dynamics at microsecond time scales. Relevant structural and thermodynamic details from biophysical experiments confirm that even long-term RNA preservation with ILs is a possible alternative devoid of any structural deformation. These results establish a unifying mechanism of how ILs are maintaining conformational integrity and thermal stability. The atomistic insights are transferable for their potential applications in drug delivery and biomaterials by considering the advantages of having maximum structural retention and minimum toxicity.
Collapse
Affiliation(s)
- Kiran
Devi Tulsiyan
- School
of Chemical Sciences, National Institute
of Science Education and Research (NISER), Bhimpur-Padanpur, Via-Jatni, District, Khurda, 752050, Bhubaneswar, India
- Homi
Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai 400094, India
| | - Subhrakant Jena
- School
of Chemical Sciences, National Institute
of Science Education and Research (NISER), Bhimpur-Padanpur, Via-Jatni, District, Khurda, 752050, Bhubaneswar, India
- Homi
Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai 400094, India
| | - María González-Viegas
- Institut
für Biologie, Humboldt Universität zu Berlin, Invalidenstraße 42, 10115 Berlin, Germany
| | - Rajiv K. Kar
- Faculty
II-Mathematics and Natural Sciences, Technische
Universität Berlin, Sekr. PC 14, Strasse des 17, Juni 135, D-10623 Berlin, Germany
| | - Himansu S. Biswal
- School
of Chemical Sciences, National Institute
of Science Education and Research (NISER), Bhimpur-Padanpur, Via-Jatni, District, Khurda, 752050, Bhubaneswar, India
- Homi
Bhabha National Institute, Training School Complex, Anushakti Nagar, Mumbai 400094, India
- . Phone: +91-674-2494 185/186
| |
Collapse
|
31
|
Baulin EF. Features and Functions of the A-Minor Motif, the Most Common Motif in RNA Structure. BIOCHEMISTRY (MOSCOW) 2021; 86:952-961. [PMID: 34488572 DOI: 10.1134/s000629792108006x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
A-minor motifs are RNA tertiary structure motifs that generally involve a canonical base pair and an adenine base forming hydrogen bonds with the minor groove of the base pair. Such motifs are among the most common tertiary interactions in known RNA structures, comparable in number with the non-canonical base pairs. They are often found in functionally important regions of non-coding RNAs and, in particular, play a central role in protein synthesis. Here, we review local variations of the A-minor geometry and discuss difficulties associated with their annotation, as well as various structural contexts and common A-minor co-motifs, and diverse functions of A-minors in various processes in a living cell.
Collapse
Affiliation(s)
- Eugene F Baulin
- Institute of Mathematical Problems of Biology RAS - the Branch of Keldysh Institute of Applied Mathematics of the Russian Academy of Sciences, Pushchino, Moscow Region, 142290, Russia. .,Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| |
Collapse
|
32
|
Emrizal R, Hamdani HY, Firdaus-Raih M. Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures. Int J Mol Sci 2021; 22:ijms22168553. [PMID: 34445259 PMCID: PMC8395288 DOI: 10.3390/ijms22168553] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 08/01/2021] [Accepted: 08/06/2021] [Indexed: 12/12/2022] Open
Abstract
The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule's functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database.
Collapse
Affiliation(s)
- Reeki Emrizal
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia;
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia
| | - Hazrina Yusof Hamdani
- Advanced Medical and Dental Institute, Universiti Sains Malaysia, Bertam, Kepala Batas 13200, Pulau Pinang, Malaysia
- Correspondence: (H.Y.H.); (M.F.-R.)
| | - Mohd Firdaus-Raih
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia;
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia
- Correspondence: (H.Y.H.); (M.F.-R.)
| |
Collapse
|
33
|
Soulé A, Reinharz V, Sarrazin-Gendron R, Denise A, Waldispühl J. Finding recurrent RNA structural networks with fast maximal common subgraphs of edge-colored graphs. PLoS Comput Biol 2021; 17:e1008990. [PMID: 34048427 PMCID: PMC8191989 DOI: 10.1371/journal.pcbi.1008990] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 06/10/2021] [Accepted: 04/22/2021] [Indexed: 11/25/2022] Open
Abstract
RNA tertiary structure is crucial to its many non-coding molecular functions. RNA architecture is shaped by its secondary structure composed of stems, stacked canonical base pairs, enclosing loops. While stems are precisely captured by free-energy models, loops composed of non-canonical base pairs are not. Nor are distant interactions linking together those secondary structure elements (SSEs). Databases of conserved 3D geometries (a.k.a. modules) not captured by energetic models are leveraged for structure prediction and design, but the computational complexity has limited their study to local elements, loops. Representing the RNA structure as a graph has recently allowed to expend this work to pairs of SSEs, uncovering a hierarchical organization of these 3D modules, at great computational cost. Systematically capturing recurrent patterns on a large scale is a main challenge in the study of RNA structures. In this paper, we present an efficient algorithm to compute maximal isomorphisms in edge colored graphs. We extend this algorithm to a framework well suited to identify RNA modules, and fast enough to considerably generalize previous approaches. To exhibit the versatility of our framework, we first reproduce results identifying all common modules spanning more than 2 SSEs, in a few hours instead of weeks. The efficiency of our new algorithm is demonstrated by computing the maximal modules between any pair of entire RNA in the non-redundant corpus of known RNA 3D structures. We observe that the biggest modules our method uncovers compose large shared sub-structure spanning hundreds of nucleotides and base pairs between the ribosomes of Thermus thermophilus, Escherichia Coli, and Pseudomonas aeruginosa. Ribonucleic Acids (RNAs) are performing a broad range of essential molecular functions in cells, many of which rely on intricate folding properties of the molecule. Watson-Crick and Wobble base pairs form early, stack onto each other to create stems connected by loops, which are themselves stabilized by more sophisticated base interaction patterns. These networks are essential to shape RNA 3D structures but unfortunately still poorly understood. Here, we undertake the task to build a catalog of base interaction networks occurring in multiple structures. However, a pairwise comparison of all RNA structures is computationally heavy. Therefore, we devise an algorithm leveraging intrinsic properties of RNA base interaction networks that enables us to quickly mine full databases of 3D structures. Compared to previous methods, our techniques bring the total running time of the analysis from months to hours while performing more general searches. The data collected though this work will benefit molecular evolution studies and serve in structure prediction tools.
Collapse
Affiliation(s)
- Antoine Soulé
- School of Computer Science, McGill University, Montréal, Canada
- LiX, École Polytechnique, Paris, France
| | - Vladimir Reinharz
- Department of Computer Science, Université du Québec à Montréal, Montréal, Canada
| | | | - Alain Denise
- Laboratoire de recherche en informatique, Université Paris-Saclay - CNRS, Orsay, France
- Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay - CEA - CNRS, Gif-sur-Yvette, France
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, Montréal, Canada
- * E-mail:
| |
Collapse
|
34
|
Zhang K, Abdallah K, Ajmera P, Finos K, Looka A, Mekhael J, Frank AT. CS-Annotate: A Tool for Using NMR Chemical Shifts to Annotate RNA Structure. J Chem Inf Model 2021; 61:1545-1549. [PMID: 33797909 DOI: 10.1021/acs.jcim.1c00006] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Here, we introduce CS-Annotate, a tool that uses assigned NMR chemical shifts to annotate structural features in RNA. At its core, CS-Annotate is a deployment of a multitask deep learning model that simultaneously classifies the solvent exposure, base-stacking and -pairing status, and conformation of individual RNA residues from their chemical shift fingerprint. Here, we briefly describe how we trained and tested the classifier and demonstrate its application to a model RNA system. CS-Annotate can be accessed via the SMALTR (Structure-based MAchine Learning Tools for RNA) Science Gateway (smaltr.org).
Collapse
Affiliation(s)
- Kexin Zhang
- Chemistry Department, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | - Kyrillos Abdallah
- Biophysics Program, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | - Pujan Ajmera
- Biophysics Program, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | - Kyle Finos
- Biophysics Program, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | - Andrew Looka
- Biophysics Program, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | - Joseph Mekhael
- Biophysics Program, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | - Aaron T Frank
- Biophysics Program, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
35
|
Jhunjhunwala A, Ali Z, Bhattacharya S, Halder A, Mitra A, Sharma P. On the Nature of Nucleobase Stacking in RNA: A Comprehensive Survey of Its Structural Variability and a Systematic Classification of Associated Interactions. J Chem Inf Model 2021; 61:1470-1480. [PMID: 33570947 DOI: 10.1021/acs.jcim.0c01225] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The astonishing diversity in folding patterns of RNA three-dimensional (3D) structures is crafted by myriads of noncovalent contacts, of which base pairing and stacking are the most prominent. A systematic and comprehensive classification and annotation of these interactions is necessary for a molecular-level understanding of their roles. However, unlike in the case of base pairing, where a widely accepted nomenclature and classification scheme exists in the public domain, currently available classification schemes for base-base stacking need major enhancements to comprehensively capture the necessary features underlying the rich stacking diversity in RNA. Here, we extend the previous stacking classification based on nucleobase interacting faces by introducing a structurally intuitive geometry-cum topology-based scheme. Specifically, a stack is first classified in terms of the geometry described by the relative orientation of the glycosidic bonds, which generates eight basic stacking geometric families for heterodimeric stacks and six of those for homodimeric stacks. Further annotation in terms of the identity of the bases and the region of involvement of purines (five-membered, six-membered, or both rings) leads to the enumeration of 384 distinct RNA base stacks. Based on our classification scheme, we present an algorithm for automated identification of stacks in RNA crystal structures and analyze the stacking context in selected RNA structures. Overall, the work described here is expected to greatly facilitate the structure-based RNA research.
Collapse
Affiliation(s)
- Ayush Jhunjhunwala
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India
| | - Zakir Ali
- Computational Biochemistry Laboratory, Department of Chemistry and Centre for Advanced Studies in Chemistry, Panjab University, Chandigarh 160014, India
| | - Sohini Bhattacharya
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India
| | - Antarip Halder
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India
| | - Abhijit Mitra
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Gachibowli, Hyderabad, Telangana 500032, India
| | - Purshotam Sharma
- Computational Biochemistry Laboratory, Department of Chemistry and Centre for Advanced Studies in Chemistry, Panjab University, Chandigarh 160014, India
| |
Collapse
|
36
|
Richardson KE, Kirkpatrick CC, Znosko BM. RNA CoSSMos 2.0: an improved searchable database of secondary structure motifs in RNA three-dimensional structures. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5707338. [PMID: 31950189 PMCID: PMC6966092 DOI: 10.1093/database/baz153] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 11/12/2019] [Accepted: 12/13/2019] [Indexed: 01/29/2023]
Abstract
The RNA Characterization of Secondary Structure Motifs, RNA CoSSMos, database is a freely accessible online database that allows users to identify secondary structure motifs among RNA 3D structures and explore their structural features. RNA CoSSMos 2.0 now requires two closing base pairs for all RNA loop motifs to create a less redundant database of secondary structures. Furthermore, RNA CoSSMos 2.0 represents an upgraded database with new features that summarize search findings and aid in the search for 3D structural patterns among RNA secondary structure motifs. Previously, users were limited to viewing search results individually, with no built-in tools to compare search results. RNA CoSSMos 2.0 provides two new features, allowing users to summarize, analyze and compare their search result findings. A function has been added to the website that calculates the average and representative structures of the search results. Additionally, users can now view a summary page of their search results that reports percentages of each structural feature found, including sugar pucker, glycosidic linkage, hydrogen bonding patterns and stacking interactions. Other upgrades include a newly embedded NGL structural viewer, the option to download the clipped structure coordinates in *.pdb format and improved NMR structure results. RNA CoSSMos 2.0 is no longer simply a search engine for a structure database; it now has the capability of analyzing, comparing and summarizing search results. Database URL: http://rnacossmos.com
Collapse
Affiliation(s)
- Katherine E Richardson
- Saint Louis University, Department of Chemistry, 3501 Laclede Avenue, St. Louis, MO 63103 USA
| | - Charles C Kirkpatrick
- Saint Louis University, Department of Chemistry, 3501 Laclede Avenue, St. Louis, MO 63103 USA
| | - Brent M Znosko
- Saint Louis University, Department of Chemistry, 3501 Laclede Avenue, St. Louis, MO 63103 USA
| |
Collapse
|
37
|
Abstract
Systematics is described for annotation of variations in RNA molecules. The conceptual framework is part of Variation Ontology (VariO) and facilitates depiction of types of variations, their functional and structural effects and other consequences in any RNA molecule in any organism. There are more than 150 RNA related VariO terms in seven levels, which can be further combined to generate even more complicated and detailed annotations. The terms are described together with examples, usually for variations and effects in human and in diseases. RNA variation type has two subcategories: variation classification and origin with subterms. Altogether six terms are available for function description. Several terms are available for affected RNA properties. The ontology contains also terms for structural description for affected RNA type, post-transcriptional RNA modifications, secondary and tertiary structure effects and RNA sugar variations. Together with the DNA and protein concepts and annotations, RNA terms allow comprehensive description of variations of genetic and non-genetic origin at all possible levels. The VariO annotations are readable both for humans and computer programs for advanced data integration and mining.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| |
Collapse
|
38
|
Chen X, Khan NS, Zhang S. LocalSTAR3D: a local stack-based RNA 3D structural alignment tool. Nucleic Acids Res 2020; 48:e77. [PMID: 32496533 PMCID: PMC7367197 DOI: 10.1093/nar/gkaa453] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2020] [Revised: 05/15/2020] [Accepted: 05/27/2020] [Indexed: 11/29/2022] Open
Abstract
A fast-growing number of non-coding RNA structures have been resolved and deposited in Protein Data Bank (PDB). In contrast to the wide range of global alignment and motif search tools, there is still a lack of local alignment tools. Among all the global alignment tools for RNA 3D structures, STAR3D has become a valuable tool for its unprecedented speed and accuracy. STAR3D compares the 3D structures of RNA molecules using consecutive base-pairs (stacks) as anchors and generates an optimal global alignment. In this article, we developed a local RNA 3D structural alignment tool, named LocalSTAR3D, which was extended from STAR3D and designed to report multiple local alignments between two RNAs. The benchmarking results show that LocalSTAR3D has better accuracy and coverage than other local alignment tools. Furthermore, the utility of this tool has been demonstrated by rediscovering kink-turn motif instances, conserved domains in group II intron RNAs, and the tRNA mimicry of IRES RNAs.
Collapse
Affiliation(s)
- Xiaoli Chen
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Nabila Shahnaz Khan
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
39
|
Becquey L, Angel E, Tahi F. BiORSEO: a bi-objective method to predict RNA secondary structures with pseudoknots using RNA 3D modules. Bioinformatics 2020; 36:2451-2457. [DOI: 10.1093/bioinformatics/btz962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 11/15/2019] [Accepted: 01/02/2020] [Indexed: 11/12/2022] Open
Abstract
Abstract
Motivation
RNA loops have been modelled and clustered from solved 3D structures into ordered collections of recurrent non-canonical interactions called ‘RNA modules’, available in databases. This work explores what information from such modules can be used to improve secondary structure prediction. We propose a bi-objective method for predicting RNA secondary structures by minimizing both an energy-based and a knowledge-based potential. The tool, called BiORSEO, outputs secondary structures corresponding to the optimal solutions from the Pareto set.
Results
We compare several approaches to predict secondary structures using inserted RNA modules information: two module data sources, Rna3Dmotif and the RNA 3D Motif Atlas, and different ways to score the module insertions: module size, module complexity or module probability according to models like JAR3D and BayesPairing. We benchmark them against a large set of known secondary structures, including some state-of-the-art tools, and comment on the usefulness of the half physics-based, half data-based approach.
Availability and implementation
The software is available for download on the EvryRNA website, as well as the datasets.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Louis Becquey
- Université Paris-Saclay, Univ Evry, IBISC, 91020, Evry, France
| | - Eric Angel
- Université Paris-Saclay, Univ Evry, IBISC, 91020, Evry, France
| | - Fariza Tahi
- Université Paris-Saclay, Univ Evry, IBISC, 91020, Evry, France
| |
Collapse
|
40
|
Magnus M, Antczak M, Zok T, Wiedemann J, Lukasiak P, Cao Y, Bujnicki JM, Westhof E, Szachniuk M, Miao Z. RNA-Puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools. Nucleic Acids Res 2020; 48:576-588. [PMID: 31799609 PMCID: PMC7145511 DOI: 10.1093/nar/gkz1108] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 11/06/2019] [Accepted: 11/15/2019] [Indexed: 12/12/2022] Open
Abstract
Significant improvements have been made in the efficiency and accuracy of RNA 3D structure prediction methods during the succeeding challenges of RNA-Puzzles, a community-wide effort on the assessment of blind prediction of RNA tertiary structures. The RNA-Puzzles contest has shown, among others, that the development and validation of computational methods for RNA fold prediction strongly depend on the benchmark datasets and the structure comparison algorithms. Yet, there has been no systematic benchmark set or decoy structures available for the 3D structure prediction of RNA, hindering the standardization of comparative tests in the modeling of RNA structure. Furthermore, there has not been a unified set of tools that allows deep and complete RNA structure analysis, and at the same time, that is easy to use. Here, we present RNA-Puzzles toolkit, a computational resource including (i) decoy sets generated by different RNA 3D structure prediction methods (raw, for-evaluation and standardized datasets), (ii) 3D structure normalization, analysis, manipulation, visualization tools (RNA_format, RNA_normalizer, rna-tools) and (iii) 3D structure comparison metric tools (RNAQUA, MCQ4Structures). This resource provides a full list of computational tools as well as a standard RNA 3D structure prediction assessment protocol for the community.
Collapse
Affiliation(s)
- Marcin Magnus
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- ReMedy-International Research Agenda Unit, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Maciej Antczak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
| | - Jakub Wiedemann
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Piotr Lukasiak
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Yang Cao
- Center of Growth, Metabolism and Aging, Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, PR China
| | - Janusz M Bujnicki
- International Institute of Molecular and Cell Biology in Warsaw, 02-109 Warsaw, Poland
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Poznan, Poland
| | - Eric Westhof
- Architecture et Réactivité de l’ARN, Université de Strasbourg, Institut de biologie moléculaire et cellulaire du CNRS, 12 allée Konrad Roentgen, 67084 Strasbourg, France
| | - Marta Szachniuk
- Institute of Computing Science & European Centre for Bioinformatics and Genomics, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Zhichao Miao
- Translational Research Institute of Brain and Brain-Like Intelligence and Department of Anesthesiology, Shanghai Fourth People's Hospital Affiliated to Tongji University School of Medicine, Shanghai 200081, China
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
- Newcastle Fibrosis Research Group, Institute of Cellular Medicine, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
41
|
Antczak M, Zablocki M, Zok T, Rybarczyk A, Blazewicz J, Szachniuk M. RNAvista: a webserver to assess RNA secondary structures with non-canonical base pairs. Bioinformatics 2019; 35:152-155. [PMID: 29985979 PMCID: PMC6298044 DOI: 10.1093/bioinformatics/bty609] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Accepted: 07/06/2018] [Indexed: 01/23/2023] Open
Abstract
Motivation In the study of 3D RNA structure, information about non-canonical interactions between nucleobases is increasingly important. Specialized databases support investigation of this issue based on experimental data, and several programs can annotate non-canonical base pairs in the RNA 3D structure. However, predicting the extended RNA secondary structure which describes both canonical and non-canonical interactions remains difficult. Results Here, we present RNAvista that allows predicting an extended RNA secondary structure from sequence or from the list enumerating canonical base pairs only. RNAvista is implemented as a publicly available webserver with user-friendly interface. It runs on all major web browsers. Availability and implementation http://rnavista.cs.put.poznan.pl
Collapse
Affiliation(s)
- Maciej Antczak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Marcin Zablocki
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Agnieszka Rybarczyk
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Jacek Blazewicz
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| |
Collapse
|
42
|
Zok T, Antczak M, Zurkowski M, Popenda M, Blazewicz J, Adamiak RW, Szachniuk M. RNApdbee 2.0: multifunctional tool for RNA structure annotation. Nucleic Acids Res 2019; 46:W30-W35. [PMID: 29718468 PMCID: PMC6031003 DOI: 10.1093/nar/gky314] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 04/14/2018] [Indexed: 01/07/2023] Open
Abstract
In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures—including that of quadruplexes—with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/
Collapse
Affiliation(s)
- Tomasz Zok
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Poznan Supercomputing and Networking Center, Jana Pawla II 10, 61-139 Poznan, Poland
| | - Maciej Antczak
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Michal Zurkowski
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Mariusz Popenda
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Jacek Blazewicz
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Ryszard W Adamiak
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science, and European Centre for Bioinformatics and Genomics, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznan, Poland
| |
Collapse
|
43
|
Reinharz V, Soulé A, Westhof E, Waldispühl J, Denise A. Mining for recurrent long-range interactions in RNA structures reveals embedded hierarchies in network families. Nucleic Acids Res 2019; 46:3841-3851. [PMID: 29608773 PMCID: PMC5934684 DOI: 10.1093/nar/gky197] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2017] [Accepted: 03/22/2018] [Indexed: 11/14/2022] Open
Abstract
The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.
Collapse
Affiliation(s)
- Vladimir Reinharz
- Department of Computer Science, Ben-Gurion University of the Negev, P.O.B. 653 Beer-Sheva, 84105, Israel.,School of Computer Science, McGill University, 3480 University, Montreal, Quebec H3A 0E9, Canada
| | - Antoine Soulé
- School of Computer Science, McGill University, 3480 University, Montreal, Quebec H3A 0E9, Canada.,LIX, École Polytechnique, CNRS, Inria, Palaiseau 91120, France
| | - Eric Westhof
- ARN, Université de Strasbourg, IBMC-CNRS, 15 rue René Descartes, Strasbourg Cedex 67084, France
| | - Jérôme Waldispühl
- School of Computer Science, McGill University, 3480 University, Montreal, Quebec H3A 0E9, Canada
| | - Alain Denise
- LRI, Université Paris-Sud, CNRS, Université Paris-Saclay, Bâtiment 650, Orsay cedex 91405, France.,I2BC, Université Paris-Sud, CNRS, CEA, Université Paris-Saclay, Bâtiment 400, Orsay cedex 91405, France
| |
Collapse
|
44
|
Thiel BC, Beckmann IK, Kerpedjiev P, Hofacker IL. 3D based on 2D: Calculating helix angles and stacking patterns using forgi 2.0, an RNA Python library centered on secondary structure elements. F1000Res 2019; 8:ISCB Comm J-287. [PMID: 31069053 PMCID: PMC6480952 DOI: 10.12688/f1000research.18458.2] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/04/2019] [Indexed: 01/01/2023] Open
Abstract
We present forgi, a Python library to analyze the tertiary structure of RNA secondary structure elements. Our representation of an RNA molecule is centered on secondary structure elements (stems, bulges and loops). By fitting a cylinder to the helix axis, these elements are carried over into a coarse-grained 3D structure representation. Integration with Biopython allows for handling of all-atom 3D information. forgi can deal with a variety of file formats including dotbracket strings, PDB and MMCIF files. We can handle modified residues, missing residues, cofold and multifold structures as well as nucleotide numbers starting at arbitrary positions. We apply this library to the study of stacking helices in junctions and pseudoknots and investigate how far stacking helices in solved experimental structures can divert from coaxial geometries.
Collapse
Affiliation(s)
- Bernhard C. Thiel
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Irene K. Beckmann
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Peter Kerpedjiev
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, 02115, USA
| | - Ivo L. Hofacker
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
- Research Group Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, 1090, Austria
| |
Collapse
|
45
|
Thiel BC, Beckmann IK, Kerpedjiev P, Hofacker IL. 3D based on 2D: Calculating helix angles and stacking patterns using forgi 2.0, an RNA Python library centered on secondary structure elements. F1000Res 2019; 8:ISCB Comm J-287. [PMID: 31069053 PMCID: PMC6480952 DOI: 10.12688/f1000research.18458.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/06/2019] [Indexed: 10/12/2023] Open
Abstract
We present forgi, a Python library to analyze the tertiary structure of RNA secondary structure elements. Our representation of an RNA molecule is centered on secondary structure elements (stems, bulges and loops). By fitting a cylinder to the helix axis, these elements are carried over into a coarse-grained 3D structure representation. Integration with Biopython allows for handling of all-atom 3D information. forgi can deal with a variety of file formats including dotbracket strings, PDB and MMCIF files. We can handle modified residues, missing residues, cofold and multifold structures as well as nucleotide numbers starting at arbitrary positions. We apply this library to the study of stacking helices in junctions and pseudo knots and investigate how far stacking helices in solved experimental structures can divert from coaxial geometries.
Collapse
Affiliation(s)
- Bernhard C. Thiel
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Irene K. Beckmann
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
| | - Peter Kerpedjiev
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, 02115, USA
| | - Ivo L. Hofacker
- Department of Theoretical Chemistry, Faculty of Chemistry, University of Vienna, Vienna, 1090, Austria
- Research Group Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, 1090, Austria
| |
Collapse
|
46
|
Bottaro S, Bussi G, Pinamonti G, Reißer S, Boomsma W, Lindorff-Larsen K. Barnaba: software for analysis of nucleic acid structures and trajectories. RNA (NEW YORK, N.Y.) 2019; 25:219-231. [PMID: 30420522 PMCID: PMC6348988 DOI: 10.1261/rna.067678.118] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Accepted: 11/06/2018] [Indexed: 06/09/2023]
Abstract
RNA molecules are highly dynamic systems characterized by a complex interplay between sequence, structure, dynamics, and function. Molecular simulations can potentially provide powerful insights into the nature of these relationships. The analysis of structures and molecular trajectories of nucleic acids can be nontrivial because it requires processing very high-dimensional data that are not easy to visualize and interpret. Here we introduce Barnaba, a Python library aimed at facilitating the analysis of nucleic acid structures and molecular simulations. The software consists of a variety of analysis tools that allow the user to (i) calculate distances between three-dimensional structures using different metrics, (ii) back-calculate experimental data from three-dimensional structures, (iii) perform cluster analysis and dimensionality reductions, (iv) search three-dimensional motifs in PDB structures and trajectories, and (v) construct elastic network models for nucleic acids and nucleic acids-protein complexes. In addition, Barnaba makes it possible to calculate torsion angles, pucker conformations, and to detect base-pairing/base-stacking interactions. Barnaba produces graphics that conveniently visualize both extended secondary structure and dynamics for a set of molecular conformations. The software is available as a command-line tool as well as a library, and supports a variety of file formats such as PDB, dcd, and xtc files. Source code, documentation, and examples are freely available at https://github.com/srnas/barnaba under GNU GPLv3 license.
Collapse
Affiliation(s)
- Sandro Bottaro
- Structural Biology and NMR Laboratory and Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
- International School for Advanced Studies, 34136 Trieste, Italy
| | - Giovanni Bussi
- International School for Advanced Studies, 34136 Trieste, Italy
| | - Giovanni Pinamonti
- International School for Advanced Studies, 34136 Trieste, Italy
- Department of Mathematics and Computer Science, Freie Universität, 14195 Berlin, Germany
| | - Sabine Reißer
- International School for Advanced Studies, 34136 Trieste, Italy
| | - Wouter Boomsma
- Department of Computer Science, University of Copenhagen, Copenhagen 2200, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory and Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| |
Collapse
|
47
|
Tolosa S, Sansón J, Hidalgo A. Theoretical study of mechanisms for double proton transfer in adenine–uracil base pair via steered molecular dynamic simulations. J Mol Liq 2018. [DOI: 10.1016/j.molliq.2018.06.018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
48
|
Williams B, Zhao B, Tandon A, Ding F, Weeks KM, Zhang Q, Dokholyan NV. Structure modeling of RNA using sparse NMR constraints. Nucleic Acids Res 2018; 45:12638-12647. [PMID: 29165648 PMCID: PMC5728392 DOI: 10.1093/nar/gkx1058] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 10/18/2017] [Indexed: 01/04/2023] Open
Abstract
RNAs fold into distinct molecular conformations that are often essential for their functions. Accurate structure modeling of complex RNA motifs, including ubiquitous non-canonical base pairs and pseudoknots, remains a challenge. Here, we present an NMR-guided all-atom discrete molecular dynamics (DMD) platform, iFoldNMR, for rapid and accurate structure modeling of complex RNAs. We show that sparse distance constraints from imino resonances, which can be readily obtained from routine NMR experiments and easier to compile than laborious assignments of non-solvent-exchangeable protons, are sufficient to direct a DMD search for low-energy RNA conformers. Benchmarking on a set of RNAs with complex folds spanning up to 56 nucleotides in length yields structural models that recapitulate experimentally determined structures with all-heavy-atom RMSDs ranging from 2.4 to 6.5 Å. This platform represents an efficient approach for high-throughput RNA structure modeling and will facilitate analysis of diverse, newly discovered functional RNAs.
Collapse
Affiliation(s)
- Benfeard Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Bo Zhao
- Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Arpit Tandon
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Feng Ding
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA
| | - Kevin M Weeks
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Qi Zhang
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Nikolay V Dokholyan
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.,Molecular and Cellular Biophysics Program, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
49
|
Escudero JA, Nivina A, Cambray G, López-Igual R, Loot C, Mazel D. Recoding of synonymous genes to expand evolutionary landscapes requires control of secondary structure affecting translation. Biotechnol Bioeng 2018; 115:184-191. [DOI: 10.1002/bit.26450] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Revised: 08/23/2017] [Accepted: 09/08/2017] [Indexed: 11/08/2022]
Affiliation(s)
- Jose A. Escudero
- Institut Pasteur; Unité de Plasticité du Génome Bactérien; Département Génomes et Génétique; Paris France
- CNRS; UMR3525; Paris France
- Departamento de Sanidad Animal; Facultad de Veterinaria; Universidad Complutense de Madrid; Madrid Spain
- VISAVET Health Surveillance Centre; Universidad Complutense Madrid; Madrid Spain
| | - Aleksandra Nivina
- Institut Pasteur; Unité de Plasticité du Génome Bactérien; Département Génomes et Génétique; Paris France
- CNRS; UMR3525; Paris France
- Université Paris Descartes; Sorbonne Paris Cité; Paris France
| | | | - Rocío López-Igual
- Institut Pasteur; Unité de Plasticité du Génome Bactérien; Département Génomes et Génétique; Paris France
- CNRS; UMR3525; Paris France
| | - Celine Loot
- Institut Pasteur; Unité de Plasticité du Génome Bactérien; Département Génomes et Génétique; Paris France
- CNRS; UMR3525; Paris France
| | - Didier Mazel
- Institut Pasteur; Unité de Plasticité du Génome Bactérien; Département Génomes et Génétique; Paris France
- CNRS; UMR3525; Paris France
| |
Collapse
|
50
|
Jain S, Schlick T. F-RAG: Generating Atomic Coordinates from RNA Graphs by Fragment Assembly. J Mol Biol 2017; 429:3587-3605. [PMID: 28988954 PMCID: PMC5693719 DOI: 10.1016/j.jmb.2017.09.017] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Revised: 09/12/2017] [Accepted: 09/22/2017] [Indexed: 10/18/2022]
Abstract
Coarse-grained models represent attractive approaches to analyze and simulate ribonucleic acid (RNA) molecules, for example, for structure prediction and design, as they simplify the RNA structure to reduce the conformational search space. Our structure prediction protocol RAGTOP (RNA-As-Graphs Topology Prediction) represents RNA structures as tree graphs and samples graph topologies to produce candidate graphs. However, for a more detailed study and analysis, construction of atomic from coarse-grained models is required. Here we present our graph-based fragment assembly algorithm (F-RAG) to convert candidate three-dimensional (3D) tree graph models, produced by RAGTOP into atomic structures. We use our related RAG-3D utilities to partition graphs into subgraphs and search for structurally similar atomic fragments in a data set of RNA 3D structures. The fragments are edited and superimposed using common residues, full atomic models are scored using RAGTOP's knowledge-based potential, and geometries of top scoring models is optimized. To evaluate our models, we assess all-atom RMSDs and Interaction Network Fidelity (a measure of residue interactions) with respect to experimentally solved structures and compare our results to other fragment assembly programs. For a set of 50 RNA structures, we obtain atomic models with reasonable geometries and interactions, particularly good for RNAs containing junctions. Additional improvements to our protocol and databases are outlined. These results provide a good foundation for further work on RNA structure prediction and design applications.
Collapse
Affiliation(s)
- Swati Jain
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA
| | - Tamar Schlick
- Department of Chemistry, New York University, 1001 Silver, 100 Washington Square East, New York, NY 10003, USA; Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA; New York University-East China Normal University Center for Computational Chemistry at New York University Shanghai, Room 340, Geography Building, North Zhongshan Road, 3663 Shanghai, China.
| |
Collapse
|