1
|
Johnson PZ, Simon AE. RNAcanvas: interactive drawing and exploration of nucleic acid structures. Nucleic Acids Res 2023:7137443. [PMID: 37094080 DOI: 10.1093/nar/gkad302] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 03/15/2023] [Accepted: 04/21/2023] [Indexed: 04/26/2023] Open
Abstract
Two-dimensional drawing of nucleic acid structures, particularly RNA structures, is fundamental to the communication of nucleic acids research. However, manually drawing structures is laborious and infeasible for structures thousands of nucleotides long. RNAcanvas automatically arranges residues into strictly shaped stems and loops while providing robust interactive editing features, including click-and-drag layout adjustment. Drawn elements are highly customizable in a point-and-click manner, including colours, fonts, size and shading, flexible numbering, and outlining of bases. Tertiary interactions can be drawn as draggable, curved lines. Leontis-Westhof notation for depicting non-canonical base-pairs is fully supported, as well as text labels for structural features (e.g. hairpins). RNAcanvas also has many unique features and performance optimizations for large structures that cannot be correctly predicted and require manual refinement based on the researcher's own analyses and expertise. To this end, RNAcanvas has point-and-click structure editing with real-time highlighting of complementary sequences and motif search functionality, novel features that greatly aid in the identification of putative long-range tertiary interactions, de novo analysis of local structures, and phylogenetic comparisons. For ease in producing publication quality figures, drawings can be exported in both SVG and PowerPoint formats. URL: https://rnacanvas.app.
Collapse
Affiliation(s)
- Philip Z Johnson
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD20742, USA
| | - Anne E Simon
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD20742, USA
| |
Collapse
|
2
|
Sweeney BA, Hoksza D, Nawrocki EP, Ribas CE, Madeira F, Cannone JJ, Gutell R, Maddala A, Meade CD, Williams LD, Petrov AS, Chan PP, Lowe TM, Finn RD, Petrov AI. R2DT is a framework for predicting and visualising RNA secondary structure using templates. Nat Commun 2021; 12:3494. [PMID: 34108470 PMCID: PMC8190129 DOI: 10.1038/s41467-021-23555-5] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 05/04/2021] [Indexed: 02/05/2023] Open
Abstract
Non-coding RNAs (ncRNA) are essential for all life, and their functions often depend on their secondary (2D) and tertiary structure. Despite the abundance of software for the visualisation of ncRNAs, few automatically generate consistent and recognisable 2D layouts, which makes it challenging for users to construct, compare and analyse structures. Here, we present R2DT, a method for predicting and visualising a wide range of RNA structures in standardised layouts. R2DT is based on a library of 3,647 templates representing the majority of known structured RNAs. R2DT has been applied to ncRNA sequences from the RNAcentral database and produced >13 million diagrams, creating the world's largest RNA 2D structure dataset. The software is amenable to community expansion, and is freely available at https://github.com/rnacentral/R2DT and a web server is found at https://rnacentral.org/r2dt .
Collapse
Affiliation(s)
- Blake A Sweeney
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - David Hoksza
- Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic
| | - Eric P Nawrocki
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Carlos Eduardo Ribas
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Fábio Madeira
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Jamie J Cannone
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Robin Gutell
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Aparna Maddala
- School of Chemistry and Biochemistry, Center for the Origins of Life, Georgia Institute of Technology, Atlanta, GA, USA
| | - Caeden D Meade
- School of Chemistry and Biochemistry, Center for the Origins of Life, Georgia Institute of Technology, Atlanta, GA, USA
| | - Loren Dean Williams
- School of Chemistry and Biochemistry, Center for the Origins of Life, Georgia Institute of Technology, Atlanta, GA, USA
| | - Anton S Petrov
- School of Chemistry and Biochemistry, Center for the Origins of Life, Georgia Institute of Technology, Atlanta, GA, USA
| | - Patricia P Chan
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Todd M Lowe
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
| | - Anton I Petrov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK.
| |
Collapse
|
3
|
Spaccasassi C, Lakin MR, Phillips A. A Logic Programming Language for Computational Nucleic Acid Devices. ACS Synth Biol 2019; 8:1530-1547. [PMID: 30372611 DOI: 10.1021/acssynbio.8b00229] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Computational nucleic acid devices show great potential for enabling a broad range of biotechnology applications, including smart probes for molecular biology research, in vitro assembly of complex compounds, high-precision in vitro disease diagnosis and, ultimately, computational theranostics inside living cells. This diversity of applications is supported by a range of implementation strategies, including nucleic acid strand displacement, localization to substrates, and the use of enzymes with polymerase, nickase, and exonuclease functionality. However, existing computational design tools are unable to account for these strategies in a unified manner. This paper presents a logic programming language that allows a broad range of computational nucleic acid systems to be designed and analyzed. The language extends standard logic programming with a novel equational theory to express nucleic acid molecular motifs. It automatically identifies matching motifs present in the full system, in order to apply a specified transformation expressed as a logical rule. The language supports the definition of logic predicates, which provide constraints that need to be satisfied in order for a given rule to be applied. The language is sufficiently expressive to encode the semantics of nucleic strand displacement systems with complex topologies, together with computation performed by a broad range of enzymes, and is readily extensible to new implementation strategies. Our approach lays the foundation for a unifying framework for the design of computational nucleic acid devices.
Collapse
Affiliation(s)
| | - Matthew R. Lakin
- Department of Computer Science, University of New Mexico, Albuquerque, New Mexico 87131, United States
- Center for Biomedical Engineering, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | | |
Collapse
|
4
|
Shabash B, Wiese KC. jViz.RNA 4.0-Visualizing pseudoknots and RNA editing employing compressed tree graphs. PLoS One 2019; 14:e0210281. [PMID: 31059508 PMCID: PMC6502502 DOI: 10.1371/journal.pone.0210281] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2018] [Accepted: 12/19/2018] [Indexed: 11/18/2022] Open
Abstract
Previously, we have introduced an improved version of jViz.RNA which enabled faster and more stable RNA visualization by employing compressed tree graphs. However, the new RNA representation and visualization method required a sophisticated mechanism of pseudoknot visualization. In this work, we present our novel pseudoknot classification and implementation of pseudoknot visualization in the context of the new RNA graph model. We then compare our approach with other RNA visualization software, and demonstrate jViz.RNA 4.0's benefits compared to other software. Additionally, we introduce interactive editing functionality into jViz.RNA and demonstrate its benefits in exploring and building RNA structures. The results presented highlight the new high degree of utility jViz.RNA 4.0 now offers. Users are now able to visualize pseudoknotted RNA, manipulate the resulting automatic layouts to suit their individual needs, and change both positioning and connectivity of the RNA molecules examined. Care was taken to limit overlap between structural elements, particularly in the case of pseudoknots to ensure an intuitive and informative layout of the final RNA structure. Availability: The software is freely available at: https://jviz.cs.sfu.ca/.
Collapse
Affiliation(s)
- Boris Shabash
- School of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Kay C. Wiese
- School of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada
- * E-mail:
| |
Collapse
|
5
|
Abstract
Abstract
Motivation
RNA secondary structure is a useful representation for studying the function of RNA, which captures most of the free energy of RNA folding. Using empirically determined energy parameters, secondary structures of nucleic acids can be efficiently computed by recursive algorithms. Several software packages supporting this task are readily available. As RNA secondary structures are outerplanar graphs, they can be drawn without intersection in the plane. Interpretation by the practitioner is eased when these drawings conform to a series of additional constraints beyond outerplanarity. These constraints are the reason why RNA drawing is difficult. Many RNA drawing algorithms therefore do not always produce intersection-free (outerplanar) drawings.
Results
To remedy this shortcoming we propose here the RNApuzzler algorithm which is guaranteed to produce intersection-free drawings. It is based on a drawing algorithm respecting constraints based on nucleotide distances (RNAturtle). We investigate relaxations of these constraints allowing for intersection-free drawings. Based on these relaxations, we implemented a fully automated, simple, and robust algorithm that produces aesthetic drawings adhering to previously established guidelines. We tested our algorithm using the RFAM database and found that we can compute intersection-free drawings of all RNAs therein efficiently.
Availability and implementation
The software can be accessed freely at: https://github.com/dwiegreffe/RNApuzzler.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Daniel Wiegreffe
- Image and Signal Processing Group, Department of Computer Science, Leipzig University, Leipzig, Germany
- Bioinformatics Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| | - Daniel Alexander
- Image and Signal Processing Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| | - Dirk Zeckzer
- Image and Signal Processing Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| |
Collapse
|
6
|
Wiegreffe D, Alexander D, Stadler PF, Zeckzer D. RNApuzzler: efficient outerplanar drawing of RNA-secondary structures. Bioinformatics 2018; 35:1342-1349. [DOI: 10.1093/bioinformatics/bty817] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2018] [Revised: 08/09/2018] [Accepted: 09/18/2018] [Indexed: 11/12/2022] Open
Affiliation(s)
- Daniel Wiegreffe
- Image and Signal Processing Group, Department of Computer Science, Leipzig University, Leipzig, Germany
- Bioinformatics Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| | - Daniel Alexander
- Image and Signal Processing Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| | - Dirk Zeckzer
- Image and Signal Processing Group, Department of Computer Science, Leipzig University, Leipzig, Germany
| |
Collapse
|
7
|
Shabash B, Wiese KC. Numerical integration methods and layout improvements in the context of dynamic RNA visualization. BMC Bioinformatics 2017; 18:282. [PMID: 28558664 PMCID: PMC5450055 DOI: 10.1186/s12859-017-1682-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 05/11/2017] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND RNA visualization software tools have traditionally presented a static visualization of RNA molecules with limited ability for users to interact with the resulting image once it is complete. Only a few tools allowed for dynamic structures. One such tool is jViz.RNA. Currently, jViz.RNA employs a unique method for the creation of the RNA molecule layout by mapping the RNA nucleotides into vertexes in a graph, which we call the detailed graph, and then utilizes a Newtonian mechanics inspired system of forces to calculate a layout for the RNA molecule. The work presented here focuses on improvements to jViz.RNA that allow the drawing of RNA secondary structures according to common drawing conventions, as well as dramatic run-time performance improvements. This is done first by presenting an alternative method for mapping the RNA molecule into a graph, which we call the compressed graph, and then employing advanced numerical integration methods for the compressed graph representation. RESULTS Comparing the compressed graph and detailed graph implementations, we find that the compressed graph produces results more consistent with RNA drawing conventions. However, we also find that employing the compressed graph method requires a more sophisticated initial layout to produce visualizations that would require minimal user interference. Comparing the two numerical integration methods demonstrates the higher stability of the Backward Euler method, and its resulting ability to handle much larger time steps, a high priority feature for any software which entails user interaction. CONCLUSION The work in this manuscript presents the preferred use of compressed graphs to detailed ones, as well as the advantages of employing the Backward Euler method over the Forward Euler method. These improvements produce more stable as well as visually aesthetic representations of the RNA secondary structures. The results presented demonstrate that both the compressed graph representation, as well as the Backward Euler integrator, greatly enhance the run-time performance and usability. The newest iteration of jViz.RNA is available at https://jviz.cs.sfu.ca/download/download.html .
Collapse
Affiliation(s)
- Boris Shabash
- School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, BC, Canada
| | - Kay C Wiese
- School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, BC, Canada.
| |
Collapse
|