1
|
Narunsky A, Higgs GA, Torres BM, Yu D, de Andrade GB, Kavita K, Breaker RR. The discovery of novel noncoding RNAs in 50 bacterial genomes. Nucleic Acids Res 2024; 52:5152-5165. [PMID: 38647067 PMCID: PMC11109978 DOI: 10.1093/nar/gkae248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 03/20/2024] [Accepted: 04/18/2024] [Indexed: 04/25/2024] Open
Abstract
Structured noncoding RNAs (ncRNAs) contribute to many important cellular processes involving chemical catalysis, molecular recognition and gene regulation. Few ncRNA classes are broadly distributed among organisms from all three domains of life, but the list of rarer classes that exhibit surprisingly diverse functions is growing. We previously developed a computational pipeline that enables the near-comprehensive identification of structured ncRNAs expressed from individual bacterial genomes. The regions between protein coding genes are first sorted based on length and the fraction of guanosine and cytidine nucleotides. Long, GC-rich intergenic regions are then examined for sequence and structural similarity to other bacterial genomes. Herein, we describe the implementation of this pipeline on 50 bacterial genomes from varied phyla. More than 4700 candidate intergenic regions with the desired characteristics were identified, which yielded 44 novel riboswitch candidates and numerous other putative ncRNA motifs. Although experimental validation studies have yet to be conducted, this rate of riboswitch candidate discovery is consistent with predictions that many hundreds of novel riboswitch classes remain to be discovered among the bacterial species whose genomes have already been sequenced. Thus, many thousands of additional novel ncRNA classes likely remain to be discovered in the bacterial domain of life.
Collapse
Affiliation(s)
- Aya Narunsky
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Gadareth A Higgs
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Blake M Torres
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Diane Yu
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Gabriel Belem de Andrade
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Kumari Kavita
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Ronald R Breaker
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06511, USA
- Howard Hughes Medical Institute, Yale University, New Haven, CT 06511, USA
| |
Collapse
|
2
|
Rasmussen RA, Wang S, Camarillo JM, Sosnowski V, Cho BK, Goo Y, Lucks J, O’Halloran T. Zur and zinc increase expression of E. coli ribosomal protein L31 through RNA-mediated repression of the repressor L31p. Nucleic Acids Res 2022; 50:12739-12753. [PMID: 36533433 PMCID: PMC9825181 DOI: 10.1093/nar/gkac1086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 10/11/2022] [Accepted: 12/12/2022] [Indexed: 12/23/2022] Open
Abstract
Bacteria can adapt in response to numerous stress conditions. One such stress condition is zinc depletion. The zinc-sensing transcription factor Zur regulates the way numerous bacterial species respond to severe changes in zinc availability. Under zinc sufficient conditions, Zn-loaded Zur (Zn2-Zur) is well-known to repress transcription of genes encoding zinc uptake transporters and paralogues of a few ribosomal proteins. Here, we report the discovery and mechanistic basis for the ability of Zur to up-regulate expression of the ribosomal protein L31 in response to zinc in E. coli. Through genetic mutations and reporter gene assays, we find that Zur achieves the up-regulation of L31 through a double repression cascade by which Zur first represses the transcription of L31p, a zinc-lacking paralogue of L31, which in turn represses the translation of L31. Mutational analyses show that translational repression by L31p requires an RNA hairpin structure within the l31 mRNA and involves the N-terminus of the L31p protein. This work uncovers a new genetic network that allows bacteria to respond to host-induced nutrient limiting conditions through a sophisticated ribosomal protein switching mechanism.
Collapse
Affiliation(s)
- Rebecca A Rasmussen
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL 60208, USA
- Chemistry of Life Process Institute, Northwestern University, Evanston, IL 60208, USA
| | - Suning Wang
- Chemistry of Life Process Institute, Northwestern University, Evanston, IL 60208, USA
- Department of Chemistry, Northwestern University, Evanston, IL 60208, USA
| | - Jeannie M Camarillo
- Northwestern Proteomics Core, Northwestern University, Evanston, IL 60208, USA
| | - Victoria Sosnowski
- Northwestern Proteomics Core, Northwestern University, Evanston, IL 60208, USA
| | - Byoung-Kyu Cho
- Northwestern Proteomics Core, Northwestern University, Evanston, IL 60208, USA
- Mass Spectrometry Technology Access Center, Washington University in St Louis, School of Medicine, USA
| | - Young Ah Goo
- Northwestern Proteomics Core, Northwestern University, Evanston, IL 60208, USA
- Mass Spectrometry Technology Access Center, Washington University in St Louis, School of Medicine, USA
| | - Julius B Lucks
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, IL 60208, USA
- Chemistry of Life Process Institute, Northwestern University, Evanston, IL 60208, USA
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA
| | - Thomas V O’Halloran
- Chemistry of Life Process Institute, Northwestern University, Evanston, IL 60208, USA
- Department of Chemistry, Northwestern University, Evanston, IL 60208, USA
- Department of Chemistry, Michigan State University, East Lansing, MI 48824, USA
- Department of Microbiology & Molecular Genetics, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
3
|
Wu Y, Zhang D, Duan A, Ao Y, Li S. The application of riboswitch sequencing for human gut bacterial classification and identification. Mol Phylogenet Evol 2022; 169:107409. [PMID: 35063674 DOI: 10.1016/j.ympev.2022.107409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 11/29/2021] [Accepted: 11/29/2021] [Indexed: 11/15/2022]
Abstract
Bacteria, especially gut bacteria play important roles in human health and diseases. The classification of many bacterial genera by the 16S ribosomal RNA (rRNA) has failed due to its low inter-species resolution. Given the wide distribution of riboswitches in bacteria, they may help 16S rRNA differentiate closely related species. We found that among 28 groups of species that could not be distinguished by 16S rRNA, eight of them could be separated by the TPP riboswitch and other riboswitches. Moreover, the species in the 16S rRNA database and these riboswitch databases overlap, therefore, using riboswitch databases can help 16S rRNA better identify species. In addition, we used Klenow DNA polymerase and a pair of short primers to facilitate the library construction of TPP riboswitches for sequencing. The sequencing results showed that the TPP riboswitch could detect the major phyla similar to those detected by 16S rRNA. Therefore, the TPP riboswitch and other riboswitch classes could potentially be applied to gut bacteria classification.
Collapse
Affiliation(s)
- Yaoyao Wu
- Medical School, Engineering Research Center of Molecular Medicine of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Deying Zhang
- Medical School, Engineering Research Center of Molecular Medicine of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China; State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, 510627, P. R. China(2)
| | - Anqi Duan
- Medical School, Engineering Research Center of Molecular Medicine of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Yaqi Ao
- Medical School, Engineering Research Center of Molecular Medicine of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Sanshu Li
- Medical School, Engineering Research Center of Molecular Medicine of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China.
| |
Collapse
|
4
|
Weinberg CE, Olzog VJ, Eckert I, Weinberg Z. Identification of over 200-fold more hairpin ribozymes than previously known in diverse circular RNAs. Nucleic Acids Res 2021; 49:6375-6388. [PMID: 34096583 PMCID: PMC8216279 DOI: 10.1093/nar/gkab454] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Accepted: 05/12/2021] [Indexed: 11/18/2022] Open
Abstract
Self-cleaving ribozymes are catalytic RNAs that cut themselves at a specific inter-nucleotide linkage. They serve as a model of RNA catalysis, and as an important tool in biotechnology. For most of the nine known structural classes of self-cleaving ribozymes, at least hundreds of examples are known, and some are present in multiple domains of life. By contrast, only four unique examples of the hairpin ribozyme class are known, despite its discovery in 1986. We bioinformatically predicted 941 unique hairpin ribozymes of a different permuted form from the four previously known hairpin ribozymes, and experimentally confirmed several diverse predictions. These results profoundly expand the number of natural hairpin ribozymes, enabling biochemical analysis based on natural sequences, and suggest that a distinct permuted form is more biologically relevant. Moreover, all novel hairpins were discovered in metatranscriptomes. They apparently reside in RNA molecules that vary both in size—from 381 to 5170 nucleotides—and in protein content. The RNA molecules likely replicate as circular single-stranded RNAs, and potentially provide a dramatic increase in diversity of such RNAs. Moreover, these organisms have eluded previous attempts to isolate RNA viruses from metatranscriptomes—suggesting a significant untapped universe of viruses or other organisms hidden within metatranscriptome sequences.
Collapse
Affiliation(s)
- Christina E Weinberg
- Institute for Biochemistry, Leipzig University, Brüderstraße 34, 04103 Leipzig, Germany
| | - V Janett Olzog
- Institute for Biochemistry, Leipzig University, Brüderstraße 34, 04103 Leipzig, Germany
| | - Iris Eckert
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Leipzig University, Härtelstraße 16-18, 04107 Leipzig, Germany
| | - Zasha Weinberg
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Leipzig University, Härtelstraße 16-18, 04107 Leipzig, Germany
| |
Collapse
|
5
|
Gao W, Jones TA, Rivas E. Discovery of 17 conserved structural RNAs in fungi. Nucleic Acids Res 2021; 49:6128-6143. [PMID: 34086938 PMCID: PMC8216456 DOI: 10.1093/nar/gkab355] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 03/25/2021] [Accepted: 04/21/2021] [Indexed: 11/13/2022] Open
Abstract
Many non-coding RNAs with known functions are structurally conserved: their intramolecular secondary and tertiary interactions are maintained across evolutionary time. Consequently, the presence of conserved structure in multiple sequence alignments can be used to identify candidate functional non-coding RNAs. Here, we present a bioinformatics method that couples iterative homology search with covariation analysis to assess whether a genomic region has evidence of conserved RNA structure. We used this method to examine all unannotated regions of five well-studied fungal genomes (Saccharomyces cerevisiae, Candida albicans, Neurospora crassa, Aspergillus fumigatus, and Schizosaccharomyces pombe). We identified 17 novel structurally conserved non-coding RNA candidates, which include four H/ACA box small nucleolar RNAs, four intergenic RNAs and nine RNA structures located within the introns and untranslated regions (UTRs) of mRNAs. For the two structures in the 3' UTRs of the metabolic genes GLY1 and MET13, we performed experiments that provide evidence against them being eukaryotic riboswitches.
Collapse
Affiliation(s)
- William Gao
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, USA
| | - Thomas A Jones
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, USA
- Howard Hughes Medical Institute, Harvard University, Cambridge, USA
| | - Elena Rivas
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, USA
| |
Collapse
|
6
|
Hou L, Xie J, Wu Y, Wang J, Duan A, Ao Y, Liu X, Yu X, Yan H, Perreault J, Li S. Identification of 11 candidate structured noncoding RNA motifs in humans by comparative genomics. BMC Genomics 2021; 22:164. [PMID: 33750298 PMCID: PMC7941889 DOI: 10.1186/s12864-021-07474-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 02/24/2021] [Indexed: 11/12/2022] Open
Abstract
Background Only 1.5% of the human genome encodes proteins, while large part of the remaining encodes noncoding RNAs (ncRNA). Many ncRNAs form structures and perform many important functions. Accurately identifying structured ncRNAs in the human genome and discovering their biological functions remain a major challenge. Results Here, we have established a pipeline (CM-line) with the following features for analyzing the large genomes of humans and other animals. First, we selected species with larger genetic distances to facilitate the discovery of covariations and compatible mutations. Second, we used CMfinder, which can generate useful alignments even with low sequence conservation. Third, we removed repetitive sequences and known structured ncRNAs to reduce the workload of CMfinder. Fourth, we used Infernal to find more representatives and refine the structure. We reported 11 classes of structured ncRNA candidates with significant covariations in humans. Functional analysis showed that these ncRNAs may have variable functions. Some may regulate circadian clock genes through poly (A) signals (PAS); some may regulate the elongation factor (EEF1A) and the T-cell receptor signaling pathway by cooperating with RNA binding proteins. Conclusions By searching for important features of RNA structure from large genomes, the CM-line has revealed the existence of a variety of novel structured ncRNAs. Functional analysis suggests that some newly discovered ncRNA motifs may have biological functions. The pipeline we have established for the discovery of structured ncRNAs and the identification of their functions can also be applied to analyze other large genomes. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07474-9.
Collapse
Affiliation(s)
- Lijuan Hou
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Jin Xie
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Yaoyao Wu
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Jiaojiao Wang
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Anqi Duan
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Yaqi Ao
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Xuejiao Liu
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Xinmei Yu
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Hui Yan
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China
| | - Jonathan Perreault
- INRS - Institut Armand-Frappier, 531 boul des Prairies, Laval, Québec, H7V1B7, Canada
| | - Sanshu Li
- Medical School, Molecular Medicine Engineering and Research Center of Ministry of Education, Key Laboratory of Precision Medicine and Molecular Diagnosis of Fujian Universities, Institute of Genomics, School of Biomedical Sciences, Huaqiao University, Xiamen, 361021, P. R. China.
| |
Collapse
|
7
|
Kalvari I, Nawrocki EP, Ontiveros-Palacios N, Argasinska J, Lamkiewicz K, Marz M, Griffiths-Jones S, Toffano-Nioche C, Gautheret D, Weinberg Z, Rivas E, Eddy SR, Finn RD, Bateman A, Petrov AI. Rfam 14: expanded coverage of metagenomic, viral and microRNA families. Nucleic Acids Res 2021; 49:D192-D200. [PMID: 33211869 PMCID: PMC7779021 DOI: 10.1093/nar/gkaa1047] [Citation(s) in RCA: 438] [Impact Index Per Article: 146.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/14/2020] [Accepted: 10/21/2020] [Indexed: 12/15/2022] Open
Abstract
Rfam is a database of RNA families where each of the 3444 families is represented by a multiple sequence alignment of known RNA sequences and a covariance model that can be used to search for additional members of the family. Recent developments have involved expert collaborations to improve the quality and coverage of Rfam data, focusing on microRNAs, viral and bacterial RNAs. We have completed the first phase of synchronising microRNA families in Rfam and miRBase, creating 356 new Rfam families and updating 40. We established a procedure for comprehensive annotation of viral RNA families starting with Flavivirus and Coronaviridae RNAs. We have also increased the coverage of bacterial and metagenome-based RNA families from the ZWD database. These developments have enabled a significant growth of the database, with the addition of 759 new families in Rfam 14. To facilitate further community contribution to Rfam, expert users are now able to build and submit new families using the newly developed Rfam Cloud family curation system. New Rfam website features include a new sequence similarity search powered by RNAcentral, as well as search and visualisation of families with pseudoknots. Rfam is freely available at https://rfam.org.
Collapse
Affiliation(s)
- Ioanna Kalvari
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Eric P Nawrocki
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Nancy Ontiveros-Palacios
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Joanna Argasinska
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kevin Lamkiewicz
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, Leutragraben 1, 07743 Jena, Germany.,European Virus Bioinformatics Center, Leutragraben 1, 07743 Jena, Germany
| | - Manja Marz
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, Leutragraben 1, 07743 Jena, Germany.,European Virus Bioinformatics Center, Leutragraben 1, 07743 Jena, Germany
| | - Sam Griffiths-Jones
- Faculty of Biology, Medicine and Health, University of Manchester, Oxford Road, Manchester, M13 9PT, UK
| | - Claire Toffano-Nioche
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Daniel Gautheret
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Zasha Weinberg
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Leipzig University, 04107 Leipzig, Germany
| | - Elena Rivas
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA
| | - Sean R Eddy
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA.,Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA.,John A. Paulson School of Engineering and Applied Science, Harvard University, Cambridge, MA 02138, USA
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Anton I Petrov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
8
|
Lenkeit F, Eckert I, Hartig JS, Weinberg Z. Discovery and characterization of a fourth class of guanidine riboswitches. Nucleic Acids Res 2021; 48:12889-12899. [PMID: 33237283 PMCID: PMC7736828 DOI: 10.1093/nar/gkaa1102] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 10/20/2020] [Accepted: 10/28/2020] [Indexed: 12/20/2022] Open
Abstract
Riboswitches are RNAs that specifically sense a small molecule and regulate genes accordingly. The recent discovery of guanidine-binding riboswitches revealed the biological significance of this compound, and uncovered genes related to its biology. For example, certain sugE genes encode guanidine exporters and are activated by the riboswitches to reduce toxic levels of guanidine in the cell. In order to study guanidine biology and riboswitches, we applied a bioinformatics strategy for discovering additional guanidine riboswitches by searching for new candidate motifs associated with sugE genes. Based on in vitro and in vivo experiments, we determined that one of our six best candidates is a new structural class of guanidine riboswitches. The expression of a genetic reporter was induced 80-fold in response to addition of 5 mM guanidine in Staphylococcus aureus. This new class, called the guanidine-IV riboswitch, reveals additional guanidine-associated protein domains that are extremely rarely or never associated with previously established guanidine riboswitches. Among these protein domains are two transporter families that are structurally distinct from SugE, and could represent novel types of guanidine exporters. These results establish a new metabolite-binding RNA, further validate a bioinformatics method for finding riboswitches and suggest substrate specificities for as-yet uncharacterized transporter proteins.
Collapse
Affiliation(s)
- Felina Lenkeit
- Department of Chemistry, University of Konstanz, Universitätsstraße 10, 78457 Konstanz, Germany
| | - Iris Eckert
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Leipzig University, Härtelstraße 16-18, 04107 Leipzig, Germany
| | - Jörg S Hartig
- Department of Chemistry, University of Konstanz, Universitätsstraße 10, 78457 Konstanz, Germany
| | - Zasha Weinberg
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Leipzig University, Härtelstraße 16-18, 04107 Leipzig, Germany
| |
Collapse
|
9
|
Sweeney BA, Petrov AI, Ribas CE, Finn RD, Bateman A, Szymanski M, Karlowski WM, Seemann SE, Gorodkin J, Cannone JJ, Gutell RR, Kay S, Marygold S, dos Santos G, Frankish A, Mudge JM, Barshir R, Fishilevich S, Chan PP, Lowe TM, Seal R, Bruford E, Panni S, Porras P, Karagkouni D, Hatzigeorgiou AG, Ma L, Zhang Z, Volders PJ, Mestdagh P, Griffiths-Jones S, Fromm B, Peterson KJ, Kalvari I, Nawrocki EP, Petrov AS, Weng S, Bouchard-Bourelle P, Scott M, Lui LM, Hoksza D, Lovering RC, Kramarz B, Mani P, Ramachandran S, Weinberg Z. RNAcentral 2021: secondary structure integration, improved sequence search and new member databases. Nucleic Acids Res 2021; 49:D212-D220. [PMID: 33106848 PMCID: PMC7779037 DOI: 10.1093/nar/gkaa921] [Citation(s) in RCA: 143] [Impact Index Per Article: 47.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 10/05/2020] [Indexed: 12/16/2022] Open
Abstract
RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences that provides a single access point to 44 RNA resources and >18 million ncRNA sequences from a wide range of organisms and RNA types. RNAcentral now also includes secondary (2D) structure information for >13 million sequences, making RNAcentral the world's largest RNA 2D structure database. The 2D diagrams are displayed using R2DT, a new 2D structure visualization method that uses consistent, reproducible and recognizable layouts for related RNAs. The sequence similarity search has been updated with a faster interface featuring facets for filtering search results by RNA type, organism, source database or any keyword. This sequence search tool is available as a reusable web component, and has been integrated into several RNAcentral member databases, including Rfam, miRBase and snoDB. To allow for a more fine-grained assignment of RNA types and subtypes, all RNAcentral sequences have been annotated with Sequence Ontology terms. The RNAcentral database continues to grow and provide a central data resource for the RNA community. RNAcentral is freely available at https://rnacentral.org.
Collapse
|