201
|
Bersani C, Huss M, Giacomello S, Xu LD, Bianchi J, Eriksson S, Jerhammar F, Alexeyenko A, Vilborg A, Lundeberg J, Lui WO, Wiman KG. Genome-wide identification of Wig-1 mRNA targets by RIP-Seq analysis. Oncotarget 2016; 7:1895-911. [PMID: 26672765 PMCID: PMC4811505 DOI: 10.18632/oncotarget.6557] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 11/15/2015] [Indexed: 02/06/2023] Open
Abstract
RNA-binding proteins (RBPs) play important roles in the regulation of gene expression through a variety of post-transcriptional mechanisms. The p53-induced RBP Wig-1 (Zmat3) binds RNA through its zinc finger domains and enhances stability of p53 and N-Myc mRNAs and decreases stability of FAS mRNA. To identify novel Wig-1-bound RNAs, we performed RNA-immunoprecipitation followed by high-throughput sequencing (RIP-Seq) in HCT116 and Saos-2 cells. We identified 286 Wig-1-bound mRNAs common between the two cell lines. Sequence analysis revealed that AU-rich elements (AREs) are highly enriched in the 3′UTR of these Wig-1-bound mRNAs. Network enrichment analysis showed that Wig-1 preferentially binds mRNAs involved in cell cycle regulation. Moreover, we identified a 2D Wig-1 binding motif in HIF1A mRNA. Our findings confirm that Wig-1 is an ARE-BP that regulates cell cycle-related processes and provide a novel view of how Wig-1 may bind mRNA through a putative structural motif. We also significantly extend the repertoire of Wig-1 target mRNAs. Since Wig-1 is a transcriptional target of the tumor suppressor p53, these results have implications for our understanding of p53-dependent stress responses and tumor suppression.
Collapse
Affiliation(s)
- Cinzia Bersani
- Department of Oncology-Pathology, Karolinska Institutet, Cancer Center Karolinska, Stockholm, Sweden
| | - Mikael Huss
- Science for Life Laboratory, School of Biotechnology, Royal Institute of Technology, Solna, Sweden
| | - Stefania Giacomello
- Science for Life Laboratory, School of Biotechnology, Royal Institute of Technology, Solna, Sweden
| | - Li-Di Xu
- Department of Oncology-Pathology, Karolinska Institutet, Cancer Center Karolinska, Stockholm, Sweden
| | - Julie Bianchi
- Department of Oncology-Pathology, Karolinska Institutet, Cancer Center Karolinska, Stockholm, Sweden
| | - Sofi Eriksson
- Department of Oncology-Pathology, Karolinska Institutet, Cancer Center Karolinska, Stockholm, Sweden
| | - Fredrik Jerhammar
- Department of Oncology-Pathology, Karolinska Institutet, Cancer Center Karolinska, Stockholm, Sweden
| | - Andrey Alexeyenko
- Department of Microbiology, Tumour and Cell biology, Bioinformatics Infrastructure for Life Sciences, Science for Life Laboratory, Karolinska Institutet, Stockholm, Sweden
| | - Anna Vilborg
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Joakim Lundeberg
- Science for Life Laboratory, School of Biotechnology, Royal Institute of Technology, Solna, Sweden
| | - Weng-Onn Lui
- Department of Oncology-Pathology, Karolinska Institutet, Cancer Center Karolinska, Stockholm, Sweden
| | - Klas G Wiman
- Department of Oncology-Pathology, Karolinska Institutet, Cancer Center Karolinska, Stockholm, Sweden
| |
Collapse
|
202
|
Zhang W, Yang S, Zhao H, Huang L. Using the ITS2 sequence-structure as a DNA mini-barcode: A case study in authenticating the traditional medicine“Fang Feng”. BIOCHEM SYST ECOL 2016. [DOI: 10.1016/j.bse.2016.10.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
203
|
Patra Bhattacharya D, Canzler S, Kehr S, Hertel J, Grosse I, Stadler PF. Phylogenetic distribution of plant snoRNA families. BMC Genomics 2016; 17:969. [PMID: 27881081 PMCID: PMC5122169 DOI: 10.1186/s12864-016-3301-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 11/15/2016] [Indexed: 12/11/2022] Open
Abstract
Background Small nucleolar RNAs (snoRNAs) are one of the most ancient families amongst non-protein-coding RNAs. They are ubiquitous in Archaea and Eukarya but absent in bacteria. Their main function is to target chemical modifications of ribosomal RNAs. They fall into two classes, box C/D snoRNAs and box H/ACA snoRNAs, which are clearly distinguished by conserved sequence motifs and the type of chemical modification that they govern. Similarly to microRNAs, snoRNAs appear in distinct families of homologs that affect homologous targets. In animals, snoRNAs and their evolution have been studied in much detail. In plants, however, their evolution has attracted comparably little attention. Results In order to chart the phylogenetic distribution of individual snoRNA families in plants, we applied a sophisticated approach for identifying homologs of known plant snoRNAs across the plant kingdom. In response to the relatively fast evolution of snoRNAs, information on conserved sequence boxes, target sequences, and secondary structure is combined to identify additional snoRNAs. We identified 296 families of snoRNAs in 24 species and traced their evolution throughout the plant kingdom. Many of the plant snoRNA families comprise paralogs. We also found that targets are well-conserved for most snoRNA families. Conclusions The sequence conservation of snoRNAs is sufficient to establish homologies between phyla. The degree of this conservation tapers off, however, between land plants and algae. Plant snoRNAs are frequently organized in highly conserved spatial clusters. As a resource for further investigations we provide carefully curated and annotated alignments for each snoRNA family under investigation. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3301-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Deblina Patra Bhattacharya
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany.,Institut für Informatik, Halle (Saale), D-06120, Germany
| | - Sebastian Canzler
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany
| | - Stephanie Kehr
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany
| | - Jana Hertel
- Young Investigators Group Bioinformatics & Transcriptomics, Helmholtz Centre for Environmental Research - UFZ, Permoserstrasse 15, Leipzig, D-04318, Germany
| | - Ivo Grosse
- Institut für Informatik, Halle (Saale), D-06120, Germany.,German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Leipzig, Germany
| | - Peter F Stadler
- Bioinformatics Group, Dept. Computer Science, and artin-Luther-Universität Halle-Wittenberg, Leipzig, D-04107, Germany. .,Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103, Germany. .,Fraunhofer Institute for Cell Therapy and Immunology, Perlickstrasse 1, Leipzig, D-04103, Germany. .,Department of Theoretical Chemistry of the University of Vienna, Währingerstrasse 17, Leipzig, A-1090, Germany. .,Center for RNA in Technology and Health, Univ. Copenhagen, Grønnegårdsvej 3, Frederiksberg C, Copenhagen, Denmark. .,Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA. .,German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Leipzig, Germany.
| |
Collapse
|
204
|
Lisenkova AA, Grigorenko AP, Tyazhelova TV, Andreeva TV, Gusev FE, Manakhov AD, Goltsov AY, Piraino S, Miglietta MP, Rogaev EI. Complete mitochondrial genome and evolutionary analysis of Turritopsis dohrnii, the "immortal" jellyfish with a reversible life-cycle. Mol Phylogenet Evol 2016; 107:232-238. [PMID: 27845203 DOI: 10.1016/j.ympev.2016.11.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Revised: 10/10/2016] [Accepted: 11/10/2016] [Indexed: 12/30/2022]
Abstract
Turritopsis dohrnii (Cnidaria, Hydrozoa, Hydroidolina, Anthoathecata) is the only known metazoan that is capable of reversing its life cycle via morph rejuvenation from the adult medusa stage to the juvenile polyp stage. Here, we present a complete mitochondrial (mt) genome sequence of T. dohrnii, which harbors genes for 13 proteins, two transfer RNAs, and two ribosomal RNAs. The T. dohrnii mt genome is characterized by typical features of species in the Hydroidolina subclass, such as a high A+T content (71.5%), reversed transcriptional orientation for the large rRNA subunit gene, and paucity of CGN codons. An incomplete complementary duplicate of the cox1 gene was found at the 5' end of the T. dohrnii mt chromosome, as were variable repeat regions flanking the chromosome. We identified species-specific variations (nad5, nad6, cob, and cox1 genes) and putative selective constraints (atp8, nad1, nad2, and nad5 genes) in the mt genes of T. dohrnii, and predicted alterations in tertiary structures of respiratory chain proteins (NADH4, NADH5, and COX1 proteins) of T. dohrnii. Based on comparative analyses of available hydrozoan mt genomes, we also determined the taxonomic relationships of T. dohrnii, recovering Filifera IV as a paraphyletic taxon, and assessed intraspecific diversity of various Hydrozoa species.
Collapse
Affiliation(s)
- A A Lisenkova
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia.
| | - A P Grigorenko
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia; Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, 303 Belmont Street, Worcester, MA 01604, USA; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - T V Tyazhelova
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia
| | - T V Andreeva
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - F E Gusev
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia; Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, 303 Belmont Street, Worcester, MA 01604, USA
| | - A D Manakhov
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia; Center of Genetics and Genetic Technologies, Lomonosov Moscow State University, GSP-1, Leninskie Gory, Moscow 119991, Russia
| | - A Yu Goltsov
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia
| | - S Piraino
- Dipartimento di Scienze e Tecnologie Biologiche ed Ambientali, Università del Salento, I-73100 Lecce, Italy.
| | - M P Miglietta
- Texas A&M University at Galveston, Dept. of Marine Biology, OCSB, Galveston, TX 77553, United States.
| | - E I Rogaev
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Gubkina 3, Moscow 119991, Russia; Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, 303 Belmont Street, Worcester, MA 01604, USA; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia; Center of Genetics and Genetic Technologies, Lomonosov Moscow State University, GSP-1, Leninskie Gory, Moscow 119991, Russia.
| |
Collapse
|
205
|
Weber L, Thoelken C, Volk M, Remes B, Lechner M, Klug G. The Conserved Dcw Gene Cluster of R. sphaeroides Is Preceded by an Uncommonly Extended 5' Leader Featuring the sRNA UpsM. PLoS One 2016; 11:e0165694. [PMID: 27802301 PMCID: PMC5089854 DOI: 10.1371/journal.pone.0165694] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2016] [Accepted: 10/17/2016] [Indexed: 11/18/2022] Open
Abstract
Cell division and cell wall synthesis mechanisms are similarly conserved among bacteria. Consequently some bacterial species have comparable sets of genes organized in the dcw (division andcellwall) gene cluster. Dcw genes, their regulation and their relative order within the cluster are outstandingly conserved among rod shaped and gram negative bacteria to ensure an efficient coordination of growth and division. A well studied representative is the dcw gene cluster of E. coli. The first promoter of the gene cluster (mraZ1p) gives rise to polycistronic transcripts containing a 38 nt long 5’ UTR followed by the first gene mraZ. Despite reported conservation we present evidence for a much longer 5’ UTR in the gram negative and rod shaped bacterium Rhodobacter sphaeroides and in the family of Rhodobacteraceae. This extended 268 nt long 5’ UTR comprises a Rho independent terminator, which in case of termination gives rise to a non-coding RNA (UpsM). This sRNA is conditionally cleaved by RNase E under stress conditions in an Hfq- and very likely target mRNA-dependent manner, implying its function in trans. These results raise the question for the regulatory function of this extended 5’ UTR. It might represent the rarely described case of a trans acting sRNA derived from a riboswitch with exclusive presence in the family of Rhodobacteraceae.
Collapse
Affiliation(s)
- Lennart Weber
- Institute of Microbiology and Molecular Biology, IFZ, Justus-Liebig-University Giessen, Giessen, Germany
| | - Clemens Thoelken
- Institute of Pharmaceutical Chemistry, Philipps-University Marburg, Marburg, Germany
| | - Marcel Volk
- Institute of Microbiology and Molecular Biology, IFZ, Justus-Liebig-University Giessen, Giessen, Germany
| | - Bernhard Remes
- Institute of Microbiology and Molecular Biology, IFZ, Justus-Liebig-University Giessen, Giessen, Germany
| | - Marcus Lechner
- Institute of Pharmaceutical Chemistry, Philipps-University Marburg, Marburg, Germany
| | - Gabriele Klug
- Institute of Microbiology and Molecular Biology, IFZ, Justus-Liebig-University Giessen, Giessen, Germany
- * E-mail:
| |
Collapse
|
206
|
Tripp V, Martin R, Orell A, Alkhnbashi OS, Backofen R, Randau L. Plasticity of archaeal C/D box sRNA biogenesis. Mol Microbiol 2016; 103:151-164. [PMID: 27743417 DOI: 10.1111/mmi.13549] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/04/2016] [Indexed: 01/11/2023]
Abstract
Archaeal and eukaryotic organisms contain sets of C/D box s(no)RNAs with guide sequences that determine ribose 2'-O-methylation sites of target RNAs. The composition of these C/D box sRNA sets is highly variable between organisms and results in varying RNA modification patterns which are important for ribosomal RNA folding and stability. Little is known about the genomic organization of C/D box sRNA genes in archaea. Here, we aimed to obtain first insights into the biogenesis of these archaeal C/D box sRNAs and analyzed the genetic context of more than 300 archaeal sRNA genes. We found that the majority of these genes do not possess independent promoters but are rather located at positions that allow for co-transcription with neighboring genes and their start or stop codons were frequently incorporated into the conserved boxC and D motifs. The biogenesis of plasmid-encoded C/D box sRNA variants was analyzed in vivo in Sulfolobus acidocaldarius. It was found that C/D box sRNA maturation occurs independent of their genetic context and relies solely on the presence of intact RNA kink-turn structures. The observed plasticity of C/D box sRNA biogenesis is suggested to enable their accelerated evolution and, consequently, allow for adjustments of the RNA modification landscape.
Collapse
Affiliation(s)
- Vanessa Tripp
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch Strasse 10, Marburg, 35043, Germany.,LOEWE Center for Synthetic Microbiology, SYNMIKRO, Karl-von-Frisch-Strasse 16, Marburg, 35043, Germany
| | - Roman Martin
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch Strasse 10, Marburg, 35043, Germany
| | - Alvaro Orell
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch Strasse 10, Marburg, 35043, Germany
| | - Omer S Alkhnbashi
- Bioinformatics group, Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, Freiburg, 79110, Germany
| | - Rolf Backofen
- Bioinformatics group, Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, Freiburg, 79110, Germany.,BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany
| | - Lennart Randau
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch Strasse 10, Marburg, 35043, Germany.,LOEWE Center for Synthetic Microbiology, SYNMIKRO, Karl-von-Frisch-Strasse 16, Marburg, 35043, Germany
| |
Collapse
|
207
|
Siqueira FM, de Morais GL, Higashi S, Beier LS, Breyer GM, de Sá Godinho CP, Sagot MF, Schrank IS, Zaha A, de Vasconcelos ATR. Mycoplasma non-coding RNA: identification of small RNAs and targets. BMC Genomics 2016; 17:743. [PMID: 27801290 PMCID: PMC5088518 DOI: 10.1186/s12864-016-3061-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Background Bacterial non-coding RNAs act by base-pairing as regulatory elements in crucial biological processes. We performed the identification of trans-encoded small RNAs (sRNA) from the genomes of Mycoplama hyopneumoniae, Mycoplasma flocculare and Mycoplasma hyorhinis, which are Mycoplasma species that have been identified in the porcine respiratory system. Results A total of 47, 15 and 11 putative sRNAs were predicted in M. hyopneumoniae, M. flocculare and M. hyorhinis, respectively. A comparative genomic analysis revealed the presence of species or lineage specific sRNA candidates. Furthermore, the expression profile of some M. hyopneumoniae sRNAs was determined by a reverse transcription amplification approach, in three different culture conditions. All tested sRNAs were transcribed in at least one condition. A detailed investigation revealed a differential expression profile for two M. hyopneumoniae sRNAs in response to oxidative and heat shock stress conditions, suggesting that their expression is influenced by environmental signals. Moreover, we analyzed sRNA-mRNA hybrids and accessed putative target genes for the novel sRNA candidates. The majority of the sRNAs showed interaction with multiple target genes, some of which could be linked to pathogenesis and cell homeostasis activity. Conclusion This study contributes to our knowledge of Mycoplasma sRNAs and their response to environmental changes. Furthermore, the mRNA target prediction provides a perspective for the characterization and comprehension of the function of the sRNA regulatory mechanisms. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3061-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Franciele Maboni Siqueira
- Centro de Biotecnologia (CBiot), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Guilherme Loss de Morais
- Laboratório Nacional de Computação Científica (LNCC), Laboratório de Bioinformática (LABINFO), Petrópolis, Rio de Janeiro, Brazil
| | - Susan Higashi
- Inria Grenoble Rhône-Alpes, 38330, Montbonnot Saint-Martin, France.,Université Lyon 1, Villeurbanne, France.,CNRS, UMR5558, Laboratoire de Biométrie et Biologie Évolutive, F-69622, Villeurbanne, France
| | - Laura Scherer Beier
- Centro de Biotecnologia (CBiot), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Gabriela Merker Breyer
- Centro de Biotecnologia (CBiot), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Caio Padoan de Sá Godinho
- Laboratório Nacional de Computação Científica (LNCC), Laboratório de Bioinformática (LABINFO), Petrópolis, Rio de Janeiro, Brazil
| | - Marie-France Sagot
- Inria Grenoble Rhône-Alpes, 38330, Montbonnot Saint-Martin, France.,Université Lyon 1, Villeurbanne, France.,CNRS, UMR5558, Laboratoire de Biométrie et Biologie Évolutive, F-69622, Villeurbanne, France
| | - Irene Silveira Schrank
- Centro de Biotecnologia (CBiot), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Arnaldo Zaha
- Centro de Biotecnologia (CBiot), Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil.
| | | |
Collapse
|
208
|
Alexandrova J, Paulus C, Rudinger-Thirion J, Jossinet F, Frugier M. Elaborate uORF/IRES features control expression and localization of human glycyl-tRNA synthetase. RNA Biol 2016; 12:1301-13. [PMID: 26327585 DOI: 10.1080/15476286.2015.1086866] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The canonical activity of glycyl-tRNA synthetase (GARS) is to charge glycine onto its cognate tRNAs. However, outside translation, GARS also participates in many other functions. A single gene encodes both the cytosolic and mitochondrial forms of GARS but 2 mRNA isoforms were identified. Using immunolocalization assays, in vitro translation assays and bicistronic constructs we provide experimental evidence that one of these mRNAs tightly controls expression and localization of human GARS. An intricate regulatory domain was found in its 5'-UTR which displays a functional Internal Ribosome Entry Site and an upstream Open Reading Frame. Together, these elements hinder the synthesis of the mitochondrial GARS and target the translation of the cytosolic enzyme to ER-bound ribosomes. This finding reveals a complex picture of GARS translation and localization in mammals. In this context, we discuss how human GARS expression could influence its moonlighting activities and its involvement in diseases.
Collapse
Affiliation(s)
- Jana Alexandrova
- a Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS ; IBMC ; 15 rue René Descartes; Strasbourg Cedex , France
| | - Caroline Paulus
- a Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS ; IBMC ; 15 rue René Descartes; Strasbourg Cedex , France
| | - Joëlle Rudinger-Thirion
- a Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS ; IBMC ; 15 rue René Descartes; Strasbourg Cedex , France
| | - Fabrice Jossinet
- a Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS ; IBMC ; 15 rue René Descartes; Strasbourg Cedex , France
| | - Magali Frugier
- a Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS ; IBMC ; 15 rue René Descartes; Strasbourg Cedex , France
| |
Collapse
|
209
|
Abstract
Coronaviruses have exceptionally large RNA genomes of approximately 30 kilobases. Genome replication and transcription is mediated by a multisubunit protein complex comprised of more than a dozen virus-encoded proteins. The protein complex is thought to bind specific cis-acting RNA elements primarily located in the 5'- and 3'-terminal genome regions and upstream of the open reading frames located in the 3'-proximal one-third of the genome. Here, we review our current understanding of coronavirus cis-acting RNA elements, focusing on elements required for genome replication and packaging. Recent bioinformatic, biochemical, and genetic studies suggest a previously unknown level of conservation of cis-acting RNA structures among different coronavirus genera and, in some cases, even beyond genus boundaries. Also, there is increasing evidence to suggest that individual cis-acting elements may be part of higher-order RNA structures involving long-range and dynamic RNA-RNA interactions between RNA structural elements separated by thousands of nucleotides in the viral genome. We discuss the structural and functional features of these cis-acting RNA elements and their specific functions in coronavirus RNA synthesis.
Collapse
Affiliation(s)
- R Madhugiri
- Institute of Medical Virology, Justus Liebig University Giessen, Giessen, Germany
| | - M Fricke
- Faculty of Mathematics and Computer Science, Friedrich Schiller University Jena, Jena, Germany
| | - M Marz
- Faculty of Mathematics and Computer Science, Friedrich Schiller University Jena, Jena, Germany; FLI Leibniz Institute for Age Research, Jena, Germany
| | - J Ziebuhr
- Institute of Medical Virology, Justus Liebig University Giessen, Giessen, Germany.
| |
Collapse
|
210
|
Hansen TA, Mollerup S, Nguyen NP, White NE, Coghlan M, Alquezar-Planas DE, Joshi T, Jensen RH, Fridholm H, Kjartansdóttir KR, Mourier T, Warnow T, Belsham GJ, Bunce M, Willerslev E, Nielsen LP, Vinner L, Hansen AJ. High diversity of picornaviruses in rats from different continents revealed by deep sequencing. Emerg Microbes Infect 2016; 5:e90. [PMID: 27530749 PMCID: PMC5034103 DOI: 10.1038/emi.2016.90] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2016] [Revised: 05/30/2016] [Accepted: 06/13/2016] [Indexed: 12/02/2022]
Abstract
Outbreaks of zoonotic diseases in humans and livestock are not uncommon, and an important component in containment of such emerging viral diseases is rapid and reliable diagnostics. Such methods are often PCR-based and hence require the availability of sequence data from the pathogen. Rattus norvegicus (R. norvegicus) is a known reservoir for important zoonotic pathogens. Transmission may be direct via contact with the animal, for example, through exposure to its faecal matter, or indirectly mediated by arthropod vectors. Here we investigated the viral content in rat faecal matter (n=29) collected from two continents by analyzing 2.2 billion next-generation sequencing reads derived from both DNA and RNA. Among other virus families, we found sequences from members of the Picornaviridae to be abundant in the microbiome of all the samples. Here we describe the diversity of the picornavirus-like contigs including near-full-length genomes closely related to the Boone cardiovirus and Theiler's encephalomyelitis virus. From this study, we conclude that picornaviruses within R. norvegicus are more diverse than previously recognized. The virome of R. norvegicus should be investigated further to assess the full potential for zoonotic virus transmission.
Collapse
Affiliation(s)
- Thomas Arn Hansen
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Sarah Mollerup
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Nam-Phuong Nguyen
- Carl R. Woese Institute for Genomic Biology, The University of Illinois at Urbana-Champaign, Urbana, IL 61801-2302, USA
| | - Nicole E White
- Trace and Environmental DNA Lab and Australian Wildlife Forensic Services, Curtin University, Perth, Western Australia 6102, Australia
| | - Megan Coghlan
- Trace and Environmental DNA Lab and Australian Wildlife Forensic Services, Curtin University, Perth, Western Australia 6102, Australia
| | - David E Alquezar-Planas
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Tejal Joshi
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Kemitorvet, DK-2800 Kongens Lyngby, Denmark
| | - Randi Holm Jensen
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Helena Fridholm
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark.,Virus Research and Development, Statens Serum Institut, DK-2300 Copenhagen, Denmark
| | - Kristín Rós Kjartansdóttir
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Tobias Mourier
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Tandy Warnow
- Departments of Bioengineering and Computer Science, The University of Illinois at Urbana-Champaign, Urbana, IL 61801-2302, USA
| | - Graham J Belsham
- National Veterinary Institute, Technical University of Denmark, Lindholm, DK-4771 Kalvehave, Denmark
| | - Michael Bunce
- Trace and Environmental DNA Lab and Australian Wildlife Forensic Services, Curtin University, Perth, Western Australia 6102, Australia
| | - Eske Willerslev
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Lars Peter Nielsen
- Department of Autoimmunology and Biomarkers, Statens Serum Institut, DK-2300 Copenhagen, Denmark
| | - Lasse Vinner
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| | - Anders Johannes Hansen
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, DK-1350 Copenhagen, Denmark
| |
Collapse
|
211
|
Sen A, Cox RT. Fly Models of Human Diseases: Drosophila as a Model for Understanding Human Mitochondrial Mutations and Disease. Curr Top Dev Biol 2016; 121:1-27. [PMID: 28057297 DOI: 10.1016/bs.ctdb.2016.07.001] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Mitochondrial diseases are a prevalent, heterogeneous class of diseases caused by defects in oxidative phosphorylation, whose severity depends upon particular genetic mutations. These diseases can be difficult to diagnose, and current therapeutics have limited efficacy, primarily treating only symptoms. Because mitochondria play a pivotal role in numerous cellular functions, especially ATP production, their diminished activity has dramatic physiological consequences. While this in and of itself makes treating mitochondrial disease complex, these organelles contain their own DNA, mtDNA, whose products are required for ATP production, in addition to the hundreds of nucleus-encoded proteins. Drosophila offers a tractable whole-animal model to understand the mechanisms underlying loss of mitochondrial function, the subsequent cellular and tissue damage that results, and how these organelles are inherited. Human and Drosophila mtDNAs encode the same set of products, and the homologous nucleus-encoded genes required for mitochondrial function are conserved. In addition, Drosophila contain sufficiently complex organ systems to effectively recapitulate many basic symptoms of mitochondrial diseases, yet are relatively easy and fast to genetically manipulate. There are several Drosophila models for specific mitochondrial diseases, which have been recently reviewed (Foriel, Willems, Smeitink, Schenck, & Beyrath, 2015). In this review, we highlight the conservation between human and Drosophila mtDNA, the present and future techniques for creating mtDNA mutations for further study, and how Drosophila has contributed to our current understanding of mitochondrial inheritance.
Collapse
Affiliation(s)
- A Sen
- Uniformed Services University, Bethesda, MD, United States
| | - R T Cox
- Uniformed Services University, Bethesda, MD, United States.
| |
Collapse
|
212
|
Biswas AK, Gao JX. PR2S2Clust: Patched RNA-seq read segments' structure-oriented clustering. J Bioinform Comput Biol 2016; 14:1650027. [PMID: 27455882 DOI: 10.1142/s021972001650027x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
RNA-seq, the next generation sequencing platform, enables researchers to explore deep into the transcriptome of organisms, such as identifying functional non-coding RNAs (ncRNAs), and quantify their expressions on tissues. The functions of ncRNAs are mostly related to their secondary structures. Thus by exploring the clustering in terms of structural profiles of the corresponding read-segments would be essential and this fuels in our motivation behind this research. In this manuscript we proposed PR2S2Clust, Patched RNA-seq Read Segments' Structure-oriented Clustering, which is an analysis platform to extract features to prepare the secondary structure profiles of the RNA-seq read segments. It provides a strategy to employ the profiles to annotate the segments into ncRNA classes using several clustering strategies. The system considers seven pairwise structural distance metrics by considering short-read mappings onto each structure, which we term as the "patched structure" while clustering the segments. In this regard, we show applications of both classical and ensemble clusterings of the partitional and hierarchical variations. Extensive real-world experiments over three publicly available RNA-seq datasets and a comparative analysis over four competitive systems confirm the effectiveness and superiority of the proposed system. The source codes and dataset of PR2S2Clust are available at the http://biomecis.uta.edu/~ashis/res/PR2S2Clust-suppl/ .
Collapse
Affiliation(s)
- Ashis Kumer Biswas
- 1 Department of Computer Science and Engineering, The University of Texas at Arlington, Texas 76019, USA
| | - Jean X Gao
- 1 Department of Computer Science and Engineering, The University of Texas at Arlington, Texas 76019, USA
| |
Collapse
|
213
|
Nitsche A, Stadler PF. Evolutionary clues in lncRNAs. WILEY INTERDISCIPLINARY REVIEWS-RNA 2016; 8. [PMID: 27436689 DOI: 10.1002/wrna.1376] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Revised: 06/06/2016] [Accepted: 06/09/2016] [Indexed: 12/13/2022]
Abstract
The diversity of long non-coding RNAs (lncRNAs) in the human transcriptome is in stark contrast to the sparse exploration of their functions concomitant with their conservation and evolution. The pervasive transcription of the largely non-coding human genome makes the evolutionary age and conservation patterns of lncRNAs to a topic of interest. Yet it is a fairly unexplored field and not that easy to determine as for protein-coding genes. Although there are a few experimentally studied cases, which are conserved at the sequence level, most lncRNAs exhibit weak or untraceable primary sequence conservation. Recent studies shed light on the interspecies conservation of secondary structures among lncRNA homologs by using diverse computational methods. This highlights the importance of structure on functionality of lncRNAs as opposed to the poor impact of primary sequence changes. Further clues in the evolution of lncRNAs are given by selective constraints on non-coding gene structures (e.g., promoters or splice sites) as well as the conservation of prevalent spatio-temporal expression patterns. However, a rapid evolutionary turnover is observable throughout the heterogeneous group of lncRNAs. This still gives rise to questions about its functional meaning. WIREs RNA 2017, 8:e1376. doi: 10.1002/wrna.1376 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Anne Nitsche
- Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Institute de Biologie Moléculaire et Cellulaire, Université de Strasbourg, Cedex, France
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Interdisciplinary Center for Bioinformatics, University Leipzig, Leipzig, Germany.,Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany.,Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology - IZI, Leipzig, Germany.,Center for Non-Coding RNA in Technology and Health, University of Copenhagen, Frederiksberg, Denmark.,Department of Theoretical Chemistry, University of Vienna, Wien, Austria.,Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
214
|
Kochan J, Wawro M, Kasza A. IF-combined smRNA FISH reveals interaction of MCPIP1 protein with IER3 mRNA. Biol Open 2016; 5:889-98. [PMID: 27256408 PMCID: PMC4958271 DOI: 10.1242/bio.018010] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
MCPIP1 and IER3 are recently described proteins essential for maintenance of immune homeostasis. IER3 is involved in the regulation of apoptosis and differentiation and has been shown lately to protect activated T cells and macrophages from apoptosis. MCPIP1 is an RNase critical for controlling inflammation-related mRNAs. MCPIP1 interacts with and degrades a set of stem-loop-containing mRNAs (including IL-6). Our results demonstrate the involvement of MCPIP1 in the regulation of IER3 mRNA levels. A dual luciferase assay revealed that over-expression of MCPIP1 resulted in a decrease of luciferase activity in the samples co-transfected with constructs containing luciferase CDS attached to IER3 3′UTR. We identified a stem-loop structure similar to that described to be important for destabilization of the IL-6 mRNA by MCPIP1. Examination of IER3 3′UTR sequence, structure and evolutionary conservation revealed that the identified stem-loop is buried within a bigger element. Deletion of this fragment abolished the regulation of IER3 3′UTR-containing transcript by MCPIP1. Finally, using immunofluorescence-combined single-molecule RNA FISH we have shown that the MCPIP1 protein co-localizes with IER3 mRNA. By this method we also proved that the presence of the wild-type NYN/PIN-like domain of MCPIP1 correlated with the decreased level of IER3 mRNA. RNA immunoprecipitation further confirmed the interaction of MCPIP1 with IER3 transcripts in vivo. Summary: We identify IER3 mRNA as a newly discovered MCPIP1 target using recently described IF-based procedures, and also identify a conserved element involved in MCPIP1-dependent IER3 transcript destabilization.
Collapse
Affiliation(s)
- Jakub Kochan
- Department of Cell Biochemistry, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Cracow 30-387, Poland
| | - Mateusz Wawro
- Department of Cell Biochemistry, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Cracow 30-387, Poland
| | - Aneta Kasza
- Department of Cell Biochemistry, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Cracow 30-387, Poland
| |
Collapse
|
215
|
Hansen TB, Venø MT, Jensen TI, Schaefer A, Damgaard CK, Kjems J. Argonaute-associated short introns are a novel class of gene regulators. Nat Commun 2016; 7:11538. [PMID: 27173734 PMCID: PMC4869172 DOI: 10.1038/ncomms11538] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 04/06/2016] [Indexed: 12/30/2022] Open
Abstract
MicroRNAs (miRNAs) are short (∼22 nucleotides) regulators of gene expression acting by direct base pairing to 3'-UTR target sites in messenger RNAs. Mature miRNAs are produced by two sequential endonucleolytic cleavages facilitated by Drosha in the nucleus and Dicer in the cytoplasm. A subclass of miRNAs, termed mirtrons, derives from short introns and enters the miRNA biogenesis pathway as Dicer substrates. Here we uncover a third biogenesis strategy that, similar to mirtron biogenesis, initiates from short introns but bypasses Dicer cleavage. These short introns (80-100 nucleotides), coined agotrons, are associated with and stabilized by Argonaute (Ago) proteins in the cytoplasm. Some agotrons are completely conserved in mammalian species, suggesting that they are functionally important. Furthermore, we demonstrate that the agotrons are capable of repressing mRNAs with seed-matching target sequences in the 3'-UTR. These data provide evidence for a novel RNA regulator of gene expression, which bypasses the canonical miRNA biogenesis machinery.
Collapse
Affiliation(s)
- Thomas B Hansen
- Department of Molecular Biology and Genetics (MBG), Interdisciplinary Nanoscience Center (iNANO), Aarhus University, C.F. Moellers Alle 3, Build 1130, Aarhus 8000, Denmark
| | - Morten T Venø
- Department of Molecular Biology and Genetics (MBG), Interdisciplinary Nanoscience Center (iNANO), Aarhus University, C.F. Moellers Alle 3, Build 1130, Aarhus 8000, Denmark
| | - Trine I Jensen
- Department of Molecular Biology and Genetics (MBG), Interdisciplinary Nanoscience Center (iNANO), Aarhus University, C.F. Moellers Alle 3, Build 1130, Aarhus 8000, Denmark
| | - Anne Schaefer
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, 1425 Madison Avenue, New York, New York 10029, USA
| | - Christian K Damgaard
- Department of Molecular Biology and Genetics (MBG), Interdisciplinary Nanoscience Center (iNANO), Aarhus University, C.F. Moellers Alle 3, Build 1130, Aarhus 8000, Denmark
| | - Jørgen Kjems
- Department of Molecular Biology and Genetics (MBG), Interdisciplinary Nanoscience Center (iNANO), Aarhus University, C.F. Moellers Alle 3, Build 1130, Aarhus 8000, Denmark
| |
Collapse
|
216
|
Lorenz R, Wolfinger MT, Tanzer A, Hofacker IL. Predicting RNA secondary structures from sequence and probing data. Methods 2016; 103:86-98. [PMID: 27064083 DOI: 10.1016/j.ymeth.2016.04.004] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 03/29/2016] [Accepted: 04/04/2016] [Indexed: 01/08/2023] Open
Abstract
RNA secondary structures have proven essential for understanding the regulatory functions performed by RNA such as microRNAs, bacterial small RNAs, or riboswitches. This success is in part due to the availability of efficient computational methods for predicting RNA secondary structures. Recent advances focus on dealing with the inherent uncertainty of prediction by considering the ensemble of possible structures rather than the single most stable one. Moreover, the advent of high-throughput structural probing has spurred the development of computational methods that incorporate such experimental data as auxiliary information.
Collapse
Affiliation(s)
- Ronny Lorenz
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria.
| | - Michael T Wolfinger
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria; Medical University of Vienna, Center for Anatomy and Cell Biology, Währingerstraße 13, 1090 Vienna, Austria.
| | - Andrea Tanzer
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria.
| | - Ivo L Hofacker
- University of Vienna, Faculty of Chemistry, Department of Theoretical Chemistry, Währingerstrasse 17, 1090 Vienna, Austria; University of Vienna, Faculty of Computer Science, Research Group Bioinformatics and Computational Biology, Währingerstr. 29, 1090 Vienna, Austria.
| |
Collapse
|
217
|
Moreira S, Valach M, Aoulad-Aissa M, Otto C, Burger G. Novel modes of RNA editing in mitochondria. Nucleic Acids Res 2016; 44:4907-19. [PMID: 27001515 PMCID: PMC4889940 DOI: 10.1093/nar/gkw188] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Accepted: 03/10/2016] [Indexed: 11/20/2022] Open
Abstract
Gene structure and expression in diplonemid mitochondria are unparalleled. Genes are fragmented in pieces (modules) that are separately transcribed, followed by the joining of module transcripts to contiguous RNAs. Some instances of unique uridine insertion RNA editing at module boundaries were noted, but the extent and potential occurrence of other editing types remained unknown. Comparative analysis of deep transcriptome and genome data from Diplonema papillatum mitochondria reveals ∼220 post-transcriptional insertions of uridines, but no insertions of other nucleotides nor deletions. In addition, we detect in total 114 substitutions of cytosine by uridine and adenosine by inosine, amassed into unusually compact clusters. Inosines in transcripts were confirmed experimentally. This is the first report of adenosine-to-inosine editing of mRNAs and ribosomal RNAs in mitochondria. In mRNAs, editing causes mostly amino-acid additions and non-synonymous substitutions; in ribosomal RNAs, it permits formation of canonical secondary structures. Two extensively edited transcripts were compared across four diplonemids. The pattern of uridine-insertion editing is strictly conserved, whereas substitution editing has diverged dramatically, but still rendering diplonemid proteins more similar to other eukaryotic orthologs. We posit that RNA editing not only compensates but also sustains, or even accelerates, ultra-rapid evolution of genome structure and sequence in diplonemid mitochondria.
Collapse
Affiliation(s)
- Sandrine Moreira
- Department of Biochemistry and Robert-Cedergren Centre for Bioinformatics and Genomics; Université de Montréal, Montreal, H3C 3J7, Canada
| | - Matus Valach
- Department of Biochemistry and Robert-Cedergren Centre for Bioinformatics and Genomics; Université de Montréal, Montreal, H3C 3J7, Canada
| | - Mohamed Aoulad-Aissa
- Department of Biochemistry and Robert-Cedergren Centre for Bioinformatics and Genomics; Université de Montréal, Montreal, H3C 3J7, Canada
| | - Christian Otto
- Bioinformatics Group, Department of Computer Science, University of Leipzig, Leipzig, D-04109, Germany
| | - Gertraud Burger
- Department of Biochemistry and Robert-Cedergren Centre for Bioinformatics and Genomics; Université de Montréal, Montreal, H3C 3J7, Canada
| |
Collapse
|
218
|
Pereira TJ, Baldwin JG. Contrasting evolutionary patterns of 28S and ITS rRNA genes reveal high intragenomic variation in Cephalenchus (Nematoda): Implications for species delimitation. Mol Phylogenet Evol 2016; 98:244-60. [PMID: 26926945 DOI: 10.1016/j.ympev.2016.02.016] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 02/20/2016] [Accepted: 02/20/2016] [Indexed: 01/05/2023]
Abstract
Concerted evolution is often assumed to be the evolutionary force driving multi-family genes, including those from ribosomal DNA (rDNA) repeat, to complete homogenization within a species, although cases of non-concerted evolution have been also documented. In this study, sequence variation of 28S and ITS ribosomal RNA (rRNA) genes in the genus Cephalenchus is assessed at three different levels, intragenomic, intraspecific, and interspecific. The findings suggest that not all Cephalenchus species undergo concerted evolution. High levels of intraspecific polymorphism, mostly due to intragenomic variation, are found in Cephalenchus sp1 (BRA-01). Secondary structure analyses of both rRNA genes and across different species show a similar substitution pattern, including mostly compensatory (CBC) and semi-compensatory (SBC) base changes, thus suggesting the functionality of these rRNA copies despite the variation found in some species. This view is also supported by low sequence variation in the 5.8S gene in relation to the flanking ITS-1 and ITS-2 as well as by the existence of conserved motifs in the former gene. It is suggested that potential cross-fertilization in some Cephalenchus species, based on inspection of female reproductive system, might contribute to both intragenomic and intraspecific polymorphism of their rRNA genes. These results reinforce the potential implications of intragenomic and intraspecific genetic diversity on species delimitation, especially in biodiversity studies based solely on metagenetic approaches. Knowledge of sequence variation will be crucial for accurate species diversity estimation using molecular methods.
Collapse
Affiliation(s)
- Tiago José Pereira
- Department of Nematology, University of California, Riverside, 900 University Avenue, Riverside, CA 92521, USA.
| | - James Gordon Baldwin
- Department of Nematology, University of California, Riverside, 900 University Avenue, Riverside, CA 92521, USA.
| |
Collapse
|
219
|
Papakostas S, Michaloudi E, Proios K, Brehm M, Verhage L, Rota J, Peña C, Stamou G, Pritchard VL, Fontaneto D, Declerck SAJ. Integrative Taxonomy Recognizes Evolutionary Units Despite Widespread Mitonuclear Discordance: Evidence from a Rotifer Cryptic Species Complex. Syst Biol 2016; 65:508-24. [DOI: 10.1093/sysbio/syw016] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2015] [Accepted: 02/09/2016] [Indexed: 01/23/2023] Open
|
220
|
Pain A, Ott A, Amine H, Rochat T, Bouloc P, Gautheret D. An assessment of bacterial small RNA target prediction programs. RNA Biol 2016; 12:509-13. [PMID: 25760244 PMCID: PMC4615726 DOI: 10.1080/15476286.2015.1020269] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Most bacterial regulatory RNAs exert their function through base-pairing with target RNAs. Computational prediction of targets is a busy research field that offers biologists a variety of web sites and software. However, it is difficult for a non-expert to evaluate how reliable those programs are. Here, we provide a simple benchmark for bacterial sRNA target prediction based on trusted E. coli sRNA/target pairs. We use this benchmark to assess the most recent RNA target predictors as well as earlier programs for RNA-RNA hybrid prediction. Moreover, we consider how the definition of mRNA boundaries can impact overall predictions. Recent algorithms that exploit both conservation of targets and accessibility information offer improved accuracy over previous software. However, even with the best predictors, the number of true biological targets with low scores and non-targets with high scores remains puzzling.
Collapse
Affiliation(s)
- Adrien Pain
- a Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS ; Université Paris-Sud ; Orsay Cedex , France
| | | | | | | | | | | |
Collapse
|
221
|
Brenes-Álvarez M, Olmedo-Verd E, Vioque A, Muro-Pastor AM. Identification of Conserved and Potentially Regulatory Small RNAs in Heterocystous Cyanobacteria. Front Microbiol 2016; 7:48. [PMID: 26870012 PMCID: PMC4734099 DOI: 10.3389/fmicb.2016.00048] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2015] [Accepted: 01/12/2016] [Indexed: 12/13/2022] Open
Abstract
Small RNAs (sRNAs) are a growing class of non-protein-coding transcripts that participate in the regulation of virtually every aspect of bacterial physiology. Heterocystous cyanobacteria are a group of photosynthetic organisms that exhibit multicellular behavior and developmental alternatives involving specific transcriptomes exclusive of a given physiological condition or even a cell type. In the context of our ongoing effort to understand developmental decisions in these organisms we have undertaken an approach to the global identification of sRNAs. Using differential RNA-Seq we have previously identified transcriptional start sites for the model heterocystous cyanobacterium Nostoc sp. PCC 7120. Here we combine this dataset with a prediction of Rho-independent transcriptional terminators and an analysis of phylogenetic conservation of potential sRNAs among 89 available cyanobacterial genomes. In contrast to predictive genome-wide approaches, the use of an experimental dataset comprising all active transcriptional start sites (differential RNA-Seq) facilitates the identification of bona fide sRNAs. The output of our approach is a dataset of predicted potential sRNAs in Nostoc sp. PCC 7120, with different degrees of phylogenetic conservation across the 89 cyanobacterial genomes analyzed. Previously described sRNAs appear among the predicted sRNAs, demonstrating the performance of the algorithm. In addition, new predicted sRNAs are now identified that can be involved in regulation of different aspects of cyanobacterial physiology, including adaptation to nitrogen stress, the condition that triggers differentiation of heterocysts (specialized nitrogen-fixing cells). Transcription of several predicted sRNAs that appear exclusively in the genomes of heterocystous cyanobacteria is experimentally verified by Northern blot. Cell-specific transcription of one of these sRNAs, NsiR8 (nitrogen stress-induced RNA 8), in developing heterocysts is also demonstrated.
Collapse
Affiliation(s)
- Manuel Brenes-Álvarez
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas and Universidad de Sevilla Sevilla, Spain
| | - Elvira Olmedo-Verd
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas and Universidad de Sevilla Sevilla, Spain
| | - Agustín Vioque
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas and Universidad de Sevilla Sevilla, Spain
| | - Alicia M Muro-Pastor
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas and Universidad de Sevilla Sevilla, Spain
| |
Collapse
|
222
|
Matelska D, Kurkowska M, Purta E, Bujnicki JM, Dunin-Horkawicz S. Loss of Conserved Noncoding RNAs in Genomes of Bacterial Endosymbionts. Genome Biol Evol 2016; 8:426-38. [PMID: 26782934 PMCID: PMC4779614 DOI: 10.1093/gbe/evw007] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
The genomes of intracellular symbiotic or pathogenic bacteria, such as of Buchnera, Mycoplasma, and Rickettsia, are typically smaller compared with their free-living counterparts. Here we showed that noncoding RNA (ncRNA) families, which are conserved in free-living bacteria, frequently could not be detected by computational methods in the small genomes. Statistical tests demonstrated that their absence is not an artifact of low GC content or small deletions in these small genomes, and thus it was indicative of an independent loss of ncRNAs in different endosymbiotic lineages. By analyzing the synteny (conservation of gene order) between the reduced and nonreduced genomes, we revealed instances of protein-coding genes that were preserved in the reduced genomes but lost cis-regulatory elements. We found that the loss of cis-regulatory ncRNA sequences, which regulate the expression of cognate protein-coding genes, is characterized by the reduction of secondary structure formation propensity, GC content, and length of the corresponding genomic regions.
Collapse
Affiliation(s)
- Dorota Matelska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
| | - Malgorzata Kurkowska
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
| | - Elzbieta Purta
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland Laboratory of Structural Bioinformatics, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University, Poznan, Poland
| | - Stanislaw Dunin-Horkawicz
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology, Warsaw, Poland
| |
Collapse
|
223
|
Bioinformatics tools for lncRNA research. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2016; 1859:23-30. [DOI: 10.1016/j.bbagrm.2015.07.014] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 07/07/2015] [Accepted: 07/14/2015] [Indexed: 12/28/2022]
|
224
|
Chatzou M, Magis C, Chang JM, Kemena C, Bussotti G, Erb I, Notredame C. Multiple sequence alignment modeling: methods and applications. Brief Bioinform 2015; 17:1009-1023. [PMID: 26615024 DOI: 10.1093/bib/bbv099] [Citation(s) in RCA: 84] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Revised: 10/16/2015] [Indexed: 12/20/2022] Open
Abstract
This review provides an overview on the development of Multiple sequence alignment (MSA) methods and their main applications. It is focused on progress made over the past decade. The three first sections review recent algorithmic developments for protein, RNA/DNA and genomic alignments. The fourth section deals with benchmarks and explores the relationship between empirical and simulated data, along with the impact on method developments. The last part of the review gives an overview on available MSA local reliability estimators and their dependence on various algorithmic properties of available methods.
Collapse
|
225
|
Survey of Natural Language Processing Techniques in Bioinformatics. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2015; 2015:674296. [PMID: 26525745 PMCID: PMC4615216 DOI: 10.1155/2015/674296] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2015] [Revised: 06/12/2015] [Accepted: 06/21/2015] [Indexed: 01/02/2023]
Abstract
Informatics methods, such as text mining and natural language processing, are always involved in bioinformatics research. In this study, we discuss text mining and natural language processing methods in bioinformatics from two perspectives. First, we aim to search for knowledge on biology, retrieve references using text mining methods, and reconstruct databases. For example, protein-protein interactions and gene-disease relationship can be mined from PubMed. Then, we analyze the applications of text mining and natural language processing techniques in bioinformatics, including predicting protein structure and function, detecting noncoding RNA. Finally, numerous methods and applications, as well as their contributions to bioinformatics, are discussed for future use by text mining and natural language processing researchers.
Collapse
|
226
|
Hecker N, Christensen-Dalsgaard M, Seemann SE, Havgaard JH, Stadler PF, Hofacker IL, Nielsen H, Gorodkin J. Optimizing RNA structures by sequence extensions using RNAcop. Nucleic Acids Res 2015; 43:8135-45. [PMID: 26283181 PMCID: PMC4787817 DOI: 10.1093/nar/gkv813] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2015] [Revised: 07/28/2015] [Accepted: 07/30/2015] [Indexed: 12/26/2022] Open
Abstract
A key aspect of RNA secondary structure prediction is the identification of novel functional elements. This is a challenging task because these elements typically are embedded in longer transcripts where the borders between the element and flanking regions have to be defined. The flanking sequences impact the folding of the functional elements both at the level of computational analyses and when the element is extracted as a transcript for experimental analysis. Here, we analyze how different flanking region lengths impact folding into a constrained structure by computing probabilities of folding for different sizes of flanking regions. Our method, RNAcop (RNA context optimization by probability), is tested on known and de novo predicted structures. In vitro experiments support the computational analysis and suggest that for a number of structures, choosing proper lengths of flanking regions is critical. RNAcop is available as web server and stand-alone software via http://rth.dk/resources/rnacop.
Collapse
Affiliation(s)
- Nikolai Hecker
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | - Mikkel Christensen-Dalsgaard
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Department of Cellular and Molecular Medicine, Panum Institute, University of Copenhagen, Bledgamsvej 3, 2200 Copenhagen N, Denmark
| | - Stefan E Seemann
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | - Jakob H Havgaard
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| | - Peter F Stadler
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Bioinformatics Group, Department of Computer Science & IZBI-Interdisciplinary Center for Bioinformatics & LIFE-Leipzig Research Center for Civilization Diseases, University Leipzig, Härtelstraße 16-18, 04107 Leipzig, Germany Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090 Wien, Austria Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, 04103 Leipzig, Germany Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| | - Ivo L Hofacker
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090 Wien, Austria
| | - Henrik Nielsen
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Department of Cellular and Molecular Medicine, Panum Institute, University of Copenhagen, Bledgamsvej 3, 2200 Copenhagen N, Denmark
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark Department of Veterinary Clinical and Animal Science, University of Copenhagen, Grønnegårdsvej 3, 1870 Frederiksberg C, Denmark
| |
Collapse
|
227
|
RC3H1 post-transcriptionally regulates A20 mRNA and modulates the activity of the IKK/NF-κB pathway. Nat Commun 2015; 6:7367. [PMID: 26170170 PMCID: PMC4510711 DOI: 10.1038/ncomms8367] [Citation(s) in RCA: 81] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2014] [Accepted: 04/30/2015] [Indexed: 12/20/2022] Open
Abstract
The RNA-binding protein RC3H1 (also known as ROQUIN) promotes TNFα mRNA decay via a 3′UTR constitutive decay element (CDE). Here we applied PAR-CLIP to human RC3H1 to identify ∼3,800 mRNA targets with >16,000 binding sites. A large number of sites are distinct from the consensus CDE and revealed a structure-sequence motif with U-rich sequences embedded in hairpins. RC3H1 binds preferentially short-lived and DNA damage-induced mRNAs, indicating a role of this RNA-binding protein in the post-transcriptional regulation of the DNA damage response. Intriguingly, RC3H1 affects expression of the NF-κB pathway regulators such as IκBα and A20. RC3H1 uses ROQ and Zn-finger domains to contact a binding site in the A20 3′UTR, demonstrating a not yet recognized mode of RC3H1 binding. Knockdown of RC3H1 resulted in increased A20 protein expression, thereby interfering with IκB kinase and NF-κB activities, demonstrating that RC3H1 can modulate the activity of the IKK/NF-κB pathway. The RNA-binding protein RC3H1/ROQUIN1 promotes the degradation of mRNA by binding to a consensus CDE present in the 3′UTR. Here the authors expand the set of consensus sequences through which RCH31 binds and regulates mRNA encoding members of the DNA damage response and IKK/NF-κB pathway.
Collapse
|
228
|
Fricke M, Dünnes N, Zayas M, Bartenschlager R, Niepmann M, Marz M. Conserved RNA secondary structures and long-range interactions in hepatitis C viruses. RNA (NEW YORK, N.Y.) 2015; 21:1219-32. [PMID: 25964384 PMCID: PMC4478341 DOI: 10.1261/rna.049338.114] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 03/07/2015] [Indexed: 05/02/2023]
Abstract
Hepatitis C virus (HCV) is a hepatotropic virus with a plus-strand RNA genome of ∼9.600 nt. Due to error-prone replication by its RNA-dependent RNA polymerase (RdRp) residing in nonstructural protein 5B (NS5B), HCV isolates are grouped into seven genotypes with several subtypes. By using whole-genome sequences of 106 HCV isolates and secondary structure alignments of the plus-strand genome and its minus-strand replication intermediate, we established refined secondary structures of the 5' untranslated region (UTR), the cis-acting replication element (CRE) in NS5B, and the 3' UTR. We propose an alternative structure in the 5' UTR, conserved secondary structures of 5B stem-loop (SL)1 and 5BSL2, and four possible structures of the X-tail at the very 3' end of the HCV genome. We predict several previously unknown long-range interactions, most importantly a possible circularization interaction between distinct elements in the 5' and 3' UTR, reminiscent of the cyclization elements of the related flaviviruses. Based on analogy to these viruses, we propose that the 5'-3' UTR base-pairing in the HCV genome might play an important role in viral RNA replication. These results may have important implications for our understanding of the nature of the cis-acting RNA elements in the HCV genome and their possible role in regulating the mutually exclusive processes of viral RNA translation and replication.
Collapse
Affiliation(s)
- Markus Fricke
- Faculty of Mathematics and Computer Science, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Nadia Dünnes
- Institute of Biochemistry, Medical Faculty, Justus-Liebig-University, 35392 Giessen, Germany
| | - Margarita Zayas
- Department of Infectious Diseases, Molecular Virology, University of Heidelberg, 69120 Heidelberg, Germany
| | - Ralf Bartenschlager
- Department of Infectious Diseases, Molecular Virology, University of Heidelberg, 69120 Heidelberg, Germany
| | - Michael Niepmann
- Institute of Biochemistry, Medical Faculty, Justus-Liebig-University, 35392 Giessen, Germany
| | - Manja Marz
- Faculty of Mathematics and Computer Science, Friedrich Schiller University Jena, 07743 Jena, Germany FLI Leibniz Institute for Age Research, 07745 Jena, Germany
| |
Collapse
|
229
|
Zhang W, Yuan Y, Yang S, Huang J, Huang L. ITS2 Secondary Structure Improves Discrimination between Medicinal "Mu Tong" Species when Using DNA Barcoding. PLoS One 2015; 10:e0131185. [PMID: 26132382 PMCID: PMC4488503 DOI: 10.1371/journal.pone.0131185] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 05/30/2015] [Indexed: 01/25/2023] Open
Abstract
DNA barcoding is a promising species identification method, but it has proved difficult to find a standardized DNA marker in plant. Although the ITS/ITS2 RNA transcript has been proposed as the core barcode for seed plants, it has been criticized for being too conserved in some species to provide enough information or too variable in some species to align it within the different taxa ranks. We selected 30 individuals, representing 16 species and four families, to explore whether ITS2 can successfully resolve species in terms of secondary structure. Secondary structure was predicted using Mfold software and sequence-structure was aligned by MARNA. RNAstat software transformed the secondary structures into 28 symbol code data for maximum parsimony (MP) analysis. The results showed that the ITS2 structures in our samples had a common four-helix folding type with some shared motifs. This conserved structure facilitated the alignment of ambiguous sequences from divergent families. The structure alignment yielded a MP tree, in which most topological relationships were congruent with the tree constructed using nucleotide sequence data. When the data was combined, we obtained a well-resolved and highly supported phylogeny, in which individuals of a same species were clustered together into a monophyletic group. As a result, the different species that are often referred to as the herb “Mu tong” were successfully identified using short fragments of 250 bp ITS2 sequences, together with their secondary structure. Thus our analysis strengthens the potential of ITS2 as a promising DNA barcode because it incorporates valuable secondary structure information that will help improve discrimination between species.
Collapse
Affiliation(s)
- Wei Zhang
- Marine College, Shandong University at Weihai, Weihai, Shandong, China; State Key Laboratory of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yuan Yuan
- State Key Laboratory of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| | - Shuo Yang
- Marine College, Shandong University at Weihai, Weihai, Shandong, China
| | - Jianjun Huang
- Marine College, Shandong University at Weihai, Weihai, Shandong, China
| | - Luqi Huang
- State Key Laboratory of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, China
| |
Collapse
|
230
|
Zirbel CL, Roll J, Sweeney BA, Petrov AI, Pirrung M, Leontis NB. Identifying novel sequence variants of RNA 3D motifs. Nucleic Acids Res 2015; 43:7504-20. [PMID: 26130723 PMCID: PMC4551918 DOI: 10.1093/nar/gkv651] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Accepted: 05/29/2015] [Indexed: 02/06/2023] Open
Abstract
Predicting RNA 3D structure from sequence is a major challenge in biophysics. An important sub-goal is accurately identifying recurrent 3D motifs from RNA internal and hairpin loop sequences extracted from secondary structure (2D) diagrams. We have developed and validated new probabilistic models for 3D motif sequences based on hybrid Stochastic Context-Free Grammars and Markov Random Fields (SCFG/MRF). The SCFG/MRF models are constructed using atomic-resolution RNA 3D structures. To parameterize each model, we use all instances of each motif found in the RNA 3D Motif Atlas and annotations of pairwise nucleotide interactions generated by the FR3D software. Isostericity relations between non-Watson–Crick basepairs are used in scoring sequence variants. SCFG techniques model nested pairs and insertions, while MRF ideas handle crossing interactions and base triples. We use test sets of randomly-generated sequences to set acceptance and rejection thresholds for each motif group and thus control the false positive rate. Validation was carried out by comparing results for four motif groups to RMDetect. The software developed for sequence scoring (JAR3D) is structured to automatically incorporate new motifs as they accumulate in the RNA 3D Motif Atlas when new structures are solved and is available free for download.
Collapse
Affiliation(s)
- Craig L Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | - James Roll
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Blake A Sweeney
- Department of Biology, Bowling Green State University, Bowling Green, OH 43403, USA
| | - Anton I Petrov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Meg Pirrung
- Department of Pharmacology, University of Colorado Denver, Aurora, CO 80045, USA
| | - Neocles B Leontis
- Department of Chemistry, Bowling Green State University, Bowling Green, OH 43403, USA
| |
Collapse
|
231
|
Moss WN, Steitz JA. In silico discovery and modeling of non-coding RNA structure in viruses. Methods 2015; 91:48-56. [PMID: 26116541 DOI: 10.1016/j.ymeth.2015.06.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2015] [Revised: 06/17/2015] [Accepted: 06/22/2015] [Indexed: 11/30/2022] Open
Abstract
This review covers several computational methods for discovering structured non-coding RNAs in viruses and modeling their putative secondary structures. Here we will use examples from two target viruses to highlight these approaches: influenza A virus-a relatively small, segmented RNA virus; and Epstein-Barr virus-a relatively large DNA virus with a complex transcriptome. Each system has unique challenges to overcome and unique characteristics to exploit. From these particular cases, generically useful approaches can be derived for the study of additional viral targets.
Collapse
Affiliation(s)
- Walter N Moss
- Department of Molecular Biophysics and Biochemistry, Howard Hughes Medical Institute, Yale University School of Medicine, New Haven, CT 06536, USA
| | - Joan A Steitz
- Department of Molecular Biophysics and Biochemistry, Howard Hughes Medical Institute, Yale University School of Medicine, New Haven, CT 06536, USA.
| |
Collapse
|
232
|
Hoksza D, Svozil D. Multiple 3D RNA Structure Superposition Using Neighbor Joining. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:520-530. [PMID: 26357263 DOI: 10.1109/tcbb.2014.2351810] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Recent advances in RNA research and the steady growth of available RNA structures call for bioinformatics methods for handling and analyzing RNA structural data. Recently, we introduced SETTER-a fast and accurate method for RNA pairwise structure alignment. In this paper, we describe MultiSETTER, SETTER extension for multiple RNA structure alignment. MultiSETTER combines SETTER's decomposition of RNA structures into non-overlapping structural subunits with the multiple sequence alignment algorithm ClustalW adapted for the structure alignment. The accuracy of MultiSETTER was assessed by the automatic classification of RNA structures and its comparison to SCOR annotations. In addition, MultiSETTER classification was also compared to multiple sequence alignment-based and secondary structure alignment-based classifications provided by LocARNA and RNADistance tools, respectively. MultiSETTER precompiled Windows libraries, as well as the C++ source code, are freely available from http://siret.cz/multisetter.
Collapse
|
233
|
Will S, Otto C, Miladi M, Möhl M, Backofen R. SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics. Bioinformatics 2015; 31:2489-96. [PMID: 25838465 PMCID: PMC4514930 DOI: 10.1093/bioinformatics/btv185] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2014] [Accepted: 03/25/2015] [Indexed: 01/19/2023] Open
Abstract
Motivation: RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of O(n6). Subsequently, numerous faster ‘Sankoff-style’ approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity (≥ quartic time). Results: Breaking this barrier, we introduce the novel Sankoff-style algorithm ‘sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)’, which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff’s original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics. Availability and implementation: SPARSE is freely available at http://www.bioinf.uni-freiburg.de/Software/SPARSE. Contact:backofen@informatik.uni-freiburg.de Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sebastian Will
- Bioinformatics, Department of Computer Science, University of Freiburg, Freiburg, Germany, Bioinformatics, Department of Computer Science, University of Leipzig, Leipzig, Germany
| | - Christina Otto
- Bioinformatics, Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - Milad Miladi
- Bioinformatics, Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - Mathias Möhl
- Bioinformatics, Department of Computer Science, University of Freiburg, Freiburg, Germany
| | - Rolf Backofen
- Bioinformatics, Department of Computer Science, University of Freiburg, Freiburg, Germany, Centre for Biological Systems Analysis (ZBSA), University of Freiburg, Freiburg, Germany, Centre for Non-coding RNA in Technology and Health, University of Copenhagen, Copenhagen, Denmark and Centre for Biological Signalling Studies (BIOSS), University of Freiburg, Freiburg, Germany
| |
Collapse
|
234
|
Small regulatory RNA-induced growth rate heterogeneity of Bacillus subtilis. PLoS Genet 2015; 11:e1005046. [PMID: 25790031 PMCID: PMC4366234 DOI: 10.1371/journal.pgen.1005046] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Accepted: 02/01/2015] [Indexed: 11/26/2022] Open
Abstract
Isogenic bacterial populations can consist of cells displaying heterogeneous physiological traits. Small regulatory RNAs (sRNAs) could affect this heterogeneity since they act by fine-tuning mRNA or protein levels to coordinate the appropriate cellular behavior. Here we show that the sRNA RnaC/S1022 from the Gram-positive bacterium Bacillus subtilis can suppress exponential growth by modulation of the transcriptional regulator AbrB. Specifically, the post-transcriptional abrB-RnaC/S1022 interaction allows B. subtilis to increase the cell-to-cell variation in AbrB protein levels, despite strong negative autoregulation of the abrB promoter. This behavior is consistent with existing mathematical models of sRNA action, thus suggesting that induction of protein expression noise could be a new general aspect of sRNA regulation. Importantly, we show that the sRNA-induced diversity in AbrB levels generates heterogeneity in growth rates during the exponential growth phase. Based on these findings, we hypothesize that the resulting subpopulations of fast- and slow-growing B. subtilis cells reflect a bet-hedging strategy for enhanced survival of unfavorable conditions. Bacterial cells that share the same genetic information can display very different phenotypes, even if they grow under identical conditions. Despite the relevance of this population heterogeneity for processes like drug resistance and development, the molecular players that induce heterogenic phenotypes are often not known. Here we report that in the Gram-positive model bacterium Bacillus subtilis a small regulatory RNA (sRNA) can induce heterogeneity in growth rates by increasing cell-to-cell variation in the levels of the transcriptional regulator AbrB, which is important for rapid growth. Remarkably, the observed variation in AbrB levels is induced post-transcriptionally because of AbrB’s negative autoregulation, and is not observed at the abrB promoter level. We show that our observations are consistent with mathematical models of sRNA action, thus suggesting that induction of protein expression noise could be a new general aspect of sRNA regulation. Since a low growth rate can be beneficial for cellular survival, we propose that the observed subpopulations of fast- and slow-growing B. subtilis cells reflect a bet-hedging strategy for enhanced survival of unfavorable conditions.
Collapse
|
235
|
Song Y, Hua L, Shapiro BA, Wang JTL. Effective alignment of RNA pseudoknot structures using partition function posterior log-odds scores. BMC Bioinformatics 2015; 16:39. [PMID: 25727492 PMCID: PMC4339682 DOI: 10.1186/s12859-015-0464-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Accepted: 01/13/2015] [Indexed: 11/18/2022] Open
Abstract
Background RNA pseudoknots play important roles in many biological processes. Previous methods for comparative pseudoknot analysis mainly focus on simultaneous folding and alignment of RNA sequences. Little work has been done to align two known RNA secondary structures with pseudoknots taking into account both sequence and structure information of the two RNAs. Results In this article we present a novel method for aligning two known RNA secondary structures with pseudoknots. We adopt the partition function methodology to calculate the posterior log-odds scores of the alignments between bases or base pairs of the two RNAs with a dynamic programming algorithm. The posterior log-odds scores are then used to calculate the expected accuracy of an alignment between the RNAs. The goal is to find an optimal alignment with the maximum expected accuracy. We present a heuristic to achieve this goal. The performance of our method is investigated and compared with existing tools for RNA structure alignment. An extension of the method to multiple alignment of pseudoknot structures is also discussed. Conclusions The method described here has been implemented in a tool named RKalign, which is freely accessible on the Internet. As more and more pseudoknots are revealed, collected and stored in public databases, we anticipate a tool like RKalign will play a significant role in data comparison, annotation, analysis, and retrieval in these databases. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0464-9) contains supplementary material, which is available to authorized users.
Collapse
|
236
|
Yang LY, Machado CA, Dang XD, Peng YQ, Yang DR, Zhang DY, Liao WJ. The incidence and pattern of copollinator diversification in dioecious and monoecious figs. Evolution 2015; 69:294-304. [PMID: 25495152 PMCID: PMC4328460 DOI: 10.1111/evo.12584] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2014] [Accepted: 11/21/2014] [Indexed: 01/08/2023]
Abstract
Differences in breeding system are associated with correlated ecological and morphological changes in plants. In Ficus, dioecy and monoecy are strongly associated with different suites of traits (tree height, population density, fruiting frequency, pollinator dispersal ecology). Although approximately 30% of fig species are pollinated by multiple species of fig-pollinating wasps, it has been suggested that copollinators are rare in dioecious figs. Here, we test whether there is a connection between the fig breeding system and copollinator incidence and diversification by conducting a meta-analysis of molecular data from pollinators of 119 fig species that includes new data from 15 Asian fig species. We find that the incidence of copollinators is not significantly different between monoecious and dioecious Ficus. Surprisingly, while all copollinators in dioecious figs are sister taxa, only 32.1% in monoecious figs are sister taxa. We present hypotheses to explain those patterns and discuss their consequences on the evolution of this mutualism.
Collapse
Affiliation(s)
- Li-Yuan Yang
- State Key Laboratory of Earth Surface Processes and Resource Ecology and MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal UniversityBeijing, 100875, China
| | - Carlos A Machado
- Department of Biology, University of Maryland1210 Biology-Psychology Building, College Park, Maryland, 20742
| | - Xiao-Dong Dang
- State Key Laboratory of Earth Surface Processes and Resource Ecology and MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal UniversityBeijing, 100875, China
| | - Yan-Qiong Peng
- Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of SciencesKunming, 650223, China
| | - Da-Rong Yang
- Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of SciencesKunming, 650223, China
| | - Da-Yong Zhang
- State Key Laboratory of Earth Surface Processes and Resource Ecology and MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal UniversityBeijing, 100875, China
| | - Wan-Jin Liao
- State Key Laboratory of Earth Surface Processes and Resource Ecology and MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal UniversityBeijing, 100875, China
| |
Collapse
|
237
|
Webb TE, Hughes A, Smalley DS, Spriggs KA. An internal ribosome entry site in the 5' untranslated region of epidermal growth factor receptor allows hypoxic expression. Oncogenesis 2015; 4:e134. [PMID: 25622307 PMCID: PMC4275558 DOI: 10.1038/oncsis.2014.43] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Revised: 09/29/2014] [Accepted: 10/15/2014] [Indexed: 12/25/2022] Open
Abstract
The expression of epidermal growth factor receptor (EGFR/ERBB1/HER1) is implicated in the progress of numerous cancers, a feature that has been exploited in the development of EGFR antibodies and EGFR tyrosine kinase inhibitors as anti-cancer drugs. However, EGFR also has important normal cellular functions, leading to serious side effects when EGFR is inhibited. One damaging characteristic of many oncogenes is the ability to be expressed in the hypoxic conditions associated with the tumour interior. It has previously been demonstrated that expression of EGFR is maintained in hypoxic conditions via an unknown mechanism of translational control, despite global translation rates generally being attenuated under hypoxic conditions. In this report, we demonstrate that the human EGFR 5′ untranslated region (UTR) sequence can initiate the expression of a downstream open reading frame via an internal ribosome entry site (IRES). We show that this effect is not due to either cryptic promoter activity or splicing events. We have investigated the requirement of the EGFR IRES for eukaryotic initiation factor 4A (eIF4A), which is an RNA helicase responsible for processing RNA secondary structure as part of translation initiation. Treatment with hippuristanol (a potent inhibitor of eIF4A) caused a decrease in EGFR 5′ UTR-driven reporter activity and also a reduction in EGFR protein level. Importantly, we show that expression of a reporter gene under the control of the EGFR IRES is maintained under hypoxic conditions despite a fall in global translation rates.
Collapse
Affiliation(s)
- T E Webb
- School of Pharmacy, University of Nottingham, Nottingham, UK
| | - A Hughes
- School of Pharmacy, University of Nottingham, Nottingham, UK
| | - D S Smalley
- School of Pharmacy, University of Nottingham, Nottingham, UK
| | - K A Spriggs
- School of Pharmacy, University of Nottingham, Nottingham, UK
| |
Collapse
|
238
|
Alkhnbashi OS, Costa F, Shah SA, Garrett RA, Saunders SJ, Backofen R. CRISPRstrand: predicting repeat orientations to determine the crRNA-encoding strand at CRISPR loci. ACTA ACUST UNITED AC 2015; 30:i489-96. [PMID: 25161238 PMCID: PMC4147912 DOI: 10.1093/bioinformatics/btu459] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION The discovery of CRISPR-Cas systems almost 20 years ago rapidly changed our perception of the bacterial and archaeal immune systems. CRISPR loci consist of several repetitive DNA sequences called repeats, inter-spaced by stretches of variable length sequences called spacers. This CRISPR array is transcribed and processed into multiple mature RNA species (crRNAs). A single crRNA is integrated into an interference complex, together with CRISPR-associated (Cas) proteins, to bind and degrade invading nucleic acids. Although existing bioinformatics tools can recognize CRISPR loci by their characteristic repeat-spacer architecture, they generally output CRISPR arrays of ambiguous orientation and thus do not determine the strand from which crRNAs are processed. Knowledge of the correct orientation is crucial for many tasks, including the classification of CRISPR conservation, the detection of leader regions, the identification of target sites (protospacers) on invading genetic elements and the characterization of protospacer-adjacent motifs. RESULTS We present a fast and accurate tool to determine the crRNA-encoding strand at CRISPR loci by predicting the correct orientation of repeats based on an advanced machine learning approach. Both the repeat sequence and mutation information were encoded and processed by an efficient graph kernel to learn higher-order correlations. The model was trained and tested on curated data comprising >4500 CRISPRs and yielded a remarkable performance of 0.95 AUC ROC (area under the curve of the receiver operator characteristic). In addition, we show that accurate orientation information greatly improved detection of conserved repeat sequence families and structure motifs. We integrated CRISPRstrand predictions into our CRISPRmap web server of CRISPR conservation and updated the latter to version 2.0. AVAILABILITY CRISPRmap and CRISPRstrand are available at http://rna.informatik.uni-freiburg.de/CRISPRmap. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Omer S Alkhnbashi
- Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany, Department of Biology, University of Copenhagen, Archaea Centre, Ole Maaloes Vej 5, DK2200 Copenhagen, Denmark and BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany
| | - Fabrizio Costa
- Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany, Department of Biology, University of Copenhagen, Archaea Centre, Ole Maaloes Vej 5, DK2200 Copenhagen, Denmark and BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany
| | - Shiraz A Shah
- Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany, Department of Biology, University of Copenhagen, Archaea Centre, Ole Maaloes Vej 5, DK2200 Copenhagen, Denmark and BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany
| | - Roger A Garrett
- Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany, Department of Biology, University of Copenhagen, Archaea Centre, Ole Maaloes Vej 5, DK2200 Copenhagen, Denmark and BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany
| | - Sita J Saunders
- Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany, Department of Biology, University of Copenhagen, Archaea Centre, Ole Maaloes Vej 5, DK2200 Copenhagen, Denmark and BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany
| | - Rolf Backofen
- Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany, Department of Biology, University of Copenhagen, Archaea Centre, Ole Maaloes Vej 5, DK2200 Copenhagen, Denmark and BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, 79110 Freiburg, Germany, Department of Biology, University of Copenhagen, Archaea Centre, Ole Maaloes Vej 5, DK2200 Copenhagen, Denmark and BIOSS Centre for Biological Signalling Studies, Cluster of Excellence, University of Freiburg, Germany
| |
Collapse
|
239
|
Otto C, Möhl M, Heyne S, Amit M, Landau GM, Backofen R, Will S. ExpaRNA-P: simultaneous exact pattern matching and folding of RNAs. BMC Bioinformatics 2014; 15:404. [PMID: 25551362 PMCID: PMC4302096 DOI: 10.1186/s12859-014-0404-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Accepted: 12/01/2014] [Indexed: 01/26/2023] Open
Abstract
Background Identifying sequence-structure motifs common to two RNAs can speed up the comparison of structural RNAs substantially. The core algorithm of the existent approach ExpaRNA solves this problem for a priori known input structures. However, such structures are rarely known; moreover, predicting them computationally is no rescue, since single sequence structure prediction is highly unreliable. Results The novel algorithm ExpaRNA-P computes exactly matching sequence-structure motifs in entire Boltzmann-distributed structure ensembles of two RNAs; thereby we match and fold RNAs simultaneously, analogous to the well-known “simultaneous alignment and folding” of RNAs. While this implies much higher flexibility compared to ExpaRNA, ExpaRNA-P has the same very low complexity (quadratic in time and space), which is enabled by its novel structure ensemble-based sparsification. Furthermore, we devise a generalized chaining algorithm to compute compatible subsets of ExpaRNA-P’s sequence-structure motifs. Resulting in the very fast RNA alignment approach ExpLoc-P, we utilize the best chain as anchor constraints for the sequence-structure alignment tool LocARNA. ExpLoc-P is benchmarked in several variants and versus state-of-the-art approaches. In particular, we formally introduce and evaluate strict and relaxed variants of the problem; the latter makes the approach sensitive to compensatory mutations. Across a benchmark set of typical non-coding RNAs, ExpLoc-P has similar accuracy to LocARNA but is four times faster (in both variants), while it achieves a speed-up over 30-fold for the longest benchmark sequences (≈400nt). Finally, different ExpLoc-P variants enable tailoring of the method to specific application scenarios. ExpaRNA-P and ExpLoc-P are distributed as part of the LocARNA package. The source code is freely available at http://www.bioinf.uni-freiburg.de/Software/ExpaRNA-P. Conclusions ExpaRNA-P’s novel ensemble-based sparsification reduces its complexity to quadratic time and space. Thereby, ExpaRNA-P significantly speeds up sequence-structure alignment while maintaining the alignment quality. Different ExpaRNA-P variants support a wide range of applications. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0404-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Christina Otto
- Bioinformatics, Institute of Computer Science, University of Freiburg, Freiburg, Germany.
| | - Mathias Möhl
- Bioinformatics, Institute of Computer Science, University of Freiburg, Freiburg, Germany.
| | - Steffen Heyne
- Max Planck Institute of Immunobiology and Epigenetics, Stuebeweg 51, Freiburg, 79108, Germany.
| | - Mika Amit
- Department of Computer Science, University of Haifa, Mount Carmel, Haifa, Israel.
| | - Gad M Landau
- Department of Computer Science, University of Haifa, Mount Carmel, Haifa, Israel. .,Department of Computer Science and Engineering, NYU-Poly, Brooklyn, NY, USA.
| | - Rolf Backofen
- Bioinformatics, Institute of Computer Science, University of Freiburg, Freiburg, Germany. .,Center for Biological Signaling Studies (BIOSS), University of Freiburg, Freiburg, Germany. .,Centre for Biological Systems Analysis (ZBSA), University of Freiburg, Freiburg, Germany. .,Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870, Denmark.
| | - Sebastian Will
- Bioinformatics, Institute of Computer Science, University of Freiburg, Freiburg, Germany. .,Bioinformatics, Department of Computer Science, University of Leipzig, Leipzig, Germany.
| |
Collapse
|
240
|
Rampersad SN. ITS1, 5.8S and ITS2 secondary structure modelling for intra-specific differentiation among species of the Colletotrichum gloeosporioides sensu lato species complex. SPRINGERPLUS 2014; 3:684. [PMID: 25512885 PMCID: PMC4254888 DOI: 10.1186/2193-1801-3-684] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Accepted: 11/12/2014] [Indexed: 11/10/2022]
Abstract
The Colletotrichum gloeosporioides species complex is among the most destructive fungal plant pathogens in the world, however, identification of member species which are of quarantine importance is impacted by a number of factors that negatively affect species identification. Structural information of the rRNA marker may be considered to be a conserved marker which can be used as supplementary information for possible species identification. The difficulty in using ITS rDNA sequences for identification lies in the low level of sequence variation at the intra-specific level and the generation of artificially-induced sequence variation due to errors in polymerization of the ITS array during DNA replication. Type and query ITS sequences were subjected to sequence analyses prior to generation of predicted consensus secondary structures, including the pattern of nucleotide polymorphisms and number of indel haplotypes, GC content, and detection of artificially-induced sequence variation. Data pertaining to structure stability, the presence of conserved motifs in secondary structures and mapping of all sequences onto the consensus C. gloeosporioides sensu stricto secondary structure for ITS1, 5.8S and ITS2 markers was then carried out. Motifs that are evolutionarily conserved among eukaryotes were found for all ITS1, 5.8S and ITS2 sequences. The sequences exhibited conserved features typical of functional rRNAs. Generally, polymorphisms occurred within less conserved regions and were seen as bulges, internal and terminal loops or non-canonical G–U base-pairs within regions of the double stranded helices. Importantly, there were also taxonomic motifs and base changes that were unique to specific taxa and which may be used to support intra-specific identification of members of the C. gloeosporioides sensu lato species complex.
Collapse
Affiliation(s)
- Sephra N Rampersad
- Department of Life Sciences, Faculty of Science and Technology, The University of the West Indies, St. Augustine, West Indies, Trinidad and Tobago
| |
Collapse
|
241
|
Madhugiri R, Fricke M, Marz M, Ziebuhr J. RNA structure analysis of alphacoronavirus terminal genome regions. Virus Res 2014; 194:76-89. [PMID: 25307890 PMCID: PMC7114417 DOI: 10.1016/j.virusres.2014.10.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Revised: 09/30/2014] [Accepted: 10/01/2014] [Indexed: 02/07/2023]
Abstract
Review of current knowledge of cis-acting RNA elements essential to coronavirus replication. Identification of RNA structural elements in alphacoronavirus terminal genome regions. Discussion of intra- and intergeneric conservation of genomic cis-acting RNA elements in alpha- and betacoronaviruses.
Coronavirus genome replication is mediated by a multi-subunit protein complex that is comprised of more than a dozen virally encoded and several cellular proteins. Interactions of the viral replicase complex with cis-acting RNA elements located in the 5′ and 3′-terminal genome regions ensure the specific replication of viral RNA. Over the past years, boundaries and structures of cis-acting RNA elements required for coronavirus genome replication have been extensively characterized in betacoronaviruses and, to a lesser extent, other coronavirus genera. Here, we review our current understanding of coronavirus cis-acting elements located in the terminal genome regions and use a combination of bioinformatic and RNA structure probing studies to identify and characterize putative cis-acting RNA elements in alphacoronaviruses. The study suggests significant RNA structure conservation among members of the genus Alphacoronavirus but also across genus boundaries. Overall, the conservation pattern identified for 5′ and 3′-terminal RNA structural elements in the genomes of alpha- and betacoronaviruses is in agreement with the widely used replicase polyprotein-based classification of the Coronavirinae, suggesting co-evolution of the coronavirus replication machinery with cognate cis-acting RNA elements.
Collapse
Affiliation(s)
- Ramakanth Madhugiri
- Institute of Medical Virology, Justus Liebig University Giessen, Schubertstrasse 81, 35392 Giessen, Germany
| | - Markus Fricke
- Faculty of Mathematics and Computer Science, Friedrich Schiller University Jena, Leutragraben 1, 07743 Jena, Germany
| | - Manja Marz
- Faculty of Mathematics and Computer Science, Friedrich Schiller University Jena, Leutragraben 1, 07743 Jena, Germany
| | - John Ziebuhr
- Institute of Medical Virology, Justus Liebig University Giessen, Schubertstrasse 81, 35392 Giessen, Germany.
| |
Collapse
|
242
|
Abstract
The Alu domain of the signal recognition particle (SRP) arrests protein biosynthesis by competition with elongation factor binding on the ribosome. The mammalian Alu domain is a protein-RNA complex, while prokaryotic Alu domains are protein-free with significant extensions of the RNA. Here we report the crystal structure of the complete Alu domain of Bacillus subtilis SRP RNA at 2.5 Å resolution. The bacterial Alu RNA reveals a compact fold, which is stabilized by prokaryote-specific extensions and interactions. In this 'closed' conformation, the 5' and 3' regions are clamped together by the additional helix 1, the connecting 3-way junction and a novel minor groove interaction, which we term the 'minor-saddle motif' (MSM). The 5' region includes an extended loop-loop pseudoknot made of five consecutive Watson-Crick base pairs. Homology modeling with the human Alu domain in context of the ribosome shows that an additional lobe in the pseudoknot approaches the large subunit, while the absence of protein results in the detachment from the small subunit. Our findings provide the structural basis for purely RNA-driven elongation arrest in prokaryotes, and give insights into the structural adaption of SRP RNA during evolution.
Collapse
Affiliation(s)
- Georg Kempf
- Heidelberg University Biochemistry Center (BZH), INF 328, D-69120 Heidelberg, Germany
| | - Klemens Wild
- Heidelberg University Biochemistry Center (BZH), INF 328, D-69120 Heidelberg, Germany
| | - Irmgard Sinning
- Heidelberg University Biochemistry Center (BZH), INF 328, D-69120 Heidelberg, Germany
| |
Collapse
|
243
|
Liu S, Vijayendran D, Carrillo-Tripp J, Miller WA, Bonning BC. Analysis of new aphid lethal paralysis virus (ALPV) isolates suggests evolution of two ALPV species. J Gen Virol 2014; 95:2809-2819. [PMID: 25170050 DOI: 10.1099/vir.0.069765-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Aphid lethal paralysis virus (ALPV; family Dicistroviridae) was first isolated from the bird cherry-oat aphid, Rhopalosiphum padi. ALPV-like virus sequences have been reported from many insects and insect predators. We identified a new isolate of ALPV (ALPV-AP) from the pea aphid, Acyrthosiphon pisum, and a new isolate (ALPV-DvV) from western corn rootworm, Diabrotica virgifera virgifera. ALPV-AP has an ssRNA genome of 9940 nt. Based on phylogenetic analysis, ALPV-AP was closely related to ALPV-AM, an ALPV isolate from honeybees, Apis mellifera, in Spain and Brookings, SD, USA. The distinct evolutionary branches suggested the existence of two lineages of the ALPV virus. One consisted of ALPV-AP and ALPV-AM, whilst all other isolates of ALPV grouped into the other lineage. The similarity of ALPV-AP and ALPV-AM was up to 88 % at the RNA level, compared with 78-79 % between ALPV-AP and other ALPV isolates. The sequence identity of proteins between ALPV-AP and ALPV-AM was 98-99 % for both ORF1 and ORF2, whilst only 85-87 % for ORF1 and 91-92 % for ORF2 between ALPV-AP and other ALPV isolates. Sequencing of RACE (rapid amplification of cDNA ends) products and cDNA clones of the virus genome revealed sequence variation in the 5' UTRs and in ORF1, indicating that ALPV may be under strong selection pressure, which could have important biological implications for ALPV host range and infectivity. Our results indicated that ALPV-like viruses infect insects in the order Coleoptera, in addition to the orders Hemiptera and Hymenoptera, and we propose that ALPV isolates be classified as two separate viral species.
Collapse
Affiliation(s)
- Sijun Liu
- Department of Entomology, Iowa State University, Ames, IA 50011, USA
| | | | - Jimena Carrillo-Tripp
- Department of Plant Pathology and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - W Allen Miller
- Department of Plant Pathology and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - Bryony C Bonning
- Department of Entomology, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
244
|
Bouwman RD, Palser A, Parry CM, Coulter E, Rasaiyaah J, Kellam P, Jenner RG. Human immunodeficiency virus Tat associates with a specific set of cellular RNAs. Retrovirology 2014; 11:53. [PMID: 24990269 PMCID: PMC4086691 DOI: 10.1186/1742-4690-11-53] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Accepted: 06/18/2014] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Human Immunodeficiency Virus 1 (HIV-1) exhibits a wide range of interactions with the host cell but whether viral proteins interact with cellular RNA is not clear. A candidate interacting factor is the trans-activator of transcription (Tat) protein. Tat is required for expression of virus genes but activates transcription through an unusual mechanism; binding to an RNA stem-loop, the transactivation response element (TAR), with the host elongation factor P-TEFb. HIV-1 Tat has also been shown to alter the expression of host genes during infection, contributing to viral pathogenesis but, whether Tat also interacts with cellular RNAs is unknown. RESULTS Using RNA immunoprecipitation coupled with microarray analysis, we have discovered that HIV-1 Tat is associated with a specific set of human mRNAs in T cells. mRNAs bound by Tat share a stem-loop structural element and encode proteins with common biological roles. In contrast, we do not find evidence that Tat associates with microRNAs or the RNA-induced silencing complex (RISC). The interaction of Tat with cellular RNA requires an intact RNA binding domain and Tat RNA binding is linked to an increase in RNA abundance in cell lines and during infection of primary CD4+ T cells by HIV. CONCLUSIONS We conclude that Tat interacts with a specific set of human mRNAs in T cells, many of which show changes in abundance in response to Tat and HIV infection. This work uncovers a previously unrecognised interaction between HIV and its host that may contribute to viral alteration of the host cellular environment.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Richard G Jenner
- MRC Centre for Medical Molecular Virology, Division of Infection and Immunity, University College London, London WC1E 6BT, UK.
| |
Collapse
|
245
|
Wolf M, Koetschan C, Müller T. ITS2, 18S, 16S or any other RNA - simply aligning sequences and their individual secondary structures simultaneously by an automatic approach. Gene 2014; 546:145-9. [PMID: 24881812 DOI: 10.1016/j.gene.2014.05.065] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Revised: 05/28/2014] [Accepted: 05/29/2014] [Indexed: 11/29/2022]
Abstract
Secondary structures of RNA sequences are increasingly being used as additional information in reconstructing phylogenies and/or in distinguishing species by compensatory base change (CBC) analyses. However, in most cases just one secondary structure is used in manually correcting an automatically generated multiple sequence alignment and/or just one secondary structure is used in guiding a sequence alignment still completely generated by hand. With the advent of databases and tools offering individual RNA secondary structures, here we re-introduce a twelve letter code already implemented in 4SALE - a tool for synchronous sequence and secondary structure alignment and editing - that enables one to align RNA sequences and their individual secondary structures synchronously and fully automatic, while dramatically increasing the phylogenetic information content. We further introduce a scaled down non-GUI version of 4SALE particularly designed for big data analysis, and available at: http://4sale.bioapps.biozentrum.uni-wuerzburg.de.
Collapse
Affiliation(s)
- Matthias Wolf
- Department of Bioinformatics, Biocenter, University of Würzburg, 97074 Würzburg, Germany.
| | - Christian Koetschan
- Department of Bioinformatics, Biocenter, University of Würzburg, 97074 Würzburg, Germany
| | - Tobias Müller
- Department of Bioinformatics, Biocenter, University of Würzburg, 97074 Würzburg, Germany
| |
Collapse
|
246
|
Lertampaiporn S, Thammarongtham C, Nukoolkit C, Kaewkamnerdpong B, Ruengjitchatchawalya M. Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm. Nucleic Acids Res 2014; 42:e93. [PMID: 24771344 PMCID: PMC4066759 DOI: 10.1093/nar/gku325] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 04/02/2014] [Accepted: 04/07/2014] [Indexed: 12/13/2022] Open
Abstract
To identify non-coding RNA (ncRNA) signals within genomic regions, a classification tool was developed based on a hybrid random forest (RF) with a logistic regression model to efficiently discriminate short ncRNA sequences as well as long complex ncRNA sequences. This RF-based classifier was trained on a well-balanced dataset with a discriminative set of features and achieved an accuracy, sensitivity and specificity of 92.11%, 90.7% and 93.5%, respectively. The selected feature set includes a new proposed feature, SCORE. This feature is generated based on a logistic regression function that combines five significant features-structure, sequence, modularity, structural robustness and coding potential-to enable improved characterization of long ncRNA (lncRNA) elements. The use of SCORE improved the performance of the RF-based classifier in the identification of Rfam lncRNA families. A genome-wide ncRNA classification framework was applied to a wide variety of organisms, with an emphasis on those of economic, social, public health, environmental and agricultural significance, such as various bacteria genomes, the Arthrospira (Spirulina) genome, and rice and human genomic regions. Our framework was able to identify known ncRNAs with sensitivities of greater than 90% and 77.7% for prokaryotic and eukaryotic sequences, respectively. Our classifier is available at http://ncrna-pred.com/HLRF.htm.
Collapse
Affiliation(s)
- Supatcha Lertampaiporn
- Biological Engineering Program, Faculty of Engineering, King Mongkut's University of Technology Thonburi, 126 Pracha Uthit Rd, Bangmod, Thung Khru, Bangkok 10140, Thailand
| | - Chinae Thammarongtham
- Biochemical Engineering and Pilot Plant Research and Development Unit, National Center for Genetic Engineering and Biotechnology at King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand
| | - Chakarida Nukoolkit
- School of Information Technology, King Mongkut's University of Technology Thonburi, 126 Pracha Uthit Rd, Bangmod, Thung Khru, Bangkok 10140, Thailand
| | - Boonserm Kaewkamnerdpong
- Biological Engineering Program, Faculty of Engineering, King Mongkut's University of Technology Thonburi, 126 Pracha Uthit Rd, Bangmod, Thung Khru, Bangkok 10140, Thailand
| | - Marasri Ruengjitchatchawalya
- Biotechnology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand Bioinformatics and Systems Biology Program, King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand
| |
Collapse
|
247
|
Backofen R, Amman F, Costa F, Findeiß S, Richter AS, Stadler PF. Bioinformatics of prokaryotic RNAs. RNA Biol 2014; 11:470-83. [PMID: 24755880 PMCID: PMC4152356 DOI: 10.4161/rna.28647] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Revised: 03/17/2014] [Accepted: 03/25/2014] [Indexed: 02/02/2023] Open
Abstract
The genome of most prokaryotes gives rise to surprisingly complex transcriptomes, comprising not only protein-coding mRNAs, often organized as operons, but also harbors dozens or even hundreds of highly structured small regulatory RNAs and unexpectedly large levels of anti-sense transcripts. Comprehensive surveys of prokaryotic transcriptomes and the need to characterize also their non-coding components is heavily dependent on computational methods and workflows, many of which have been developed or at least adapted specifically for the use with bacterial and archaeal data. This review provides an overview on the state-of-the-art of RNA bioinformatics focusing on applications to prokaryotes.
Collapse
Affiliation(s)
- Rolf Backofen
- Bioinformatics Group; Department of Computer Science; University of Freiburg; Georges-Köhler-Allee 106; D-79110 Freiburg, Germany
- Center for non-coding RNA in Technology and Health; University of Copenhagen; Grønnegårdsvej 3; DK-1870 Frederiksberg C, Denmark
| | - Fabian Amman
- Institute for Theoretical Chemistry; University of Vienna; Währingerstraße 17; A-1090 Wien, Austria
- Bioinformatics Group; Department of Computer Science, and Interdisciplinary Center for Bioinformatics; University of Leipzig; Härtelstraße 16-18; D-04107 Leipzig, Germany
| | - Fabrizio Costa
- Bioinformatics Group; Department of Computer Science; University of Freiburg; Georges-Köhler-Allee 106; D-79110 Freiburg, Germany
| | - Sven Findeiß
- Institute for Theoretical Chemistry; University of Vienna; Währingerstraße 17; A-1090 Wien, Austria
- Bioinformatics and Computational Biology Research Group; University of Vienna; Währingerstraße 29; A-1090 Wien, Austria
| | - Andreas S Richter
- Bioinformatics Group; Department of Computer Science; University of Freiburg; Georges-Köhler-Allee 106; D-79110 Freiburg, Germany
- Max Planck Institute of Immunobiology and Epigenetics; Stübeweg 51; D-79108 Freiburg, Germany
| | - Peter F Stadler
- Center for non-coding RNA in Technology and Health; University of Copenhagen; Grønnegårdsvej 3; DK-1870 Frederiksberg C, Denmark
- Institute for Theoretical Chemistry; University of Vienna; Währingerstraße 17; A-1090 Wien, Austria
- Bioinformatics Group; Department of Computer Science, and Interdisciplinary Center for Bioinformatics; University of Leipzig; Härtelstraße 16-18; D-04107 Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences; Inselstraße 22; D-04103 Leipzig, Germany
- Fraunhofer Institute for Cell Therapy and Immunology – IZI; Perlickstraße 1; D-04103 Leipzig, Germany
- Santa Fe Institute; Santa Fe, NM USA
| |
Collapse
|
248
|
Lu Z, Matera AG. Vicinal: a method for the determination of ncRNA ends using chimeric reads from RNA-seq experiments. Nucleic Acids Res 2014; 42:e79. [PMID: 24623808 PMCID: PMC4027162 DOI: 10.1093/nar/gku207] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Non-coding (nc)RNAs are important structural and regulatory molecules. Accurate determination of the primary sequence and secondary structure of ncRNAs is important for understanding their functions. During cDNA synthesis, RNA 3' end stem-loops can self-prime reverse transcription, creating RNA-cDNA chimeras. We found that chimeric RNA-cDNA fragments can also be detected at 5' end stem-loops, although at much lower frequency. Using the Gubler-Hoffman method, both types of chimeric fragments can be converted to cDNA during library construction, and they are readily detectable in high-throughput RNA sequencing (RNA-seq) experiments. Here, we show that these chimeric reads contain valuable information about the boundaries of ncRNAs. We developed a bioinformatic method, called Vicinal, to precisely map the ends of numerous fruitfly, mouse and human ncRNAs. Using this method, we analyzed chimeric reads from over 100 RNA-seq datasets, the results of which we make available for users to find RNAs of interest. In summary, we show that Vicinal is a useful tool for determination of the precise boundaries of uncharacterized ncRNAs, facilitating further structure/function studies.
Collapse
Affiliation(s)
- Zhipeng Lu
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599-3280, USA Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599-3280, USA
| | - A Gregory Matera
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599-3280, USA Department of Genetics, University of North Carolina, Chapel Hill, NC 27599-3280, USA Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599-3280, USA
| |
Collapse
|
249
|
Abendroth U, Schmidtke C, Bonas U. Small non-coding RNAs in plant-pathogenic Xanthomonas spp. RNA Biol 2014; 11:457-63. [PMID: 24667380 DOI: 10.4161/rna.28240] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
The genus Xanthomonas comprises a large group of plant-pathogenic bacteria. The infection and bacterial multiplication in the plant tissue depends on the type III secretion system and other virulence determinants. Recent studies revealed that bacterial virulence is also controlled at the post-transcriptional level by small non-coding RNAs (sRNAs). In this review, we highlight our current knowledge about sRNAs and RNA-binding proteins in Xanthomonas species.
Collapse
Affiliation(s)
- Ulrike Abendroth
- Dept. of Genetics; Martin-Luther-Universität Halle-Wittenberg; Halle, Germany
| | - Cornelius Schmidtke
- Dept. of Genetics; Martin-Luther-Universität Halle-Wittenberg; Halle, Germany
| | - Ulla Bonas
- Dept. of Genetics; Martin-Luther-Universität Halle-Wittenberg; Halle, Germany
| |
Collapse
|
250
|
Maticzka D, Lange SJ, Costa F, Backofen R. GraphProt: modeling binding preferences of RNA-binding proteins. Genome Biol 2014; 15:R17. [PMID: 24451197 PMCID: PMC4053806 DOI: 10.1186/gb-2014-15-1-r17] [Citation(s) in RCA: 187] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2013] [Accepted: 01/22/2014] [Indexed: 12/01/2022] Open
Abstract
We present GraphProt, a computational framework for learning sequence- and structure-binding preferences of RNA-binding proteins (RBPs) from high-throughput experimental data. We benchmark GraphProt, demonstrating that the modeled binding preferences conform to the literature, and showcase the biological relevance and two applications of GraphProt models. First, estimated binding affinities correlate with experimental measurements. Second, predicted Ago2 targets display higher levels of expression upon Ago2 knockdown, whereas control targets do not. Computational binding models, such as those provided by GraphProt, are essential for predicting RBP binding sites and affinities in all tissues. GraphProt is freely available at http://www.bioinf.uni-freiburg.de/Software/GraphProt.
Collapse
|