1
|
Diallo I, Ho J, Lalaouna D, Massé E, Provost P. RNA Sequencing Unveils Very Small RNAs With Potential Regulatory Functions in Bacteria. Front Mol Biosci 2022; 9:914991. [PMID: 35720117 PMCID: PMC9203972 DOI: 10.3389/fmolb.2022.914991] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 05/02/2022] [Indexed: 12/21/2022] Open
Abstract
RNA sequencing (RNA-seq) is the gold standard for the discovery of small non-coding RNAs. Following a long-standing approach, reads shorter than 16 nucleotides (nt) are removed from the small RNA sequencing libraries or datasets. The serendipitous discovery of an eukaryotic 12 nt-long RNA species capable of modulating the microRNA from which they derive prompted us to challenge this dogma and, by expanding the window of RNA sizes down to 8 nt, to confirm the existence of functional very small RNAs (vsRNAs <16 nt). Here we report the detailed profiling of vsRNAs in Escherichia coli, E. coli-derived outer membrane vesicles (OMVs) and five other bacterial strains (Pseudomonas aeruginosa PA7, P. aeruginosa PAO1, Salmonella enterica serovar Typhimurium 14028S, Legionella pneumophila JR32 Philadelphia-1 and Staphylococcus aureus HG001). vsRNAs of 8–15 nt in length [RNAs (8-15 nt)] were found to be more abundant than RNAs of 16–30 nt in length [RNAs (16–30 nt)]. vsRNA biotypes were distinct and varied within and across bacterial species and accounted for one third of reads identified in the 8–30 nt window. The tRNA-derived fragments (tRFs) have appeared as a major biotype among the vsRNAs, notably Ile-tRF and Ala-tRF, and were selectively loaded in OMVs. tRF-derived vsRNAs appear to be thermodynamically stable with at least 2 G-C basepairs and stem-loop structure. The analyzed tRF-derived vsRNAs are predicted to target several human host mRNAs with diverse functions. Bacterial vsRNAs and OMV-derived vsRNAs could be novel players likely modulating the intricate relationship between pathogens and their hosts.
Collapse
Affiliation(s)
- Idrissa Diallo
- CHU de Québec Research Center/CHUL Pavilion, Department of Microbiology, Infectious Diseases and Immunology, Faculty of Medicine, Université Laval, Quebec City, QC, Canada
| | - Jeffrey Ho
- CHU de Québec Research Center/CHUL Pavilion, Department of Microbiology, Infectious Diseases and Immunology, Faculty of Medicine, Université Laval, Quebec City, QC, Canada
| | - David Lalaouna
- CRCHUS, RNA Group, Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Eric Massé
- CRCHUS, RNA Group, Department of Biochemistry and Functional Genomics, Faculty of Medicine and Health Sciences, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Patrick Provost
- CHU de Québec Research Center/CHUL Pavilion, Department of Microbiology, Infectious Diseases and Immunology, Faculty of Medicine, Université Laval, Quebec City, QC, Canada
- *Correspondence: Patrick Provost,
| |
Collapse
|
2
|
Zhuang J, Liu D, Lin M, Qiu W, Liu J, Chen S. PseUdeep: RNA Pseudouridine Site Identification with Deep Learning Algorithm. Front Genet 2021; 12:773882. [PMID: 34868261 PMCID: PMC8637112 DOI: 10.3389/fgene.2021.773882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 10/04/2021] [Indexed: 11/16/2022] Open
Abstract
Background: Pseudouridine (Ψ) is a common ribonucleotide modification that plays a significant role in many biological processes. The identification of Ψ modification sites is of great significance for disease mechanism and biological processes research in which machine learning algorithms are desirable as the lab exploratory techniques are expensive and time-consuming. Results: In this work, we propose a deep learning framework, called PseUdeep, to identify Ψ sites of three species: H. sapiens, S. cerevisiae, and M. musculus. In this method, three encoding methods are used to extract the features of RNA sequences, that is, one-hot encoding, K-tuple nucleotide frequency pattern, and position-specific nucleotide composition. The three feature matrices are convoluted twice and fed into the capsule neural network and bidirectional gated recurrent unit network with a self-attention mechanism for classification. Conclusion: Compared with other state-of-the-art methods, our model gets the highest accuracy of the prediction on the independent testing data set S-200; the accuracy improves 12.38%, and on the independent testing data set H-200, the accuracy improves 0.68%. Moreover, the dimensions of the features we derive from the RNA sequences are only 109,109, and 119 in H. sapiens, M. musculus, and S. cerevisiae, which is much smaller than those used in the traditional algorithms. On evaluation via tenfold cross-validation and two independent testing data sets, PseUdeep outperforms the best traditional machine learning model available. PseUdeep source code and data sets are available at https://github.com/dan111262/PseUdeep.
Collapse
Affiliation(s)
- Jujuan Zhuang
- College of Science, Dalian Maritime University, Dalian, China
| | - Danyang Liu
- College of Science, Dalian Maritime University, Dalian, China
| | - Meng Lin
- College of Science, Dalian Maritime University, Dalian, China
| | - Wenjing Qiu
- Electrical and Information Engineering, Anhui University of Technology, Anhui, China
- Geneis (Beijing) Co., Ltd., Beijing, China
| | | | - Size Chen
- Department of Oncology, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
- Guangdong Provincial Engineering Research Center for Esophageal Cancer Precise Therapy, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
- Central Laboratory, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
- *Correspondence: Size Chen,
| |
Collapse
|
3
|
Khan SM, He F, Wang D, Chen Y, Xu D. MU-PseUDeep: A deep learning method for prediction of pseudouridine sites. Comput Struct Biotechnol J 2020; 18:1877-1883. [PMID: 32774783 PMCID: PMC7387732 DOI: 10.1016/j.csbj.2020.07.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 07/09/2020] [Accepted: 07/10/2020] [Indexed: 01/18/2023] Open
Abstract
Pseudouridine synthase binds to uridine sites and catalyzes the conversion of uridine to pseudouridine (Ψ). This binding takes place in a specific context and in the conformation of nucleotides. Most machine-learning methods for Ψ site classification use nucleotide frequency as a feature, which may not fully depict the relevant conformation around a Ψ site. Using the power of deep learning and raw sequence, as well as secondary structure features, our tool MU-PseUDeep is designed to capture both the sequence and secondary structure context, which inputs the raw RNA sequence and the predicted secondary structure to two sets of convolutional neural networks. It has shown considerable improvement in Ψ site prediction over existing tools, XG-PseU, PseUI, and iRNA-PseU for both balanced and imbalanced datasets. To the best of our knowledge, this is the most accurate tool for Ψ site prediction. We also used MU-PseUDeep to scan the human transcriptome, which shows that the genes with predicted Ψ sites are enriched in nucleotide and protein binding, as well as in neurodegeneration pathways. The tool is open source, available at https://github.com/smk5g5/MU-PseUDeep.
Collapse
Affiliation(s)
- Saad M. Khan
- Informatics Institute, University of Missouri, Columbia, MO 65211, United States
| | - Fei He
- Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, United States
- School of Information Science and Technology, Northeast Normal University, Changchun 130117, China
| | - Duolin Wang
- Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, United States
| | - Yongbing Chen
- School of Information Science and Technology, Northeast Normal University, Changchun 130117, China
| | - Dong Xu
- Informatics Institute, University of Missouri, Columbia, MO 65211, United States
- Department of Electrical Engineering and Computer Science and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, United States
- Corresponding author.
| |
Collapse
|
4
|
Evolution of Eukaryal and Archaeal Pseudouridine Synthase Pus10. J Mol Evol 2018; 86:77-89. [PMID: 29349599 DOI: 10.1007/s00239-018-9827-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 01/03/2018] [Indexed: 10/18/2022]
Abstract
In archaea, pseudouridine (Ψ) synthase Pus10 modifies uridine (U) to Ψ at positions 54 and 55 of tRNA. In contrast, Pus10 is not found in bacteria, where modifications at those two positions are carried out by TrmA (U54 to m5U54) and TruB (U55 to Ψ55). Many eukaryotes have an apparent redundancy; their genomes contain orthologs of archaeal Pus10 and bacterial TrmA and TruB. Although eukaryal Pus10 genes share a conserved catalytic domain with archaeal Pus10 genes, their biological roles are not clear for the two reasons. First, experimental evidence suggests that human Pus10 participates in apoptosis induced by the tumor necrosis factor-related apoptosis-inducing ligand. Whether the function of human Pus10 is in place or in addition to of Ψ synthesis in tRNA is unknown. Second, Pus10 is found in earlier evolutionary branches of fungi (such as chytrid Batrachochytrium) but is absent in all dikaryon fungi surveyed (Ascomycetes and Basidiomycetes). We did a comprehensive analysis of sequenced genomes and found that orthologs of Pus10, TrmA, and TruB were present in all the animals, plants, and protozoa surveyed. This indicates that the common eukaryotic ancestor possesses all the three genes. Next, we examined 116 archaeal and eukaryotic Pus10 protein sequences to find that Pus10 existed as a single copy gene in all the surveyed genomes despite ancestral whole genome duplications had occurred. This indicates a possible deleterious gene dosage effect. Our results suggest that functional redundancy result in gene loss or neofunctionalization in different evolutionary lineages.
Collapse
|
5
|
Abstract
All types of nucleic acids in cells undergo naturally occurring chemical modifications, including DNA, rRNA, mRNA, snRNA, and most prominently tRNA. Over 100 different modifications have been described and every position in the purine and pyrimidine bases can be modified; often the sugar is also modified [1]. In tRNA, the function of modifications varies; some modulate global and/or local RNA structure, and others directly impact decoding and may be essential for viability. Whichever the case, the overall importance of modifications is highlighted by both their evolutionary conservation and the fact that organisms use a substantial portion of their genomes to encode modification enzymes, far exceeding what is needed for the de novo synthesis of the canonical nucleotides themselves [2]. Although some modifications occur at exactly the same nucleotide position in tRNAs from the three domains of life, many can be found at various positions in a particular tRNA and their location may vary between and within different tRNAs. With this wild array of chemical diversity and substrate specificities, one of the big challenges in the tRNA modification field has been to better understand at a molecular level the modes of substrate recognition by the different modification enzymes; in this realm RNA binding rests at the heart of the problem. This chapter will focus on several examples of modification enzymes where their mode of RNA binding is well understood; from these, we will try to draw general conclusions and highlight growing themes that may be applicable to the RNA modification field at large.
Collapse
|
6
|
Becker Y, Eaton CJ, Brasell E, May KJ, Becker M, Hassing B, Cartwright GM, Reinhold L, Scott B. The Fungal Cell-Wall Integrity MAPK Cascade Is Crucial for Hyphal Network Formation and Maintenance of Restrictive Growth of Epichloë festucae in Symbiosis With Lolium perenne. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2015; 28:69-85. [PMID: 25303335 DOI: 10.1094/mpmi-06-14-0183-r] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Epichloë festucae is a mutualistic symbiont that systemically colonizes the intercellular spaces of Lolium perenne leaves to form a highly structured and interconnected hyphal network. In an Agrobacterium tumefaciens T-DNA forward genetic screen, we identified a mutant TM1066 that had a severe host interaction phenotype, causing stunting and premature senescence of the host. Molecular analysis revealed that the mutation responsible for this phenotype was in the cell-wall integrity (CWI) mitogen-activated protein kinase kinase (MAPKK), mkkA. Mutants generated by targeted deletion of the mkkA or the downstream mpkA kinase recapitulated the phenotypes observed for TM1066. Both mutants were defective in hyphal cell–cell fusion, formed intrahyphal hyphae, had enhanced conidiation, and showed microcyclic conidiation. Transmission electron microscopy and confocal microscopy analysis of leaf tissue showed that mutant hyphae were more abundant than the wild type in the intercellular spaces and colonized the vascular bundles. Hyphal branches failed to fuse but, instead, grew past one another to form bundles of convoluted hyphae. Mutant hyphae showed increased fluorescence with AF488-WGA, indicative of increased accessibility of chitin, a hypothesis supported by changes in the cell-wall ultrastructure. These results show that the CWI MAPK pathway is a key signaling pathway for controlling the mutualistic symbiotic interaction between E. festucae and L. perenne.
Collapse
|
7
|
Friedt J, Leavens FMV, Mercier E, Wieden HJ, Kothe U. An arginine-aspartate network in the active site of bacterial TruB is critical for catalyzing pseudouridine formation. Nucleic Acids Res 2014; 42:3857-70. [PMID: 24371284 PMCID: PMC3973310 DOI: 10.1093/nar/gkt1331] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Revised: 11/27/2013] [Accepted: 11/30/2013] [Indexed: 11/12/2022] Open
Abstract
Pseudouridine synthases introduce the most common RNA modification and likely use the same catalytic mechanism. Besides a catalytic aspartate residue, the contributions of other residues for catalysis of pseudouridine formation are poorly understood. Here, we have tested the role of a conserved basic residue in the active site for catalysis using the bacterial pseudouridine synthase TruB targeting U55 in tRNAs. Substitution of arginine 181 with lysine results in a 2500-fold reduction of TruB's catalytic rate without affecting tRNA binding. Furthermore, we analyzed the function of a second-shell aspartate residue (D90) that is conserved in all TruB enzymes and interacts with C56 of tRNA. Site-directed mutagenesis, biochemical and kinetic studies reveal that this residue is not critical for substrate binding but influences catalysis significantly as replacement of D90 with glutamate or asparagine reduces the catalytic rate 30- and 50-fold, respectively. In agreement with molecular dynamics simulations of TruB wild type and TruB D90N, we propose an electrostatic network composed of the catalytic aspartate (D48), R181 and D90 that is important for catalysis by fine-tuning the D48-R181 interaction. Conserved, negatively charged residues similar to D90 are found in a number of pseudouridine synthases, suggesting that this might be a general mechanism.
Collapse
Affiliation(s)
- Jenna Friedt
- Department of Chemistry and Biochemistry, Alberta RNA Research and Training Institute, University of Lethbridge, Lethbridge AB T1K 3M4, Canada
| | - Fern M. V. Leavens
- Department of Chemistry and Biochemistry, Alberta RNA Research and Training Institute, University of Lethbridge, Lethbridge AB T1K 3M4, Canada
| | - Evan Mercier
- Department of Chemistry and Biochemistry, Alberta RNA Research and Training Institute, University of Lethbridge, Lethbridge AB T1K 3M4, Canada
| | - Hans-Joachim Wieden
- Department of Chemistry and Biochemistry, Alberta RNA Research and Training Institute, University of Lethbridge, Lethbridge AB T1K 3M4, Canada
| | - Ute Kothe
- Department of Chemistry and Biochemistry, Alberta RNA Research and Training Institute, University of Lethbridge, Lethbridge AB T1K 3M4, Canada
| |
Collapse
|
8
|
Spenkuch F, Motorin Y, Helm M. Pseudouridine: still mysterious, but never a fake (uridine)! RNA Biol 2014; 11:1540-54. [PMID: 25616362 PMCID: PMC4615568 DOI: 10.4161/15476286.2014.992278] [Citation(s) in RCA: 146] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Revised: 09/23/2014] [Accepted: 10/10/2014] [Indexed: 01/15/2023] Open
Abstract
Pseudouridine (Ψ) is the most abundant of >150 nucleoside modifications in RNA. Although Ψ was discovered as the first modified nucleoside more than half a century ago, neither the enzymatic mechanism of its formation, nor the function of this modification are fully elucidated. We present the consistent picture of Ψ synthases, their substrates and their substrate positions in model organisms of all domains of life as it has emerged to date and point out the challenges that remain concerning higher eukaryotes and the elucidation of the enzymatic mechanism.
Collapse
MESH Headings
- Escherichia coli/genetics
- Escherichia coli/metabolism
- Humans
- Intramolecular Transferases/genetics
- Intramolecular Transferases/metabolism
- Isoenzymes/genetics
- Isoenzymes/metabolism
- Nucleic Acid Conformation
- Pseudouridine/metabolism
- RNA/genetics
- RNA/metabolism
- RNA Processing, Post-Transcriptional
- RNA, Mitochondrial
- RNA, Ribosomal/genetics
- RNA, Ribosomal/metabolism
- RNA, Transfer, Amino Acid-Specific/chemistry
- RNA, Transfer, Amino Acid-Specific/genetics
- RNA, Transfer, Amino Acid-Specific/metabolism
- Ribonucleoproteins, Small Nuclear/genetics
- Ribonucleoproteins, Small Nuclear/metabolism
- Ribosomes/chemistry
- Ribosomes/metabolism
- Saccharomyces cerevisiae/genetics
- Saccharomyces cerevisiae/metabolism
- Uridine/metabolism
- RNA, Guide, CRISPR-Cas Systems
Collapse
Affiliation(s)
- Felix Spenkuch
- Institute of Pharmacy and Biochemistry; Johannes Gutenberg-University of Mainz; Mainz, Germany
| | - Yuri Motorin
- Laboratoire IMoPA; Ingénierie Moléculaire et Physiopathologie Articulaire; BioPôle de l'Université de Lorraine; Campus Biologie-Santé; Faculté de Médecine; Vandoeuvre-les-Nancy Cedex, France
| | - Mark Helm
- Institute of Pharmacy and Biochemistry; Johannes Gutenberg-University of Mainz; Mainz, Germany
| |
Collapse
|