1
|
Khomarbaghi Z, Ngan WY, Ayan GB, Lim S, Dechow-Seligmann G, Nandy P, Gallie J. Large-scale duplication events underpin population-level flexibility in tRNA gene copy number in Pseudomonas fluorescens SBW25. Nucleic Acids Res 2024; 52:2446-2462. [PMID: 38296823 PMCID: PMC10954465 DOI: 10.1093/nar/gkae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 01/10/2024] [Accepted: 01/15/2024] [Indexed: 02/02/2024] Open
Abstract
The complement of tRNA genes within a genome is typically considered to be a (relatively) stable characteristic of an organism. Here, we demonstrate that bacterial tRNA gene set composition can be more flexible than previously appreciated, particularly regarding tRNA gene copy number. We report the high-rate occurrence of spontaneous, large-scale, tandem duplication events in laboratory populations of the bacterium Pseudomonas fluorescens SBW25. The identified duplications are up to ∼1 Mb in size (∼15% of the wildtype genome) and are predicted to change the copy number of up to 917 genes, including several tRNA genes. The observed duplications are inherently unstable: they occur, and are subsequently lost, at extremely high rates. We propose that this unusually plastic type of mutation provides a mechanism by which tRNA gene set diversity can be rapidly generated, while simultaneously preserving the underlying tRNA gene set in the absence of continued selection. That is, if a tRNA set variant provides no fitness advantage, then high-rate segregation of the duplication ensures the maintenance of the original tRNA gene set. However, if a tRNA gene set variant is beneficial, the underlying duplication fragment(s) may persist for longer and provide raw material for further, more stable, evolutionary change.
Collapse
Affiliation(s)
- Zahra Khomarbaghi
- Microbial Evolutionary Dynamics Research Group, Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Wing Y Ngan
- Microbial Evolutionary Dynamics Research Group, Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Gökçe B Ayan
- Microbial Evolutionary Dynamics Research Group, Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Sungbin Lim
- Microbial Evolutionary Dynamics Research Group, Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Gunda Dechow-Seligmann
- Microbial Evolutionary Dynamics Research Group, Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Pabitra Nandy
- Microbial Evolutionary Dynamics Research Group, Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Jenna Gallie
- Microbial Evolutionary Dynamics Research Group, Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| |
Collapse
|
2
|
van der Gulik PT, Egas M, Kraaijeveld K, Dombrowski N, Groot AT, Spang A, Hoff WD, Gallie J. On distinguishing between canonical tRNA genes and tRNA gene fragments in prokaryotes. RNA Biol 2023; 20:48-58. [PMID: 36727270 PMCID: PMC9897764 DOI: 10.1080/15476286.2023.2172370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Automated genome annotation is essential for extracting biological information from sequence data. The identification and annotation of tRNA genes is frequently performed by the software package tRNAscan-SE, the output of which is listed for selected genomes in the Genomic tRNA database (GtRNAdb). Here, we highlight a pervasive error in prokaryotic tRNA gene sets on GtRNAdb: the mis-categorization of partial, non-canonical tRNA genes as standard, canonical tRNA genes. Firstly, we demonstrate the issue using the tRNA gene sets of 20 organisms from the archaeal taxon Thermococcaceae. According to GtRNAdb, these organisms collectively deviate from the expected set of tRNA genes in 15 instances, including the listing of eleven putative canonical tRNA genes. However, after detailed manual annotation, only one of these eleven remains; the others are either partial, non-canonical tRNA genes resulting from the integration of genetic elements or CRISPR-Cas activity (seven instances), or attributable to ambiguities in input sequences (three instances). Secondly, we show that similar examples of the mis-categorization of predicted tRNA sequences occur throughout the prokaryotic sections of GtRNAdb. While both canonical and non-canonical prokaryotic tRNA gene sequences identified by tRNAscan-SE are biologically interesting, the challenge of reliably distinguishing between them remains. We recommend employing a combination of (i) screening input sequences for the genetic elements typically associated with non-canonical tRNA genes, and ambiguities, (ii) activating the tRNAscan-SE automated pseudogene detection function, and (iii) scrutinizing predicted tRNA genes with low isotype scores. These measures greatly reduce manual annotation efforts, and lead to improved prokaryotic tRNA gene set predictions.
Collapse
Affiliation(s)
- Peter T.S. van der Gulik
- Department of Algorithms and Complexity, Centrum Wiskunde & Informatica, Amsterdam, The Netherlands,CONTACT Peter T.S. van der Gulik Centrum Wiskunde & Informatica, Amsterdam, The Netherlands
| | - Martijn Egas
- Department of Evolutionary and Population Biology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Ken Kraaijeveld
- Leiden Centre for Applied Bioscience, University of Applied Sciences Leiden, Leiden, The Netherlands
| | - Nina Dombrowski
- Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, Den Burg, The Netherlands
| | - Astrid T. Groot
- Department of Evolutionary and Population Biology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Anja Spang
- Department of Evolutionary and Population Biology, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands,Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, Den Burg, The Netherlands
| | - Wouter D. Hoff
- Department of Microbiology and Molecular Genetics, Oklahoma State University, Stillwater, Oklahoma, USA,Wouter Hoff
| | - Jenna Gallie
- Department of Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany,Jenna Gallie
| |
Collapse
|
3
|
Wang YC, Lu MC, Li YT, Tang HL, Hsiao PY, Chen BH, Teng RH, Chiou CS, Lai YC. Microevolution of CG23-I Hypervirulent Klebsiella pneumoniae during Recurrent Infections in a Single Patient. Microbiol Spectr 2022; 10:e0207722. [PMID: 36129301 PMCID: PMC9602619 DOI: 10.1128/spectrum.02077-22] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Accepted: 09/05/2022] [Indexed: 12/31/2022] Open
Abstract
CG23-I lineage constitutes the majority of hypervirulent Klebsiella pneumoniae. A diabetic patient suffered six episodes of infections caused by CG23-I K. pneumoniae. A total of nine isolates were collected in 2020. We performed whole-genome sequencing to elucidate the within-patient evolution of CG23-I K. pneumoniae. The maximum pairwise difference among the nine longitudinally collected isolates was five single nucleotide polymorphisms. One of the mutations was at the Asp87 position of GyrA. Four indels were identified, including an initiator tRNAfMet duplication, a tRNAArg deletion, a 7-bp insertion, and a 22-bp deletion. All 9 isolates had the genomic features of CG23-I K. pneumoniae, a chromosome-borne ICEKp10, and a large virulence plasmid. The carriage of a complete set of genes for the biosynthesis of colibactin by ICEKp10 gave the nine isolates an ability to cause DNA damage to RAW264.7 cells. Compared with the initial isolate, the last isolate with an additional copy of initiator tRNAfMet grew faster in a nutrient-limiting condition and exhibited enhanced virulence in BALB/c mice. Collectively, we characterized the within-patient microevolution of CG23-I K. pneumoniae through an in-depth comparison of genome sequences. Using the in vitro experiments and mouse models, we also demonstrated that these genomic alterations endowed the isolates with advantages to pass through in vivo selection. IMPORTANCE CG23-I is a significant lineage of hypervirulent Klebsiella pneumoniae. This study characterizes the within-patient microevolution of CG23-I K. pneumoniae. Selective pressures from continuous use of antibiotics favored point mutations contributing to bacterial resistance to antibiotics. The duplication of an initiator tRNAfMet gene helped CG23-I K. pneumoniae proliferate to reach a maximal population size during infections. For longer persistence inside a human host, the large virulence plasmid evolved with more flexible control of replication through duplication of the iteron-1 region. With the genomic alterations, the last isolate had a growth advantage over the initial isolate and exhibited enhanced virulence in BALB/c mice. This study gives us a deeper understanding of the genome evolution during the within-patient pathoadaptation of CG23-I K. pneumoniae.
Collapse
Affiliation(s)
- Yao-Chen Wang
- Department of Internal Medicine, Chung Shan Medical University Hospital, Taichung, Taiwan
- School of Medicine, Chung Shan Medical University, Taichung, Taiwan
| | - Min-Chi Lu
- Department of Microbiology and Immunology, School of Medicine, China Medical University, Taichung, Taiwan
- Division of Infectious Diseases, Department of Internal Medicine, China Medical University Hospital, Taichung, Taiwan
| | - Yia-Ting Li
- Division of Respiratory Therapy, Department of Internal Medicine, Chung Shan Medical University Hospital, Taichung, Taiwan
| | - Hui-Ling Tang
- Department of Microbiology and Immunology, School of Medicine, China Medical University, Taichung, Taiwan
| | - Pei-Yi Hsiao
- Department of Microbiology and Immunology, School of Medicine, Chung Shan Medical University, Taichung, Taiwan
| | - Bo-Han Chen
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Ministry of Health and Welfare, Taipei, Taiwan
| | - Ru-Hsiou Teng
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Ministry of Health and Welfare, Taipei, Taiwan
| | - Chien-Shun Chiou
- Central Region Laboratory, Center for Diagnostics and Vaccine Development, Centers for Disease Control, Ministry of Health and Welfare, Taipei, Taiwan
| | - Yi-Chyi Lai
- Department of Internal Medicine, Chung Shan Medical University Hospital, Taichung, Taiwan
- Department of Microbiology and Immunology, School of Medicine, Chung Shan Medical University, Taichung, Taiwan
| |
Collapse
|
4
|
Chua M, Tan A, Tremblay-Savard O. BOPAL 2.0 and a study of tRNA and rRNA gene evolution in Clostridium. J Bioinform Comput Biol 2021; 19:2140007. [PMID: 34775921 DOI: 10.1142/s0219720021400072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We present BOPAL 2.0, an improved version of the BOPAL algorithm for the evolutionary history inference of tRNA and rRNA genes in bacterial genomes. Our approach can infer complete evolutionary scenarios and ancestral gene orders on a phylogeny and considers a wide range of events such as duplications, deletions, substitutions, inversions and transpositions. It is based on the fact that tRNA and rRNA genes are often organized in operons/clusters in bacteria, and this information is used to help identify orthologous genes for each genome comparison. BOPAL 2.0 introduces new features, such as a triple-wise alignment step, context-aware singleton matching and a second pass of the algorithm. Evaluation on simulated datasets shows that BOPAL 2.0 outperforms the original BOPAL in terms of the accuracy of inferred events and ancestral genomes. We also present a study of the tRNA/rRNA gene evolution in the Clostridium genus, in which the organization of these genes is very divergent. Our results indicate that tRNA and rRNA genes in Clostridium have evolved through numerous duplications, losses, transpositions and substitutions, but very few inversions were inferred.
Collapse
Affiliation(s)
- Meghan Chua
- Department of Computer Science, University of Manitoba, 103 Dafoe Rd W, Winnipeg, Manitoba, Canada R3T 5V6, Canada
| | - Anthony Tan
- Department of Computer Science, University of Manitoba, 103 Dafoe Rd W, Winnipeg, Manitoba, Canada R3T 5V6, Canada
| | - Olivier Tremblay-Savard
- Department of Computer Science, University of Manitoba, 103 Dafoe Rd W, Winnipeg, Manitoba, Canada R3T 5V6, Canada
| |
Collapse
|
5
|
Villarreal LP, Witzany G. Social Networking of Quasi-Species Consortia drive Virolution via Persistence. AIMS Microbiol 2021; 7:138-162. [PMID: 34250372 PMCID: PMC8255905 DOI: 10.3934/microbiol.2021010] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 04/25/2021] [Indexed: 12/31/2022] Open
Abstract
The emergence of cooperative quasi-species consortia (QS-C) thinking from the more accepted quasispecies equations of Manfred Eigen, provides a conceptual foundation from which concerted action of RNA agents can now be understood. As group membership becomes a basic criteria for the emergence of living systems, we also start to understand why the history and context of social RNA networks become crucial for survival and function. History and context of social RNA networks also lead to the emergence of a natural genetic code. Indeed, this QS-C thinking can also provide us with a transition point between the chemical world of RNA replicators and the living world of RNA agents that actively differentiate self from non-self and generate group identity with membership roles. Importantly the social force of a consortia to solve complex, multilevel problems also depend on using opposing and minority functions. The consortial action of social networks of RNA stem-loops subsequently lead to the evolution of cellular organisms representing a tree of life.
Collapse
|
6
|
Ayan GB, Park HJ, Gallie J. The birth of a bacterial tRNA gene by large-scale, tandem duplication events. eLife 2020; 9:57947. [PMID: 33124983 PMCID: PMC7661048 DOI: 10.7554/elife.57947] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 10/29/2020] [Indexed: 12/20/2022] Open
Abstract
Organisms differ in the types and numbers of tRNA genes that they carry. While the evolutionary mechanisms behind tRNA gene set evolution have been investigated theoretically and computationally, direct observations of tRNA gene set evolution remain rare. Here, we report the evolution of a tRNA gene set in laboratory populations of the bacterium Pseudomonas fluorescens SBW25. The growth defect caused by deleting the single-copy tRNA gene, serCGA, is rapidly compensated by large-scale (45–290 kb) duplications in the chromosome. Each duplication encompasses a second, compensatory tRNA gene (serTGA) and is associated with a rise in tRNA-Ser(UGA) in the mature tRNA pool. We postulate that tRNA-Ser(CGA) elimination increases the translational demand for tRNA-Ser(UGA), a pressure relieved by increasing serTGA copy number. This work demonstrates that tRNA gene sets can evolve through duplication of existing tRNA genes, a phenomenon that may contribute to the presence of multiple, identical tRNA gene copies within genomes.
Collapse
Affiliation(s)
- Gökçe B Ayan
- Department of Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Hye Jin Park
- Department of Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Asia Pacific Center for Theoretical Physics, Pohang, Republic of Korea
| | - Jenna Gallie
- Department of Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany
| |
Collapse
|
7
|
Wang SE, Brooks AES, Poole AM, Simoes-Barbosa A. Determinants of translation efficiency in the evolutionarily-divergent protist Trichomonas vaginalis. BMC Mol Cell Biol 2020; 21:54. [PMID: 32689943 PMCID: PMC7370421 DOI: 10.1186/s12860-020-00297-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 06/29/2020] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Trichomonas vaginalis, the causative agent of a prevalent urogenital infection in humans, is an evolutionarily divergent protozoan. Protein-coding genes in T. vaginalis are largely controlled by two core promoter elements, producing mRNAs with short 5' UTRs. The specific mechanisms adopted by T. vaginalis to fine-tune the translation efficiency (TE) of mRNAs remain largely unknown. RESULTS Using both computational and experimental approaches, this study investigated two key factors influencing TE in T. vaginalis: codon usage and mRNA secondary structure. Statistical dependence between TE and codon adaptation index (CAI) highlighted the impact of codon usage on mRNA translation in T. vaginalis. A genome-wide interrogation revealed that low structural complexity at the 5' end of mRNA followed closely by a highly structured downstream region correlates with TE variation in this organism. To validate these findings, a synthetic library of 15 synonymous iLOV genes was created, representing five mRNA folding profiles and three codon usage profiles. Fluorescence signals produced by the expression of these synonymous iLOV genes in T. vaginalis were consistent with and validated our in silico predictions. CONCLUSIONS This study demonstrates the role of codon usage bias and mRNA secondary structure in TE of T. vaginalis mRNAs, contributing to a better understanding of the factors that influence, and possibly regulate, gene expression in this human pathogen.
Collapse
Affiliation(s)
- Shuqi E Wang
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, Los Angeles, USA
| | - Anna E S Brooks
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
| | - Anthony M Poole
- School of Biological Sciences, The University of Auckland, Auckland, New Zealand
- Bioinformatics Institute, The University of Auckland, Auckland, New Zealand
| | | |
Collapse
|
8
|
Pawliszak T, Chua M, Leung CK, Tremblay-Savard O. Operon-based approach for the inference of rRNA and tRNA evolutionary histories in bacteria. BMC Genomics 2020; 21:252. [PMID: 32299351 PMCID: PMC7160887 DOI: 10.1186/s12864-020-6612-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Background In bacterial genomes, rRNA and tRNA genes are often organized into operons, i.e. segments of closely located genes that share a single promoter and are transcribed as a single unit. Analyzing how these genes and operons evolve can help us understand what are the most common evolutionary events affecting them and give us a better picture of ancestral codon usage and protein synthesis. Results We introduce BOPAL, a new approach for the inference of evolutionary histories of rRNA and tRNA genes in bacteria, which is based on the identification of orthologous operons. Since operons can move around in the genome but are rarely transformed (e.g. rarely broken into different parts), this approach allows for a better inference of orthologous genes in genomes that have been affected by many rearrangements, which in turn helps with the inference of more realistic evolutionary scenarios and ancestors. Conclusions From our comparisons of BOPAL with other gene order alignment programs using simulated data, we have found that BOPAL infers evolutionary events and ancestral gene orders more accurately than other methods based on alignments. An analysis of 12 Bacillus genomes also showed that BOPAL performs just as well as other programs at building ancestral histories in a minimal amount of events.
Collapse
Affiliation(s)
- Tomasz Pawliszak
- Department of Computer Science, University of Manitoba, Winnipeg, Canada
| | - Meghan Chua
- Department of Computer Science, University of Manitoba, Winnipeg, Canada
| | - Carson K Leung
- Department of Computer Science, University of Manitoba, Winnipeg, Canada
| | | |
Collapse
|
9
|
tRNA Genes Affect Chromosome Structure and Function via Local Effects. Mol Cell Biol 2019; 39:MCB.00432-18. [PMID: 30718362 DOI: 10.1128/mcb.00432-18] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Accepted: 01/18/2019] [Indexed: 11/20/2022] Open
Abstract
The genome is packaged and organized in an ordered, nonrandom manner, and specific chromatin segments contact nuclear substructures to mediate this organization. tRNA genes (tDNAs) are binding sites for transcription factors and architectural proteins and are thought to play an important role in the organization of the genome. In this study, we investigate the roles of tDNAs in genomic organization and chromosome function by editing a chromosome so that it lacked any tDNAs. Surprisingly our analyses of this tDNA-less chromosome show that loss of tDNAs does not grossly affect chromatin architecture or chromosome tethering and mobility. However, loss of tDNAs affects local nucleosome positioning and the binding of SMC proteins at these loci. The absence of tDNAs also leads to changes in centromere clustering and a reduction in the frequency of long-range HML-HMR heterochromatin clustering with concomitant effects on gene silencing. We propose that the tDNAs primarily affect local chromatin structure, which results in effects on long-range chromosome architecture.
Collapse
|
10
|
Sun Y, Tamarit D, Andersson SGE. Switches in Genomic GC Content Drive Shifts of Optimal Codons under Sustained Selection on Synonymous Sites. Genome Biol Evol 2018; 9:2560-2579. [PMID: 27540085 PMCID: PMC5629928 DOI: 10.1093/gbe/evw201] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/12/2016] [Indexed: 12/16/2022] Open
Abstract
The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites.
Collapse
Affiliation(s)
- Yu Sun
- Department of Molecular Evolution, Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Daniel Tamarit
- Department of Molecular Evolution, Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Siv G E Andersson
- Department of Molecular Evolution, Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| |
Collapse
|
11
|
Mohanta TK, Syed AS, Ameen F, Bae H. Novel Genomic and Evolutionary Perspective of Cyanobacterial tRNAs. Front Genet 2017; 8:200. [PMID: 29321793 PMCID: PMC5733544 DOI: 10.3389/fgene.2017.00200] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Accepted: 11/21/2017] [Indexed: 11/30/2022] Open
Abstract
Transfer RNA (tRNA) plays a central role in protein synthesis and acts as an adaptor molecule between an mRNA and an amino acid. A tRNA has an L-shaped clover leaf-like structure and contains an acceptor arm, D-arm, D-loop, anti-codon arm, anti-codon loop, variable loop, Ψ-arm and Ψ-loop. All of these arms and loops are important in protein translation. Here, we aimed to delineate the genomic architecture of these arms and loops in cyanobacterial tRNA. Studies from tRNA sequences from 61 cyanobacterial species showed that, except for few tRNAs (tRNAAsn, tRNALeu, tRNAGln, and tRNAMet), all contained a G nucleotide at the 1st position in the acceptor arm. tRNALeu and tRNAMet did not contain any conserved nucleotides at the 1st position whereas tRNAAsn and tRNAGln contained a conserved U1 nucleotide. In several tRNA families, the variable region also contained conserved nucleotides. Except for tRNAMet and tRNAGlu, all other tRNAs contained a conserved A nucleotide at the 1st position in the D-loop. The Ψ-loop contained a conserved U1-U2-C3-x-A5-x-U7 sequence, except for tRNAGly, tRNAAla, tRNAVal, tRNAPhe, tRNAThr, and tRNAGln in which the U7 nucleotide was not conserved. However, in tRNAAsp, the U7 nucleotide was substituted with a C7 nucleotide. Additionally, tRNAArg, tRNAGly, and tRNALys of cyanobacteria contained a group I intron within the anti-codon loop region. Maximum composite likelihood study on the transition/transversion of cyanobacterial tRNA revealed that the rate of transition was higher than the rate of transversion. An evolutionary tree was constructed to understand the evolution of cyanobacterial tRNA and analyses revealed that cyanobacterial tRNA may have evolved polyphyletically with high rate of gene loss.
Collapse
Affiliation(s)
- Tapan K Mohanta
- School of Biotechnology, Yeungnam University, Gyeongsan, South Korea
| | - Asad S Syed
- Department of Botany and Microbiology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Fuad Ameen
- Department of Botany and Microbiology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Hanhong Bae
- School of Biotechnology, Yeungnam University, Gyeongsan, South Korea
| |
Collapse
|
12
|
Tran TTT, Belahbib H, Bonnefoy V, Talla E. A Comprehensive tRNA Genomic Survey Unravels the Evolutionary History of tRNA Arrays in Prokaryotes. Genome Biol Evol 2015; 8:282-95. [PMID: 26710853 PMCID: PMC4758250 DOI: 10.1093/gbe/evv254] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/13/2015] [Indexed: 01/12/2023] Open
Abstract
Considering the importance of tRNAs in the translation machinery, scant attention has been paid to tRNA array units defined as genomic regions containing at least 20 tRNA genes with a minimal tRNA gene density of two tRNA genes per kilobase. Our analysis of Acidithiobacillus ferrivorans CF27 and Acidithiobacillus ferrooxidans ATCC 23270(T) genomes showed that both display a tRNA array unit with syntenic conservation which mainly contributed to the tRNA gene redundancy in these two organisms. Our investigations into the occurrence and distribution of tRNA array units revealed that 1) this tRNA organization is limited to few phyla and mainly found in Gram-positive bacteria; and 2) the presence of tRNA arrays favors the redundancy of tRNA genes, in particular those encoding the core tRNA isoacceptors. Finally, comparative array organization revealed that tRNA arrays were acquired through horizontal gene transfer (from Firmicutes or unknown donor), before being subjected to tRNA rearrangements, deletions, and duplications. In Bacilli, the most parsimonious evolutionary history involved two common ancestors and the acquisition of their arrays arose late in evolution, in the genera branches. Functional roles of the array units in organism lifestyle, selective genetic advantage and translation efficiency, as well as the evolutionary advantages of organisms harboring them were proposed. Our study offers new insight into the structural organization and evolution of tRNA arrays in prokaryotic organisms.
Collapse
Affiliation(s)
- Tam T T Tran
- Aix Marseille Université, CNRS, IGS, UMR 7256, IMM, France
| | | | | | - Emmanuel Talla
- Aix Marseille Université, CNRS, IGS, UMR 7256, IMM, France
| |
Collapse
|
13
|
McDonald MJ, Chou CH, Swamy KBS, Huang HD, Leu JY. The evolutionary dynamics of tRNA-gene copy number and codon-use in E. coli. BMC Evol Biol 2015; 15:163. [PMID: 26282127 PMCID: PMC4539685 DOI: 10.1186/s12862-015-0441-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 07/29/2015] [Indexed: 11/25/2022] Open
Abstract
Background The introduction of foreign DNA by Lateral Gene Transfer (LGT) can quickly and drastically alter genome composition. Problems can arise if the genes introduced by LGT use codons that are not suited to the host’s translational machinery. Here we investigate compensatory adaptation of E. coli in response to the introduction of large volumes of codons that are rarely used by the host genome. Results We analyze genome sequences from the E. coli/Shigella complex, and find that certain tRNA genes are present in multiple copies in two pathogenic Shigella and O157:H7 subgroups of E. coli. Furthermore, we show that the codons that correspond to these multi-copy number tRNA genes are enriched in the high copy number Selfish Genetic Elements (SGE’s) in Shigella and laterally introduced genes in O157:H7. We analyze the duplicate copies and find evidence for the selective retention of tRNA genes introduced by LGT in response to the changed codon content of the genome. Conclusion These data support a model where the relatively rapid influx of LGT genes and SGE’s introduces a large number of genes maladapted to the host’s translational machinery. Under these conditions, it becomes advantageous for the host to retain tRNA genes that are required for the incorporation of amino acids at these codons. Subsequently, the increased number of copies of these specific tRNA genes adjusts the cellular tRNA pool to the demands set by global shifts in codon usage. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0441-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Chih-Hung Chou
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan. .,Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan.
| | | | - Hsien-Da Huang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan. .,Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan.
| | - Jun-Yi Leu
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan.
| |
Collapse
|
14
|
López-Madrigal S, Latorre A, Moya A, Gil R. The link between independent acquisition of intracellular gamma-endosymbionts and concerted evolution in Tremblaya princeps. Front Microbiol 2015; 6:642. [PMID: 26161080 PMCID: PMC4479817 DOI: 10.3389/fmicb.2015.00642] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 06/12/2015] [Indexed: 02/05/2023] Open
Abstract
Many insect species establish mutualistic symbiosis with intracellular bacteria that complement their unbalanced diets. The betaproteobacterium "Candidatus Tremblaya" maintains an ancient symbiosis with mealybugs (Hemiptera: Pseudococcidae), which are classified in subfamilies Phenacoccinae and Pseudococcinae. Most Phenacoccinae mealybugs have "Candidatus Tremblaya phenacola" as their unique endosymbiont, while most Pseudococcinae mealybugs show a nested symbiosis (a bacterial symbiont placed inside another one) where every "Candidatus Tremblaya princeps" cell harbors several cells of a gammaproteobacterium. Genomic characterization of the endosymbiotic consortium from Planococcus citri, composed by "Ca. Tremblaya princeps" and "Candidatus Moranella endobia," unveiled several atypical features of the former's genome, including the concerted evolution of paralogous loci. Its comparison with the genome of "Ca. Tremblaya phenacola" PAVE, single endosymbiont of Phenacoccus avenae, suggests that the atypical reductive evolution of "Ca. Tremblaya princeps" could be linked to the acquisition of "Ca. Moranella endobia," which possess an almost complete set of genes encoding proteins involved in homologous recombination. In order to test this hypothesis, we performed comparative genomics between "Ca. Tremblaya phenacola" and "Ca. Tremblaya princeps" and searched for the co-occurrence of concerted evolution and homologous recombination genes in endosymbiotic consortia from four unexplored mealybug species, Dysmicoccus boninsis, Planococcus ficus, Pseudococcus longispinus, and Pseudococcus viburni. Our results support a link between concerted evolution and nested endosymbiosis.
Collapse
Affiliation(s)
- Sergio López-Madrigal
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de ValènciaValència, Spain
| | - Amparo Latorre
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de ValènciaValència, Spain
- Área de Genómica y Salud de la Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunitat Valenciana (FISABIO) – Salud PúblicaValència, Spain
| | - Andrés Moya
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de ValènciaValència, Spain
- Área de Genómica y Salud de la Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunitat Valenciana (FISABIO) – Salud PúblicaValència, Spain
| | - Rosario Gil
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de ValènciaValència, Spain
- *Correspondence: Rosario Gil, Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, C/Catedrático José Beltrán 2, 46980 Paterna, Valencia, Spain
| |
Collapse
|
15
|
Andreotti S, Reinert K, Canzar S. The duplication-loss small phylogeny problem: from cherries to trees. J Comput Biol 2014; 20:643-59. [PMID: 24000925 DOI: 10.1089/cmb.2013.0057] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The reconstruction of the history of evolutionary genome-wide events among a set of related organisms is of great biological interest since it can help to reveal the genomic basis of phenotypes. The sequencing of whole genomes faciliates the study of gene families that vary in size through duplication and loss events, like transfer RNA. However, a high sequence similarity often does not allow one to distinguish between orthologs and paralogs. Previous methods have addressed this difficulty by taking into account flanking regions of members of a family independently. We go one step further by inferring the order of genes of (a set of) families for ancestral genomes by considering the order of these genes on sequenced genomes. We present a novel branch-and-cut algorithm to solve the two species small phylogeny problem in the evolutionary model of duplications and losses. On average, our implementation, DupLoCut, improves the running time of a recently proposed method in the experiments on six Vibrionaceae lineages by a factor of ∼200. Besides the mere improvement in running time, the efficiency of our approach allows us to extend our model from cherries of a species tree, that is, subtrees with two leaves, to the median of three species setting. Being able to determine the median of three species is of key importance to one of the most common approaches to ancestral reconstruction, and our experiments show that its repeated computation considerably reduces the number of duplications and losses along the tree both on simulated instances comprising 128 leaves and a set of Bacillus genomes. Furthermore, in our simulations we show that a reduction in cost goes hand in hand with an improvement of the predicted ancestral genomes. Finally, we prove that the small phylogeny problem in the duplication-loss model is NP-complete already for two species.
Collapse
Affiliation(s)
- Sandro Andreotti
- Department of Mathematics and Computer Science, Institute of Computer Science, Freie Universität Berlin, Berlin, Germany
| | | | | |
Collapse
|
16
|
Yona AH, Bloom-Ackermann Z, Frumkin I, Hanson-Smith V, Charpak-Amikam Y, Feng Q, Boeke JD, Dahan O, Pilpel Y. tRNA genes rapidly change in evolution to meet novel translational demands. eLife 2013; 2:e01339. [PMID: 24363105 PMCID: PMC3868979 DOI: 10.7554/elife.01339] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
Abstract
Changes in expression patterns may occur when organisms are presented with new environmental challenges, for example following migration or genetic changes. To elucidate the mechanisms by which the translational machinery adapts to such changes, we perturbed the tRNA pool of Saccharomyces cerevisiae by tRNA gene deletion. We then evolved the deletion strain and observed that the genetic adaptation was recurrently based on a strategic mutation that changed the anticodon of other tRNA genes to match that of the deleted one. Strikingly, a systematic search in hundreds of genomes revealed that anticodon mutations occur throughout the tree of life. We further show that the evolution of the tRNA pool also depends on the need to properly couple translation to protein folding. Together, our observations shed light on the evolution of the tRNA pool, demonstrating that mutation in the anticodons of tRNA genes is a common adaptive mechanism when meeting new translational demands. DOI:http://dx.doi.org/10.7554/eLife.01339.001 Genes contain the blueprints for the proteins that are essential for countless biological functions and processes, and the path that leads from a particular gene to the corresponding protein is long and complex. The genetic information stored in the DNA must first be transcribed to produce a messenger RNA molecule, which then has to be translated to produce a string of amino acids that fold to form a protein. The translation step is performed by a molecular machine called the ribosome, with transfer RNA molecules bringing the amino acids that are needed to make the protein. The information in messenger RNA is stored as a series of letters, with groups of three letters called codons representing the different amino acids. Since there are four letters—A, C, G and U—it is possible to form 64 different codons. And since there are only 20 amino acids, two or more different codons can specify the same amino acid (for example, AGU and AGC both specify serine), and two or more different transfer RNA molecules can take this amino acid to the ribosome. Moreover, some codons are found more often than others in the messenger RNA molecules, so the genes that encode the related transfer RNA molecules are more common than the genes for other transfer RNA molecules. Environmental pressures mean that organisms must adapt to survive, with some genes and proteins increasing in importance, and others becoming less important. Clearly the relative numbers of the different transfer RNA molecules will also need to change to reflect these evolutionary changes, but the details of how this happens were not understood. Now Yona et al. have explored this issue by studying yeast cells that lack a gene for one of the less common transfer RNA molecules (corresponding to the codon AGG, which specifies the amino acid arginine). At first this mutation resulted in slower growth of the yeast cells, but after being allowed to evolve over 200 generations, the rate of growth matched that of a normal strain with all transfer RNA genes. Yona et al. found that the gene for a more common transfer RNA molecule, corresponding to the codon AGA, which also specifies arginine, had mutated to AGG. As a result, the mutated yeast was eventually able to produce proteins as quickly as wild type yeast. Moreover, further experiments showed that the levels of some transfer RNAs are kept deliberately low in order to slow down the production of proteins so as to ensure that the proteins assume their correct structure. But does the way these cells evolved in the lab resemble what happened in nature? To address this question Yona et al. examined a database of transfer RNA sequences from more than 500 species, and found evidence for the same codon-based switching mechanism in many species across the tree of life. DOI:http://dx.doi.org/10.7554/eLife.01339.002
Collapse
Affiliation(s)
- Avihu H Yona
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Shabalina SA, Spiridonov NA, Kashina A. Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity. Nucleic Acids Res 2013; 41:2073-94. [PMID: 23293005 PMCID: PMC3575835 DOI: 10.1093/nar/gks1205] [Citation(s) in RCA: 187] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions.
Collapse
Affiliation(s)
- Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20984, USA.
| | | | | |
Collapse
|
18
|
Iriarte A, Baraibar JD, Romero H, Castro-Sowinski S, Musto H. Evolution of optimal codon choices in the family Enterobacteriaceae. MICROBIOLOGY-SGM 2013; 159:555-564. [PMID: 23288542 DOI: 10.1099/mic.0.061952-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The Enterobacteriaceae are a large family of Proteobacteria that include many well-known prokaryotic genera, such as Escherichia, Yersinia and Salmonella. The main ideas of synonymous codon usage (CU) evolution and translational selection have been deeply influenced by studies with these bacterial groups. In this work we report the analysis of the CU pattern of completely sequenced bacterial genomes that belong to the Enterobacteriaceae. The effect of selection in translation acting at the levels of speed and accuracy, and phylogenetic trends within this group are described. Preferred (optimal) codons were identified. The evolutionary dynamics of these codons were studied and following a Bayesian approach these preferences were traced back to the common ancestor of the family. We found that there is some level of variation in selection among the analysed micro-organisms that is probably associated with lineage-specific trends. The codon bias was largely conserved across the evolutionary time of the family in highly expressed genes and protein conserved regions, suggesting a major role of negative selection. In this sense, the results support the idea that the extant CU bias is finely tuned over the ancestral well-conserved pool of tRNAs.
Collapse
Affiliation(s)
- Andrés Iriarte
- Área Genética, Depto. de Genética y Mejora Animal, Facultad de Veterinaria (UDELAR), Av. A. Lasplaces 1550, CP 11600, Montevideo, Uruguay.,Laboratorio de Evolución, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay.,Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Juan Diego Baraibar
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Héctor Romero
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Susana Castro-Sowinski
- Sección Bioquímica y Biología Molecular, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Héctor Musto
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| |
Collapse
|
19
|
Bobay LM, Rocha EPC, Touchon M. The adaptation of temperate bacteriophages to their host genomes. Mol Biol Evol 2012; 30:737-51. [PMID: 23243039 PMCID: PMC3603311 DOI: 10.1093/molbev/mss279] [Citation(s) in RCA: 129] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Rapid turnover of mobile elements drives the plasticity of bacterial genomes. Integrated bacteriophages (prophages) encode host-adaptive traits and represent a sizable fraction of bacterial chromosomes. We hypothesized that natural selection shapes prophage integration patterns relative to the host genome organization. We tested this idea by detecting and studying 500 prophages of 69 strains of Escherichia and Salmonella. Phage integrases often target not only conserved genes but also intergenic positions, suggesting purifying selection for integration sites. Furthermore, most integration hotspots are conserved between the two host genera. Integration sites seem also selected at the large chromosomal scale, as they are nonrandomly organized in terms of the origin-terminus axis and the macrodomain structure. The genes of lambdoid prophages are systematically co-oriented with the bacterial replication fork and display the host high frequency of polarized FtsK-orienting polar sequences motifs required for chromosome segregation. matS motifs are strongly avoided by prophages suggesting counter selection of motifs disrupting macrodomains. These results show how natural selection for seamless integration of prophages in the chromosome shapes the evolution of the bacterium and the phage. First, integration sites are highly conserved for many millions of years favoring lysogeny over the lytic cycle for temperate phages. Second, the global distribution of prophages is intimately associated with the chromosome structure and the patterns of gene expression. Third, the phage endures selection for DNA motifs that pertain exclusively to the biology of the prophage in the bacterial chromosome. Understanding prophage genetic adaptation sheds new lights on the coexistence of horizontal transfer and organized bacterial genomes.
Collapse
Affiliation(s)
- Louis-Marie Bobay
- Microbial Evolutionary Genomics Group, Institut Pasteur, Paris, France.
| | | | | |
Collapse
|
20
|
Novoa EM, Pavon-Eternod M, Pan T, Ribas de Pouplana L. A role for tRNA modifications in genome structure and codon usage. Cell 2012; 149:202-13. [PMID: 22464330 DOI: 10.1016/j.cell.2012.01.050] [Citation(s) in RCA: 186] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Revised: 11/23/2011] [Accepted: 01/12/2012] [Indexed: 11/17/2022]
Abstract
Transfer RNA (tRNA) gene content is a differentiating feature of genomes that contributes to the efficiency of the translational apparatus, but the principles shaping tRNA gene copy number and codon composition are poorly understood. Here, we report that the emergence of two specific tRNA modifications shaped the structure and composition of all extant genomes. Through the analysis of more than 500 genomes, we identify two kingdom-specific tRNA modifications as major contributors that separated archaeal, bacterial, and eukaryal genomes in terms of their tRNA gene composition. We show that, contrary to prior observations, genomic codon usage and tRNA gene frequencies correlate in all kingdoms if these two modifications are taken into account and that presence or absence of these modifications explains patterns of gene expression observed in previous studies. Finally, we experimentally demonstrate that human gene expression levels correlate well with genomic codon composition if these identified modifications are considered.
Collapse
Affiliation(s)
- Eva Maria Novoa
- Institute for Research in Biomedicine, c/ Baldiri Reixac 15-21, 08028 Barcelona, Catalonia, Spain
| | | | | | | |
Collapse
|
21
|
Raab JR, Chiu J, Zhu J, Katzman S, Kurukuti S, Wade PA, Haussler D, Kamakaka RT. Human tRNA genes function as chromatin insulators. EMBO J 2011; 31:330-50. [PMID: 22085927 DOI: 10.1038/emboj.2011.406] [Citation(s) in RCA: 102] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2011] [Accepted: 10/07/2011] [Indexed: 11/09/2022] Open
Abstract
Insulators help separate active chromatin domains from silenced ones. In yeast, gene promoters act as insulators to block the spread of Sir and HP1 mediated silencing while in metazoans most insulators are multipartite autonomous entities. tDNAs are repetitive sequences dispersed throughout the human genome and we now show that some of these tDNAs can function as insulators in human cells. Using computational methods, we identified putative human tDNA insulators. Using silencer blocking, transgene protection and repressor blocking assays we show that some of these tDNA-containing fragments can function as barrier insulators in human cells. We find that these elements also have the ability to block enhancers from activating RNA pol II transcribed promoters. Characterization of a putative tDNA insulator in human cells reveals that the site possesses chromatin signatures similar to those observed at other better-characterized eukaryotic insulators. Enhanced 4C analysis demonstrates that the tDNA insulator makes long-range chromatin contacts with other tDNAs and ETC sites but not with intervening or flanking RNA pol II transcribed genes.
Collapse
Affiliation(s)
- Jesse R Raab
- Department of MCD Biology, University of California, Santa Cruz, CA, USA
| | | | | | | | | | | | | | | |
Collapse
|
22
|
Retchless AC, Lawrence JG. Quantification of codon selection for comparative bacterial genomics. BMC Genomics 2011; 12:374. [PMID: 21787402 PMCID: PMC3162537 DOI: 10.1186/1471-2164-12-374] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2011] [Accepted: 07/25/2011] [Indexed: 11/16/2022] Open
Abstract
Background Statistics measuring codon selection seek to compare genes by their sensitivity to selection for translational efficiency, but existing statistics lack a model for testing the significance of differences between genes. Here, we introduce a new statistic for measuring codon selection, the Adaptive Codon Enrichment (ACE). Results This statistic represents codon usage bias in terms of a probabilistic distribution, quantifying the extent that preferred codons are over-represented in the gene of interest relative to the mean and variance that would result from stochastic sampling of codons. Expected codon frequencies are derived from the observed codon usage frequencies of a broad set of genes, such that they are likely to reflect nonselective, genome wide influences on codon usage (e.g. mutational biases). The relative adaptiveness of synonymous codons is deduced from the frequency of codon usage in a pre-selected set of genes relative to the expected frequency. The ACE can predict both transcript abundance during rapid growth and the rate of synonymous substitutions, with accuracy comparable to or greater than existing metrics. We further examine how the composition of reference gene sets affects the accuracy of the statistic, and suggest methods for selecting appropriate reference sets for any genome, including bacteriophages. Finally, we demonstrate that the ACE may naturally be extended to quantify the genome-wide influence of codon selection in a manner that is sensitive to a large fraction of codons in the genome. This reveals substantial variation among genomes, correlated with the tRNA gene number, even among groups of bacteria where previously proposed whole-genome measures show little variation. Conclusions The statistical framework of the ACE allows rigorous comparison of the level of codon selection acting on genes, both within a genome and between genomes.
Collapse
Affiliation(s)
- Adam C Retchless
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | |
Collapse
|
23
|
Abstract
The structure and function of transfer RNA (tRNA) genes have been extensively studied for several decades, yet the general mechanisms controlling tRNA gene family evolution remain unclear, primarily because previous phylogenetics-based methods fail to distinguish between paralogs and orthologs that are highly similar in sequence. We have developed a system for identifying orthologs of tRNAs using flanking sequences to identify regions of conserved synteny and used it to annotate sets of orthologous tRNA genes across the 12 sequenced species of Drosophila. These data have allowed us to place the gains and losses of individual tRNA genes on each branch of the Drosophila tree and estimate rates of tRNA gene turnover. Our results show extensive rearrangement of the Drosophila tRNA gene complement over the last 60 My. We estimate a combined average rate of 2.18 ± 0.10 tRNA gene gains and losses per million years across the Drosophila lineage. We have identified 192 tRNAs that are ancestral to the genus, of which 157 are “core” tRNAs conserved in at least 11 of 12 extant species. We provide evidence that the core set of tRNA genes encode a nearly complete set of anticodons and have different properties from other “peripheral” tRNA genes, such as preferential location outside large tRNA clusters and higher sequence conservation. We also demonstrate that tRNA isoacceptor and alloacceptor changes by anticodon shifts have occurred several times in Drosophila, annotating 16 such events in functional tRNAs during the evolution of the genus.
Collapse
Affiliation(s)
- Hubert H Rogers
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | | | | |
Collapse
|
24
|
Bermudez-Santana C, Attolini CSO, Kirsten T, Engelhardt J, Prohaska SJ, Steigele S, Stadler PF. Genomic organization of eukaryotic tRNAs. BMC Genomics 2010; 11:270. [PMID: 20426822 PMCID: PMC2888827 DOI: 10.1186/1471-2164-11-270] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2010] [Accepted: 04/28/2010] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND Surprisingly little is known about the organization and distribution of tRNA genes and tRNA-related sequences on a genome-wide scale. While tRNA gene complements are usually reported in passing as part of genome annotation efforts, and peculiar features such as the tandem arrangements of tRNA gene in Entamoeba histolytica have been described in some detail, systematic comparative studies are rare and mostly restricted to bacteria. We therefore set out to survey the genomic arrangement of tRNA genes and pseudogenes in a wide range of eukaryotes to identify common patterns and taxon-specific peculiarities. RESULTS In line with previous reports, we find that tRNA complements evolve rapidly and tRNA gene and pseudogene locations are subject to rapid turnover. At phylum level, the distributions of the number of tRNA genes and pseudogenes numbers are very broad, with standard deviations on the order of the mean. Even among closely related species we observe dramatic changes in local organization. For instance, 65% and 87% of the tRNA genes and pseudogenes are located in genomic clusters in zebrafish and stickleback, resp., while such arrangements are relatively rare in the other three sequenced teleost fish genomes. Among basal metazoa, Trichoplax adherens has hardly any duplicated tRNA gene, while the sea anemone Nematostella vectensis boasts more than 17000 tRNA genes and pseudogenes. Dramatic variations are observed even within the eutherian mammals. Higher primates, for instance, have 616 +/- 120 tRNA genes and pseudogenes of which 17% to 36% are arranged in clusters, while the genome of the bushbaby Otolemur garnetti has 45225 tRNA genes and pseudogenes of which only 5.6% appear in clusters. In contrast, the distribution is surprisingly uniform across plant genomes. Consistent with this variability, syntenic conservation of tRNA genes and pseudogenes is also poor in general, with turn-over rates comparable to those of unconstrained sequence elements. Despite this large variation in abundance in Eukarya we observe a significant correlation between the number of tRNA genes, tRNA pseudogenes, and genome size. CONCLUSIONS The genomic organization of tRNA genes and pseudogenes shows complex lineage-specific patterns characterized by an extensive variability that is in striking contrast to the extreme levels of sequence-conservation of the tRNAs themselves. The comprehensive analysis of the genomic organization of tRNA genes and pseudogenes in Eukarya provides a basis for further studies into the interplay of tRNA gene arrangements and genome organization in general.
Collapse
Affiliation(s)
- Clara Bermudez-Santana
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107, Leipzig, Germany
- Department of Biology, Universidad Nacional de Colombia. Carrera45 # 26-85 - Edificio Uriel Gutiérrez, Bogotá D.C., Colombia
| | - Camille Stephan-Otto Attolini
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107, Leipzig, Germany
- Biostatistics and Bioinformatics unit, Institute for Research in Biomedicine (IRB Barcelona), Barcelona, Spain
| | - Toralf Kirsten
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107, Leipzig, Germany
| | - Jan Engelhardt
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107, Leipzig, Germany
| | - Sonja J Prohaska
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107, Leipzig, Germany
| | | | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstraße 16-18, D-04107, Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Inselstraß 22 D-04103 Leipzig, Germany
- Fraunhofer Institute for Cell Therapy and Immunology, Perlickstraße 1, D-04103 Leipzig, Germany
- Santa Fe Institute, 1399 Hyde Park Rd, Santa Fe, NM 87501, USA
- Institute for Theoretical Chemistry, University of Vienna, Währingerstraße 17, A-1090 Wien, Austria
| |
Collapse
|
25
|
Ponnala L. On finding poorly translated codons based on their usage frequency. Bioinformation 2009; 4:63-5. [PMID: 20198170 PMCID: PMC2823382 DOI: 10.6026/97320630004063] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2009] [Accepted: 07/18/2009] [Indexed: 11/23/2022] Open
Abstract
Long stretches of “rare” codons are known to severely inhibit the efficiency of translation. Understanding the distribution of such rare codons is
of critical importance in improving the efficiency of heterologous gene expression systems. Accurate estimates of codon usage take the
abundance of each protein into consideration. In this paper, we analyze the correlation between approximate measures of codon usage and the
availability of tRNA at various growth rates in E coli. We show that the computationally derived estimates of tRNA isoacceptor concentration
enable the finding of poorly translated codons.
Collapse
Affiliation(s)
- Lalit Ponnala
- Computational Biology Service Unit, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
26
|
Abstract
Chromatin insulators separate active from repressed chromatin domains. In yeast the RNA pol III transcription machinery bound to tRNA genes function with histone acetylases and chromatin remodelers to restrict the spread of heterochromatin. Our results collectively demonstrate that binding of TFIIIC is necessary for insulation but binding of TFIIIB along with TFIIIC likely improves the probability of complex formation at an insulator. Insulation by this transcription factor occurs in the absence of RNA polymerase III or polymerase II but requires specific histone acetylases and chromatin remodelers. This analysis identifies a minimal set of factors required for insulation.
Collapse
|
27
|
Merkl R, Wiezer A. GO4genome: a prokaryotic phylogeny based on genome organization. J Mol Evol 2009; 68:550-62. [PMID: 19436929 PMCID: PMC3085772 DOI: 10.1007/s00239-009-9233-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2008] [Revised: 03/10/2009] [Accepted: 04/03/2009] [Indexed: 11/24/2022]
Abstract
Determining the phylogeny of closely related prokaryotes may fail in an analysis of rRNA or a small set of sequences. Whole-genome phylogeny utilizes the maximally available sample space. For a precise determination of genome similarity, two aspects have to be considered when developing an algorithm of whole-genome phylogeny: (1) gene order conservation is a more precise signal than gene content; and (2) when using sequence similarity, failures in identifying orthologues or the in situ replacement of genes via horizontal gene transfer may give misleading results. GO4genome is a new paradigm, which is based on a detailed analysis of gene function and the location of the respective genes. For characterization of genes, the algorithm uses gene ontology enabling a comparison of function independent of evolutionary relationship. After the identification of locally optimal series of gene functions, their length distribution is utilized to compute a phylogenetic distance. The outcome is a classification of genomes based on metabolic capabilities and their organization. Thus, the impact of effects on genome organization that are not covered by methods of molecular phylogeny can be studied. Genomes of strains belonging to Escherichia coli, Shigella, Streptococcus, Methanosarcina, and Yersinia were analyzed. Differences from the findings of classical methods are discussed.
Collapse
Affiliation(s)
- Rainer Merkl
- Institut für Biophysik und Physikalische Biochemie, Universität Regensburg, 93040, Regensburg, Germany.
| | | |
Collapse
|
28
|
Tang DTP, Glazov EA, McWilliam SM, Barris WC, Dalrymple BP. Analysis of the complement and molecular evolution of tRNA genes in cow. BMC Genomics 2009; 10:188. [PMID: 19393063 PMCID: PMC2680898 DOI: 10.1186/1471-2164-10-188] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2008] [Accepted: 04/24/2009] [Indexed: 12/21/2022] Open
Abstract
Background Detailed information regarding the number and organization of transfer RNA (tRNA) genes at the genome level is becoming readily available with the increase of DNA sequencing of whole genomes. However the identification of functional tRNA genes is challenging for species that have large numbers of repetitive elements containing tRNA derived sequences, such as Bos taurus. Reliable identification and annotation of entire sets of tRNA genes allows the evolution of tRNA genes to be understood on a genomic scale. Results In this study, we explored the B. taurus genome using bioinformatics and comparative genomics approaches to catalogue and analyze cow tRNA genes. The initial analysis of the cow genome using tRNAscan-SE identified 31,868 putative tRNA genes and 189,183 pseudogenes, where 28,830 of the 31,868 predicted tRNA genes were classified as repetitive elements by the RepeatMasker program. We then used comparative genomics to further discriminate between functional tRNA genes and tRNA-derived sequences for the remaining set of 3,038 putative tRNA genes. For our analysis, we used the human, chimpanzee, mouse, rat, horse, dog, chicken and fugu genomes to predict that the number of active tRNA genes in cow lies in the vicinity of 439. Of this set, 150 tRNA genes were 100% identical in their sequences across all nine vertebrate genomes studied. Using clustering analyses, we identified a new tRNA-GlyCCC subfamily present in all analyzed mammalian genomes. We suggest that this subfamily originated from an ancestral tRNA-GlyGCC gene via a point mutation prior to the radiation of the mammalian lineages. Lastly, in a separate analysis we created phylogenetic profiles for each putative cow tRNA gene using a representative set of genomes to gain an overview of common evolutionary histories of tRNA genes. Conclusion The use of a combination of bioinformatics and comparative genomics approaches has allowed the confident identification of a set of cow tRNA genes that will facilitate further studies in understanding the molecular evolution of cow tRNA genes.
Collapse
Affiliation(s)
- Dave T P Tang
- CSIRO Livestock Industries, Queensland Biosciences Precinct, St Lucia, QLD, Australia.
| | | | | | | | | |
Collapse
|
29
|
Chen H, Xu L, Gu Z. Regulation dynamics of WGD genes during yeast metabolic oscillation. Mol Biol Evol 2008; 25:2513-6. [PMID: 18815125 DOI: 10.1093/molbev/msn212] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Saccharomyces cerevisiae and its close relatives are characterized by their propensity to ferment even in the presence of oxygen. It was hypothesized that whole-genome duplication (WGD) led to the development of this efficient fermentative lifestyle (WGD-fermentation hypothesis, Piskur 2001. In this study, we found that a significantly higher proportion of WGD genes than non-WGD genes are dynamically regulated during metabolic oscillation in response to oxygen change. The same data set also shows that the WGD genes, as compared with the smaller scale duplicate genes, are enriched with pairs where both copies have cyclic expression during the metabolic oscillation (either with the same or different phases). These results provide new evidences for the WGD-fermentation hypothesis and new insights into the relationship between the genome duplication and the evolution of new lifestyles in eukaryotic organisms.
Collapse
|
30
|
Higgs PG, Ran W. Coevolution of Codon Usage and tRNA Genes Leads to Alternative Stable States of Biased Codon Usage. Mol Biol Evol 2008; 25:2279-91. [DOI: 10.1093/molbev/msn173] [Citation(s) in RCA: 114] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
31
|
Hughes T, Ekman D, Ardawatia H, Elofsson A, Liberles DA. Evaluating dosage compensation as a cause of duplicate gene retention in Paramecium tetraurelia. Genome Biol 2007; 8:213. [PMID: 17521457 PMCID: PMC1929130 DOI: 10.1186/gb-2007-8-5-213] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Can dosage compensation completely explain gene retention after whole-genome duplication? The high retention of duplicate genes in the genome of Paramecium tetraurelia has led to the hypothesis that most of the retained genes have persisted because of constraints due to gene dosage. This and other possible mechanisms are discussed in the light of expectations from population genetics and systems biology.
Collapse
Affiliation(s)
- Timothy Hughes
- Computational Biology Unit, Bergen Center for Computational Science, University of Bergen, 5020 Bergen, Norway
| | - Diana Ekman
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden
| | - Himanshu Ardawatia
- Computational Biology Unit, Bergen Center for Computational Science, University of Bergen, 5020 Bergen, Norway
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden
| | - Arne Elofsson
- Department of Biochemistry and Biophysics, Stockholm University, 10691 Stockholm, Sweden
| | - David A Liberles
- Department of Molecular Biology, University of Wyoming, Laramie, WY 82071, USA
| |
Collapse
|
32
|
Abstract
We compare the diversity of chromosomal-encoded transfer RNA (tRNA) genes from 11 eukaryotes as identified by tRNAScan-SE of their respective genomes. They include the budding and fission yeast, worm, fruit fly, fugu, chicken, dog, rat, mouse, chimp and human. The number of tRNA genes are between 170 and 570 and the number of tRNA isoacceptors range from 41 to 55. Unexpectedly, the number of tRNA genes having the same anticodon but different sequences elsewhere in the tRNA body (defined here as tRNA isodecoder genes) varies significantly (10-246). tRNA isodecoder genes allow up to 274 different tRNA species to be produced from 446 genes in humans, but only up to 51 from 275 genes in the budding yeast. The fraction of tRNA isodecoder genes among all tRNA genes increases across the phylogenetic spectrum. A large number of sequence differences in human tRNA isodecoder genes occurs in the internal promoter regions for RNA polymerase III. We also describe a systematic, ligation-based method to detect and quantify tRNA isodecoder molecules in human samples, and show differential expression of three tRNA isodecoders in six human tissues. The large number of tRNA isodecoder genes in eukaryotes suggests that tRNA function may be more diverse than previously appreciated.
Collapse
Affiliation(s)
| | - Tao Pan
- Department of Biochemistry and Molecular Biology929 East 57th street, Chicago, IL 60637, USA
- To whom correspondence should be addressed. Tel: +1 773 702 4179; Fax: +1 773 702 0439;
| |
Collapse
|