1
|
McGowan J, Kilias ES, Alacid E, Lipscombe J, Jenkins BH, Gharbi K, Kaithakottil GG, Macaulay IC, McTaggart S, Warring SD, Richards TA, Hall N, Swarbreck D. Identification of a non-canonical ciliate nuclear genetic code where UAA and UAG code for different amino acids. PLoS Genet 2023; 19:e1010913. [PMID: 37796765 PMCID: PMC10553269 DOI: 10.1371/journal.pgen.1010913] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 08/10/2023] [Indexed: 10/07/2023] Open
Abstract
The genetic code is one of the most highly conserved features across life. Only a few lineages have deviated from the "universal" genetic code. Amongst the few variants of the genetic code reported to date, the codons UAA and UAG virtually always have the same translation, suggesting that their evolution is coupled. Here, we report the genome and transcriptome sequencing of a novel uncultured ciliate, belonging to the Oligohymenophorea class, where the translation of the UAA and UAG stop codons have changed to specify different amino acids. Genomic and transcriptomic analyses revealed that UAA has been reassigned to encode lysine, while UAG has been reassigned to encode glutamic acid. We identified multiple suppressor tRNA genes with anticodons complementary to the reassigned codons. We show that the retained UGA stop codon is enriched in the 3'UTR immediately downstream of the coding region of genes, suggesting that there is functional drive to maintain tandem stop codons. Using a phylogenomics approach, we reconstructed the ciliate phylogeny and mapped genetic code changes, highlighting the remarkable number of independent genetic code changes within the Ciliophora group of protists. According to our knowledge, this is the first report of a genetic code variant where UAA and UAG encode different amino acids.
Collapse
Affiliation(s)
- Jamie McGowan
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Elisabet Alacid
- Department of Biology, University of Oxford, Oxford, United Kingdom
| | - James Lipscombe
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Karim Gharbi
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Iain C. Macaulay
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | - Seanna McTaggart
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | - Sally D. Warring
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | | | - Neil Hall
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
- School of Biological Sciences, University of East Anglia, Norwich, United Kingdom
| | - David Swarbreck
- Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| |
Collapse
|
2
|
Fonseca PLC, De-Paula RB, Araújo DS, Tomé LMR, Mendes-Pereira T, Rodrigues WFC, Del-Bem LE, Aguiar ERGR, Góes-Neto A. Global Characterization of Fungal Mitogenomes: New Insights on Genomic Diversity and Dynamism of Coding Genes and Accessory Elements. Front Microbiol 2021; 12:787283. [PMID: 34925295 PMCID: PMC8672057 DOI: 10.3389/fmicb.2021.787283] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 11/11/2021] [Indexed: 01/13/2023] Open
Abstract
Fungi comprise a great diversity of species with distinct ecological functions and lifestyles. Similar to other eukaryotes, fungi rely on interactions with prokaryotes and one of the most important symbiotic events was the acquisition of mitochondria. Mitochondria are organelles found in eukaryotic cells whose main function is to generate energy through aerobic respiration. Mitogenomes (mtDNAs) are double-stranded circular or linear DNA from mitochondria that may contain core genes and accessory elements that can be replicated, transcribed, and independently translated from the nuclear genome. Despite their importance, investigative studies on the diversity of fungal mitogenomes are scarce. Herein, we have evaluated 788 curated fungal mitogenomes available at NCBI database to assess discrepancies and similarities among them and to better understand the mechanisms involved in fungal mtDNAs variability. From a total of 12 fungal phyla, four do not have any representative with available mitogenomes, which highlights the underrepresentation of some groups in the current available data. We selected representative and non-redundant mitogenomes based on the threshold of 90% similarity, eliminating 81 mtDNAs. Comparative analyses revealed considerable size variability of mtDNAs with a difference of up to 260 kb in length. Furthermore, variation in mitogenome length and genomic composition are generally related to the number and length of accessory elements (introns, HEGs, and uORFs). We identified an overall average of 8.0 (0–39) introns, 8.0 (0–100) HEGs, and 8.2 (0–102) uORFs per genome, with high variation among phyla. Even though the length of the core protein-coding genes is considerably conserved, approximately 36.3% of the mitogenomes evaluated have at least one of the 14 core coding genes absent. Also, our results revealed that there is not even a single gene shared among all mitogenomes. Other unusual genes in mitogenomes were also detected in many mitogenomes, such as dpo and rpo, and displayed diverse evolutionary histories. Altogether, the results presented in this study suggest that fungal mitogenomes are diverse, contain accessory elements and are absent of a conserved gene that can be used for the taxonomic classification of the Kingdom Fungi.
Collapse
Affiliation(s)
- Paula L C Fonseca
- Department of Genetics, Ecology and Evolution, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.,Department of Biological Science (DCB), Center of Biotechnology and Genetics (CBG), Universidade Estadual de Santa Cruz (UESC), Ilhéus, Brazil
| | - Ruth B De-Paula
- Graduate School of Biomedical Sciences, Baylor College of Medicine, Houston, TX, United States
| | - Daniel S Araújo
- Program in Bioinformatics, Loyola University Chicago, Chicago, IL, United States
| | - Luiz Marcelo Ribeiro Tomé
- Molecular and Computational Biology of Fungi Laboratory, Department of Microbiology, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Thairine Mendes-Pereira
- Molecular and Computational Biology of Fungi Laboratory, Department of Microbiology, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | | | - Luiz-Eduardo Del-Bem
- Program of Bioinformatics, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.,Department of Botany, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Eric R G R Aguiar
- Department of Biological Science (DCB), Center of Biotechnology and Genetics (CBG), Universidade Estadual de Santa Cruz (UESC), Ilhéus, Brazil
| | - Aristóteles Góes-Neto
- Molecular and Computational Biology of Fungi Laboratory, Department of Microbiology, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.,Program of Bioinformatics, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| |
Collapse
|
3
|
Lui LM, Nielsen TN, Arkin AP. A method for achieving complete microbial genomes and improving bins from metagenomics data. PLoS Comput Biol 2021; 17:e1008972. [PMID: 33961626 PMCID: PMC8172020 DOI: 10.1371/journal.pcbi.1008972] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 06/02/2021] [Accepted: 04/16/2021] [Indexed: 11/19/2022] Open
Abstract
Metagenomics facilitates the study of the genetic information from uncultured microbes and complex microbial communities. Assembling complete genomes from metagenomics data is difficult because most samples have high organismal complexity and strain diversity. Some studies have attempted to extract complete bacterial, archaeal, and viral genomes and often focus on species with circular genomes so they can help confirm completeness with circularity. However, less than 100 circularized bacterial and archaeal genomes have been assembled and published from metagenomics data despite the thousands of datasets that are available. Circularized genomes are important for (1) building a reference collection as scaffolds for future assemblies, (2) providing complete gene content of a genome, (3) confirming little or no contamination of a genome, (4) studying the genomic context and synteny of genes, and (5) linking protein coding genes to ribosomal RNA genes to aid metabolic inference in 16S rRNA gene sequencing studies. We developed a semi-automated method called Jorg to help circularize small bacterial, archaeal, and viral genomes using iterative assembly, binning, and read mapping. In addition, this method exposes potential misassemblies from k-mer based assemblies. We chose species of the Candidate Phyla Radiation (CPR) to focus our initial efforts because they have small genomes and are only known to have one ribosomal RNA operon. In addition to 34 circular CPR genomes, we present one circular Margulisbacteria genome, one circular Chloroflexi genome, and two circular megaphage genomes from 19 public and published datasets. We demonstrate findings that would likely be difficult without circularizing genomes, including that ribosomal genes are likely not operonic in the majority of CPR, and that some CPR harbor diverged forms of RNase P RNA. Code and a tutorial for this method is available at https://github.com/lmlui/Jorg and is available on the DOE Systems Biology KnowledgeBase as a beta app.
Collapse
Affiliation(s)
- Lauren M. Lui
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
| | - Torben N. Nielsen
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
| | - Adam P. Arkin
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
- Department of Bioengineering, University of California, Berkeley, California, United States of America
- Innovative Genomics Institute, Berkeley, CA, United States of America
| |
Collapse
|
4
|
A search for the physical basis of the genetic code. Biosystems 2020; 195:104148. [DOI: 10.1016/j.biosystems.2020.104148] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/09/2020] [Accepted: 04/09/2020] [Indexed: 01/01/2023]
|
5
|
Evolutionary Diversity in the Intracellular Microsporidian Parasite Nosema sp. Infecting Wild Silkworm Revealed by IGS Nucleotide Sequence Diversity. J Mol Evol 2020; 88:345-360. [PMID: 32166385 DOI: 10.1007/s00239-020-09936-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 02/27/2020] [Indexed: 10/24/2022]
Abstract
Intracellular microsporidian Nosema mylitta infects Indian wild silkworm Antheraea mylitta causing pebrine disease. Genetic structure and phylogeny of N. mylitta are analysed using nucleotide variability in 5S ribosomal DNA and intergenic spacer (IGS) sequence from 20 isolates collected from Southern, Northern and Central regions of Jharkhand State. Nucleotide diversity (π) and genetic differentiation Gst were highest in the Central isolates whereas lowest in the North. Among the isolates, absence of nucleotides, transitions and transversions were observed. Haplotyping showed nucleotide variability at 83 positions in IGS and 13 positions in 5S rDNA. Haplotype-based genetic differentiation was 0.96 to 0.97 whereas nucleotide sequence-based genetic differentiation was higher (Ks = 22.29) between Southern and Central isolates. Bottleneck analysis showed negative value for Tajima's D and other summary statistics revealing induction of loss of rare alleles and population explosion. From IGS, 17 ancestral sequences were inferred by Network algorithm. Core of nine closely related nodes having ancient nucleotides and peripheral nodes with highly divergent nucleotides were derived. Most diverged peripheral haplotype was Bero (H11) from the Central region whereas Deoghar (H3) of the Northern region diverged early. Phylogeny of N. mylitta grouped Southern and Northern isolates together revealed weak phylogenetic signal for these locations. Phylogeny of N. mylitta with Nosema sp. infecting other lepidopterans clustered N. mylitta isolates with N. antheraea and N. philosamiae of China indicating genetic similarity whereas other species were dissimilar showing diversity irrespective of country of origin.
Collapse
|
6
|
Iost I, Chabas S, Darfeuille F. Maturation of atypical ribosomal RNA precursors in Helicobacter pylori. Nucleic Acids Res 2019; 47:5906-5921. [PMID: 31006803 PMCID: PMC6582327 DOI: 10.1093/nar/gkz258] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Revised: 03/28/2019] [Accepted: 04/18/2019] [Indexed: 01/01/2023] Open
Abstract
In most bacteria, ribosomal RNA is transcribed as a single polycistronic precursor that is first processed by RNase III. This double-stranded specific RNase cleaves two large stems flanking the 23S and 16S rRNA mature sequences, liberating three 16S, 23S and 5S rRNA precursors, which are further processed by other ribonucleases. Here, we investigate the rRNA maturation pathway of the human gastric pathogen Helicobacter pylori. This bacterium has an unusual arrangement of its rRNA genes, the 16S rRNA gene being separated from a 23S-5S rRNA cluster. We show that RNase III also initiates processing in this organism, by cleaving two typical stem structures encompassing 16S and 23S rRNAs and an atypical stem–loop located upstream of the 5S rRNA. Deletion of RNase III leads to the accumulation of a large 23S-5S precursor that is found in polysomes, suggesting that it can function in translation. Finally, we characterize a cis-encoded antisense RNA overlapping the leader of the 23S-5S rRNA precursor. We present evidence that this antisense RNA interacts with this precursor, forming an intermolecular complex that is cleaved by RNase III. This pairing induces additional specific cleavages of the rRNA precursor coupled with a rapid degradation of the antisense RNA.
Collapse
Affiliation(s)
- Isabelle Iost
- ARNA Laboratory, Inserm U1212, CNRS UMR 5320, Université de Bordeaux, France
| | - Sandrine Chabas
- ARNA Laboratory, Inserm U1212, CNRS UMR 5320, Université de Bordeaux, France
| | - Fabien Darfeuille
- ARNA Laboratory, Inserm U1212, CNRS UMR 5320, Université de Bordeaux, France
| |
Collapse
|
7
|
Noutahi E, Calderon V, Blanchette M, El-Mabrouk N, Lang BF. Rapid Genetic Code Evolution in Green Algal Mitochondrial Genomes. Mol Biol Evol 2019; 36:766-783. [PMID: 30698742 PMCID: PMC6551751 DOI: 10.1093/molbev/msz016] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Genetic code deviations involving stop codons have been previously reported in mitochondrial genomes of several green plants (Viridiplantae), most notably chlorophyte algae (Chlorophyta). However, as changes in codon recognition from one amino acid to another are more difficult to infer, such changes might have gone unnoticed in particular lineages with high evolutionary rates that are otherwise prone to codon reassignments. To gain further insight into the evolution of the mitochondrial genetic code in green plants, we have conducted an in-depth study across mtDNAs from 51 green plants (32 chlorophytes and 19 streptophytes). Besides confirming known stop-to-sense reassignments, our study documents the first cases of sense-to-sense codon reassignments in Chlorophyta mtDNAs. In several Sphaeropleales, we report the decoding of AGG codons (normally arginine) as alanine, by tRNA(CCU) of various origins that carry the recognition signature for alanine tRNA synthetase. In Chromochloris, we identify tRNA variants decoding AGG as methionine and the synonymous codon CGG as leucine. Finally, we find strong evidence supporting the decoding of AUA codons (normally isoleucine) as methionine in Pycnococcus. Our results rely on a recently developed conceptual framework (CoreTracker) that predicts codon reassignments based on the disparity between DNA sequence (codons) and the derived protein sequence. These predictions are then validated by an evaluation of tRNA phylogeny, to identify the evolution of new tRNAs via gene duplication and loss, and structural modifications that lead to the assignment of new tRNA identities and a change in the genetic code.
Collapse
Affiliation(s)
- Emmanuel Noutahi
- Département d'Informatique et de Recherche opérationnelle (DIRO), Université de Montréal, CP 6128 succursale Centre-Ville, Montreal, QC, Canada
| | - Virginie Calderon
- Institut de Recherches Cliniques de Montréal, Montreal, Quebec, Canada
| | - Mathieu Blanchette
- School of Computer Science, McGill University, McConnell Engineering Bldg., Montréal, QC H3A 0E9, Canada
- McGill Centre for Bioinformatics, McGill University, Montréal, QC, Canada
| | - Nadia El-Mabrouk
- Département d'Informatique et de Recherche opérationnelle (DIRO), Université de Montréal, CP 6128 succursale Centre-Ville, Montreal, QC, Canada
| | - Bernd Franz Lang
- Département de Biochimie, Centre Robert Cedergren, Université de Montréal, CP 6128 succursale Centre-Ville, Montreal, QC, Canada
| |
Collapse
|
8
|
Many alternative and theoretical genetic codes are more robust to amino acid replacements than the standard genetic code. J Theor Biol 2019; 464:21-32. [DOI: 10.1016/j.jtbi.2018.12.030] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 12/17/2018] [Accepted: 12/19/2018] [Indexed: 02/07/2023]
|
9
|
Błażej P, Wnętrzak M, Mackiewicz D, Mackiewicz P. Optimization of the standard genetic code according to three codon positions using an evolutionary algorithm. PLoS One 2018; 13:e0201715. [PMID: 30092017 PMCID: PMC6084934 DOI: 10.1371/journal.pone.0201715] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 07/21/2018] [Indexed: 12/28/2022] Open
Abstract
Many biological systems are typically examined from the point of view of adaptation to certain conditions or requirements. One such system is the standard genetic code (SGC), which generally minimizes the cost of amino acid replacements resulting from mutations or mistranslations. However, no full consensus has been reached on the factors that caused the evolution of this feature. One of the hypotheses suggests that code optimality was directly selected as an advantage to preserve information about encoded proteins. An important feature that should be considered when studying the SGC is the different roles of the three codon positions. Therefore, we investigated the robustness of this code regarding the cost of amino acid replacements resulting from substitutions in these positions separately and the sum of these costs. We applied a modified evolutionary algorithm and included four models of the genetic code assuming various restrictions on its structure. The SGC was compared both with the codes that minimize the objective function and those that maximize it. This approach allowed us to place the SGC in the global space of possible codes, which is a more appropriate and unbiased comparison than that with randomly generated codes because they are characterized by relatively uniform amino acid assignments to codons. The SGC appeared to be well optimized at the global scale, but its individual positions were not fully optimized because there were codes that were optimized for only one codon position and simultaneously outperformed the SGC at the other positions. We also found that different code structures may lead to the same optimality and that random codes can show a tendency to minimize costs under some of the genetic code models. Our results suggest that the optimality of SGC could be a by-product of other processes.
Collapse
Affiliation(s)
- Paweł Błażej
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Małgorzata Wnętrzak
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
- * E-mail:
| |
Collapse
|
10
|
Anacker ML, Drecktrah D, LeCoultre RD, Lybecker M, Samuels DS. RNase III Processing of rRNA in the Lyme Disease Spirochete Borrelia burgdorferi. J Bacteriol 2018; 200:e00035-18. [PMID: 29632096 PMCID: PMC5996687 DOI: 10.1128/jb.00035-18] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Accepted: 04/04/2018] [Indexed: 02/08/2023] Open
Abstract
The rRNA genes of Borrelia (Borreliella) burgdorferi are unusually organized; the spirochete has a single 16S rRNA gene that is more than 3 kb from a tandem pair of 23S-5S rRNA operons. We generated an rnc null mutant in B. burgdorferi that exhibits a pleiotropic phenotype, including decreased growth rate and increased cell length. Here, we demonstrate that endoribonuclease III (RNase III) is, as expected, involved in processing the 23S rRNA in B. burgdorferi The 5' and 3' ends of the three rRNAs were determined in the wild type and rncBb mutants; the results suggest that RNase III in B. burgdorferi is required for the full maturation of the 23S rRNA but not for the 5S rRNA nor, curiously, for the 16S rRNA.IMPORTANCE Lyme disease, the most common tick-borne zoonosis in the Northern Hemisphere, is caused by the bacterium Borrelia (Borreliella) burgdorferi, a member of the deeply branching spirochete phylum. B. burgdorferi carries a limited suite of ribonucleases, enzymes that cleave RNA during processing and degradation. Several ribonucleases, including RNase III, are involved in the production of ribosomes, which catalyze translation and are a major target of antibiotics. This is the first study to dissect the role of an RNase in any spirochete. We demonstrate that an RNase III mutant is viable but has altered processing of rRNA.
Collapse
MESH Headings
- Bacterial Proteins/genetics
- Bacterial Proteins/metabolism
- Borrelia burgdorferi/enzymology
- Borrelia burgdorferi/genetics
- Borrelia burgdorferi/metabolism
- Humans
- Lyme Disease/microbiology
- Operon
- RNA, Bacterial/genetics
- RNA, Bacterial/metabolism
- RNA, Ribosomal, 16S/genetics
- RNA, Ribosomal, 16S/metabolism
- RNA, Ribosomal, 23S/genetics
- RNA, Ribosomal, 23S/metabolism
- RNA, Ribosomal, 5S/genetics
- RNA, Ribosomal, 5S/metabolism
- Ribonuclease III/genetics
- Ribonuclease III/metabolism
Collapse
Affiliation(s)
- Melissa L Anacker
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Dan Drecktrah
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Richard D LeCoultre
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Meghan Lybecker
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
- Department of Biology, University of Colorado, Colorado Springs, Colorado, USA
| | - D Scott Samuels
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
- Center for Biomolecular Structure and Dynamics, University of Montana, Missoula, Montana, USA
| |
Collapse
|
11
|
Lin X, Yu ACS, Chan TF. Efforts and Challenges in Engineering the Genetic Code. Life (Basel) 2017; 7:life7010012. [PMID: 28335420 PMCID: PMC5370412 DOI: 10.3390/life7010012] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Revised: 03/09/2017] [Accepted: 03/10/2017] [Indexed: 12/15/2022] Open
Abstract
This year marks the 48th anniversary of Francis Crick’s seminal work on the origin of the genetic code, in which he first proposed the “frozen accident” hypothesis to describe evolutionary selection against changes to the genetic code that cause devastating global proteome modification. However, numerous efforts have demonstrated the viability of both natural and artificial genetic code variations. Recent advances in genetic engineering allow the creation of synthetic organisms that incorporate noncanonical, or even unnatural, amino acids into the proteome. Currently, successful genetic code engineering is mainly achieved by creating orthogonal aminoacyl-tRNA/synthetase pairs to repurpose stop and rare codons or to induce quadruplet codons. In this review, we summarize the current progress in genetic code engineering and discuss the challenges, current understanding, and future perspectives regarding genetic code modification.
Collapse
Affiliation(s)
- Xiao Lin
- School of Life Sciences, The Chinese University of Hong Kong, Sha Tin, NT, Hong Kong, China.
| | - Allen Chi Shing Yu
- School of Life Sciences, The Chinese University of Hong Kong, Sha Tin, NT, Hong Kong, China.
| | - Ting Fung Chan
- School of Life Sciences, The Chinese University of Hong Kong, Sha Tin, NT, Hong Kong, China.
| |
Collapse
|
12
|
Smith DR, Keeling PJ. Protists and the Wild, Wild West of Gene Expression: New Frontiers, Lawlessness, and Misfits. Annu Rev Microbiol 2016; 70:161-78. [PMID: 27359218 DOI: 10.1146/annurev-micro-102215-095448] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The DNA double helix has been called one of life's most elegant structures, largely because of its universality, simplicity, and symmetry. The expression of information encoded within DNA, however, can be far from simple or symmetric and is sometimes surprisingly variable, convoluted, and wantonly inefficient. Although exceptions to the rules exist in certain model systems, the true extent to which life has stretched the limits of gene expression is made clear by nonmodel systems, particularly protists (microbial eukaryotes). The nuclear and organelle genomes of protists are subject to the most tangled forms of gene expression yet identified. The complicated and extravagant picture of the underlying genetics of eukaryotic microbial life changes how we think about the flow of genetic information and the evolutionary processes shaping it. Here, we discuss the origins, diversity, and growing interest in noncanonical protist gene expression and its relationship to genomic architecture.
Collapse
Affiliation(s)
- David Roy Smith
- Department of Biology, University of Western Ontario, London, Ontario, Canada N6A 5B7;
| | - Patrick J Keeling
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4;
| |
Collapse
|
13
|
Mühlhausen S, Findeisen P, Plessmann U, Urlaub H, Kollmar M. A novel nuclear genetic code alteration in yeasts and the evolution of codon reassignment in eukaryotes. Genome Res 2016; 26:945-55. [PMID: 27197221 PMCID: PMC4937558 DOI: 10.1101/gr.200931.115] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2015] [Accepted: 04/28/2016] [Indexed: 01/12/2023]
Abstract
The genetic code is the cellular translation table for the conversion of nucleotide sequences into amino acid sequences. Changes to the meaning of sense codons would introduce errors into almost every translated message and are expected to be highly detrimental. However, reassignment of single or multiple codons in mitochondria and nuclear genomes, although extremely rare, demonstrates that the code can evolve. Several models for the mechanism of alteration of nuclear genetic codes have been proposed (including “codon capture,” “genome streamlining,” and “ambiguous intermediate” theories), but with little resolution. Here, we report a novel sense codon reassignment in Pachysolen tannophilus, a yeast related to the Pichiaceae. By generating proteomics data and using tRNA sequence comparisons, we show that Pachysolen translates CUG codons as alanine and not as the more usual leucine. The Pachysolen tRNACAG is an anticodon-mutated tRNAAla containing all major alanine tRNA recognition sites. The polyphyly of the CUG-decoding tRNAs in yeasts is best explained by a tRNA loss driven codon reassignment mechanism. Loss of the CUG-tRNA in the ancient yeast is followed by gradual decrease of respective codons and subsequent codon capture by tRNAs whose anticodon is not part of the aminoacyl-tRNA synthetase recognition region. Our hypothesis applies to all nuclear genetic code alterations and provides several testable predictions. We anticipate more codon reassignments to be uncovered in existing and upcoming genome projects.
Collapse
Affiliation(s)
- Stefanie Mühlhausen
- Group Systems Biology of Motor Proteins, Department of NMR-Based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, 37077 Göttingen, Germany
| | - Peggy Findeisen
- Group Systems Biology of Motor Proteins, Department of NMR-Based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, 37077 Göttingen, Germany
| | - Uwe Plessmann
- Bioanalytical Mass Spectrometry, Max-Planck-Institute for Biophysical Chemistry, 37077 Göttingen, Germany
| | - Henning Urlaub
- Bioanalytical Mass Spectrometry, Max-Planck-Institute for Biophysical Chemistry, 37077 Göttingen, Germany; Bioanalytics Group, Department of Clinical Chemistry, University Medical Center Göttingen, 37075 Göttingen, Germany
| | - Martin Kollmar
- Group Systems Biology of Motor Proteins, Department of NMR-Based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, 37077 Göttingen, Germany
| |
Collapse
|
14
|
Bezerra AR, Guimarães AR, Santos MAS. Non-Standard Genetic Codes Define New Concepts for Protein Engineering. Life (Basel) 2015; 5:1610-28. [PMID: 26569314 PMCID: PMC4695839 DOI: 10.3390/life5041610] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Revised: 10/12/2015] [Accepted: 10/21/2015] [Indexed: 11/16/2022] Open
Abstract
The essential feature of the genetic code is the strict one-to-one correspondence between codons and amino acids. The canonical code consists of three stop codons and 61 sense codons that encode 20% of the amino acid repertoire observed in nature. It was originally designated as immutable and universal due to its conservation in most organisms, but sequencing of genes from the human mitochondrial genomes revealed deviations in codon assignments. Since then, alternative codes have been reported in both nuclear and mitochondrial genomes and genetic code engineering has become an important research field. Here, we review the most recent concepts arising from the study of natural non-standard genetic codes with special emphasis on codon re-assignment strategies that are relevant to engineering genetic code in the laboratory. Recent tools for synthetic biology and current attempts to engineer new codes for incorporation of non-standard amino acids are also reviewed in this article.
Collapse
Affiliation(s)
- Ana R Bezerra
- Health Sciences Department, Institute for Biomedicine-iBiMED, University of Aveiro, Campus de Santiago, Aveiro 3810-193, Portugal.
| | - Ana R Guimarães
- Health Sciences Department, Institute for Biomedicine-iBiMED, University of Aveiro, Campus de Santiago, Aveiro 3810-193, Portugal.
| | - Manuel A S Santos
- Health Sciences Department, Institute for Biomedicine-iBiMED, University of Aveiro, Campus de Santiago, Aveiro 3810-193, Portugal.
| |
Collapse
|
15
|
Acosta S, Carela M, Garcia-Gonzalez A, Gines M, Vicens L, Cruet R, Massey SE. DNA Repair Is Associated with Information Content in Bacteria, Archaea, and DNA Viruses. J Hered 2015; 106:644-59. [PMID: 26320243 DOI: 10.1093/jhered/esv055] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Accepted: 07/07/2015] [Indexed: 11/13/2022] Open
Abstract
The concept of a "proteomic constraint" proposes that DNA repair capacity is positively correlated with the information content of a genome, which can be approximated to the size of the proteome (P). This in turn implies that DNA repair genes are more likely to be present in genomes with larger values of P. This stands in contrast to the common assumption that informational genes have a core function and so are evenly distributed across organisms. We examined the presence/absence of 18 DNA repair genes in bacterial genomes. A positive relationship between gene presence and P was observed for 17 genes in the total dataset, and 16 genes when only nonintracellular bacteria were examined. A marked reduction of DNA repair genes was observed in intracellular bacteria, consistent with their reduced value of P. We also examined archaeal and DNA virus genomes, and show that the presence of DNA repair genes is likewise related to a larger value of P. In addition, the products of the bacterial genes mutY, vsr, and ndk, involved in the correction of GC/AT mutations, are strongly associated with reduced genome GC content. We therefore propose that a reduction in information content leads to a loss of DNA repair genes and indirectly to a reduction in genome GC content in bacteria by exposure to the underlying AT mutation bias. The reduction in P may also indirectly lead to the increase in substitution rates observed in intracellular bacteria via loss of DNA repair genes.
Collapse
Affiliation(s)
- Sharlene Acosta
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Miguelina Carela
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Aurian Garcia-Gonzalez
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Mariela Gines
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Luis Vicens
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Ricardo Cruet
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey)
| | - Steven E Massey
- From the Department of Biology, University of Puerto Rico-Rio Piedras, PO Box 23360, San Juan 00931, Puerto Rico (Acosta, Carela, Garcia-Gonzalez, Gines, Vicens, Cruet, and Massey).
| |
Collapse
|
16
|
Zhou JH, Ding YZ, He Y, Chu YF, Zhao P, Ma LY, Wang XJ, Li XR, Liu YS. The effect of multiple evolutionary selections on synonymous codon usage of genes in the Mycoplasma bovis genome. PLoS One 2014; 9:e108949. [PMID: 25350396 PMCID: PMC4211681 DOI: 10.1371/journal.pone.0108949] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Accepted: 08/26/2014] [Indexed: 11/19/2022] Open
Abstract
Mycoplasma bovis is a major pathogen causing arthritis, respiratory disease and mastitis in cattle. A better understanding of its genetic features and evolution might represent evidences of surviving host environments. In this study, multiple factors influencing synonymous codon usage patterns in M. bovis (three strains’ genomes) were analyzed. The overall nucleotide content of genes in the M. bovis genome is AT-rich. Although the G and C contents at the third codon position of genes in the leading strand differ from those in the lagging strand (p<0.05), the 59 synonymous codon usage patterns of genes in the leading strand are highly similar to those in the lagging strand. The over-represented codons and the under-represented codons were identified. A comparison of the synonymous codon usage pattern of M. bovis and cattle (susceptible host) indicated the independent formation of synonymous codon usage of M. bovis. Principal component analysis revealed that (i) strand-specific mutational bias fails to affect the synonymous codon usage pattern in the leading and lagging strands, (ii) mutation pressure from nucleotide content plays a role in shaping the overall codon usage, and (iii) the major trend of synonymous codon usage has a significant correlation with the gene expression level that is estimated by the codon adaptation index. The plot of the effective number of codons against the G+C content at the third codon position also reveals that mutation pressure undoubtedly contributes to the synonymous codon usage pattern of M. bovis. Additionally, the formation of the overall codon usage is determined by certain evolutionary selections for gene function classification (30S protein, 50S protein, transposase, membrane protein, and lipoprotein) and translation elongation region of genes in M. bovis. The information could be helpful in further investigations of evolutionary mechanisms of the Mycoplasma family and heterologous expression of its functionally important proteins.
Collapse
Affiliation(s)
- Jian-hua Zhou
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
| | - Yao-zhong Ding
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
| | - Ying He
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
| | - Yue-feng Chu
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
| | - Ping Zhao
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
| | - Li-ya Ma
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
| | - Xin-jun Wang
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
| | - Xue-rui Li
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
- * E-mail: (XRL); (YSL)
| | - Yong-sheng Liu
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, Gansu, P.R. China
- * E-mail: (XRL); (YSL)
| |
Collapse
|
17
|
McCutcheon JP, Moran NA. Extreme genome reduction in symbiotic bacteria. Nat Rev Microbiol 2011; 10:13-26. [PMID: 22064560 DOI: 10.1038/nrmicro2670] [Citation(s) in RCA: 933] [Impact Index Per Article: 71.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Since 2006, numerous cases of bacterial symbionts with extraordinarily small genomes have been reported. These organisms represent independent lineages from diverse bacterial groups. They have diminutive gene sets that rival some mitochondria and chloroplasts in terms of gene numbers and lack genes that are considered to be essential in other bacteria. These symbionts have numerous features in common, such as extraordinarily fast protein evolution and a high abundance of chaperones. Together, these features point to highly degenerate genomes that retain only the most essential functions, often including a considerable fraction of genes that serve the hosts. These discoveries have implications for the concept of minimal genomes, the origins of cellular organelles, and studies of symbiosis and host-associated microbiota.
Collapse
Affiliation(s)
- John P McCutcheon
- University of Montana, Division of Biological Sciences, 32 Campus Drive, HS104, Missoula, Montana 59812, USA.
| | | |
Collapse
|
18
|
Seaborg DM. Was Wright right? The canonical genetic code is an empirical example of an adaptive peak in nature; deviant genetic codes evolved using adaptive bridges. J Mol Evol 2010; 71:87-99. [PMID: 20711776 PMCID: PMC2924497 DOI: 10.1007/s00239-010-9373-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2010] [Accepted: 07/02/2010] [Indexed: 11/30/2022]
Abstract
The canonical genetic code is on a sub-optimal adaptive peak with respect to its ability to minimize errors, and is close to, but not quite, optimal. This is demonstrated by the near-total adjacency of synonymous codons, the similarity of adjacent codons, and comparisons of frequency of amino acid usage with number of codons in the code for each amino acid. As a rare empirical example of an adaptive peak in nature, it shows adaptive peaks are real, not merely theoretical. The evolution of deviant genetic codes illustrates how populations move from a lower to a higher adaptive peak. This is done by the use of "adaptive bridges," neutral pathways that cross over maladaptive valleys by virtue of masking of the phenotypic expression of some maladaptive aspects in the genotype. This appears to be the general mechanism by which populations travel from one adaptive peak to another. There are multiple routes a population can follow to cross from one adaptive peak to another. These routes vary in the probability that they will be used, and this probability is determined by the number and nature of the mutations that happen along each of the routes. A modification of the depiction of adaptive landscapes showing genetic distances and probabilities of travel along their multiple possible routes would throw light on this important concept.
Collapse
Affiliation(s)
- David M Seaborg
- Foundation for Biological Conservation and Research, 1888 Pomar Way, Walnut Creek, CA 94598-1424, USA.
| |
Collapse
|
19
|
Sammet SG, Bastolla U, Porto M. Comparison of translation loads for standard and alternative genetic codes. BMC Evol Biol 2010; 10:178. [PMID: 20546599 PMCID: PMC2909233 DOI: 10.1186/1471-2148-10-178] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 06/14/2010] [Indexed: 11/25/2022] Open
Abstract
Background The (almost) universality of the genetic code is one of the most intriguing properties of cellular life. Nevertheless, several variants of the standard genetic code have been observed, which differ in one or several of 64 codon assignments and occur mainly in mitochondrial genomes and in nuclear genomes of some bacterial and eukaryotic parasites. These variants are usually considered to be the result of non-adaptive evolution. It has been shown that the standard genetic code is preferential to randomly assembled codes for its ability to reduce the effects of errors in protein translation. Results Using a genotype-to-phenotype mapping based on a quantitative model of protein folding, we compare the standard genetic code to seven of its naturally occurring variants with respect to the fitness loss associated to mistranslation and mutation. These fitness losses are computed through computer simulations of protein evolution with mutations that are either neutral or lethal, and different mutation biases, which influence the balance between unfolding and misfolding stability. We show that the alternative codes may produce significantly different mutation and translation loads, particularly for genomes evolving with a rather large mutation bias. Most of the alternative genetic codes are found to be disadvantageous to the standard code, in agreement with the view that the change of genetic code is a mutationally driven event. Nevertheless, one of the studied alternative genetic codes is predicted to be preferable to the standard code for a broad range of mutation biases. Conclusions Our results show that, with one exception, the standard genetic code is generally better able to reduce the translation load than the naturally occurring variants studied here. Besides this exception, some of the other alternative genetic codes are predicted to be better adapted for extreme mutation biases. Hence, the fixation of alternative genetic codes might be a neutral or nearly-neutral event in the majority of the cases, but adaptation cannot be excluded for some of the studied cases.
Collapse
Affiliation(s)
- Stefanie Gabriele Sammet
- Institut für Festkörperphysik, Technische Universität Darmstadt, Hochschulstr, 8, 64289 Darmstadt, Germany
| | | | | |
Collapse
|
20
|
Abstract
Phylogenomics reveals extreme gene loss in typhus group (TG) rickettsiae relative to the levels for other rickettsial lineages. We report here a curious protease-encoding gene (ppcE) that is conserved only in TG rickettsiae. As a possible determinant of host pathogenicity, ppcE warrants consideration in the development of therapeutics against epidemic and murine typhus.
Collapse
|
21
|
Koonin EV, Novozhilov AS. Origin and evolution of the genetic code: the universal enigma. IUBMB Life 2009; 61:99-111. [PMID: 19117371 DOI: 10.1002/iub.146] [Citation(s) in RCA: 199] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The genetic code is nearly universal, and the arrangement of the codons in the standard codon table is highly nonrandom. The three main concepts on the origin and evolution of the code are the stereochemical theory, according to which codon assignments are dictated by physicochemical affinity between amino acids and the cognate codons (anticodons); the coevolution theory, which posits that the code structure coevolved with amino acid biosynthesis pathways; and the error minimization theory under which selection to minimize the adverse effect of point mutations and translation errors was the principal factor of the code's evolution. These theories are not mutually exclusive and are also compatible with the frozen accident hypothesis, that is, the notion that the standard code might have no special properties but was fixed simply because all extant life forms share a common ancestor, with subsequent changes to the code, mostly, precluded by the deleterious effect of codon reassignment. Mathematical analysis of the structure and possible evolutionary trajectories of the code shows that it is highly robust to translational misreading but there are numerous more robust codes, so the standard code potentially could evolve from a random code via a short sequence of codon series reassignments. Thus, much of the evolution that led to the standard code could be a combination of frozen accident with selection for error minimization although contributions from coevolution of the code with metabolic pathways and weak affinities between amino acids and nucleotide triplets cannot be ruled out. However, such scenarios for the code evolution are based on formal schemes whose relevance to the actual primordial evolution is uncertain. A real understanding of the code origin and evolution is likely to be attainable only in conjunction with a credible scenario for the evolution of the coding principle itself and the translation system.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| | | |
Collapse
|
22
|
|
23
|
Gillespie JJ, Ammerman NC, Dreher-Lesnick SM, Rahman MS, Worley MJ, Setubal JC, Sobral BS, Azad AF. An anomalous type IV secretion system in Rickettsia is evolutionarily conserved. PLoS One 2009; 4:e4833. [PMID: 19279686 PMCID: PMC2653234 DOI: 10.1371/journal.pone.0004833] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2009] [Accepted: 01/28/2009] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Bacterial type IV secretion systems (T4SSs) comprise a diverse transporter family functioning in conjugation, competence, and effector molecule (DNA and/or protein) translocation. Thirteen genome sequences from Rickettsia, obligate intracellular symbionts/pathogens of a wide range of eukaryotes, have revealed a reduced T4SS relative to the Agrobacterium tumefaciens archetype (vir). However, the Rickettsia T4SS has not been functionally characterized for its role in symbiosis/virulence, and none of its substrates are known. RESULTS Superimposition of T4SS structural/functional information over previously identified Rickettsia components implicate a functional Rickettsia T4SS. virB4, virB8 and virB9 are duplicated, yet only one copy of each has the conserved features of similar genes in other T4SSs. An extraordinarily duplicated VirB6 gene encodes five hydrophobic proteins conserved only in a short region known to be involved in DNA transfer in A. tumefaciens. virB1, virB2 and virB7 are newly identified, revealing a Rickettsia T4SS lacking only virB5 relative to the vir archetype. Phylogeny estimation suggests vertical inheritance of all components, despite gene rearrangements into an archipelago of five islets. Similarities of Rickettsia VirB7/VirB9 to ComB7/ComB9 proteins of epsilon-proteobacteria, as well as phylogenetic affinities to the Legionella lvh T4SS, imply the Rickettsiales ancestor acquired a vir-like locus from distantly related bacteria, perhaps while residing in a protozoan host. Modern modifications of these systems likely reflect diversification with various eukaryotic host cells. CONCLUSION We present the rvh (Rickettsiales vir homolog) T4SS, an evolutionary conserved transporter with an unknown role in rickettsial biology. This work lays the foundation for future laboratory characterization of this system, and also identifies the Legionella lvh T4SS as a suitable genetic model.
Collapse
Affiliation(s)
- Joseph J Gillespie
- Virginia Bioinformatics Institute at Virginia Tech, Blacksburg, Virginia, United States of America.
| | | | | | | | | | | | | | | |
Collapse
|
24
|
de Koning AP, Noble GP, Heiss AA, Wong J, Keeling PJ. Environmental PCR survey to determine the distribution of a non-canonical genetic code in uncultivable oxymonads. Environ Microbiol 2008; 10:65-74. [PMID: 18211267 DOI: 10.1111/j.1462-2920.2007.01430.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The universal genetic code is conserved throughout most living systems, but a non-canonical code where TAA and TAG encode glutamine has evolved in several eukaryotes, including oxymonad protists. Most oxymonads are uncultivable, so environmental RT-PCR and PCR was used to examine the distribution of this rare character. A total of 253 unique isolates of four protein-coding genes were sampled from the hindgut community of the cockroach, Cryptocercus punctulatus, an environment rich in diversity from two of the five subgroups of oxymonad, saccinobaculids and polymastigids. Four alpha-tubulins were found with non-canonical glutamine codons. Environmental RACE confirmed that these and related genes used only TGA as stop codons, as expected for the non-canonical code, whereas other genes used TAA or TAG as stop codons, as expected for the universal code. We characterized alpha-tubulin from manually isolated Saccinobaculus ambloaxostylus, confirming it uses the universal code and suggesting, by elimination, that the non-canonical code is used by a polymastigid. HSP90 and EF-1alpha phylogenies also showed environmental sequences falling into two distinct groups, and are generally consistent with previous hypotheses that polymastigids and Streblomastix are closely related. Overall, we propose that the non-canonical genetic code arose once in a common ancestor of Streblomastix and a subgroup of polymastigids.
Collapse
Affiliation(s)
- Audrey P de Koning
- Canadian Institute for Advanced Research, Department of Botany, University of British Columbia, 3529-6270 University Boulevard, Vancouver, BC V6T 1Z4, Canada
| | | | | | | | | |
Collapse
|
25
|
|
26
|
Sengupta S, Yang X, Higgs PG. The mechanisms of codon reassignments in mitochondrial genetic codes. J Mol Evol 2007; 64:662-88. [PMID: 17541678 PMCID: PMC1894752 DOI: 10.1007/s00239-006-0284-7] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2006] [Accepted: 03/07/2007] [Indexed: 11/26/2022]
Abstract
Many cases of nonstandard genetic codes are known in mitochondrial genomes. We carry out analysis of phylogeny and codon usage of organisms for which the complete mitochondrial genome is available, and we determine the most likely mechanism for codon reassignment in each case. Reassignment events can be classified according to the gain-loss framework. The “gain” represents the appearance of a new tRNA for the reassigned codon or the change of an existing tRNA such that it gains the ability to pair with the codon. The “loss” represents the deletion of a tRNA or the change in a tRNA so that it no longer translates the codon. One possible mechanism is codon disappearance (CD), where the codon disappears from the genome prior to the gain and loss events. In the alternative mechanisms the codon does not disappear. In the unassigned codon mechanism, the loss occurs first, whereas in the ambiguous intermediate mechanism, the gain occurs first. Codon usage analysis gives clear evidence of cases where the codon disappeared at the point of the reassignment and also cases where it did not disappear. CD is the probable explanation for stop to sense reassignments and a small number of reassignments of sense codons. However, the majority of sense-to-sense reassignments cannot be explained by CD. In the latter cases, by analysis of the presence or absence of tRNAs in the genome and of the changes in tRNA sequences, it is sometimes possible to distinguish between the unassigned codon and the ambiguous intermediate mechanisms. We emphasize that not all reassignments follow the same scenario and that it is necessary to consider the details of each case carefully.
Collapse
Affiliation(s)
- Supratim Sengupta
- Department of Physics and Astronomy, McMaster University, Hamilton, Ontario L8S 4M1 Canada
- Department of Physics and Atmospheric Science, Dalhousie University, Halifax, Nova Scotia B3H 3J5 Canada
| | - Xiaoguang Yang
- Department of Physics and Astronomy, McMaster University, Hamilton, Ontario L8S 4M1 Canada
| | - Paul G. Higgs
- Department of Physics and Astronomy, McMaster University, Hamilton, Ontario L8S 4M1 Canada
| |
Collapse
|
27
|
Massey SE, Garey JR. A comparative genomics analysis of codon reassignments reveals a link with mitochondrial proteome size and a mechanism of genetic code change via suppressor tRNAs. J Mol Evol 2007; 64:399-410. [PMID: 17390094 DOI: 10.1007/s00239-005-0260-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2005] [Accepted: 12/12/2006] [Indexed: 10/23/2022]
Abstract
Using a comparative genomics approach we demonstrate a negative correlation between the number of codon reassignments undergone by 222 mitochondrial genomes and the mitochondrial genome size, the number of mitochondrial ORFs, and the sizes of the large and small subunit mitochondrial rRNAs. In addition, we show that the TGA-to-tryptophan codon reassignment, which has occurred 11 times in mitochondrial genomes, is found in mitochondrial genomes smaller than those which have not undergone the reassignment. We therefore propose that mitochondrial codon reassignments occur in a wide range of phyla, particularly in Metazoa, due to a reduced "proteomic constraint" on the mitochondrial genetic code, compared to the nuclear genetic code. The reduced proteomic constraint reflects the small size of the mitochondrial-encoded proteome and allows codon reassignments to occur with less likelihood of lethality. In addition, we demonstrate a striking link between nonsense codon reassignments and the decoding properties of naturally occurring nonsense suppressor tRNAs. This suggests that natural preexisting nonsense suppression facilitated nonsense codon reassignments and constitutes a novel mechanism of genetic code change. These findings explain for the first time the identity of the stop codons and amino acids reassigned in mitochondrial and nuclear genomes. Nonsense suppressor tRNAs provided the raw material for nonsense codon reassignments, implying that the properties of the tRNA anticodon have dictated the identity of nonsense codon reassignments.
Collapse
Affiliation(s)
- Steven E Massey
- Department of Biology, University of South Florida, 4202 East Fowler Avenue, Tampa, FL 33620, USA.
| | | |
Collapse
|
28
|
Audia JP, Winkler HH. Study of the five Rickettsia prowazekii proteins annotated as ATP/ADP translocases (Tlc): Only Tlc1 transports ATP/ADP, while Tlc4 and Tlc5 transport other ribonucleotides. J Bacteriol 2006; 188:6261-8. [PMID: 16923893 PMCID: PMC1595366 DOI: 10.1128/jb.00371-06] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The obligate intracytoplasmic pathogen Rickettsia prowazekii relies on the transport of many essential compounds from the cytoplasm of the eukaryotic host cell in lieu of de novo synthesis, an evolutionary outcome undoubtedly linked to obligatory growth in this metabolite-replete niche. The paradigm for the study of rickettsial transport systems is the ATP/ADP translocase Tlc1, which exchanges bacterial ADP for host cell ATP as a source of energy, rather than as a source of adenylate. Interestingly, the R. prowazekii genome encodes four open reading frames that are highly homologous to the well-characterized ATP/ADP translocase Tlc1. Therefore, by annotation, the R. prowazekii genome encodes a total of five ATP/ADP translocases: Tlc1, Tlc2, Tlc3, Tlc4, and Tlc5. We have confirmed by quantitative reverse transcriptase PCR that mRNAs corresponding to all five tlc homologues are expressed in R. prowazekii growing in L-929 cells and have shown their heterologous protein expression in Escherichia coli, suggesting that none of the tlc genes are pseudogenes in the process of evolutionary meltdown. However, we demonstrate by heterologous expression in E. coli that only Tlc1 functions as an ATP/ADP transporter. A survey of nucleotides and nucleosides has determined that Tlc4 transports CTP, UTP, and GDP. Intriguingly, although GTP was not transported by Tlc4, it was an inhibitor of CTP and UTP uptake and demonstrated a K(i) similar to that of GDP. In addition, we demonstrate that Tlc5 transports GTP and GDP. We postulate that Tlc4 and Tlc5 serve the primary function of maintaining intracellular pools of nucleotides for rickettsial nucleic acid biosynthesis and do not provide the cell with nucleoside triphosphates as an energy source, as is the case for Tlc1. Although heterologous expression of Tlc2 and Tlc3 was observed in E. coli, we were unable to identify substrates for these proteins.
Collapse
Affiliation(s)
- Jonathon P Audia
- Laboratory of Molecular Biology, Department of Microbiology and Immunology, University of South Alabama College of Medicine, Mobile, AL 36688, USA.
| | | |
Collapse
|
29
|
Affiliation(s)
- Dieter Söll
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8114, USA.
| | | |
Collapse
|
30
|
Abstract
Many modified genetic codes are found in specific genomes in which one or more codons have been reassigned to a different amino acid from that in the canonical code. We present a new framework for codon reassignment that incorporates two previously proposed mechanisms (codon disappearance and ambiguous intermediate) and introduces two further mechanisms (unassigned codon and compensatory change). Our theory is based on the observation that reassignment involves a gain and a loss. The loss could be the deletion or loss of function of a tRNA or release factor. The gain could be the gain of a new type of tRNA or the gain of function of an existing tRNA due to mutation or base modification. The four mechanisms are distinguished by whether the codon disappears from the genome during the reassignment and by the order of the gain and loss events. We present simulations of the gain-loss model showing that all four mechanisms can occur within the same framework as the parameters are varied. We investigate the way the frequencies of the mechanisms are influenced by selection strengths, the number of codons undergoing reassignment, directional mutation pressure, and selection for reduced genome size.
Collapse
Affiliation(s)
- Supratim Sengupta
- Department of Physics and Astronomy, McMaster University, Hamilton, Ontario, Canada
| | | |
Collapse
|
31
|
Teyssier C, Marchandin H, Jumas-Bilak E. [The genome of alpha-proteobacteria : complexity, reduction, diversity and fluidity]. Can J Microbiol 2004; 50:383-96. [PMID: 15284884 DOI: 10.1139/w04-033] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
The alpha-proteobacteria displayed diverse and often unconventional life-styles. In particular, they keep close relationships with the eucaryotic cell. Their genomic organization is often atypical. Indeed, complex genomes, with two or more chromosomes that could be linear and sometimes associated with plasmids larger than one megabase, have been described. Moreover, polymorphism in genome size and topology as well as in replicon number was observed among very related bacteria, even in a same species. Alpha-proteobacteria provide a good model to study the reductive evolution, the role and origin of multiple chromosomes, and the genomic fluidity. The amount of new data harvested in the last decade should lead us to better understand emergence of bacterial life-styles and to build the conceptual basis to improve the definition of the bacterial species.
Collapse
Affiliation(s)
- Corinne Teyssier
- Laboratoire de bactériologie, Faculté de pharmacie, Montpellier CEDEX 5, France
| | | | | |
Collapse
|
32
|
Desai D, Zhang K, Barik S, Srivastava A, Bolander MEME, Sarkar G. Intragenic codon bias in a set of mouse and human genes. J Theor Biol 2004; 230:215-25. [PMID: 15302553 DOI: 10.1016/j.jtbi.2004.05.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2004] [Revised: 05/06/2004] [Accepted: 05/06/2004] [Indexed: 11/20/2022]
Abstract
To better conceptualize the mechanism underlying the evolution of synonymous codons, we have analysed intragenic codon usage in chosen "regions" of some mouse and human genes. We divided a given gene into two regions: one consisting of a trinucleotide repeat (TNR) and the other consisting of the "rest of the coding region" (RCR). Usually, a TNR is composed of a repetitive single codon, which may reflect its frequency in a gene. In contrast, a non-random frequency of a codon in the RCR versus TNR (or vice versa) of a gene should indicate a bias for that codon within the TNR. We examined this scenario by comparing codon frequency between the RCR and the cognate TNR(s) for a set of human and mouse genes. A TNR length of six amino acids or more was used to identify genes from the Genbank database. Twenty nine human and twenty one mouse genes containing TNRs coding for nine different amino acid runs were identified. The ratio of codon frequency in a TNR versus the corresponding RCR was expressed as "fold change" which was also regarded as a measure of codon bias (defined as preferential use either in TNR or in RCR). Chi-square values were then determined from the distribution of codon frequency in a TNR vs. the cognate RCR. At p<0.001, 22% and 27%, respectively, of human and mouse TNRs showed codon bias. Greater than 40% of the TNRs (29 out of 69 in human, and 18 of 42 in mouse) showed codon bias at p<0.05. In addition, we identify eight single-codon TNRs in mouse and ten in human genes. Thus, our results show intragenic codon bias in both mouse and human genes expressed in diverse tissue types. Since our results are independent of the Codon Adaptation Index (CAI) and starvation CAI, and since the tRNA repertoire in a cell or in a tissue is constant, our data suggest that other constraints besides tRNA abundance played a role in creating intragenic codon bias in these genes.
Collapse
Affiliation(s)
- Dinakar Desai
- Department of Orthopedics, Mayo Clinic and Foundation, Medical Science Building 3-69, 200 1st Street, SW, Rochester, MN 55905, USA
| | | | | | | | | | | |
Collapse
|
33
|
Bacher JM, Bull JJ, Ellington AD. Evolution of phage with chemically ambiguous proteomes. BMC Evol Biol 2003; 3:24. [PMID: 14667253 PMCID: PMC317279 DOI: 10.1186/1471-2148-3-24] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2002] [Accepted: 12/10/2003] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The widespread introduction of amino acid substitutions into organismal proteomes has occurred during natural evolution, but has been difficult to achieve by directed evolution. The adaptation of the translation apparatus represents one barrier, but the multiple mutations that may be required throughout a proteome in order to accommodate an alternative amino acid or analogue is an even more daunting problem. The evolution of a small bacteriophage proteome to accommodate an unnatural amino acid analogue can provide insights into the number and type of substitutions that individual proteins will require to retain functionality. RESULTS The bacteriophage Qbeta initially grows poorly in the presence of the amino acid analogue 6-fluorotryptophan. After 25 serial passages, the fitness of the phage on the analogue was substantially increased; there was no loss of fitness when the evolved phage were passaged in the presence of tryptophan. Seven mutations were fixed throughout the phage in two independent lines of descent. None of the mutations changed a tryptophan residue. CONCLUSIONS A relatively small number of mutations allowed an unnatural amino acid to be functionally incorporated into a highly interdependent set of proteins. These results support the 'ambiguous intermediate' hypothesis for the emergence of divergent genetic codes, in which the adoption of a new genetic code is preceded by the evolution of proteins that can simultaneously accommodate more than one amino acid at a given codon. It may now be possible to direct the evolution of organisms with novel genetic codes using methods that promote ambiguous intermediates.
Collapse
Affiliation(s)
- Jamie M Bacher
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX, USA 78712
- The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, CA, USA 92037
| | - James J Bull
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX, USA 78712
- Section of Integrative Biology, University of Texas at Austin, Austin, TX, USA 78712
| | - Andrew D Ellington
- Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX, USA 78712
- Department of Chemistry and Biochemistry, University of Texas at Austin, Austin, TX, USA 78712
| |
Collapse
|
34
|
Abstract
The primordial genetic code probably has been a drastically simplified ancestor of the canonical code that is used by contemporary cells. In order to understand how the present-day code came about we first need to explain how the language of the building plan can change without destroying the encoded information. In this work we introduce a minimal organism model that is based on biophysically reasonable descriptions of RNA and protein, namely secondary structure folding and knowledge based potentials. The evolution of a population of such organism under competition for a common resource is simulated explicitly at the level of individual replication events. Starting with very simple codes, and hence greatly reduced amino acid alphabets, we observe a diversification of the codes in most simulation runs. The driving force behind this effect is the possibility to produce fitter proteins when the repertoire of amino acids is enlarged.
Collapse
Affiliation(s)
- Günter Weberndorfer
- Institut für Theoretische Chemie und Molekulare Strukturbiologie, Universität Wien, Wien, Austria
| | | | | |
Collapse
|
35
|
Vitorino L, Zé-Zé L, Sousa A, Bacellar F, Tenreiro R. rRNA intergenic spacer regions for phylogenetic analysis of Rickettsia species. Ann N Y Acad Sci 2003; 990:726-33. [PMID: 12860714 DOI: 10.1111/j.1749-6632.2003.tb07451.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Species of the genus Rickettsia are responsible for several human diseases, namely epidemic typhus, Rocky Mountain spotted fever, and tick-borne typhus transmitted by arthropod vectors. The rrl-rrf intergenic spacer region (rrl-rrf ITS) was sequenced for 12 Rickettsia strains, including R. typhi, 6 untested species, R. aeschlimannii, Bar29, R. helvetica, R. honei, R. massilae, and R. slovaca as well as 5 Portuguese isolates. Phylogenetic trees inferred from rrl-rrf spacer sequences using maximum-parsimony and distance methods provided largely congruent tree topologies, supported by significant bootstrap values, enabling the identification of five distinct rickettsiae clusters.
Collapse
Affiliation(s)
- Liliana Vitorino
- Departamento de Biologia Vegetal/Centro de Genética e Biologia Molecular, Faculdade de Ciências, Universidade de Lisboa, Campo Grande, 1749-016 Lisboa, Portugal
| | | | | | | | | |
Collapse
|
36
|
Massung RF, Lee K, Mauel M, Gusa A. Characterization of the rRNA genes of Ehrlichia chaffeensis and Anaplasma phagocytophila. DNA Cell Biol 2002; 21:587-96. [PMID: 12215262 DOI: 10.1089/104454902320308960] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The rRNA genes of Ehrlichia chaffeensis and Anaplasma phagocytophila have been analyzed. The 16S rRNA genes were previously characterized for both of these agents. Southern hybridization was used to show that there are single copies of both the 16S and 23S rRNA genes in the genomes of each organism, and that the 16S rRNA genes were upstream from the 23S rRNA genes by at least 16 and 11 Kb for E. chaffeensis and A. phagocytophila, respectively. PCR amplification and gene walking was used to sequence the 23S and 5S rRNA genes, and show that these genes are contiguous and are likely expressed as a single operon. The level of homology between the E. chaffeensis and A. phagocytophila 23S and 5S rRNA genes, and 23S-5S spacers, was 91.8, 81.5, and 40%, respectively. To confirm the hybridization data, genome walking was used to sequence downstream of the 16S rRNA genes, and although no tRNA genes were identified, open reading frames encoding homologues of the Escherichia coli succinate dehydrogenase, subunit C, were found in both E. chaffeensis and A. phagocytophila. Phylogenetic analysis using the 23S rRNA gene suggests that reorganization of the phylum Proteobacteria by division of the class Alphaproteobacteria into two separate subclasses, may be appropriate.
Collapse
Affiliation(s)
- Robert F Massung
- Division of Viral and Rickettsial Diseases, National Center for Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia 30333, USA.
| | | | | | | |
Collapse
|
37
|
Amiri H, Alsmark CM, Andersson SGE. Proliferation and deterioration of Rickettsia palindromic elements. Mol Biol Evol 2002; 19:1234-43. [PMID: 12140235 DOI: 10.1093/oxfordjournals.molbev.a004184] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
It has been suggested that Rickettsia Palindromic Elements (RPEs) have evolved as selfish DNA that mediate protein sequence evolution by being targeted to genes that code for RNA and proteins. Here, we have examined the phylogenetic depth of two RPEs that are located close to the genes encoding elongation factors Tu (tuf) and G (fus) in Rickettsia. An exceptional organization of the elongation factor genes was found in all 11 species examined, with complete or partial RPEs identified downstream of the tuf gene (RPE-tuf) in six species and of the fus gene (RPE-fus) in 10 species. A phylogenetic reconstruction shows that both RPE-tuf and RPE-fus have evolved in a manner that is consistent with the expected species divergence. The analysis provides evidence for independent loss of RPE-tuf in several species, possibly mediated by short repetitive sequences flanking the site of excision. The remaining RPE-tuf sequences evolve as neutral sequences in different stages of deterioration. Likewise, highly fragmented remnants of the RPE-fus sequence were identified in two species. This suggests that genome-specific differences in the content of RPEs are the result of recent loss rather than recent proliferation.
Collapse
Affiliation(s)
- Haleh Amiri
- Department of Molecular Evolution, University of Uppsala, Sweden
| | | | | |
Collapse
|
38
|
Abstract
Our thesis is that the DNA composition and structure of genomes are selected in part by mutation bias (GC pressure) and in part by ecology. To illustrate this point, we compare and contrast the oligonucleotide composition and the mosaic structure in 36 complete genomes and in 27 long genomic sequences from archaea and eubacteria. We report the following findings (1) High-GC-content genomes show a large underrepresentation of short distances between G(n) and C(n) homopolymers with respect to distances between A(n) and T(n) homopolymers; we discuss selection versus mutation bias hypotheses. (2) The oligonucleotide compositions of the genomes of Neisseria (meningitidis and gonorrhoea), Helicobacter pylori and Rhodobacter capsulatus are more biased than the other sequenced genomes. (3) The genomes of free-living species or nonchronic pathogens show more mosaic-like structure than genomes of chronic pathogens or intracellular symbionts. (4) Genome mosaicity of intracellular parasites has a maximum corresponding to the average gene length; in the genomes of free-living and nonchronic pathogens the maximum occurs at larger length scales. This suggests that free-living species can incorporate large pieces of DNA from the environment, whereas for intracellular parasites there are recombination events between homologous genes. We discuss the consequences in terms of evolution of genome size. (5) Intracellular symbionts and obligate pathogens show small, but not zero, amount of chromosome mosaicity, suggesting that recombination events occur in these species.
Collapse
Affiliation(s)
- Pietro Liò
- Department of Zoology, University of Cambridge, United Kingdom.
| |
Collapse
|
39
|
Abstract
A simple method is presented for reconstructing phylogenetic trees on the basis of gene transposition. It is shown that differences in gene arrangements among genomes could allow us to determine whether a gene transposition event has occurred before or after species divergence from parsimonious considerations. The method is applied to evolutionary relationships among the bacterial class Proteobacteria, for which complete genomic sequences most densely accumulate and comprehensive gene order comparisons are possible. We were able to infer the emergence order of proteobacterial subclasses as epsilon-->beta-->gamma. This order is consistent with sequence-based inferences, which conversely confirms the usefulness of the approach presented here.
Collapse
Affiliation(s)
- T Kunisawa
- Department of Applied Biological Sciences, Science University of Tokyo, Noda, 278-8510, Japan.
| |
Collapse
|
40
|
Abstract
Although bacteria increase their DNA content through horizontal transfer and gene duplication, their genomes remain small and, in particular, lack nonfunctional sequences. This pattern is most readily explained by a pervasive bias towards higher numbers of deletions than insertions. When selection is not strong enough to maintain them, genes are lost in large deletions or inactivated and subsequently eroded. Gene inactivation and loss are particularly apparent in obligate parasites and symbionts, in which dramatic reductions in genome size can result not from selection to lose DNA, but from decreased selection to maintain gene functionality. Here we discuss the evidence showing that deletional bias is a major force that shapes bacterial genomes.
Collapse
Affiliation(s)
- A Mira
- Dept of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA
| | | | | |
Collapse
|
41
|
Abstract
Studies of neutrally evolving sequences suggest that differences in eukaryotic genome sizes result from different rates of DNA loss. However, very few pseudogenes have been identified in microbial species, and the processes whereby genes and genomes deteriorate in bacteria remain largely unresolved. The typhus-causing agent, Rickettsia prowazekii, is exceptional in that as much as 24% of its 1.1-Mb genome consists of noncoding DNA and pseudogenes. To test the hypothesis that the noncoding DNA in the R. prowazekii genome represents degraded remnants of ancestral genes, we systematically examined all of the identified pseudogenes and their flanking sequences in three additional Rickettsia species. Consistent with the hypothesis, we observe sequence similarities between genes and pseudogenes in one species and intergenic DNA in another species. We show that the frequencies and average sizes of deletions are larger than insertions in neutrally evolving pseudogene sequences. Our results suggest that inactivated genetic material in the Rickettsia genomes deteriorates spontaneously due to a mutation bias for deletions and that the noncoding sequences represent DNA in the final stages of this degenerative process.
Collapse
Affiliation(s)
- J O Andersson
- Department of Molecular Evolution, University of Uppsala, Uppsala, Sweden
| | | |
Collapse
|
42
|
Knight RD, Freeland SJ, Landweber LF. Rewiring the keyboard: evolvability of the genetic code. Nat Rev Genet 2001; 2:49-58. [PMID: 11253070 DOI: 10.1038/35047500] [Citation(s) in RCA: 264] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The genetic code evolved in two distinct phases. First, the 'canonical' code emerged before the last universal ancestor; subsequently, this code diverged in numerous nuclear and organelle lineages. Here, we examine the distribution and causes of these secondary deviations from the canonical genetic code. The majority of non-standard codes arise from alterations in the tRNA, with most occurring by post-transcriptional modifications, such as base modification or RNA editing, rather than by substitutions within tRNA anticodons.
Collapse
Affiliation(s)
- R D Knight
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA.
| | | | | |
Collapse
|
43
|
Abstract
The endosymbiotic theory for the origin of mitochondria requires substantial modification. The three identifiable ancestral sources to the proteome of mitochondria are proteins descended from the ancestral alpha-proteobacteria symbiont, proteins with no homology to bacterial orthologs, and diverse proteins with bacterial affinities not derived from alpha-proteobacteria. Random mutations in the form of deletions large and small seem to have eliminated nonessential genes from the endosymbiont-mitochondrial genome lineages. This process, together with the transfer of genes from the endosymbiont-mitochondrial genome to nuclei, has led to a marked reduction in the size of mitochondrial genomes. All proteins of bacterial descent that are encoded by nuclear genes were probably transferred by the same mechanism, involving the disintegration of mitochondria or bacteria by the intracellular membranous vacuoles of cells to release nucleic acid fragments that transform the nuclear genome. This ongoing process has intermittently introduced bacterial genes to nuclear genomes. The genomes of the last common ancestor of all organisms, in particular of mitochondria, encoded cytochrome oxidase homologues. There are no phylogenetic indications either in the mitochondrial proteome or in the nuclear genomes that the initial or subsequent function of the ancestor to the mitochondria was anaerobic. In contrast, there are indications that relatively advanced eukaryotes adapted to anaerobiosis by dismantling their mitochondria and refitting them as hydrogenosomes. Accordingly, a continuous history of aerobic respiration seems to have been the fate of most mitochondrial lineages. The initial phases of this history may have involved aerobic respiration by the symbiont functioning as a scavenger of toxic oxygen. The transition to mitochondria capable of active ATP export to the host cell seems to have required recruitment of eukaryotic ATP transport proteins from the nucleus. The identity of the ancestral host of the alpha-proteobacterial endosymbiont is unclear, but there is no indication that it was an autotroph. There are no indications of a specific alpha-proteobacterial origin to genes for glycolysis. In the absence of data to the contrary, it is assumed that the ancestral host cell was a heterotroph.
Collapse
Affiliation(s)
- C G Kurland
- Department of Molecular Evolution, Evolutionary Biology Centre, University of Uppsala, Uppsala SE 752 36, Lund University, Lund SE 223 62, Sweden.
| | | |
Collapse
|
44
|
Andersson SG, Dehio C. Rickettsia prowazekii and Bartonella henselae: differences in the intracellular life styles revisited. Int J Med Microbiol 2000; 290:135-41. [PMID: 11045918 DOI: 10.1016/s1438-4221(00)80081-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022] Open
Abstract
Within the alpha subdivision of proteobacteria, the arthropod-borne human pathogens Rickettsia prowazekii and Bartonella henselae provide examples of bacteria with obligate and facultative intracellular life styles, respectively. The complete genome sequence of R. prowazekii has been published, whereas the sequencing of the B. henselae genome is in its final stage. Here, we provide a brief overview of a comparative analysis of both genomes based on the delineated metabolic properties. The relative proportion of genes devoted to basic information processes is similar in the two genomes. In contrast, a full set of genes encoding proteins involved in the biosynthesis of amino acids and nucleotides is present in B. henselae, while the majority of these genes is absent from R. prowazekii. This suggests that B. henselae has a better potential for growth in the free-living mode, whereas R. prowazekii is more specialised to growth in an intracellular environment. Functional genomics will provide the potential to further resolve the genetic basis for successful human infections by these important parasites.
Collapse
Affiliation(s)
- S G Andersson
- Department of Molecular Evolution, Evolutionary Biology Center, Uppsala University, Sweden.
| | | |
Collapse
|
45
|
Abstract
At the beginning of the 20th century, it was discovered at the Pasteur Institute in Tunis that epidemic typhus is transmitted by the human body louse. The complete genome sequence of its causative agent, Rickettsia prowazekii, was determined at Uppsala University in Sweden at the end of the century. In this mini-review, we discuss insights gained from the genome sequence of this fascinating and deadly organism.
Collapse
Affiliation(s)
- J O Andersson
- Department of Molecular Evolution, Uppsala University, Evolutionary Biology Center, Sweden
| | | |
Collapse
|
46
|
Abstract
A model for the developmental pathway of the genetic code, grounded on group theory and the thermodynamics of codon-anticodon interaction is presented. At variance with previous models, it takes into account not only the optimization with respect to amino acid attributes but, also physicochemical constraints and initial conditions. A 'simple-first' rule is introduced after ranking the amino acids with respect to two current measures of chemical complexity. It is shown that a primeval code of only seven amino acids is enough to build functional proteins. It is assumed that these proteins drive the further expansion of the code. The proposed primeval code is compared with surrogate codes randomly generated and with another proposal for primeval code found in the literature. The departures from the 'universal' code, observed in many organisms and cellular compartments, fit naturally in the proposed evolutionary scheme. A strong correlation is found between, on one side, the two classes of aminoacyl-tRNA synthetases, and on the other, the amino acids grouped by end-atom-type and by codon type. An inverse of Davydov's rules, to associate the amino acid end atoms (O/N and non-O/non-N) of 18 amino acids with codons containing a weak base (A/U), extended to the 20 amino acids, is derived.
Collapse
Affiliation(s)
- M A Jiménez-Montaño
- Innovationskolleg Theoretische Biologie, Humboldt-Universität zu Berlin, Germany.
| |
Collapse
|
47
|
Zomorodipour A, Andersson SG. Obligate intracellular parasites: Rickettsia prowazekii and Chlamydia trachomatis. FEBS Lett 1999; 452:11-5. [PMID: 10376669 DOI: 10.1016/s0014-5793(99)00563-3] [Citation(s) in RCA: 71] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Transitions to obligate intracellular parasitism have occurred at numerous times in the evolutionary past. The genome sequences of two obligate intracellular parasites, Rickettsia prowazekii and Chlamydia trachomatis, were published last year. A comparative analysis of these two genomes has revealed examples of reductive convergent evolution, such as a massive loss of genes involved in biosynthetic functions. In addition, both genomes were found to encode transport systems for ATP and ADP, not otherwise found in bacteria. Here, we discuss adaptations to intracellular habitats by comparing the information obtained from the recently published genome sequences of R. prowazekii and C. trachomatis.
Collapse
Affiliation(s)
- A Zomorodipour
- Department of Molecular Evolution, Biomedical Center, Uppsala, Sweden
| | | |
Collapse
|
48
|
Andersson SG, Kurland CG. Ancient and recent horizontal transfer events: the origins of mitochondria. APMIS. SUPPLEMENTUM 1998; 84:5-14. [PMID: 9850675 DOI: 10.1111/j.1600-0463.1998.tb05641.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Affiliation(s)
- S G Andersson
- Department of Molecular Biology, Uppsala University, Sweden
| | | |
Collapse
|
49
|
|
50
|
Abstract
The recent sequencing of the entire genomes of Mycoplasma genitalium and M. pneumoniae has attracted considerable attention to the molecular biology of mycoplasmas, the smallest self-replicating organisms. It appears that we are now much closer to the goal of defining, in molecular terms, the entire machinery of a self-replicating cell. Comparative genomics based on comparison of the genomic makeup of mycoplasmal genomes with those of other bacteria, has opened new ways of looking at the evolutionary history of the mycoplasmas. There is now solid genetic support for the hypothesis that mycoplasmas have evolved as a branch of gram-positive bacteria by a process of reductive evolution. During this process, the mycoplasmas lost considerable portions of their ancestors' chromosomes but retained the genes essential for life. Thus, the mycoplasmal genomes carry a high percentage of conserved genes, greatly facilitating gene annotation. The significant genome compaction that occurred in mycoplasmas was made possible by adopting a parasitic mode of life. The supply of nutrients from their hosts apparently enabled mycoplasmas to lose, during evolution, the genes for many assimilative processes. During their evolution and adaptation to a parasitic mode of life, the mycoplasmas have developed various genetic systems providing a highly plastic set of variable surface proteins to evade the host immune system. The uniqueness of the mycoplasmal systems is manifested by the presence of highly mutable modules combined with an ability to expand the antigenic repertoire by generating structural alternatives, all compressed into limited genomic sequences. In the absence of a cell wall and a periplasmic space, the majority of surface variable antigens in mycoplasmas are lipoproteins. Apart from providing specific antimycoplasmal defense, the host immune system is also involved in the development of pathogenic lesions and exacerbation of mycoplasma induced diseases. Mycoplasmas are able to stimulate as well as suppress lymphocytes in a nonspecific, polyclonal manner, both in vitro and in vivo. As well as to affecting various subsets of lymphocytes, mycoplasmas and mycoplasma-derived cell components modulate the activities of monocytes/macrophages and NK cells and trigger the production of a wide variety of up-regulating and down-regulating cytokines and chemokines. Mycoplasma-mediated secretion of proinflammatory cytokines, such as tumor necrosis factor alpha, interleukin-1 (IL-1), and IL-6, by macrophages and of up-regulating cytokines by mitogenically stimulated lymphocytes plays a major role in mycoplasma-induced immune system modulation and inflammatory responses.
Collapse
Affiliation(s)
- S Razin
- Department of Membrane and Ultrastructure Research, The Hebrew University-Hadassah Medical School, Jerusalem 91120, Israel.
| | | | | |
Collapse
|