1
|
Nemzer LR. Shannon information entropy in the canonical genetic code. J Theor Biol 2017; 415:158-170. [DOI: 10.1016/j.jtbi.2016.12.010] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Revised: 11/30/2016] [Accepted: 12/12/2016] [Indexed: 11/15/2022]
|
2
|
Clues to tRNA Evolution from the Distribution of Class II tRNAs and Serine Codons in the Genetic Code. Life (Basel) 2016; 6:life6010010. [PMID: 26927183 PMCID: PMC4810241 DOI: 10.3390/life6010010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 02/20/2016] [Accepted: 02/22/2016] [Indexed: 11/30/2022] Open
Abstract
We have previously proposed that tRNAGly was the first tRNA and glycine was the first amino acid incorporated into the genetic code. The next two amino acids incorporated would have been the other two small hydrophilic amino acids serine and aspartic acid, which occurred through the duplication of the tRNAGly sequence, followed by mutation of its anticodon by single C to U transition mutations, possibly through spontaneous deamination. Interestingly, however, tRNASer has a different structure than most other tRNAs, possessing a long variable arm; because of this tRNASer is classified as a class II tRNA. Also, serine codons are found not only in the bottom right-hand corner of the genetic code table next to those for glycine and aspartic acid, but also in the top row of the table, next to those for two of the most hydrophobic amino acids, leucine and phenylalanine. In the following, I propose that the class II tRNA structure of tRNASer and the arrangement of serine codons in the genetic code provide clues to the early evolution of tRNA and the genetic code. In addition, I address Di Giulio’s recent criticism of our proposal that tRNAGly was the first tRNA, and discuss how early peptides produced from a restricted amino acid alphabet of glycine, serine and aspartic acid might have possessed proteolytic activity, which is possibly important for the early recycling of amino acid monomers.
Collapse
|
3
|
Ancestral Reconstruction of a Pre-LUCA Aminoacyl-tRNA Synthetase Ancestor Supports the Late Addition of Trp to the Genetic Code. J Mol Evol 2015; 80:171-85. [PMID: 25791872 DOI: 10.1007/s00239-015-9672-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2015] [Accepted: 03/09/2015] [Indexed: 01/14/2023]
Abstract
The genetic code was likely complete in its current form by the time of the last universal common ancestor (LUCA). Several scenarios have been proposed for explaining the code's pre-LUCA emergence and expansion, and the relative order of the appearance of amino acids used in translation. One co-evolutionary model of genetic code expansion proposes that at least some amino acids were added to the code by the ancient divergence of aminoacyl-tRNA synthetase (aaRS) families. Of all the amino acids used within the genetic code, Trp is most frequently claimed as a relatively recent addition. We observe that, since TrpRS and TyrRS are paralogous protein families retaining significant sequence similarity, the inferred sequence composition of their ancestor can be used to evaluate this co-evolutionary model of genetic code expansion. We show that ancestral sequence reconstructions of the pre-LUCA paralog ancestor of TyrRS and TrpRS have several sites containing Tyr, yet a complete absence of sites containing Trp. This is consistent with the paralog ancestor being specific for the utilization of Tyr, with Trp being a subsequent addition to the genetic code facilitated by a process of aaRS divergence and neofunctionalization. Only after this divergence could Trp be specifically encoded and incorporated into proteins, including the TyrRS and TrpRS descendant lineages themselves. This early absence of Trp is observed under both homogeneous and non-homogeneous models of ancestral sequence reconstruction. Simulations support that this observed absence of Trp is unlikely to be due to chance or model bias. These results support that the final stages of genetic code evolution occurred well within the "protein world," and that the presence-absence of Trp within conserved sites of ancient protein domains is a likely measure of their relative antiquity, permitting the relative timing of extremely early events within protein evolution before LUCA.
Collapse
|
4
|
Genetic code evolution started with the incorporation of glycine, followed by other small hydrophilic amino acids. J Mol Evol 2014; 78:307-9. [PMID: 24916657 DOI: 10.1007/s00239-014-9627-y] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Accepted: 05/23/2014] [Indexed: 10/25/2022]
Abstract
We propose that glycine was the first amino acid to be incorporated into the genetic code, followed by serine, aspartic and/or glutamic acid-small hydrophilic amino acids that all have codons in the bottom right-hand corner of the standard genetic code table. Because primordial ribosomal synthesis is presumed to have been rudimentary, this stage would have been characterized by the synthesis of short, water-soluble peptides, the first of which would have comprised polyglycine. Evolution of the code is proposed to have occurred by the duplication and mutation of tRNA sequences, which produced a radiation of codon assignment outwards from the bottom right-hand corner. As a result of this expansion, we propose a trend from small hydrophilic to hydrophobic amino acids, with selection for longer polypeptides requiring a hydrophobic core for folding and stability driving the incorporation of hydrophobic amino acids into the code.
Collapse
|
5
|
Kawahara-Kobayashi A, Hitotsuyanagi M, Amikura K, Kiga D. Experimental evolution of a green fluorescent protein composed of 19 unique amino acids without tryptophan. ORIGINS LIFE EVOL B 2014; 44:75-86. [PMID: 25399308 DOI: 10.1007/s11084-014-9371-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2013] [Accepted: 09/25/2014] [Indexed: 10/24/2022]
Abstract
At some stage of evolution, genes of organisms may have encoded proteins that were synthesized using fewer than 20 unique amino acids. Similar to evolution of the natural 19-amino-acid proteins GroEL/ES, proteins composed of 19 unique amino acids would have been able to evolve by accumulating beneficial mutations within the 19-amino-acid repertoire encoded in an ancestral genetic code. Because Trp is thought to be the last amino acid included in the canonical 20-amino-acid repertoire, this late stage of protein evolution could be mimicked by experimental evolution of 19-amino-acid proteins without tryptophan (Trp). To further understand the evolution of proteins, we tried to mimic the evolution of a 19-amino-acid protein involving the accumulation of beneficial mutations using directed evolution by random mutagenesis on the whole targeted gene sequence. We created active 19-amino-acid green fluorescent proteins (GFPs) without Trp from a poorly fluorescent 19-amino-acid mutant, S1-W57F, by using directed evolution with two rounds of mutagenesis and selection. The N105I and S205T mutations showed beneficial effects on the S1-W57F mutant. When these two mutations were combined on S1-W57F, we observed an additive effect on the fluorescence intensity. In contrast, these mutations showed no clear improvement individually or in combination on GFPS1, which is the parental GFP mutant composed of 20 amino acids. Our results provide an additional example for the experimental evolution of 19-amino-acid proteins without Trp, and would help understand the mechanisms underlying the evolution of 19-amino-acid proteins. (236 words).
Collapse
Affiliation(s)
- Akio Kawahara-Kobayashi
- Department of Computational Intelligence and Systems Science, Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Yokohama, Kanagawa, 226-8503, Japan
| | | | | | | |
Collapse
|
6
|
Groussin M, Boussau B, Charles S, Blanquart S, Gouy M. The molecular signal for the adaptation to cold temperature during early life on Earth. Biol Lett 2013; 9:20130608. [PMID: 24046876 PMCID: PMC3971708 DOI: 10.1098/rsbl.2013.0608] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Several lines of evidence such as the basal location of thermophilic lineages in large-scale phylogenetic trees and the ancestral sequence reconstruction of single enzymes or large protein concatenations support the conclusion that the ancestors of the bacterial and archaeal domains were thermophilic organisms which were adapted to hot environments during the early stages of the Earth. A parsimonious reasoning would therefore suggest that the last universal common ancestor (LUCA) was also thermophilic. Various authors have used branch-wise non-homogeneous evolutionary models that better capture the variation of molecular compositions among lineages to accurately reconstruct the ancestral G + C contents of ribosomal RNAs and the ancestral amino acid composition of highly conserved proteins. They confirmed the thermophilic nature of the ancestors of Bacteria and Archaea but concluded that LUCA, their last common ancestor, was a mesophilic organism having a moderate optimal growth temperature. In this letter, we investigate the unknown nature of the phylogenetic signal that informs ancestral sequence reconstruction to support this non-parsimonious scenario. We find that rate variation across sites of molecular sequences provides information at different time scales by recording the oldest adaptation to temperature in slow-evolving regions and subsequent adaptations in fast-evolving ones.
Collapse
Affiliation(s)
- Mathieu Groussin
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR5558, Villeurbanne, France.
| | | | | | | | | |
Collapse
|
7
|
An D, Brown D, Chatterjee I, Dong X, Ramos-Padron E, Wilson S, Bordenave S, Caffrey SM, Gieg LM, Sensen CW, Voordouw G. Microbial community and potential functional gene diversity involved in anaerobic hydrocarbon degradation and methanogenesis in an oil sands tailings pond. Genome 2013; 56:612-8. [DOI: 10.1139/gen-2013-0083] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Oil sands tailings ponds harbor large amounts of tailings resulting from surface mining of bitumen and consist of water, sand, clays, residual bitumen, and hydrocarbon diluent. Oxygen ingress in these ponds is limited to the surface layers, causing most hydrocarbon degradation to be catalyzed by anaerobic, methanogenic microbial communities. This causes the evolution of large volumes of methane of up to 104m3/day. A pyrosequencing survey of 16S rRNA amplicons from 10 samples obtained from different depths indicated the presence of a wide variety of taxa involved in anaerobic hydrocarbon degradation and methanogenesis, including the phyla Proteobacteria, Euryarchaeota, Firmicutes, Actinobacteria, Chloroflexi, and Bacteroidetes. Metagenomic sequencing of DNA isolated from one of these samples indicated a more diverse community than indicated by the 16S rRNA amplicon survey. Both methods indicated the same major phyla to be present. The metagenomic dataset indicated the presence of genes involved in the three stages of anaerobic aromatic hydrocarbon degradation, including genes for enzymes of the peripheral (upper), the central (lower), and the methanogenesis pathways. Upper pathway genes showed broad phylogenetic affiliation (Proteobacteria, Firmicutes, and Actinobacteria), whereas lower pathway genes were mostly affiliated with the Deltaproteobacteria. Genes for both hydrogenotrophic and acetotrophic methanogenesis were also found. The wide variety of taxa involved in initial hydrocarbon degradation through upper pathways may reflect the variety of residual bitumen and diluent components present in the tailings pond.
Collapse
Affiliation(s)
- Dongshan An
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Damon Brown
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Indranil Chatterjee
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Xiaoli Dong
- Visual Genomics Centre, Faculty of Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Esther Ramos-Padron
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Sandra Wilson
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Sylvain Bordenave
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Sean M. Caffrey
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Lisa M. Gieg
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| | - Christoph W. Sensen
- Visual Genomics Centre, Faculty of Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Gerrit Voordouw
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| |
Collapse
|
8
|
Fournier GP, Neumann JE, Gogarten JP. Inferring the ancient history of the translation machinery and genetic code via recapitulation of ribosomal subunit assembly orders. PLoS One 2010; 5:e9437. [PMID: 20208990 PMCID: PMC2830423 DOI: 10.1371/journal.pone.0009437] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Accepted: 12/23/2009] [Indexed: 11/29/2022] Open
Abstract
Universally conserved positions in ribosomal proteins have significant biases in amino acid usage, likely indicating the expansion of the genetic code at the time leading up to the most recent common ancestor(s) (MRCA). Here, we apply this principle to the evolutionary history of the ribosome before the MRCA. It has been proposed that the experimentally determined order of assembly for ribosomal subunits recapitulates their evolutionary chronology. Given this model, we produce a probabilistic evolutionary ordering of the universally conserved small subunit (SSU) and large subunit (LSU) ribosomal proteins. Optimizing the relative ordering of SSU and LSU evolutionary chronologies with respect to minimizing differences in amino acid usage bias, we find strong compositional evidence for a more ancient origin for early LSU proteins. Furthermore, we find that this ordering produces several trends in specific amino acid usages compatible with models of genetic code evolution.
Collapse
Affiliation(s)
- Gregory P Fournier
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America.
| | | | | |
Collapse
|
9
|
|
10
|
Grosjean H, de Crécy-Lagard V, Marck C. Deciphering synonymous codons in the three domains of life: co-evolution with specific tRNA modification enzymes. FEBS Lett 2010; 584:252-64. [PMID: 19931533 DOI: 10.1016/j.febslet.2009.11.052] [Citation(s) in RCA: 203] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2009] [Revised: 11/11/2009] [Accepted: 11/16/2009] [Indexed: 10/20/2022]
Abstract
The strategies organisms use to decode synonymous codons in cytosolic protein synthesis are not uniform. The complete isoacceptor tRNA repertoire and the type of modified nucleoside found at the wobble position 34 of their anticodons were analyzed in all kingdoms of life. This led to the identification of four main decoding strategies that are diversely used in Bacteria, Archaea and Eukarya. Many of the modern tRNA modification enzymes acting at position 34 of tRNAs are present only in specific domains and obviously have arisen late during evolution. In an evolutionary fine-tuning process, these enzymes must have played an essential role in the progressive introduction of new amino acids, and in the refinement and standardization of the canonical nuclear genetic code observed in all extant organisms (functional convergent evolutionary hypothesis).
Collapse
Affiliation(s)
- Henri Grosjean
- Université Paris-Sud, CNRS, UMR8621, Institut de Génétique et de Microbiologie, Orsay F-91405, France.
| | | | | |
Collapse
|
11
|
Boussau B, Blanquart S, Necsulea A, Lartillot N, Gouy M. Parallel adaptations to high temperatures in the Archaean eon. Nature 2008; 456:942-5. [DOI: 10.1038/nature07393] [Citation(s) in RCA: 173] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2008] [Accepted: 09/01/2008] [Indexed: 11/09/2022]
|
12
|
Sun FJ, Caetano-Anollés G. Evolutionary patterns in the sequence and structure of transfer RNA: a window into early translation and the genetic code. PLoS One 2008; 3:e2799. [PMID: 18665254 PMCID: PMC2474678 DOI: 10.1371/journal.pone.0002799] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2008] [Accepted: 07/02/2008] [Indexed: 01/06/2023] Open
Abstract
Transfer RNA (tRNA) molecules play vital roles during protein synthesis. Their acceptor arms are aminoacylated with specific amino acid residues while their anticodons delimit codon specificity. The history of these two functions has been generally linked in evolutionary studies of the genetic code. However, these functions could have been differentially recruited as evolutionary signatures were left embedded in tRNA molecules. Here we built phylogenies derived from the sequence and structure of tRNA, we forced taxa into monophyletic groups using constraint analyses, tested competing evolutionary hypotheses, and generated timelines of amino acid charging and codon discovery. Charging of Sec, Tyr, Ser and Leu appeared ancient, while specificities related to Asn, Met, and Arg were derived. The timelines also uncovered an early role of the second and then first codon bases, identified codons for Ala and Pro as the most ancient, and revealed important evolutionary take-overs related to the loss of the long variable arm in tRNA. The lack of correlation between ancestries of amino acid charging and encoding indicated that the separate discoveries of these functions reflected independent histories of recruitment. These histories were probably curbed by co-options and important take-overs during early diversification of the living world.
Collapse
Affiliation(s)
- Feng-Jie Sun
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| |
Collapse
|
13
|
Sun FJ, Caetano-Anollés G. Evolutionary patterns in the sequence and structure of transfer RNA: early origins of archaea and viruses. PLoS Comput Biol 2008; 4:e1000018. [PMID: 18369418 PMCID: PMC2265525 DOI: 10.1371/journal.pcbi.1000018] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2007] [Accepted: 02/01/2008] [Indexed: 02/06/2023] Open
Abstract
Transfer RNAs (tRNAs) are ancient molecules that are central to translation. Since they probably carry evolutionary signatures that were left behind when the living world diversified, we reconstructed phylogenies directly from the sequence and structure of tRNA using well-established phylogenetic methods. The trees placed tRNAs with long variable arms charging Sec, Tyr, Ser, and Leu consistently at the base of the rooted phylogenies, but failed to reveal groupings that would indicate clear evolutionary links to organismal origin or molecular functions. In order to uncover evolutionary patterns in the trees, we forced tRNAs into monophyletic groups using constraint analyses to generate timelines of organismal diversification and test competing evolutionary hypotheses. Remarkably, organismal timelines showed Archaea was the most ancestral superkingdom, followed by viruses, then superkingdoms Eukarya and Bacteria, in that order, supporting conclusions from recent phylogenomic studies of protein architecture. Strikingly, constraint analyses showed that the origin of viruses was not only ancient, but was linked to Archaea. Our findings have important implications. They support the notion that the archaeal lineage was very ancient, resulted in the first organismal divide, and predated diversification of tRNA function and specificity. Results are also consistent with the concept that viruses contributed to the development of the DNA replication machinery during the early diversification of the living world.
Collapse
Affiliation(s)
- Feng-Jie Sun
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America
| |
Collapse
|
14
|
Abstract
The evolution of the transfer RNA (tRNA) molecule is controversial but embeds the history of protein biosynthesis, the genetic code, and the origins of diversified life. A new phylogenetic method based on RNA structure that we developed provides new lines of evidence to support the genome tag hypothesis and confirms that the 'top half' of tRNA is more ancient than the 'bottom half'. Timelines of amino acid charging function generated from constraint analyses showed that selenocysteine, tyrosine, serine, and leucine specificities were ancient, while those related to asparagine, methionine, and arginine were more recent. The timelines also uncovered an early role of the second and then first codon bases, identified codons for alanine and proline as the most ancient, and revealed important evolutionary take-overs related to the loss of the long variable arm of tRNA. Furthermore, organismal timelines showed Archaea was the oldest superkingdom, followed by viruses, and superkingdoms Eukarya and Bacteria in that order supporting conclusions from recent phylogenomic studies of protein architecture. Strikingly, results showed that the origin of viruses was not only ancient but was linked to Archaea, supporting the notion that the archaeal lineage is the most ancient on earth and its origin predated diversification of tRNA function and specificity.
Collapse
Affiliation(s)
- Feng-Jie Sun
- Department of Crop Sciences at the University of Illinois at Urbana-Champaign, 61801, USA
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois, 332 NSRC, 1101 West Peabody Drive, Urbana, Illinois, 61801, USA
| |
Collapse
|