1
|
Di Giulio M. The polyphyletic origins of glycyl-tRNA synthetase and lysyl-tRNA synthetase and their implications. Biosystems 2024; 244:105287. [PMID: 39127441 DOI: 10.1016/j.biosystems.2024.105287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 08/07/2024] [Accepted: 08/07/2024] [Indexed: 08/12/2024]
Abstract
I analyzed the polyphyletic origin of glycyl-tRNA synthetase (GlyRS) and lysyl-tRNA synthetase (LysRS), making plausible the following implications. The fact that the genetic code needed to evolve aminoacyl-tRNA synthetases (ARSs) only very late would be in perfect agreement with a late origin, in the main phyletic lineages, of both GlyRS and LysRS. Indeed, as suggested by the coevolution theory, since the genetic code was structured by biosynthetic relationships between amino acids and as these occurred on tRNA-like molecules which were evidently already loaded with amino acids during its structuring, this made possible a late origin of ARSs. All this corroborates the coevolution theory of the origin of the genetic code to the detriment of theories which would instead predict an early intervention of the action of ARSs in organizing the genetic code. Furthermore, the assembly of the GlyRS and LysRS protein domains in main phyletic lineages is itself at least evidence of the possibility that ancestral genes were assembled using pieces of genetic material that coded these protein domains. This is in accordance with the exon theory of genes which postulates that ancestral exons coded for protein domains or modules that were assembled to form the first genes. This theory is exemplified precisely in the evolution of both GlyRS and LysRS which occurred through the assembly of protein domains in the main phyletic lineages, as analyzed here. Furthermore, this late assembly of protein domains of these proteins into the two main phyletic lineages, i.e. a polyphyletic origin of both GlyRS and LysRS, appears to corroborate the progenote evolutionary stage for both LUCA and at least the first part of the evolutionary stages of the ancestor of bacteria and that of archaea. Indeed, this polyphyletic origin would imply that the genetic code was still evolving because at least two ARSs, i.e. proteins that make the genetic code possible today, were still evolving. This would imply that the evolutionary stages involved were characterized not by cells but by protocells, that is, by progenotes because this is precisely the definition of a progenote. This conclusion would be strengthened by the observation that both GlyRS and LysRS originating in the phyletic lineages leading to bacteria and archaea, would demonstrate that, more generally, proteins were most likely still in rapid and progressive evolution. Namely, a polyphyletic origin of proteins which would qualify at least the initial phase of the evolutionary stage of the ancestor of bacteria and that of archaea as stages belonging to the progenote.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
2
|
Di Giulio M. Theories of the origin of the genetic code: Strong corroboration for the coevolution theory. Biosystems 2024; 239:105217. [PMID: 38663520 DOI: 10.1016/j.biosystems.2024.105217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/29/2024]
Abstract
I analyzed all the theories and models of the origin of the genetic code, and over the years, I have considered the main suggestions that could explain this origin. The conclusion of this analysis is that the coevolution theory of the origin of the genetic code is the theory that best captures the majority of observations concerning the organization of the genetic code. In other words, the biosynthetic relationships between amino acids would have heavily influenced the origin of the organization of the genetic code, as supported by the coevolution theory. Instead, the presence in the genetic code of physicochemical properties of amino acids, which have also been linked to the physicochemical properties of anticodons or codons or bases by stereochemical and physicochemical theories, would simply be the result of natural selection. More explicitly, I maintain that these correlations between codons, anticodons or bases and amino acids are in fact the result not of a real correlation between amino acids and codons, for example, but are only the effect of the intervention of natural selection. Specifically, in the genetic code table we expect, for example, that the most similar codons - that is, those that differ by only one base - will have more similar physicochemical properties. Therefore, the 64 codons of the genetic code table ordered in a certain way would also represent an ordering of some of their physicochemical properties. Now, a study aimed at clarifying which physicochemical property of amino acids has influenced the allocation of amino acids in the genetic code has established that the partition energy of amino acids has played a role decisive in this. Indeed, under some conditions, the genetic code was found to be approximately 98% optimized on its columns. In this same work, it was shown that this was most likely the result of the action of natural selection. If natural selection had truly allocated the amino acids in the genetic code in such a way that similar amino acids also have similar codons - this, not through a mechanism of physicochemical interaction between, for example, codons and amino acids - then it might turn out that even different physicochemical properties of codons (or anticodons or bases) show some correlation with the physicochemical properties of amino acids, simply because the partition energy of amino acids is correlated with other physicochemical properties of amino acids. It is very likely that this would inevitably lead to a correlation between codons (or anticodons or bases) and amino acids. In other words, since the codons (anticodons or bases) are ordered in the genetic code, that is to say, some of their physicochemical properties should also be ordered by a similar order, and given that the amino acids would also appear to have been ordered in the genetic code by selection natural, then it should inevitably turn out that there is a correlation between, for example, the hydrophobicity of anticodons and that of amino acids. Instead, the intervention of natural selection in organizing the genetic code would appear to be highly compatible with the main mechanism of structuring the genetic code as supported by the coevolution theory. This would make the coevolution theory the only plausible explanation for the origin of the genetic code.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
3
|
Abstract
Wobble coding is inevitable during evolution of the Standard Genetic Code (SGC). It ultimately splits half of NN U/C/A/G coding boxes with different assignments. Further, it contributes to pervasive SGC order by reinforcing close spacing for identical SGC assignments. But wobble cannot appear too soon, or it will inhibit encoding and more decisively, obstruct evolution of full coding tables. However, these prior results assumed Crick wobble, NN U/C and NN A/G, read by a single adaptor RNA. Superwobble translates NN U/C/A/G codons, using one adaptor RNA with an unmodified 5' anticodon U (appropriate to earliest coding) in modern mitochondria, plastids, and mycoplasma. Assuming the SGC was selected when evolving codes most resembled it, characteristics of the critical selection events can be calculated. For example, continuous superwobble infrequently evolves SGC-like coding tables. So, continuous superwobble is a very improbable origin hypothesis. In contrast, late-arising superwobble shares late Crick wobble's frequent resemblance to SGC order. Thus late superwobble is possible, but yields SGC-like assignments less frequently than late Crick wobble. Ancient coding ambiguity, most simply, arose from Crick wobble alone. This is consistent with SGC assignments to NAN codons.
Collapse
Affiliation(s)
- Michael Yarus
- Department of Molecular, Cellular and Developmental Biology, University of Colorado Boulder, Boulder, CO, 80309-0347, USA.
| |
Collapse
|
4
|
Abstract
Diverse models have been advanced for the evolution of the genetic code. Here, models for tRNA, aminoacyl-tRNA synthetase (aaRS) and genetic code evolution were combined with an understanding of EF-Tu suppression of tRNA 3rd anticodon position wobbling. The result is a highly detailed scheme that describes the placements of all amino acids in the standard genetic code. The model describes evolution of 6-, 4-, 3-, 2- and 1-codon sectors. Innovation in column 3 of the code is explained. Wobbling and code degeneracy are explained. Separate distribution of serine sectors between columns 2 and 4 of the code is described. We conclude that very little chaos contributed to evolution of the genetic code and that the pattern of evolution of aaRS enzymes describes a history of the evolution of the code. A model is proposed to describe the biological selection for the earliest evolution of the code and for protocell evolution.
Collapse
Affiliation(s)
- Lei Lei
- Department of Biology, University of New England, Biddeford, ME, USA
| | - Zachary Frome Burton
- Department of Biochemistry and Molecular Biology, Michigan State University, E. Lansing, MI, USA
| |
Collapse
|
5
|
A search for the physical basis of the genetic code. Biosystems 2020; 195:104148. [DOI: 10.1016/j.biosystems.2020.104148] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/09/2020] [Accepted: 04/09/2020] [Indexed: 01/01/2023]
|
6
|
The phylogenetic distribution of the glutaminyl-tRNA synthetase and Glu-tRNA Gln amidotransferase in the fundamental lineages would imply that the ancestor of archaea, that of eukaryotes and LUCA were progenotes. Biosystems 2020; 196:104174. [PMID: 32535177 DOI: 10.1016/j.biosystems.2020.104174] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 05/25/2020] [Accepted: 05/25/2020] [Indexed: 12/21/2022]
Abstract
The function of the glutaminyl-tRNA synthetase and Glu-tRNAGln amidotransferase might be related to the origin of the genetic code because, for example, glutaminyl-tRNA synthetase catalyses the fundamental reaction that makes the genetic code. If the evolutionary stage of the origin of these two enzymes could be unambiguously identified, then the genetic code should still have been originating at that particular evolutionary stage because the fundamental reaction that makes the code itself was still evidently evolving. This would result in that particular evolutionary moment being attributed to the evolutionary stage of the progenote because it would have a relationship between the genotype and the phenotype not yet fully realized because the genetic code was precisely still originating. I then analyzed the distribution of the glutaminyl-tRNA synthetase and Glu-tRNAGln aminodotrasferase in the main phyletic lineages. Since in some cases the origin of these two enzymes can be related to the evolutionary stages of ancestors of archaea and eukaryotes, this would indicate these ancestors as progenotes because at that evolutionary moment the genetic code was evidently still evolving, thus realizing the definition of progenote. The conclusion that the ancestor of archaea and that of eukaryotes were progenotes would imply that even the last universal common ancestor (LUCA) was a progenote because it appeared, on the tree of life, temporally before these ancestors.
Collapse
|
7
|
Rakočević MM. The Cipher of the Genetic Code. Biosystems 2018; 171:31-47. [PMID: 29870756 DOI: 10.1016/j.biosystems.2018.05.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2018] [Revised: 05/14/2018] [Accepted: 05/29/2018] [Indexed: 10/14/2022]
Abstract
A new approach to understanding of the genetic code is developed. In order to overcome the key paradox (and Darwinian selection problem) that the highly complex amino acid Phe is encoded by the simplest codons (UUY), and the simplest Gly encoded by the most complex codons (GGN); as well as the paradox of the duplication of some amino acids in the encoding process (Leu, Ser, Arg), we proposed an extension of the notion (and concept) of genetic code. For a better (and lighter) understanding of genetic coding, we proposed a hypothesis after that (under the conditions of allowed metaphoricity and modeling in biology) genetic code has to be understood, analogously in cryptology, as the unity of three entities: the code, the cipher of the code and the key of the cipher. In this hierarchy the term (and notion) "genetic code" remains what has been from the beginning: a connection between four-letter alphabet (four Py-Pu nucleotides, in form of codons) and a twenty-letter alphabet (twenty amino acids); the cipher is a specific chemical complementarity in chemical properties of molecules in the form: similarity in dissimilarity versus dissimilarity in similarity ("Sim in Diss vs Diss in Sim") and the key of cipher: the complementarity on the binary tree of the genetic code in the form: 0-15, 1-14, 2-13, …, 6-9, 7-8. These concepts improve understanding that within the two main Genetic Code Tables (of the nucleotide doublets and nucleotide Triplets) exists a sophisticated nuancing and balancing in the properties of the constituents of GC, including the balance of the number of molecules, atoms, and nucleons.
Collapse
|
8
|
Froese T, Campos JI, Fujishima K, Kiga D, Virgo N. Horizontal transfer of code fragments between protocells can explain the origins of the genetic code without vertical descent. Sci Rep 2018; 8:3532. [PMID: 29476089 PMCID: PMC5824800 DOI: 10.1038/s41598-018-21973-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 02/14/2018] [Indexed: 11/09/2022] Open
Abstract
Theories of the origin of the genetic code typically appeal to natural selection and/or mutation of hereditable traits to explain its regularities and error robustness, yet the present translation system presupposes high-fidelity replication. Woese's solution to this bootstrapping problem was to assume that code optimization had played a key role in reducing the effect of errors caused by the early translation system. He further conjectured that initially evolution was dominated by horizontal exchange of cellular components among loosely organized protocells ("progenotes"), rather than by vertical transmission of genes. Here we simulated such communal evolution based on horizontal transfer of code fragments, possibly involving pairs of tRNAs and their cognate aminoacyl tRNA synthetases or a precursor tRNA ribozyme capable of catalysing its own aminoacylation, by using an iterated learning model. This is the first model to confirm Woese's conjecture that regularity, optimality, and (near) universality could have emerged via horizontal interactions alone.
Collapse
Affiliation(s)
- Tom Froese
- Institute for Applied Mathematics and Systems Research (IIMAS), National Autonomous University of Mexico (UNAM), Mexico City, 04510, Mexico. .,Center for the Sciences of Complexity (C3), National Autonomous University of Mexico (UNAM), Mexico City, 04510, Mexico.
| | - Jorge I Campos
- Center for the Sciences of Complexity (C3), National Autonomous University of Mexico (UNAM), Mexico City, 04510, Mexico.,Faculty of Higher Education Aragon, National Autonomous University of Mexico (UNAM), Nezahualcoyotl City, State of Mexico, 57130, Mexico
| | - Kosuke Fujishima
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8550, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, 9970035, Japan
| | - Daisuke Kiga
- Faculty of Science and Engineering, School of Advanced Science and Engineering, Waseda University, Shinjuku, Tokyo, 169-8555, Japan
| | - Nathaniel Virgo
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8550, Japan
| |
Collapse
|
9
|
Di Giulio M. The aminoacyl-tRNA synthetases had only a marginal role in the origin of the organization of the genetic code: Evidence in favor of the coevolution theory. J Theor Biol 2017; 432:14-24. [DOI: 10.1016/j.jtbi.2017.08.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2017] [Revised: 08/01/2017] [Accepted: 08/03/2017] [Indexed: 10/19/2022]
|
10
|
Nemzer LR. Shannon information entropy in the canonical genetic code. J Theor Biol 2017; 415:158-170. [DOI: 10.1016/j.jtbi.2016.12.010] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Revised: 11/30/2016] [Accepted: 12/12/2016] [Indexed: 11/15/2022]
|
11
|
Fang P, Guo M. Evolutionary Limitation and Opportunities for Developing tRNA Synthetase Inhibitors with 5-Binding-Mode Classification. Life (Basel) 2015; 5:1703-25. [PMID: 26670257 PMCID: PMC4695845 DOI: 10.3390/life5041703] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2015] [Revised: 11/24/2015] [Accepted: 11/25/2015] [Indexed: 12/30/2022] Open
Abstract
Aminoacyl-tRNA synthetases (aaRSs) are enzymes that catalyze the transfer of amino acids to their cognate tRNAs as building blocks for translation. Each of the aaRS families plays a pivotal role in protein biosynthesis and is indispensable for cell growth and survival. In addition, aaRSs in higher species have evolved important non-translational functions. These translational and non-translational functions of aaRS are attractive for developing antibacterial, antifungal, and antiparasitic agents and for treating other human diseases. The interplay between amino acids, tRNA, ATP, EF-Tu and non-canonical binding partners, had shaped each family with distinct pattern of key sites for regulation, with characters varying among species across the path of evolution. These sporadic variations in the aaRSs offer great opportunity to target these essential enzymes for therapy. Up to this day, growing numbers of aaRS inhibitors have been discovered and developed. Here, we summarize the latest developments and structural studies of aaRS inhibitors, and classify them with distinct binding modes into five categories.
Collapse
Affiliation(s)
- Pengfei Fang
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, China.
- Department of Cancer Biology, The Scripps Research Institute, Scripps Florida, 130 Scripps Way, Jupiter, FL 33458, USA.
| | - Min Guo
- Department of Cancer Biology, The Scripps Research Institute, Scripps Florida, 130 Scripps Way, Jupiter, FL 33458, USA.
| |
Collapse
|
12
|
Abstract
The impressive body of work on the major evolutionary transitions in the last 20 y calls for a reconstruction of the theory although a 2D account (evolution of informational systems and transitions in individuality) remains. Significant advances include the concept of fraternal and egalitarian transitions (lower-level units like and unlike, respectively). Multilevel selection, first without, then with, the collectives in focus is an important explanatory mechanism. Transitions are decomposed into phases of origin, maintenance, and transformation (i.e., further evolution) of the higher level units, which helps reduce the number of transitions in the revised list by two so that it is less top-heavy. After the transition, units show strong cooperation and very limited realized conflict. The origins of cells, the emergence of the genetic code and translation, the evolution of the eukaryotic cell, multicellularity, and the origin of human groups with language are reconsidered in some detail in the light of new data and considerations. Arguments are given why sex is not in the revised list as a separate transition. Some of the transitions can be recursive (e.g., plastids, multicellularity) or limited (transitions that share the usual features of major transitions without a massive phylogenetic impact, such as the micro- and macronuclei in ciliates). During transitions, new units of reproduction emerge, and establishment of such units requires high fidelity of reproduction (as opposed to mere replication).
Collapse
Affiliation(s)
- Eörs Szathmáry
- Center for the Conceptual Foundations of Science, Parmenides Foundation, D-82049 Munich, Germany; Department of Plant Systematics, Ecology and Theoretical Biology, Biological Institute, Eötvös University, H-1117 Budapest, Hungary; and MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, H-1117 Budapest, Hungary
| |
Collapse
|
13
|
Ancestral Reconstruction of a Pre-LUCA Aminoacyl-tRNA Synthetase Ancestor Supports the Late Addition of Trp to the Genetic Code. J Mol Evol 2015; 80:171-85. [PMID: 25791872 DOI: 10.1007/s00239-015-9672-1] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2015] [Accepted: 03/09/2015] [Indexed: 01/14/2023]
Abstract
The genetic code was likely complete in its current form by the time of the last universal common ancestor (LUCA). Several scenarios have been proposed for explaining the code's pre-LUCA emergence and expansion, and the relative order of the appearance of amino acids used in translation. One co-evolutionary model of genetic code expansion proposes that at least some amino acids were added to the code by the ancient divergence of aminoacyl-tRNA synthetase (aaRS) families. Of all the amino acids used within the genetic code, Trp is most frequently claimed as a relatively recent addition. We observe that, since TrpRS and TyrRS are paralogous protein families retaining significant sequence similarity, the inferred sequence composition of their ancestor can be used to evaluate this co-evolutionary model of genetic code expansion. We show that ancestral sequence reconstructions of the pre-LUCA paralog ancestor of TyrRS and TrpRS have several sites containing Tyr, yet a complete absence of sites containing Trp. This is consistent with the paralog ancestor being specific for the utilization of Tyr, with Trp being a subsequent addition to the genetic code facilitated by a process of aaRS divergence and neofunctionalization. Only after this divergence could Trp be specifically encoded and incorporated into proteins, including the TyrRS and TrpRS descendant lineages themselves. This early absence of Trp is observed under both homogeneous and non-homogeneous models of ancestral sequence reconstruction. Simulations support that this observed absence of Trp is unlikely to be due to chance or model bias. These results support that the final stages of genetic code evolution occurred well within the "protein world," and that the presence-absence of Trp within conserved sites of ancient protein domains is a likely measure of their relative antiquity, permitting the relative timing of extremely early events within protein evolution before LUCA.
Collapse
|
14
|
Salinas DG, Gallardo MO, Osorio MI. Probable relationship between partitions of the set of codons and the origin of the genetic code. Biosystems 2014; 117:77-81. [PMID: 24495914 DOI: 10.1016/j.biosystems.2014.01.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2013] [Revised: 12/26/2013] [Accepted: 01/24/2014] [Indexed: 11/16/2022]
Abstract
Here we study the distribution of randomly generated partitions of the set of amino acid-coding codons. Some results are an application from a previous work, about the Stirling numbers of the second kind and triplet codes, both to the cases of triplet codes having four stop codons, as in mammalian mitochondrial genetic code, and hypothetical doublet codes. Extending previous results, in this work it is found that the most probable number of blocks of synonymous codons, in a genetic code, is similar to the number of amino acids when there are four stop codons, as well as it could be for a primigenious doublet code. Also it is studied the integer partitions associated to patterns of synonymous codons and it is shown, for the canonical code, that the standard deviation inside an integer partition is one of the most probable. We think that, in some early epoch, the genetic code might have had a maximum of the disorder or entropy, independent of the assignment between codons and amino acids, reaching a state similar to "code freeze" proposed by Francis Crick. In later stages, maybe deterministic rules have reassigned codons to amino acids, forming the natural codes, such as the canonical code, but keeping the numerical features describing the set partitions and the integer partitions, like a "fossil numbers"; both kinds of partitions about the set of amino acid-coding codons.
Collapse
Affiliation(s)
- Dino G Salinas
- Centro de Investigación Biomédica, Facultad de Medicina, Universidad Diego Portales, Avda. Ejército 141, Santiago, Chile.
| | - Mauricio O Gallardo
- Centro de Investigación Biomédica, Facultad de Medicina, Universidad Diego Portales, Avda. Ejército 141, Santiago, Chile.
| | - Manuel I Osorio
- Centro de Investigación Biomédica, Facultad de Medicina, Universidad Diego Portales, Avda. Ejército 141, Santiago, Chile.
| |
Collapse
|
15
|
The protein invasion: a broad review on the origin of the translational system. J Mol Evol 2013; 77:185-96. [PMID: 24145863 DOI: 10.1007/s00239-013-9592-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 10/12/2013] [Indexed: 12/25/2022]
Abstract
Translation, coded peptide synthesis, arguably exists at the heart of modern cellular life. By orchestrating an incredibly complex interaction between tRNAs, mRNAs, aaRSs, the ribosome, and numerous other small molecules, the translational system allows the interpretation of data in the form of DNA to create massively complex proteins which control and enact almost every cellular function. A natural question then, is how did this system evolve? Here we present a broad review of the existing theories of the last two decades on the origin of the translational system. We attempt to synthesize the wide variety of ideas as well as organize them into modular components, addressing the evolution of the peptide-RNA interaction, tRNA, mRNA, the ribosome, and the first proteins separately. We hope to provide both a comprehensive overview of the literature as well as a framework for future discussions and novel theories.
Collapse
|
16
|
The Biological “Invariant of Motion” vs. “Struggle for Life”? On the Possible Quantum Mechanical Origin and Evolution of Semiotic Controls in Biology. INFORMATION 2013. [DOI: 10.3390/info4040367] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
17
|
Rodin AS, Szathmáry E, Rodin SN. On origin of genetic code and tRNA before translation. Biol Direct 2011; 6:14. [PMID: 21342520 PMCID: PMC3050877 DOI: 10.1186/1745-6150-6-14] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2010] [Accepted: 02/22/2011] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental veto on "foresight evolution", 2) modular structures of tRNAs and aminoacyl-tRNA synthetases, and 3) the updated library of aa-binding sites in RNA aptamers successfully selected in vitro for eight amino acids. RESULTS The aa-binding sites of arginine, isoleucine and tyrosine contain both their cognate triplets, anticodons and codons. We have noticed that these cases might be associated with palindrome-dinucleotides. For example, one-base shift to the left brings arginine codons CGN, with CG at 1-2 positions, to the respective anticodons NCG, with CG at 2-3 positions. Formally, the concomitant presence of codons and anticodons is also expected in the reverse situation, with codons containing palindrome-dinucleotides at their 2-3 positions, and anticodons exhibiting them at 1-2 positions. A closer analysis reveals that, surprisingly, RNA binding sites for Arg, Ile and Tyr "prefer" (exactly as in the actual genetic code) the anticodon(2-3)/codon(1-2) tetramers to their anticodon(1-2)/codon(2-3) counterparts, despite the seemingly perfect symmetry of the latter. However, since in vitro selection of aa-specific RNA aptamers apparently had nothing to do with translation, this striking preference provides a new strong support to the notion of the genetic code emerging before translation, in response to catalytic (and possibly other) needs of ancient RNA life. Consistently with the pre-translation origin of the code, we propose here a new model of tRNA origin by the gradual, Fibonacci process-like, elongation of a tRNA molecule from a primordial coding triplet and 5'DCCA3' quadruplet (D is a base-determinator) to the eventual 76 base-long cloverleaf-shaped molecule. CONCLUSION Taken together, our findings necessarily imply that primordial tRNAs, tRNA aminoacylating ribozymes, and (later) the translation machinery in general have been co-evolving to ''fit'' the (likely already defined) genetic code, rather than the opposite way around. Coding triplets in this primal pre-translational code were likely similar to the anticodons, with second and third nucleotides being more important than the less specific first one. Later, when the code was expanding in co-evolution with the translation apparatus, the importance of 2-3 nucleotides of coding triplets "transferred" to the 1-2 nucleotides of their complements, thus distinguishing anticodons from codons. This evolutionary primacy of anticodons in genetic coding makes the hypothesis of primal stereo-chemical affinity between amino acids and cognate triplets, the hypothesis of coding coenzyme handles for amino acids, the hypothesis of tRNA-like genomic 3' tags suggesting that tRNAs originated in replication, and the hypothesis of ancient ribozymes-mediated operational code of tRNA aminoacylation not mutually contradicting but rather co-existing in harmony.
Collapse
Affiliation(s)
- Andrei S Rodin
- Human Genetics Center, School of Public Health, University of Texas, Houston, TX 77225, USA
- Collegium Budapest (Institute for Advanced Study), Szentháromság u. 2, H-1014 Budapest, Hungary
| | - Eörs Szathmáry
- Collegium Budapest (Institute for Advanced Study), Szentháromság u. 2, H-1014 Budapest, Hungary
- Parmenides Center for the Study of Thinking, Kirchplatz 1, D-82049 Munich/Pullach, Germany
- Institute of Biology, Eötvös University, 1c Pázmány Péter sétány, H-1117 Budapest, Hungary
| | - Sergei N Rodin
- Collegium Budapest (Institute for Advanced Study), Szentháromság u. 2, H-1014 Budapest, Hungary
- Department of Molecular and Cellular Biology, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA
| |
Collapse
|
18
|
|
19
|
Koonin EV, Novozhilov AS. Origin and evolution of the genetic code: the universal enigma. IUBMB Life 2009; 61:99-111. [PMID: 19117371 DOI: 10.1002/iub.146] [Citation(s) in RCA: 213] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The genetic code is nearly universal, and the arrangement of the codons in the standard codon table is highly nonrandom. The three main concepts on the origin and evolution of the code are the stereochemical theory, according to which codon assignments are dictated by physicochemical affinity between amino acids and the cognate codons (anticodons); the coevolution theory, which posits that the code structure coevolved with amino acid biosynthesis pathways; and the error minimization theory under which selection to minimize the adverse effect of point mutations and translation errors was the principal factor of the code's evolution. These theories are not mutually exclusive and are also compatible with the frozen accident hypothesis, that is, the notion that the standard code might have no special properties but was fixed simply because all extant life forms share a common ancestor, with subsequent changes to the code, mostly, precluded by the deleterious effect of codon reassignment. Mathematical analysis of the structure and possible evolutionary trajectories of the code shows that it is highly robust to translational misreading but there are numerous more robust codes, so the standard code potentially could evolve from a random code via a short sequence of codon series reassignments. Thus, much of the evolution that led to the standard code could be a combination of frozen accident with selection for error minimization although contributions from coevolution of the code with metabolic pathways and weak affinities between amino acids and nucleotide triplets cannot be ruled out. However, such scenarios for the code evolution are based on formal schemes whose relevance to the actual primordial evolution is uncertain. A real understanding of the code origin and evolution is likely to be attainable only in conjunction with a credible scenario for the evolution of the coding principle itself and the translation system.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| | | |
Collapse
|
20
|
Jakó É, Ittzés P, Szenes Á, Kun Á, Szathmáry E, Pál G. In silico detection of tRNA sequence features characteristic to aminoacyl-tRNA synthetase class membership. Nucleic Acids Res 2007; 35:5593-609. [PMID: 17704131 PMCID: PMC2018626 DOI: 10.1093/nar/gkm598] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Aminoacyl tRNA synthetases (aaRS) are grouped into Class I and II based on primary and tertiary structure and enzyme properties suggesting two independent phylogenetic lineages. Analogously, tRNA molecules can also form two respective classes, based on the class membership of their corresponding aaRS. Although some aaRS–tRNA interactions are not extremely specific and require editing mechanisms to avoid misaminoacylation, most aaRS–tRNA interactions are rather stereospecific. Thus, class-specific aaRS features could be mirrored by class-specific tRNA features. However, previous investigations failed to detect conserved class-specific nucleotides. Here we introduce a discrete mathematical approach that evaluates not only class-specific ‘strictly present’, but also ‘strictly absent’ nucleotides. The disjoint subsets of these elements compose a unique partition, named extended consensus partition (ECP). By analyzing the ECP for both Class I and II tDNA sets from 50 (13 archaeal, 30 bacterial and 7 eukaryotic) species, we could demonstrate that class-specific tRNA sequence features do exist, although not in terms of strictly conserved nucleotides as it had previously been anticipated. This finding demonstrates that important information was hidden in tRNA sequences inaccessible for traditional statistical methods. The ECP analysis might contribute to the understanding of tRNA evolution and could enrich the sequence analysis tool repertoire.
Collapse
Affiliation(s)
- Éena Jakó
- Theoretical Biology and Ecology Research Group of the Hungarian Academy of Sciences, Department of Plant Taxonomy and Ecology, eScience Regional Knowledge Center, at Eötvös Loránd University, Collegium Budapest, Institute for Advanced Study, Budapest, Hungary, Department of Biochemistry and Department of Plant Taxonomy and Ecology, Eötvös Loránd University, Budapest, Hungary
| | - Péter Ittzés
- Theoretical Biology and Ecology Research Group of the Hungarian Academy of Sciences, Department of Plant Taxonomy and Ecology, eScience Regional Knowledge Center, at Eötvös Loránd University, Collegium Budapest, Institute for Advanced Study, Budapest, Hungary, Department of Biochemistry and Department of Plant Taxonomy and Ecology, Eötvös Loránd University, Budapest, Hungary
| | - Áron Szenes
- Theoretical Biology and Ecology Research Group of the Hungarian Academy of Sciences, Department of Plant Taxonomy and Ecology, eScience Regional Knowledge Center, at Eötvös Loránd University, Collegium Budapest, Institute for Advanced Study, Budapest, Hungary, Department of Biochemistry and Department of Plant Taxonomy and Ecology, Eötvös Loránd University, Budapest, Hungary
| | - Ádám Kun
- Theoretical Biology and Ecology Research Group of the Hungarian Academy of Sciences, Department of Plant Taxonomy and Ecology, eScience Regional Knowledge Center, at Eötvös Loránd University, Collegium Budapest, Institute for Advanced Study, Budapest, Hungary, Department of Biochemistry and Department of Plant Taxonomy and Ecology, Eötvös Loránd University, Budapest, Hungary
| | - Eörs Szathmáry
- Theoretical Biology and Ecology Research Group of the Hungarian Academy of Sciences, Department of Plant Taxonomy and Ecology, eScience Regional Knowledge Center, at Eötvös Loránd University, Collegium Budapest, Institute for Advanced Study, Budapest, Hungary, Department of Biochemistry and Department of Plant Taxonomy and Ecology, Eötvös Loránd University, Budapest, Hungary
| | - Gábor Pál
- Theoretical Biology and Ecology Research Group of the Hungarian Academy of Sciences, Department of Plant Taxonomy and Ecology, eScience Regional Knowledge Center, at Eötvös Loránd University, Collegium Budapest, Institute for Advanced Study, Budapest, Hungary, Department of Biochemistry and Department of Plant Taxonomy and Ecology, Eötvös Loránd University, Budapest, Hungary
- *To whom correspondence should be addressed. +36 1 2090555/8577+36 1 3812172
| |
Collapse
|
21
|
Shimizu H, Yokobori SI, Ohkuri T, Yokogawa T, Nishikawa K, Yamagishi A. Extremely thermophilic translation system in the common ancestor commonote: ancestral mutants of Glycyl-tRNA synthetase from the extreme thermophile Thermus thermophilus. J Mol Biol 2007; 369:1060-9. [PMID: 17477933 DOI: 10.1016/j.jmb.2007.04.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2006] [Revised: 03/29/2007] [Accepted: 04/02/2007] [Indexed: 11/16/2022]
Abstract
Based on phylogenetic analysis of 16 S and 18 S rRNAs, the common ancestor of all organisms (Commonote) was proposed to be hyperthermophilic. We have previously tested this hypothesis using enzymes with ancestral residues that are inferred by molecular phylogenetic analysis. The ancestral mutant enzymes involved in metabolic systems show higher thermal stability than wild-type enzymes, consistent with the hyperthermophile common ancestor hypothesis. Here, we have extended the experiments to include an enzyme of the translation system, glycyl-tRNA synthetase (GlyRS). The translation system often shows a phylogenetic tree that is similar to the rRNA tree. Thus, it is likely that the tree represents the evolutionary route of the organisms. The maximum-likelihood tree of alpha(2) type GlyRS was constructed. From this analysis the ancestral sequence of GlyRS was deduced and individual or pairs of ancestral residues were introduced into Thermus thermophilus GlyRS. The ancestral mutants were expressed in Escherichia coli, purified and activity measured. The thermostability of eight mutated proteins was evaluated by CD (circular dichroism) measurements. Six mutants showed higher thermostability than wild-type enzyme and seven mutants showed higher activity than wild-type enzyme at 70 degrees C, suggesting an extremely thermophilic translation system in the common ancestor Commonote.
Collapse
Affiliation(s)
- Hideaki Shimizu
- Department of Molecular Biology, Tokyo University of Pharmacy and Life Science, 1432-1 Horinouchi, Hachioji, Tokyo 192-0392, Japan
| | | | | | | | | | | |
Collapse
|
22
|
Scott JH, O'Brien DM, Emerson D, Sun H, McDonald GD, Salgado A, Fogel ML. An examination of the carbon isotope effects associated with amino acid biosynthesis. ASTROBIOLOGY 2006; 6:867-80. [PMID: 17155886 DOI: 10.1089/ast.2006.6.867] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Stable carbon isotope ratios (delta(13)C) were determined for alanine, proline, phenylalanine, valine, leucine, isoleucine, aspartate (aspartic acid and asparagine), glutamate (glutamic acid and glutamine), lysine, serine, glycine, and threonine from metabolically diverse microorganisms. The microorganisms examined included fermenting bacteria, organotrophic, chemolithotrophic, phototrophic, methylotrophic, methanogenic, acetogenic, acetotrophic, and naturally occurring cryptoendolithic communities from the Dry Valleys of Antarctica. Here we demonstrated that reactions involved in amino acid biosynthesis can be used to distinguish amino acids formed by life from those formed by nonbiological processes. The unique patterns of delta(13)C imprinted by life on amino acids produced a biological bias. We also showed that, by applying discriminant function analysis to the delta(13)C value of a pool of amino acids formed by biological activity, it was possible to identify key aspects of intermediary carbon metabolism in the microbial world. In fact, microorganisms examined in this study could be placed within one of three metabolic groups: (1) heterotrophs that grow by oxidizing compounds containing three or more carbon-to-carbon bonds (fermenters and organotrophs), (2) autotrophs that grow by taking up carbon dioxide (chemolitotrophs and phototrophs), and (3) acetoclastic microbes that grow by assimilation of formaldehyde or acetate (methylotrophs, methanogens, acetogens, and acetotrophs). Furthermore, we demonstrated that cryptoendolithic communities from Antarctica grouped most closely with the autotrophs, which indicates that the dominant metabolic pathways in these communities are likely those utilized for CO(2 )fixation. We propose that this technique can be used to determine the dominant metabolic types in a community and reveal the overall flow of carbon in a complex ecosystem.
Collapse
Affiliation(s)
- James H Scott
- Department of Earth Sciences, Dartmouth College, Hanover, New Hampshire 03755, USA.
| | | | | | | | | | | | | |
Collapse
|
23
|
Patel A. The triplet genetic code had a doublet predecessor. J Theor Biol 2005; 233:527-32. [PMID: 15748913 DOI: 10.1016/j.jtbi.2004.10.029] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2004] [Revised: 09/20/2004] [Accepted: 10/27/2004] [Indexed: 11/30/2022]
Abstract
Information theoretic analysis of genetic languages indicates that the naturally occurring 20 amino acids and the triplet genetic code arose by duplication of 10 amino acids of class-II and a doublet genetic code having codons NNY and anticodons GNN. Evidence for this scenario is presented based on the properties of aminoacyl-tRNA synthetases, amino acids and nucleotide bases.
Collapse
Affiliation(s)
- Apoorva Patel
- Centre for High Energy Physics and Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore 560012, India.
| |
Collapse
|
24
|
Rakocević MM. A harmonic structure of the genetic code. J Theor Biol 2004; 229:221-34. [PMID: 15207477 DOI: 10.1016/j.jtbi.2004.03.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2003] [Revised: 03/19/2004] [Accepted: 03/26/2004] [Indexed: 10/26/2022]
Abstract
In this paper is presented a new, very harmonic structure of the genetic code (GC) within a system of "4 x 5" (and/or of "5 x 4") of amino acids (AAs) in two variants. In first variant, the five rows within the system start with one polar charged amino acid (AA) each, making first column, consisting from five polar charged AAs (D, R, K, H, E). Five polar non-charged AAs (N, P, Y, W, Q) follow, then five non-polar AAs as last column (A, L, F, V, I) and, finally, five polar or non-polar AAs, in a combination, as first to last column (A as non-polar; S, T as polar, and G, P as ambivalent AAs). A second variant is subsequent to this one-"4 x 5" system with five nitrogen AAs (K, R, P, H, W), five oxygen (D, E, Y, S, T), five solely carbon (A, L, F, V, I) and five "combined" AAs (G with hydrogen as side chain; C and M with carbon and sulfur; N and Q with carbon, oxygen and nitrogen). A strict balance of atom and nucleon number as well as molecule mass follows the classification in both system variants.
Collapse
Affiliation(s)
- Miloje M Rakocević
- Department of Chemistry, Faculty of Science, University of Nis, Cirila i Metodija 2, Serbia & Montenegro, Yugoslavia.
| |
Collapse
|
25
|
Klipcan L, Safro M. Amino acid biogenesis, evolution of the genetic code and aminoacyl-tRNA synthetases. J Theor Biol 2004; 228:389-96. [PMID: 15135037 DOI: 10.1016/j.jtbi.2004.01.014] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2003] [Revised: 01/19/2004] [Accepted: 01/27/2004] [Indexed: 12/01/2022]
Abstract
The aminoacyl-tRNA synthetases (aaRSs) ensure the fidelity of the translation of the genetic code, covalently attaching appropriate amino acids to the corresponding nucleic acid adaptor molecules-tRNA. The fundamental role of aminoacylation reaction catalysed by aaRSs implies that representatives of the family are thought to be among the earliest proteins to appear. Based on sequence analysis and catalytic domain structure, aaRSs have been partitioned into two classes of 10 enzymes each. However, based on the structural and sequence data only, it will not be easily understood that the present partitioning is not governed by chance. Our findings suggest that organization of amino acid biosynthetic pathways and clustering of aaRSs into different classes are intimately related to one another. A plausible explanation for such a relationship is dictated by early link between aaRSs and amino acids biosynthetic proteins. The aaRSs catalytic cores are highly relevant to the ancient metabolic reactions, namely, amino acids and cofactors biosynthesis. In particular we show that class II aaRSs mostly associated with the primordial amino acids, while class I aaRSs are usually related to amino acids evolved lately. Reasoning from this we propose a possible chronology of genetic code evolution.
Collapse
Affiliation(s)
- Liron Klipcan
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | | |
Collapse
|
26
|
Cavalcanti AR, Ferreira R. On the relative content of G,C bases in codons of amino acids corresponding to class I and II aminoacyl-tRNA synthetases. ORIGINS LIFE EVOL B 2001; 31:257-69. [PMID: 11434105 DOI: 10.1023/a:1010639521100] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We have analyzed the relative G,C content from protein coding regions of 530 organisms and found that the ratio of the G,C content of the codons of the amino acids corresponding to Class II and Class I aminoacyl-tRNA synthetases decreases in a statistically significant way from prokaryotes to animals. This can be interpreted assuming that an initial asymmetry between the G,C content of codons of Class I and II amino acids existed and has decreased in the course of evolution.
Collapse
Affiliation(s)
- A R Cavalcanti
- Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, 60637, Chicago, IL, USA.
| | | |
Collapse
|
27
|
Cavalcanti AR, Neto BD, Ferreira R. On the classes of aminoacyl-tRNA synthetases and the error minimization in the genetic code. J Theor Biol 2000; 204:15-20. [PMID: 10772845 DOI: 10.1006/jtbi.2000.1082] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
As a consequence of the existence of two classes of aminoacyl-tRNA synthetases (aaRSs), we defined two types of mutations: g (mutations that do not change the class of the involved amino acids) and u (those which change the class). We have found that the mean chemical distance resulting from g mutations is smaller than that corresponding to u mutations, indicating that g mutations are responsible for most of the known minimization of the genetic code. This supports models for the origin and evolution of the code, in which new amino acids were added after duplications or modification of existing aaRSs.
Collapse
Affiliation(s)
- A R Cavalcanti
- Departamento de Química Fundamental, Universidade Federal de Pernambuco, Recife, PE, 50670-901, Brazil.
| | | | | |
Collapse
|
28
|
Abstract
The evolutionary forces that produced the canonical genetic code before the last universal ancestor remain obscure. One hypothesis is that the arrangement of amino acid/codon assignments results from selection to minimize the effects of errors (e.g., mistranslation and mutation) on resulting proteins. If amino acid similarity is measured as polarity, the canonical code does indeed outperform most theoretical alternatives. However, this finding does not hold for other amino acid properties, ignores plausible restrictions on possible code structure, and does not address the naturally occurring nonstandard genetic codes. Finally, other analyses have shown that significantly better code structures are possible. Here, we show that if theoretically possible code structures are limited to reflect plausible biological constraints, and amino acid similarity is quantified using empirical data of substitution frequencies, the canonical code is at or very close to a global optimum for error minimization across plausible parameter space. This result is robust to variation in the methods and assumptions of the analysis. Although significantly better codes do exist under some assumptions, they are extremely rare and thus consistent with reports of an adaptive code: previous analyses which suggest otherwise derive from a misleading metric. However, all extant, naturally occurring, secondarily derived, nonstandard genetic codes do appear less adaptive. The arrangement of amino acid assignments to the codons of the standard genetic code appears to be a direct product of natural selection for a system that minimizes the phenotypic impact of genetic error. Potential criticisms of previous analyses appear to be without substance. That known variants of the standard genetic code appear less adaptive suggests that different evolutionary factors predominated before and after fixation of the canonical code. While the evidence for an adaptive code is clear, the process by which the code achieved this optimization requires further attention.
Collapse
Affiliation(s)
- S J Freeland
- Department of Ecology, Princeton University, University of Bath, Bath, England
| | | | | | | |
Collapse
|
29
|
Abstract
The genes of glutamyl- and prolyl-tRNA synthetases (GluRS and ProRS) are organized differently in the three kingdoms of the tree of life. In bacteria and archaea, distinct genes encode the two proteins. In several organisms from the eukaryotic phylum of coelomate metazoans, the two polypeptides are carried by a single polypeptide chain to form a bifunctional protein. The linker region is made of imperfectly repeated units also recovered as singular or plural elements connected as N-terminal or C-terminal polypeptide extensions in various eukaryotic aminoacyl-tRNA synthetases. Phylogenetic analysis points to the monophyletic origin of this polypeptide motif appended to six different members of the synthetase family, belonging to either of the two classes of aminoacyl-tRNA synthetases. In particular, the monospecific GluRS and ProRS from Caenorhabditis elegans, an acoelomate metazoan, exhibit this recurrent motif as a C-terminal or N-terminal appendage, respectively. Our analysis of the extant motifs suggests a possible series of events responsible for a gene fusion that gave rise to the bifunctional glutamyl-prolyl-tRNA synthetase through recombination between genomic sequences encoding the repeated units.
Collapse
Affiliation(s)
- E Berthonneau
- Laboratoire d'Enzymologie et Biochimie Structurales, C.N.R.S., 1 Avenue de la Terrasse, 91190, Gif-sur-Yvette, France
| | | |
Collapse
|
30
|
Lehmann J. Physico-chemical constraints connected with the coding properties of the genetic system. J Theor Biol 2000; 202:129-44. [PMID: 10640433 DOI: 10.1006/jtbi.1999.1045] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
New insights on the origin of the genetic code, based on the analysis of the physico-chemical properties of its molecular constituents (RNA and amino acids), are reported in this paper. We point out a symmetry in the genetic code table and show that it can be explained by the nature of the anticodon-codon interaction. The importance of the strength of this interaction is examined and a correlation is found between the free-energy change (DeltaG(0)) of anticodon-codon association and the volume of the corresponding amino acids. This correlation is investigated in conjunction with the well-known one linking the hydrophobicity of the anticodons with that of the amino acids. We show that they can be considerated separately and that the energy vs. volume correlation may be explained by the process implicating the peptide bond formation between two successive amino acids during translation. This interpretation is supported by a statistical pattern of bases (purines or pyrimidines), observed in present coding genes, and by considerations involving the availability of the different kinds of amino acids. Finally, we try to explain the hydrophobicity correlation when reconstructing the events at the time of the so-called "RNA World". The whole of our investigation shows that the genetic code might be sufficiently robust to exist without the participation of pre-existing proteins, and that this robustness is a consequence of the physico-chemical properties of the four bases of the genetic system.
Collapse
Affiliation(s)
- J Lehmann
- Institut de Physique de la Matière Condensée, Lausanne, 1015, Switzerland.
| |
Collapse
|
31
|
Das GK, Bhattacharyya D, Burma DP. A possible mechanism of peptide bond formation on ribosome without mediation of peptidyl transferase. J Theor Biol 1999; 200:193-205. [PMID: 10504285 DOI: 10.1006/jtbi.1999.0987] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Ribosome, the ubiquitous organelle, is the site for protein synthesis in all types of cells. The consecutive peptide bonds are formed by the transpeptidation reaction between carboxyl group of peptidyl moiety and the amino group of the aminoacyl moiety. Both the moieties are attached to the appropiate tRNAs positioned on the ribosome at P and A sites, respectively, through codon-anticodon recognition directed by messenger RNA. The reaction seems to proceed by the nucleophillic attack of the amino group of the aminoacyl tRNA at the A site and on the carboxyl of the ester group of the tRNA at P-site of ribosome. The configuration of the carbon atom of the tetrahedral intermediate may be R or S depending on the direction of the nucleophillic attack. After selecting the favorable conformation of this tetrahedral intermediate quantum mechanical calculations have been carried out to determine the energy needed for its formation. A cyclic intermediate where 2'-OH of the ribose sugar of the P-site tRNA is a member of the ring can be formed from the tetrahedral intermediate. This cyclic intermediate produces a free tRNA and a tRNA attached to a planar peptide unit. Analysis of the energetics using semiempirical method for the formation of a cyclic intermediate indicates that the peptide bond formation through the tetrahedral intermediate in S configuration may not need assistance from any outside agent like an enzyme
Collapse
Affiliation(s)
- G K Das
- Department of Chemistry, Visva-Bharati, Santiniketan, 731235, India
| | | | | |
Collapse
|
32
|
Abstract
For the first time it is shown that the genetic code, as a binary code, is determined by Golden mean through the unity of the binary-code tree and the Farey tree, corresponding to the Watson-Crick table and Gray code model of the genetic code at the same time. On the other hand, the Golden mean relations also correspond with the essentiality-nonessentiality of protein amino acids, as well as with their splitting into two classes handled by two classes of aminoacyl-tRNA synthetases.
Collapse
Affiliation(s)
- M M Rakocević
- Department of Chemistry, Faculty of Science, University of Nis, Yugoslavia
| |
Collapse
|
33
|
Henikoff S, Greene EA, Pietrokovski S, Bork P, Attwood TK, Hood L. Gene families: the taxonomy of protein paralogs and chimeras. Science 1997; 278:609-14. [PMID: 9381171 DOI: 10.1126/science.278.5338.609] [Citation(s) in RCA: 287] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Ancient duplications and rearrangements of protein-coding segments have resulted in complex gene family relationships. Duplications can be tandem or dispersed and can involve entire coding regions or modules that correspond to folded protein domains. As a result, gene products may acquire new specificities, altered recognition properties, or modified functions. Extreme proliferation of some families within an organism, perhaps at the expense of other families, may correspond to functional innovations during evolution. The underlying processes are still at work, and the large fraction of human and other genomes consisting of transposable elements may be a manifestation of the evolutionary benefits of genomic flexibility.
Collapse
Affiliation(s)
- S Henikoff
- Fred Hutchinson Cancer Research Center and Howard Hughes Medical Institute, Seattle, WA 98109-1024, USA
| | | | | | | | | | | |
Collapse
|
34
|
Two classes of the aminoacyl-tRNA synthetases in correspondence with the Codon path cube. Bull Math Biol 1997. [DOI: 10.1007/bf02458423] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
35
|
Delarue M. Partition of aminoacyl-tRNA synthetases in two different structural classes dating back to early metabolism: implications for the origin of the genetic code and the nature of protein sequences. J Mol Evol 1995; 41:703-11. [PMID: 8587114 DOI: 10.1007/bf00173148] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
We describe, on the molecular level, a possible fuzzy and primordial translation apparatus capable of synthesizing polypeptides from nucleic acids in a world containing a mixture of coevolving molecules of RNA and proteins already arranged in metabolic cycles (including cofactors). Close attention is paid to template-free systems because they are believed to be the immediate ancestors of this primordial translation apparatus. The two classes of aminoacyl-tRNA synthetases (aaRSs), as seen today, are considered as the remnants of such a simple imprecise translation apparatus and are used as guidelines for the construction of the model. Earlier theoretical work by Bedian on a related system is invoked to show how specificity and stability could have been achieved automatically and rather quickly, starting from such an imprecise system, i.e., how the encoded synthesis of proteins could have appeared. Because of the binary nature of the underlying proto-code, the first genetically encoded proteins would then have been alternating copolymers with a high degree of degeneracy, but not random. Indeed, a clear signal for alternating hydrophobic and hydrophilic residues in present-day protein sequences can be detected. Later evolution of the genetic code would have proceeded along lines already discussed by Crick. However, in the initial stages, the translation apparatus proposed here is in fact very similar to the one postulated by Woese, only here it is given a molecular framework. This hypothesis departs from the paradigm of the RNA world in that it supposes that the origin of the genetic code occurred after the apparition of some functional (statistical) proteins first. Implications for protein design are also discussed.
Collapse
Affiliation(s)
- M Delarue
- Unité d'Immunologie Structurale, Institut Pasteur, Paris, France
| |
Collapse
|