1
|
Salman A, Biziaev N, Shuvalova E, Alkalaeva E. mRNA context and translation factors determine decoding in alternative nuclear genetic codes. Bioessays 2024; 46:e2400058. [PMID: 38724251 DOI: 10.1002/bies.202400058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 04/19/2024] [Accepted: 04/23/2024] [Indexed: 06/27/2024]
Abstract
The genetic code is a set of instructions that determine how the information in our genetic material is translated into amino acids. In general, it is universal for all organisms, from viruses and bacteria to humans. However, in the last few decades, exceptions to this rule have been identified both in pro- and eukaryotes. In this review, we discuss the 16 described alternative eukaryotic nuclear genetic codes and observe theories of their appearance in evolution. We consider possible molecular mechanisms that allow codon reassignment. Most reassignments in nuclear genetic codes are observed for stop codons. Moreover, in several organisms, stop codons can simultaneously encode amino acids and serve as termination signals. In this case, the meaning of the codon is determined by the additional factors besides the triplets. A comprehensive review of various non-standard coding events in the nuclear genomes provides a new insight into the translation mechanism in eukaryotes.
Collapse
Affiliation(s)
- Ali Salman
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| | - Nikita Biziaev
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| | - Ekaterina Shuvalova
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| | - Elena Alkalaeva
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
2
|
Di Giulio M. Theories of the origin of the genetic code: Strong corroboration for the coevolution theory. Biosystems 2024; 239:105217. [PMID: 38663520 DOI: 10.1016/j.biosystems.2024.105217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Revised: 04/16/2024] [Accepted: 04/18/2024] [Indexed: 04/29/2024]
Abstract
I analyzed all the theories and models of the origin of the genetic code, and over the years, I have considered the main suggestions that could explain this origin. The conclusion of this analysis is that the coevolution theory of the origin of the genetic code is the theory that best captures the majority of observations concerning the organization of the genetic code. In other words, the biosynthetic relationships between amino acids would have heavily influenced the origin of the organization of the genetic code, as supported by the coevolution theory. Instead, the presence in the genetic code of physicochemical properties of amino acids, which have also been linked to the physicochemical properties of anticodons or codons or bases by stereochemical and physicochemical theories, would simply be the result of natural selection. More explicitly, I maintain that these correlations between codons, anticodons or bases and amino acids are in fact the result not of a real correlation between amino acids and codons, for example, but are only the effect of the intervention of natural selection. Specifically, in the genetic code table we expect, for example, that the most similar codons - that is, those that differ by only one base - will have more similar physicochemical properties. Therefore, the 64 codons of the genetic code table ordered in a certain way would also represent an ordering of some of their physicochemical properties. Now, a study aimed at clarifying which physicochemical property of amino acids has influenced the allocation of amino acids in the genetic code has established that the partition energy of amino acids has played a role decisive in this. Indeed, under some conditions, the genetic code was found to be approximately 98% optimized on its columns. In this same work, it was shown that this was most likely the result of the action of natural selection. If natural selection had truly allocated the amino acids in the genetic code in such a way that similar amino acids also have similar codons - this, not through a mechanism of physicochemical interaction between, for example, codons and amino acids - then it might turn out that even different physicochemical properties of codons (or anticodons or bases) show some correlation with the physicochemical properties of amino acids, simply because the partition energy of amino acids is correlated with other physicochemical properties of amino acids. It is very likely that this would inevitably lead to a correlation between codons (or anticodons or bases) and amino acids. In other words, since the codons (anticodons or bases) are ordered in the genetic code, that is to say, some of their physicochemical properties should also be ordered by a similar order, and given that the amino acids would also appear to have been ordered in the genetic code by selection natural, then it should inevitably turn out that there is a correlation between, for example, the hydrophobicity of anticodons and that of amino acids. Instead, the intervention of natural selection in organizing the genetic code would appear to be highly compatible with the main mechanism of structuring the genetic code as supported by the coevolution theory. This would make the coevolution theory the only plausible explanation for the origin of the genetic code.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
3
|
Girard C. The tri-flow adaptiveness of codes in major evolutionary transitions. Biosystems 2024; 237:105133. [PMID: 38336225 DOI: 10.1016/j.biosystems.2024.105133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/26/2024] [Accepted: 01/27/2024] [Indexed: 02/12/2024]
Abstract
Life codes increase in both number and variety with biological complexity. Although our knowledge of codes is constantly expanding, the evolutionary progression of organic, neural, and cultural codes in response to selection pressure remains poorly understood. Greater clarification of the selective mechanisms is achieved by investigating how major evolutionary transitions reduce spatiotemporal and energetic constraints on transmitting heritable code to offspring. Evolution toward less constrained flows is integral to enduring flow architecture everywhere, in both engineered and natural flow systems. Beginning approximately 4 billion years ago, the most basic level for transmitting genetic material to offspring was initiated by protocell division. Evidence from ribosomes suggests that protocells transmitted comma-free or circular codes, preceding the evolution of standard genetic code. This rudimentary information flow within protocells is likely to have first emerged within the geo-energetic and geospatial constraints of hydrothermal vents. A broad-gauged hypothesis is that major evolutionary transitions overcame such constraints with tri-flow adaptations. The interconnected triple flows incorporated energy-converting, spatiotemporal, and code-based informational dynamics. Such tri-flow adaptations stacked sequence splicing code on top of protein-DNA recognition code in eukaryotes, prefiguring the transition to sexual reproduction. Sex overcame the spatiotemporal-energetic constraints of binary fission with further code stacking. Examples are tubulin code and transcription initiation code in vertebrates. In a later evolutionary transition, language reduced metabolic-spatiotemporal constraints on inheritance by stacking phonetic, phonological, and orthographic codes. In organisms that reproduce sexually, each major evolutionary transition is shown to be a tri-flow adaptation that adds new levels of code-based informational exchange. Evolving biological complexity is also shown to increase the nongenetic transmissibility of code.
Collapse
Affiliation(s)
- Chris Girard
- Department of Global and Sociocultural Studies, Florida International University, Miami, FL 33199, United States.
| |
Collapse
|
4
|
Spirov A. Evolution of the RNA world: From signals to codes. Biosystems 2023; 234:105043. [PMID: 37852409 DOI: 10.1016/j.biosystems.2023.105043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 09/25/2023] [Accepted: 09/25/2023] [Indexed: 10/20/2023]
Abstract
The accumulated material in evolutionary biology, greatly enhanced by the achievements of modern synthetic biology, allows us to envision certain key hypothetical stages of prebiotic (chemical) evolution. This is often understood as the further evolution in the RNA World towards the RNA-protein World. It is a path towards the emergence of translation and the genetic code (I), signaling pathways with signaling molecules (II), and the appearance of RNA-based components of future gene regulatory networks (III). We believe that these evolutionary paths can be constructively viewed from the perspective of the concept of biological codes (Barbieri, 2003). Crucial evolutionary events in these directions would involve the emergence of RNA-based adaptors. Such adaptors connect two families of functionally and chemically distinct molecules into one functional entity. The emergence of primitive translation processes is undoubtedly the major milestone in the evolutionary path towards modern life. The key aspect here is the appearance of adaptors between amino acids and their cognate triplet codons. The initial steps are believed to involve the emergence of proto-transfer RNAs capable of self-aminoacylation. The second significant evolutionary breakthrough is the development of biochemical regulatory networks based on signaling molecules of the RNA World (ribonucleotides and their derivatives), as well as receptors and effectors (riboswitches) for these messengers. Some authors refer to this as the "lost language of the RNA World." The third evolutionary step is the emergence of signal sequences for ribozymes on the molecules of their RNA targets. This level of regulation in the RNA World is comparable to the gene regulatory networks of modern organisms. We believe that the signal sequences on target molecules have been rediscovered and developed by evolution into the gene regulatory networks of modern cells. In conclusion, the immense diversity of modern biological codes, in some of its key characteristics, can be traced back to the achievements of prebiotic evolution.
Collapse
Affiliation(s)
- Alexander Spirov
- The Institute of Scientific Information for Social Sciences RAS, Moscow, Russia.
| |
Collapse
|
5
|
Kak S. Self-similarity and the maximum entropy principle in the genetic code. Theory Biosci 2023; 142:205-210. [PMID: 37402087 DOI: 10.1007/s12064-023-00396-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 06/16/2023] [Indexed: 07/05/2023]
Abstract
This paper addresses the relationship between information and structure of the genetic code. The code has two puzzling anomalies: First, when viewed as 64 sub-cubes of a [Formula: see text] cube, the codons for serine (S) are not contiguous, and there are amino acid codons with zero redundancy, which goes counter to the objective of error correction. To make sense of this, the paper shows that the genetic code must be viewed not only on stereochemical, co-evolution, and error-correction considerations, but also on two additional factors of significance to natural systems, that of an information-theoretic dimensionality of the code data, and the principle of maximum entropy. One implication of non-integer dimensionality associated with data dimensions is self-similarity to different scales, and it is shown that the genetic code does satisfy this property, and it is further shown that the maximum entropy principle operates through the scrambling of the elements in the sense of maximum algorithmic information complexity, generated by an appropriate exponentiation mapping. It is shown that the new considerations and the use of maximum entropy transformation create new constraints that are likely the reasons for the non-uniform codon groups and codons with no redundancy.
Collapse
Affiliation(s)
- Subhash Kak
- Chapman University, Orange, CA, 92866, USA.
- Oklahoma State University, Stillwater, OK, 74078, USA.
| |
Collapse
|
6
|
Gaydukova SA, Moldovan MA, Vallesi A, Heaphy SM, Atkins JF, Gelfand MS, Baranov PV. Nontriplet feature of genetic code in Euplotes ciliates is a result of neutral evolution. Proc Natl Acad Sci U S A 2023; 120:e2221683120. [PMID: 37216548 PMCID: PMC10235951 DOI: 10.1073/pnas.2221683120] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 04/12/2023] [Indexed: 05/24/2023] Open
Abstract
The triplet nature of the genetic code is considered a universal feature of known organisms. However, frequent stop codons at internal mRNA positions in Euplotes ciliates ultimately specify ribosomal frameshifting by one or two nucleotides depending on the context, thus posing a nontriplet feature of the genetic code of these organisms. Here, we sequenced transcriptomes of eight Euplotes species and assessed evolutionary patterns arising at frameshift sites. We show that frameshift sites are currently accumulating more rapidly by genetic drift than they are removed by weak selection. The time needed to reach the mutational equilibrium is several times longer than the age of Euplotes and is expected to occur after a several-fold increase in the frequency of frameshift sites. This suggests that Euplotes are at an early stage of the spread of frameshifting in expression of their genome. In addition, we find the net fitness burden of frameshift sites to be noncritical for the survival of Euplotes. Our results suggest that fundamental genome-wide changes such as a violation of the triplet character of genetic code can be introduced and maintained solely by neutral evolution.
Collapse
Affiliation(s)
- Sofya A. Gaydukova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow199911, Russia
| | - Mikhail A. Moldovan
- A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow127051, Russia
| | - Adriana Vallesi
- Laboratory of Eukaryotic Microbiology and Animal Biology, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino62032, Italy
| | - Stephen M. Heaphy
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
| | - John F. Atkins
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
- Department of Human Genetics, University of Utah, Salt Lake City, UT84112
| | - Mikhail S. Gelfand
- A. A. Kharkevich Institute for Information Transmission Problems RAS, Moscow127051, Russia
| | - Pavel V. Baranov
- School of Biochemistry and Cell Biology, University College Cork, CorkT12 XF62, Ireland
| |
Collapse
|
7
|
Zagrovic B, Adlhart M, Kapral TH. Coding From Binding? Molecular Interactions at the Heart of Translation. Annu Rev Biophys 2023; 52:69-89. [PMID: 36626765 DOI: 10.1146/annurev-biophys-090622-102329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The mechanism and the evolution of DNA replication and transcription, the key elements of the central dogma of biology, are fundamentally well explained by the physicochemical complementarity between strands of nucleic acids. However, the determinants that have shaped the third part of the dogma-the process of biological translation and the universal genetic code-remain unclear. We review and seek parallels between different proposals that view the evolution of translation through the prism of weak, noncovalent interactions between biological macromolecules. In particular, we focus on a recent proposal that there exists a hitherto unrecognized complementarity at the heart of biology, that between messenger RNA coding regions and the proteins that they encode, especially if the two are unstructured. Reflecting the idea that the genetic code evolved from intrinsic binding propensities between nucleotides and amino acids, this proposal promises to forge a link between the distant past and the present of biological systems.
Collapse
Affiliation(s)
- Bojan Zagrovic
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Marlene Adlhart
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Thomas H Kapral
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
| |
Collapse
|
8
|
Di Giulio M. The error minimization of the genetic code would have been determined by natural selection and not by a neutral evolution. Biosystems 2023; 224:104838. [PMID: 36657560 DOI: 10.1016/j.biosystems.2023.104838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/10/2023] [Accepted: 01/15/2023] [Indexed: 01/18/2023]
Abstract
I discuss the mechanisms by which the error minimization observed in the genetic code would have been produced; that is, the ability of the genetic code to buffer, for example, the deleterious effects of translation errors. Here, I analyse whether the error minimization was produced by the intervention of natural selection or whether it is an emergent, that is, neutral property; in other words, whether it is a by-product of another mechanism that was structuring the genetic code. In particular, I criticize Massey's simulations (2008) - favouring the neutral hypothesis - which, containing elements of natural selection, would render his conclusions at least partly tautological. Furthermore, I criticize some of Koonin's (2017) interpretations regarding Massey's simulations. Finally, I criticize the opinion of Janzen et al. (2022) according to which their self-aminoacylating ribozyme system would have been capable of generating an error minimization of the genetic code as its emergent property. That is to say, I criticize, more generally, a neutral origin of error minimization. Indeed, any mechanism for structuring the genetic code would be capable of generating, in theory, such an emergent property. The problem is that to demonstrate this, it would be necessary to show that the level of optimization achieved by the genetic code would be that expected under the neutral hypothesis, the one that Janzen et al. (2022) instead they did not make. Therefore, their view is only a hypothesis and is very far from being corroborated by their results. Instead, in the literature there is a strong evidence that the level of optimization achieved by the genetic code is so high that it would imply, per se, an intervention of natural selection in the origin of error minimization of the genetic code. On the other hand, this level of optimization would be very far from what might have been produced by a neutral process.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
9
|
Harrison SA, Palmeira RN, Halpern A, Lane N. A biophysical basis for the emergence of the genetic code in protocells. BIOCHIMICA ET BIOPHYSICA ACTA. BIOENERGETICS 2022; 1863:148597. [PMID: 35868450 DOI: 10.1016/j.bbabio.2022.148597] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 06/27/2022] [Accepted: 07/13/2022] [Indexed: 11/17/2022]
Abstract
The origin of the genetic code is an abiding mystery in biology. Hints of a 'code within the codons' suggest biophysical interactions, but these patterns have resisted interpretation. Here, we present a new framework, grounded in the autotrophic growth of protocells from CO2 and H2. Recent work suggests that the universal core of metabolism recapitulates a thermodynamically favoured protometabolism right up to nucleotide synthesis. Considering the genetic code in relation to an extended protometabolism allows us to predict most codon assignments. We show that the first letter of the codon corresponds to the distance from CO2 fixation, with amino acids encoded by the purines (G followed by A) being closest to CO2 fixation. These associations suggest a purine-rich early metabolism with a restricted pool of amino acids. The second position of the anticodon corresponds to the hydrophobicity of the amino acid encoded. We combine multiple measures of hydrophobicity to show that this correlation holds strongly for early amino acids but is weaker for later species. Finally, we demonstrate that redundancy at the third position is not randomly distributed around the code: non-redundant amino acids can be assigned based on size, specifically length. We attribute this to additional stereochemical interactions at the anticodon. These rules imply an iterative expansion of the genetic code over time with codon assignments depending on both distance from CO2 and biophysical interactions between nucleotide sequences and amino acids. In this way the earliest RNA polymers could produce non-random peptide sequences with selectable functions in autotrophic protocells.
Collapse
Affiliation(s)
- Stuart A Harrison
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland
| | - Raquel Nunes Palmeira
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland
| | - Aaron Halpern
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland
| | - Nick Lane
- Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London WC1E 6BT, United Kingdom of Great Britain and Northern Ireland.
| |
Collapse
|
10
|
Wong ML, Bartlett S. Asymptotic burnout and homeostatic awakening: a possible solution to the Fermi paradox? J R Soc Interface 2022; 19:20220029. [PMID: 35506212 PMCID: PMC9065981 DOI: 10.1098/rsif.2022.0029] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Previous studies show that city metrics having to do with growth, productivity and overall energy consumption scale superlinearly, attributing this to the social nature of cities. Superlinear scaling results in crises called ‘singularities’, where population and energy demand tend to infinity in a finite amount of time, which must be avoided by ever more frequent ‘resets’ or innovations that postpone the system's collapse. Here, we place the emergence of cities and planetary civilizations in the context of major evolutionary transitions. With this perspective, we hypothesize that once a planetary civilization transitions into a state that can be described as one virtually connected global city, it will face an ‘asymptotic burnout’, an ultimate crisis where the singularity-interval time scale becomes smaller than the time scale of innovation. If a civilization develops the capability to understand its own trajectory, it will have a window of time to affect a fundamental change to prioritize long-term homeostasis and well-being over unyielding growth—a consciously induced trajectory change or ‘homeostatic awakening’. We propose a new resolution to the Fermi paradox: civilizations either collapse from burnout or redirect themselves to prioritizing homeostasis, a state where cosmic expansion is no longer a goal, making them difficult to detect remotely.
Collapse
Affiliation(s)
- Michael L Wong
- Earth and Planets Laboratory, Carnegie Institution for Science, Washington, DC 20015, USA
| | - Stuart Bartlett
- Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, CA 91125, USA
| |
Collapse
|
11
|
Trinucleotide k-circular codes II: Biology. Biosystems 2022; 217:104668. [DOI: 10.1016/j.biosystems.2022.104668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 02/27/2022] [Accepted: 03/16/2022] [Indexed: 11/22/2022]
|
12
|
Kondratyeva LG, Dyachkova MS, Galchenko AV. The Origin of Genetic Code and Translation in the Framework of Current Concepts on the Origin of Life. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:150-169. [PMID: 35508902 DOI: 10.1134/s0006297922020079] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The origin of genetic code and translation system is probably the central and most difficult problem in the investigations on the origin of life and one of the most complex problems in the evolutionary biology in general. There are multiple hypotheses on the emergence and development of existing genetic systems that propose the mechanisms for the origin and early evolution of genetic code, as well as for the emergence of replication and translation. Here, we discuss the most well-known of these hypotheses, although none of them provides a description of the early evolution of genetic systems without gaps and assumptions. The RNA world hypothesis is a currently prevailing scientific idea on the early evolution of biological and pre-biological structures, the main advantage of which is the assumption that RNAs as the first living systems were self-sufficient, i.e., capable of functioning as both catalysts and templates. However, this hypothesis has also significant limitations. In particular, no ribozymes with processive polymerase activity have been yet discovered or synthesized. Taking into account the mutual need of proteins and nucleic acids in each other in the current world, many authors propose the early evolution scenarios based on the co-evolution of these two classes of organic molecules. They postulate that the emergence of translation was necessary for the replication of nucleic acids, in contrast to the RNA world hypothesis, according to which the emergence of translation was preceded by the era of self-replicating RNAs. Although such scenarios are less parsimonious from the evolutionary point of view, since they require simultaneous emergence and evolution of two classes of organic molecules, as well as the emergence of synchronized replication and translation, their major advantage is that they explain the development of processive and much more accurate protein-dependent replication.
Collapse
Affiliation(s)
- Liya G Kondratyeva
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| | | | - Alexey V Galchenko
- Peoples' Friendship University of Russia (RUDN University), Moscow, 117198, Russia.
| |
Collapse
|
13
|
Model of Genetic Code Structure Evolution under Various Types of Codon Reading. Int J Mol Sci 2022; 23:ijms23031690. [PMID: 35163612 PMCID: PMC8835785 DOI: 10.3390/ijms23031690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 01/23/2022] [Accepted: 01/25/2022] [Indexed: 11/28/2022] Open
Abstract
The standard genetic code (SGC) is a set of rules according to which 64 codons are assigned to 20 canonical amino acids and stop coding signal. As a consequence, the SGC is redundant because there is a greater number of codons than the number of encoded labels. This redundancy implies the existence of codons that encode the same genetic information. The size and organization of such synonymous codon blocks are important characteristics of the SGC structure whose evolution is still unclear. Therefore, we studied possible evolutionary mechanisms of the codon block structure. We conducted computer simulations assuming that coding systems at early stages of the SGC evolution were sets of ambiguous codon assignments with high entropy. We included three types of reading systems characterized by different inaccuracy and pattern of codon recognition. In contrast to the previous study, we allowed for evolution of the reading systems and their competition. The simulations performed under minimization of translational errors and reduction of coding ambiguity produced the coding system resistant to these errors. The reading system similar to that present in the SGC dominated the others very quickly. The survived system was also characterized by low entropy and possessed properties similar to that in the SGC. Our simulation show that the unambiguous SGC could emerged from a code with a lower level of ambiguity and the number of tRNAs increased during the evolution.
Collapse
|
14
|
Furukawa R, Yokobori SI, Sato R, Kumagawa T, Nakagawa M, Katoh K, Yamagishi A. Amino Acid Specificity of Ancestral Aminoacyl-tRNA Synthetase Prior to the Last Universal Common Ancestor Commonote commonote. J Mol Evol 2022; 90:73-94. [PMID: 35084522 PMCID: PMC8821087 DOI: 10.1007/s00239-021-10043-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 12/16/2021] [Indexed: 11/24/2022]
Abstract
Extant organisms commonly use 20 amino acids in protein synthesis. In the translation system, aminoacyl-tRNA synthetase (ARS) selectively binds an amino acid and transfers it to the cognate tRNA. It is postulated that the amino acid repertoire of ARS expanded during the development of the translation system. In this study we generated composite phylogenetic trees for seven ARSs (SerRS, ProRS, ThrRS, GlyRS-1, HisRS, AspRS, and LysRS) which are thought to have diverged by gene duplication followed by mutation, before the evolution of the last universal common ancestor. The composite phylogenetic tree shows that the AspRS/LysRS branch diverged from the other five ARSs at the deepest node, with the GlyRS/HisRS branch and the other three ARSs (ThrRS, ProRS and SerRS) diverging at the second deepest node. ThrRS diverged next, and finally ProRS and SerRS diverged from each other. Based on the phylogenetic tree, sequences of the ancestral ARSs prior to the evolution of the last universal common ancestor were predicted. The amino acid specificity of each ancestral ARS was then postulated by comparison with amino acid recognition sites of ARSs of extant organisms. Our predictions demonstrate that ancestral ARSs had substantial specificity and that the number of amino acid types amino-acylated by proteinaceous ARSs was limited before the appearance of a fuller range of proteinaceous ARS species. From an assumption that 10 amino acid species are required for folding and function, proteinaceous ARS possibly evolved in a translation system composed of preexisting ribozyme ARSs, before the evolution of the last universal common ancestor.
Collapse
Affiliation(s)
- Ryutaro Furukawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.,Faculty of Human Science, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Shin-Ichi Yokobori
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Riku Sato
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Taimu Kumagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Mizuho Nakagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Kazutaka Katoh
- Department of Genome Informatics, Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.
| |
Collapse
|
15
|
Caldararo F, Di Giulio M. The genetic code is very close to a global optimum in a model of its origin taking into account both the partition energy of amino acids and their biosynthetic relationships. Biosystems 2022; 214:104613. [DOI: 10.1016/j.biosystems.2022.104613] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 01/16/2022] [Accepted: 01/17/2022] [Indexed: 01/23/2023]
|
16
|
Average and Standard Deviation of the Error Function for Random Genetic Codes with Standard Stop Codons. Acta Biotheor 2021; 70:7. [PMID: 34919168 DOI: 10.1007/s10441-021-09427-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 09/27/2021] [Indexed: 10/19/2022]
Abstract
The origin of the genetic code has been attributed in part to an accidental assignment of codons to amino acids. Although several lines of evidence indicate the subsequent expansion and improvement of the genetic code, the hypothesis of Francis Crick concerning a frozen accident occurring at the early stage of genetic code evolution is still widely accepted. Considering Crick's hypothesis, mathematical descriptions of hypothetical scenarios involving a huge number of possible coexisting random genetic codes could be very important to explain the origin and evolution of a selected genetic code. This work aims to contribute in this regard, that is, it provides a theoretical framework in which statistical parameters of error functions are calculated. Given a genetic code and an amino acid property, the functional code robustness is estimated by means of a known error function. In this work, using analytical calculations, general expressions for the average and standard deviation of the error function distributions of completely random codes with standard stop codons were obtained. As a possible biological application of these results, any set of amino acids and any pure or mixed amino acid properties can be used in the calculations, such that, in case of having to select a set of amino acids to create a genetic code, possible advantages of natural selection of the genetic codes could be discussed.
Collapse
|
17
|
The Combinatorial Fusion Cascade to Generate the Standard Genetic Code. Life (Basel) 2021; 11:life11090975. [PMID: 34575125 PMCID: PMC8467831 DOI: 10.3390/life11090975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 09/14/2021] [Accepted: 09/14/2021] [Indexed: 11/17/2022] Open
Abstract
Combinatorial fusion cascade was proposed as a transition stage between prebiotic chemistry and early forms of life. The combinatorial fusion cascade consists of three stages: eight initial complimentary pairs of amino acids, four protocodes, and the standard genetic code. The initial complimentary pairs and the protocodes are divided into dominant and recessive entities. The transitions between these stages obey the same combinatorial fusion rules for all amino acids. The combinatorial fusion cascade mathematically describes the codon assignments in the standard genetic code. It explains the availability of amino acids with the even and odd numbers of codons, the appearance of stop codons, inclusion of novel canonical amino acids, exceptional high numbers of codons for amino acids arginine, leucine, and serine, and the temporal order of amino acid inclusion into the genetic code. The temporal order of amino acids within the cascade is congruent with the consensus temporal order previously derived from the similarities between the available hypotheses. The control over the combinatorial fusion cascades would open the road for a novel technology to develop artificial microorganisms.
Collapse
|
18
|
'Whole Organism', Systems Biology, and Top-Down Criteria for Evaluating Scenarios for the Origin of Life. Life (Basel) 2021; 11:life11070690. [PMID: 34357062 PMCID: PMC8306273 DOI: 10.3390/life11070690] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 07/12/2021] [Accepted: 07/13/2021] [Indexed: 12/22/2022] Open
Abstract
While most advances in the study of the origin of life on Earth (OoLoE) are piecemeal, tested against the laws of chemistry and physics, ultimately the goal is to develop an overall scenario for life's origin(s). However, the dimensionality of non-equilibrium chemical systems, from the range of possible boundary conditions and chemical interactions, renders the application of chemical and physical laws difficult. Here we outline a set of simple criteria for evaluating OoLoE scenarios. These include the need for containment, steady energy and material flows, and structured spatial heterogeneity from the outset. The Principle of Continuity, the fact that all life today was derived from first life, suggests favoring scenarios with fewer non-analog (not seen in life today) to analog (seen in life today) transitions in the inferred first biochemical pathways. Top-down data also indicate that a complex metabolism predated ribozymes and enzymes, and that full cellular autonomy and motility occurred post-LUCA. Using these criteria, we find the alkaline hydrothermal vent microchamber complex scenario with a late evolving exploitation of the natural occurring pH (or Na+ gradient) by ATP synthase the most compelling. However, there are as yet so many unknowns, we also advocate for the continued development of as many plausible scenarios as possible.
Collapse
|
19
|
Rosandić M, Paar V. The novel Ideal Symmetry Genetic Code table - Common purine-pyrimidine symmetry net for all RNA and DNA species. J Theor Biol 2021; 524:110748. [PMID: 33933479 DOI: 10.1016/j.jtbi.2021.110748] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 04/22/2021] [Accepted: 04/27/2021] [Indexed: 11/29/2022]
Abstract
Ever since Nirenberg's discovery in 1961 in which codons code individual amino acids, numerous scientists searched for symmetries within the genetic code. The standard genetic code (SGC) table is an alphabetic artificial construct based on the U-C-A-G ordering of nucleotides without natural symmetries. Up to the present, complete symmetry in the genetic code has not been found, leaving doubt as to whether the symmetrical nature as the protector of order even exists. Our novel Ideal Symmetry Genetic Code (ISyGC) table reflects a unique fundamental physicochemical purine-pyrimidine symmetry net for all more than thirty known variations of nuclear and mitochondrial genetic codes. The nuclear genetic code for RNA and DNA viruses also contains the same purine-pyrimidine symmetry net. We show that the ISyGC table leads to automatic transformation into a DNA sequence akin to the 5'3 codon and 3'5 anticodon patterns. As a result of purine-pyrimidine symmetries between codons in the ISyGC table, algorithms of the first two bases as well of the third base of codons show how tRNA cognate anticodons can recognize synonymous codons during mRNA decoding. We show that the ISyGC purine-pyrimidine net with its physicochemical properties represents an evolutionary common "frozen accident" at the onset of each genetic code creation and RNA to DNA evolution. As such, during all of evolution the unique fundamental purine-pyrimidine symmetry net of all genetic codes remains unchangeable. In this way, evolution is a road paved with symmetries.
Collapse
Affiliation(s)
- Marija Rosandić
- University Hospital Centre Zagreb (ret.), Zagreb, Croatia; Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia.
| | - Vladimir Paar
- Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia; Department of Physics, Faculty of Science, University of Zagreb, Croatia.
| |
Collapse
|
20
|
Nesterov-Mueller A, Popov R, Seligmann H. Combinatorial Fusion Rules to Describe Codon Assignment in the Standard Genetic Code. Life (Basel) 2020; 11:life11010004. [PMID: 33374866 PMCID: PMC7824455 DOI: 10.3390/life11010004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 12/15/2020] [Accepted: 12/21/2020] [Indexed: 11/16/2022] Open
Abstract
We propose combinatorial fusion rules that describe the codon assignment in the standard genetic code simply and uniformly for all canonical amino acids. These rules become obvious if the origin of the standard genetic code is considered as a result of a fusion of four protocodes: Two dominant AU and GC protocodes and two recessive AU and GC protocodes. The biochemical meaning of the fusion rules consists of retaining the complementarity between cognate codons of the small hydrophobic amino acids and large charged or polar amino acids within the protocodes. The proto tRNAs were assembled in form of two kissing hairpins with 9-base and 10-base loops in the case of dominant protocodes and two 9-base loops in the case of recessive protocodes. The fusion rules reveal the connection between the stop codons, the non-canonical amino acids, pyrrolysine and selenocysteine, and deviations in the translation of mitochondria. Using fusion rules, we predicted the existence of additional amino acids that are essential for the development of the standard genetic code. The validity of the proposed partition of the genetic code into dominant and recessive protocodes is considered referring to state-of-the-art hypotheses. The formation of two aminoacyl-tRNA synthetase classes is compatible with four-protocode partition.
Collapse
Affiliation(s)
- Alexander Nesterov-Mueller
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
- Correspondence:
| | - Roman Popov
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
| | - Hervé Seligmann
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
- Laboratory AGEIS EA 7407, Team Tools for e-GnosisMedical & LabcomCNRS/UGA/OrangeLabs Telecoms4Health, Faculty of Medicine, Université Grenoble Alpes, F-38700 La Tronche, France
| |
Collapse
|
21
|
Kunnev D. Origin of Life: The Point of No Return. Life (Basel) 2020; 10:life10110269. [PMID: 33153087 PMCID: PMC7693465 DOI: 10.3390/life10110269] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 11/01/2020] [Accepted: 11/01/2020] [Indexed: 12/13/2022] Open
Abstract
Origin of life research is one of the greatest scientific frontiers of mankind. Many hypotheses have been proposed to explain how life began. Although different hypotheses emphasize different initial phenomena, all of them agree around one important concept: at some point, along with the chain of events toward life, Darwinian evolution emerged. There is no consensus, however, how this occurred. Frequently, the mechanism leading to Darwinian evolution is not addressed and it is assumed that this problem could be solved later, with experimental proof of the hypothesis. Here, the author first defines the minimum components required for Darwinian evolution and then from this standpoint, analyzes some of the hypotheses for the origin of life. Distinctive features of Darwinian evolution and life rooted in the interaction between information and its corresponding structure/function are then reviewed. Due to the obligatory dependency of the information and structure subject to Darwinian evolution, these components must be locked in their origin. One of the most distinctive characteristics of Darwinian evolution in comparison with all other processes is the establishment of a fundamentally new level of matter capable of evolving and adapting. Therefore, the initiation of Darwinian evolution is the "point of no return" after which life begins. In summary: a definition and a mechanism for Darwinian evolution are provided together with a critical analysis of some of the hypotheses for the origin of life.
Collapse
Affiliation(s)
- Dimiter Kunnev
- Department of Oral Biology, University at Buffalo, Buffalo, NY 14263, USA
| |
Collapse
|
22
|
Morales JM, Michaelian K. Photon Dissipation as the Origin of Information Encoding in RNA and DNA. ENTROPY (BASEL, SWITZERLAND) 2020; 22:E940. [PMID: 33286709 PMCID: PMC7597208 DOI: 10.3390/e22090940] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 08/07/2020] [Accepted: 08/13/2020] [Indexed: 11/16/2022]
Abstract
Ultraviolet light incident on organic material can initiate its spontaneous dissipative structuring into chromophores which can catalyze their own replication. This may have been the case for one of the most ancient of all chromophores dissipating the Archean UVC photon flux, the nucleic acids. Oligos of nucleic acids with affinity to particular amino acids which foment UVC photon dissipation would most efficiently catalyze their own reproduction and thus would have been selected through non-equilibrium thermodynamic imperatives which favor dissipation. Indeed, we show here that those amino acids with characteristics most relevant to fomenting UVC photon dissipation are precisely those with greatest chemical affinity to their codons or anticodons. This could provide a thermodynamic basis for the specificity in the amino acid-nucleic acid interaction and an explanation for the accumulation of information in nucleic acids since this information is relevant to the optimization of dissipation of the externally imposed thermodynamic potentials. The accumulation of information in this manner provides a link between evolution and entropy production.
Collapse
Affiliation(s)
- Julián Mejía Morales
- Postgrado in Physical Sciences, Instituto de Física, Universidad Nacional Autónoma de México, Cto. de la Investigación Científica, Cuidad Universitaria, Mexico City C.P. 04510, Mexico;
| | - Karo Michaelian
- Department of Nuclear Physics and Application of Radiation, Instituto de Física, Universidad Nacional Autónoma de México, Cto. de la Investigación Científica, Cuidad Universitaria, Mexico City C.P. 04510, Mexico
| |
Collapse
|
23
|
Fimmel E, Michel CJ, Pirot F, Sereni JS, Starman M, Strüngmann L. The Relation Between k-Circularity and Circularity of Codes. Bull Math Biol 2020; 82:105. [PMID: 32754878 PMCID: PMC7402406 DOI: 10.1007/s11538-020-00770-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 06/24/2020] [Indexed: 12/15/2022]
Abstract
A code X is k-circular if any concatenation of at most k words from X, when read on a circle, admits exactly one partition into words from X. It is circular if it is k-circular for every integer k. While it is not a priori clear from the definition, there exists, for every pair \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$(n,\ell )$$\end{document}(n,ℓ), an integer k such that every k-circular \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\ell $$\end{document}ℓ-letter code over an alphabet of cardinality n is circular, and we determine the least such integer k for all values of n and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\ell $$\end{document}ℓ. The k-circular codes may represent an important evolutionary step between the circular codes, such as the comma-free codes, and the genetic code.
Collapse
Affiliation(s)
- Elena Fimmel
- Institute of Mathematical Biology, Faculty for Computer Sciences, Mannheim University of Applied Sciences, 68163 Mannheim, Germany
| | - Christian J. Michel
- Theoretical Bioinformatics, ICube, C.N.R.S., University of Strasbourg, 300 Boulevard Sébastien Brant, 67400 Illkirch, France
| | - François Pirot
- Theoretical Bioinformatics, ICube, C.N.R.S., University of Strasbourg, 300 Boulevard Sébastien Brant, 67400 Illkirch, France
- LORIA (Orpailleur), C.N.R.S., University of Lorraine, INRIA, Campus scientifique, 54506 Vandœuvre-lès-Nancy Cedex, France
| | - Jean-Sébastien Sereni
- Theoretical Bioinformatics, ICube, C.N.R.S., University of Strasbourg, 300 Boulevard Sébastien Brant, 67400 Illkirch, France
| | - Martin Starman
- Institute of Mathematical Biology, Faculty for Computer Sciences, Mannheim University of Applied Sciences, 68163 Mannheim, Germany
- Theoretical Bioinformatics, ICube, C.N.R.S., University of Strasbourg, 300 Boulevard Sébastien Brant, 67400 Illkirch, France
| | - Lutz Strüngmann
- Institute of Mathematical Biology, Faculty for Computer Sciences, Mannheim University of Applied Sciences, 68163 Mannheim, Germany
| |
Collapse
|
24
|
Muchowska KB, Varma SJ, Moran J. Nonenzymatic Metabolic Reactions and Life's Origins. Chem Rev 2020; 120:7708-7744. [PMID: 32687326 DOI: 10.1021/acs.chemrev.0c00191] [Citation(s) in RCA: 110] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Prebiotic chemistry aims to explain how the biochemistry of life as we know it came to be. Most efforts in this area have focused on provisioning compounds of importance to life by multistep synthetic routes that do not resemble biochemistry. However, gaining insight into why core metabolism uses the molecules, reactions, pathways, and overall organization that it does requires us to consider molecules not only as synthetic end goals. Equally important are the dynamic processes that build them up and break them down. This perspective has led many researchers to the hypothesis that the first stage of the origin of life began with the onset of a primitive nonenzymatic version of metabolism, initially catalyzed by naturally occurring minerals and metal ions. This view of life's origins has come to be known as "metabolism first". Continuity with modern metabolism would require a primitive version of metabolism to build and break down ketoacids, sugars, amino acids, and ribonucleotides in much the same way as the pathways that do it today. This review discusses metabolic pathways of relevance to the origin of life in a manner accessible to chemists, and summarizes experiments suggesting several pathways might have their roots in prebiotic chemistry. Finally, key remaining milestones for the protometabolic hypothesis are highlighted.
Collapse
Affiliation(s)
| | - Sreejith J Varma
- University of Strasbourg, CNRS, ISIS UMR 7006, 67000 Strasbourg, France
| | - Joseph Moran
- University of Strasbourg, CNRS, ISIS UMR 7006, 67000 Strasbourg, France
| |
Collapse
|
25
|
Footprints of a Singular 22-Nucleotide RNA Ring at the Origin of Life. BIOLOGY 2020; 9:biology9050088. [PMID: 32344921 PMCID: PMC7285048 DOI: 10.3390/biology9050088] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2020] [Revised: 04/06/2020] [Accepted: 04/19/2020] [Indexed: 11/17/2022]
Abstract
(1) Background: Previous experimental observations and theoretical hypotheses have been providing insight into a hypothetical world where an RNA hairpin or ring may have debuted as the primary informational and functional molecule. We propose a model revisiting the architecture of RNA-peptide interactions at the origin of life through the evolutionary dynamics of RNA populations. (2) Methods: By performing a step-by-step computation of the smallest possible hairpin/ring RNA sequences compatible with building up a variety of peptides of the primitive network, we inferred the sequence of a singular docosameric RNA molecule, we call the ALPHA sequence. Then, we searched for any relics of the peptides made from ALPHA in sequences deposited in the different public databases. (3) Results: Sequence matching between ALPHA and sequences from organisms among the earliest forms of life on Earth were found at high statistical relevance. We hypothesize that the frequency of appearance of relics from ALPHA sequence in present genomes has a functional necessity. (4) Conclusions: Given the fitness of ALPHA as a supportive sequence of the framework of all existing theories, and the evolution of Archaea and giant viruses, it is anticipated that the unique properties of this singular archetypal ALPHA sequence should prove useful as a model matrix for future applications, ranging from synthetic biology to DNA computing.
Collapse
|
26
|
Piette BMAG, Heddle JG. A Peptide-Nucleic Acid Replicator Origin for Life. Trends Ecol Evol 2020; 35:397-406. [PMID: 32294421 DOI: 10.1016/j.tree.2020.01.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Revised: 12/19/2019] [Accepted: 01/15/2020] [Indexed: 12/20/2022]
Abstract
Evolution requires self-replication. But, what was the very first self-replicator directly ancestral to all life? The currently favoured RNA World theory assigns this role to RNA alone but suffers from a number of seemingly intractable problems. Instead, we suggest that the self-replicator consisted of both peptides and nucleic acid strands. Such a nucleopeptide replicator is more feasible both in the light of the replication machinery currently found in cells and the complexity of the evolutionary path required to reach them. Recent theoretical and mathematical work supports this idea and provide a blueprint for future investigations.
Collapse
Affiliation(s)
| | - Jonathan G Heddle
- Bionanoscience and Biochemistry Laboratory, Malopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland.
| |
Collapse
|
27
|
Evolution of Life on Earth: tRNA, Aminoacyl-tRNA Synthetases and the Genetic Code. Life (Basel) 2020; 10:life10030021. [PMID: 32131473 PMCID: PMC7151597 DOI: 10.3390/life10030021] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2020] [Revised: 02/13/2020] [Accepted: 02/27/2020] [Indexed: 02/07/2023] Open
Abstract
Life on Earth and the genetic code evolved around tRNA and the tRNA anticodon. We posit that the genetic code initially evolved to synthesize polyglycine as a cross-linking agent to stabilize protocells. We posit that the initial amino acids to enter the code occupied larger sectors of the code that were then invaded by incoming amino acids. Displacements of amino acids follow selection rules. The code sectored from a glycine code to a four amino acid code to an eight amino acid code to an ~16 amino acid code to the standard 20 amino acid code with stops. The proposed patterns of code sectoring are now most apparent from patterns of aminoacyl-tRNA synthetase evolution. The Elongation Factor-Tu GTPase anticodon-codon latch that checks the accuracy of translation appears to have evolved at about the eight amino acid to ~16 amino acid stage. Before evolution of the EF-Tu latch, we posit that both the 1st and 3rd anticodon positions were wobble positions. The genetic code evolved via tRNA charging errors and via enzymatic modifications of amino acids joined to tRNAs, followed by tRNA and aminoacyl-tRNA synthetase differentiation. Fidelity mechanisms froze the code by inhibiting further innovation.
Collapse
|
28
|
Michel CJ, Thompson JD. Identification of a circular code periodicity in the bacterial ribosome: origin of codon periodicity in genes? RNA Biol 2020; 17:571-583. [PMID: 31960748 DOI: 10.1080/15476286.2020.1719311] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023] Open
Abstract
Three-base periodicity (TBP), where nucleotides and higher order n-tuples are preferentially spaced by 3, 6, 9, etc. bases, is a well-known intrinsic property of protein-coding DNA sequences. However, its origins are still not fully understood. One hypothesis is that the periodicity reflects a primordial coding system that was used before the emergence of the modern standard genetic code (SGC). Recent evidence suggests that the X circular code, a set of 20 trinucleotides allowing the reading frames in genes to be retrieved locally, represents a possible ancestor of the SGC. Motifs from the X circular code have been found in the reading frame of protein-coding regions in extant organisms from bacteria to eukaryotes, in many transfer RNA (tRNA) genes and in important functional regions of the ribosomal RNA (rRNA), notably in the peptidyl transferase centre and the decoding centre. Here, we have used a powerful correlation function to search for periodicity patterns involving the 20 trinucleotides of the X circular code in a large set of bacterial protein-coding genes, as well as in the translation machinery, including rRNA and tRNA sequences. As might be expected, we found a strong circular code periodicity 0 modulo 3 in the protein-coding genes. More surprisingly, we also identified a similar circular code periodicity in a large region of the 16S rRNA. This region includes the 3' major domain corresponding to the primordial proto-ribosome decoding centre and containing numerous sites that interact with the tRNA and messenger RNA (mRNA) during translation. Furthermore, 3D structural analysis shows that the periodicity region surrounds the mRNA channel that lies between the head and the body of the SSU. Our results support the hypothesis that the X circular code may constitute an ancestral translation code involved in reading frame retrieval and maintenance, traces of which persist in modern mRNA, tRNA and rRNA despite their long evolution and adaptation to the SGC.
Collapse
Affiliation(s)
- Christian J Michel
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg, France
| | - Julie D Thompson
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg, France
| |
Collapse
|
29
|
Dila G, Ripp R, Mayer C, Poch O, Michel CJ, Thompson JD. Circular code motifs in the ribosome: a missing link in the evolution of translation? RNA (NEW YORK, N.Y.) 2019; 25:1714-1730. [PMID: 31506380 PMCID: PMC6859856 DOI: 10.1261/rna.072074.119] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 09/06/2019] [Indexed: 05/29/2023]
Abstract
The origin of the genetic code remains enigmatic five decades after it was elucidated, although there is growing evidence that the code coevolved progressively with the ribosome. A number of primordial codes were proposed as ancestors of the modern genetic code, including comma-free codes such as the RRY, RNY, or GNC codes (R = G or A, Y = C or T, N = any nucleotide), and the X circular code, an error-correcting code that also allows identification and maintenance of the reading frame. It was demonstrated previously that motifs of the X circular code are significantly enriched in the protein-coding genes of most organisms, from bacteria to eukaryotes. Here, we show that imprints of this code also exist in the ribosomal RNA (rRNA). In a large-scale study involving 133 organisms representative of the three domains of life, we identified 32 universal X motifs that are conserved in the rRNA of >90% of the organisms. Intriguingly, most of the universal X motifs are located in rRNA regions involved in important ribosome functions, notably in the peptidyl transferase center and the decoding center that form the original "proto-ribosome." Building on the existing accretion models for ribosome evolution, we propose that error-correcting circular codes represented an important step in the emergence of the modern genetic code. Thus, circular codes would have allowed the simultaneous coding of amino acids and synchronization of the reading frame in primitive translation systems, prior to the emergence of more sophisticated start codon recognition and translation initiation mechanisms.
Collapse
Affiliation(s)
- Gopal Dila
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Raymond Ripp
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Claudine Mayer
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
- Unité de Microbiologie Structurale, Institut Pasteur, CNRS, 75724 Paris Cedex 15, France
- Université Paris Diderot, Sorbonne Paris Cité, 75724 Paris Cedex 15, France
| | - Olivier Poch
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Christian J Michel
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Julie D Thompson
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| |
Collapse
|
30
|
Genetic codes optimized as a traveling salesman problem. PLoS One 2019; 14:e0224552. [PMID: 31658301 PMCID: PMC6816573 DOI: 10.1371/journal.pone.0224552] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 10/16/2019] [Indexed: 11/19/2022] Open
Abstract
The Standard Genetic Code (SGC) is robust to mutational errors such that frequently occurring mutations minimally alter the physio-chemistry of amino acids. The apparent correlation between the evolutionary distances among codons and the physio-chemical distances among their cognate amino acids suggests an early co-diversification between the codons and amino acids. Here we formulated the co-minimization of evolutionary distances between codons and physio-chemical distances between amino acids as a Traveling Salesman Problem (TSP) and solved it with a Hopfield neural network. In this unsupervised learning algorithm, macromolecules (e.g., tRNAs and aminoacyl-tRNA synthetases) associating codons with amino acids were considered biological analogs of Hopfield neurons associating "tour cities" with "tour positions". The Hopfield network efficiently yielded an abundance of genetic codes that were more error-minimizing than SGC and could thus be used to design artificial genetic codes. We further argue that as a self-optimization algorithm, the Hopfield neural network provides a model of origin of SGC and other adaptive molecular systems through evolutionary learning.
Collapse
|
31
|
A tRNA- and Anticodon-Centric View of the Evolution of Aminoacyl-tRNA Synthetases, tRNAomes, and the Genetic Code. Life (Basel) 2019; 9:life9020037. [PMID: 31060233 PMCID: PMC6616430 DOI: 10.3390/life9020037] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Revised: 04/24/2019] [Accepted: 05/01/2019] [Indexed: 11/25/2022] Open
Abstract
Pathways of standard genetic code evolution remain conserved and apparent, particularly upon analysis of aminoacyl-tRNA synthetase (aaRS) lineages. Despite having incompatible active site folds, class I and class II aaRS are homologs by sequence. Specifically, structural class IA aaRS enzymes derive from class IIA aaRS enzymes by in-frame extension of the protein N-terminus and by an alternate fold nucleated by the N-terminal extension. The divergence of aaRS enzymes in the class I and class II clades was analyzed using the Phyre2 protein fold recognition server. The class I aaRS radiated from the class IA enzymes, and the class II aaRS radiated from the class IIA enzymes. The radiations of aaRS enzymes bolster the coevolution theory for evolution of the amino acids, tRNAomes, the genetic code, and aaRS enzymes and support a tRNA anticodon-centric perspective. We posit that second- and third-position tRNA anticodon sequence preference (C>(U~G)>A) powerfully selected the sectoring pathway for the code. GlyRS-IIA appears to have been the primordial aaRS from which all aaRS enzymes evolved, and glycine appears to have been the primordial amino acid around which the genetic code evolved.
Collapse
|
32
|
Roura Frigolé H, Camacho N, Castellví Coma M, Fernández-Lozano C, García-Lema J, Rafels-Ybern À, Canals A, Coll M, Ribas de Pouplana L. tRNA deamination by ADAT requires substrate-specific recognition mechanisms and can be inhibited by tRFs. RNA (NEW YORK, N.Y.) 2019; 25:607-619. [PMID: 30737359 PMCID: PMC6467012 DOI: 10.1261/rna.068189.118] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 01/28/2019] [Indexed: 05/30/2023]
Abstract
Adenosine deaminase acting on transfer RNA (ADAT) is an essential eukaryotic enzyme that catalyzes the deamination of adenosine to inosine at the first position of tRNA anticodons. Mammalian ADATs modify eight different tRNAs, having increased their substrate range from a bacterial ancestor that likely deaminated exclusively tRNAArg Here we investigate the recognition mechanisms of tRNAArg and tRNAAla by human ADAT to shed light on the process of substrate expansion that took place during the evolution of the enzyme. We show that tRNA recognition by human ADAT does not depend on conserved identity elements, but on the overall structural features of tRNA. We find that ancestral-like interactions are conserved for tRNAArg, while eukaryote-specific substrates use alternative mechanisms. These recognition studies show that human ADAT can be inhibited by tRNA fragments in vitro, including naturally occurring fragments involved in important regulatory pathways.
Collapse
MESH Headings
- Adenosine/metabolism
- Adenosine Deaminase/genetics
- Adenosine Deaminase/metabolism
- Anticodon/chemistry
- Anticodon/genetics
- Anticodon/metabolism
- Base Sequence
- Deamination
- Evolution, Molecular
- Gene Expression
- Humans
- Inosine/metabolism
- Nucleic Acid Conformation
- RNA, Transfer, Ala/chemistry
- RNA, Transfer, Ala/genetics
- RNA, Transfer, Ala/metabolism
- RNA, Transfer, Arg/chemistry
- RNA, Transfer, Arg/genetics
- RNA, Transfer, Arg/metabolism
- Sequence Alignment
- Substrate Specificity
Collapse
Affiliation(s)
- Helena Roura Frigolé
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
| | - Noelia Camacho
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
| | - Maria Castellví Coma
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
| | - Carla Fernández-Lozano
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
| | - Jorge García-Lema
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
| | - Àlbert Rafels-Ybern
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
| | - Albert Canals
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
- Molecular Biology Institute of Barcelona, Consejo Superior de Investigaciones Científicas, 08028 Barcelona, Catalonia, Spain
| | - Miquel Coll
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
- Molecular Biology Institute of Barcelona, Consejo Superior de Investigaciones Científicas, 08028 Barcelona, Catalonia, Spain
| | - Lluís Ribas de Pouplana
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain
- Catalan Institution for Research and Advanced Studies (ICREA), 08010 Barcelona, Catalonia, Spain
| |
Collapse
|
33
|
BłaŻej P, Wnetrzak M, Mackiewicz D, Mackiewicz P. The influence of different types of translational inaccuracies on the genetic code structure. BMC Bioinformatics 2019; 20:114. [PMID: 30841864 PMCID: PMC6404327 DOI: 10.1186/s12859-019-2661-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 01/29/2019] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The standard genetic code is a recipe for assigning unambiguously 21 labels, i.e. amino acids and stop translation signal, to 64 codons. However, at early stages of the translational machinery development, the codons did not have to be read unambiguously and the early genetic codes could have contained some ambiguous assignments of codons to amino acids. Therefore, the goal of this work was to obtain the genetic code structures which could have evolved assuming different types of inaccuracy of the translational machinery starting from unambiguous assignments of codons to amino acids. RESULTS We developed a theoretical model assuming that the level of uncertainty of codon assignments can gradually decrease during the simulations. Since it is postulated that the standard code has evolved to be robust against point mutations and mistranslations, we developed three simulation scenarios assuming that such errors can influence one, two or three codon positions. The simulated codes were selected using the evolutionary algorithm methodology to decrease coding ambiguity and increase their robustness against mistranslation. CONCLUSIONS The results indicate that the typical codon block structure of the genetic code could have evolved to decrease the ambiguity of amino acid to codon assignments and to increase the fidelity of reading the genetic information. However, the robustness to errors was not the decisive factor that influenced the genetic code evolution because it is possible to find theoretical codes that minimize the reading errors better than the standard genetic code.
Collapse
Affiliation(s)
- Paweł BłaŻej
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Małgorzata Wnetrzak
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Dorota Mackiewicz
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| | - Paweł Mackiewicz
- Department of Genomics, University of Wrocław, ul. Joliot-Curie 14a, Wrocław, 50-383 Poland
| |
Collapse
|
34
|
Abstract
Photosynthesis and nitrogen fixation became evolutionarily immutable as “frozen metabolic accidents” because multiple interactions between the proteins and protein complexes involved led to their co-evolution in modules. This has impeded their adaptation to an oxidizing atmosphere, and reconfiguration now requires modification or replacement of whole modules, using either natural modules from exotic species or non-natural proteins with similar interaction potential. Ultimately, the relevant complexes might be reconstructed (almost) from scratch, starting either from appropriate precursor processes or by designing alternative pathways. These approaches will require advances in synthetic biology, laboratory evolution, and a better understanding of module functions.
Collapse
Affiliation(s)
- Dario Leister
- Faculty of Biology, Ludwig-Maximilians-University Munich, Großhaderner Str. 2, 82152, Planegg-Martinsried, Germany.
| |
Collapse
|
35
|
Wnętrzak M, Błażej P, Mackiewicz D, Mackiewicz P. The optimality of the standard genetic code assessed by an eight-objective evolutionary algorithm. BMC Evol Biol 2018; 18:192. [PMID: 30545289 PMCID: PMC6293558 DOI: 10.1186/s12862-018-1304-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 11/22/2018] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The standard genetic code (SGC) is a unique set of rules which assign amino acids to codons. Similar amino acids tend to have similar codons indicating that the code evolved to minimize the costs of amino acid replacements in proteins, caused by mutations or translational errors. However, if such optimization in fact occurred, many different properties of amino acids must have been taken into account during the code evolution. Therefore, this problem can be reformulated as a multi-objective optimization task, in which the selection constraints are represented by measures based on various amino acid properties. RESULTS To study the optimality of the SGC we applied a multi-objective evolutionary algorithm and we used the representatives of eight clusters, which grouped over 500 indices describing various physicochemical properties of amino acids. Thanks to that we avoided an arbitrary choice of amino acid features as optimization criteria. As a consequence, we were able to conduct a more general study on the properties of the SGC than the ones presented so far in other papers on this topic. We considered two models of the genetic code, one preserving the characteristic codon blocks structure of the SGC and the other without this restriction. The results revealed that the SGC could be significantly improved in terms of error minimization, hereby it is not fully optimized. Its structure differs significantly from the structure of the codes optimized to minimize the costs of amino acid replacements. On the other hand, using newly defined quality measures that placed the SGC in the global space of theoretical genetic codes, we showed that the SGC is definitely closer to the codes that minimize the costs of amino acids replacements than those maximizing them. CONCLUSIONS The standard genetic code represents most likely only partially optimized systems, which emerged under the influence of many different factors. Our findings can be useful to researchers involved in modifying the genetic code of the living organisms and designing artificial ones.
Collapse
Affiliation(s)
- Małgorzata Wnętrzak
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland
| | - Paweł Błażej
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383, Wrocław, Poland.
| |
Collapse
|
36
|
Błażej P, Wnętrzak M, Mackiewicz D, Mackiewicz P. Optimization of the standard genetic code according to three codon positions using an evolutionary algorithm. PLoS One 2018; 13:e0201715. [PMID: 30092017 PMCID: PMC6084934 DOI: 10.1371/journal.pone.0201715] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 07/21/2018] [Indexed: 12/28/2022] Open
Abstract
Many biological systems are typically examined from the point of view of adaptation to certain conditions or requirements. One such system is the standard genetic code (SGC), which generally minimizes the cost of amino acid replacements resulting from mutations or mistranslations. However, no full consensus has been reached on the factors that caused the evolution of this feature. One of the hypotheses suggests that code optimality was directly selected as an advantage to preserve information about encoded proteins. An important feature that should be considered when studying the SGC is the different roles of the three codon positions. Therefore, we investigated the robustness of this code regarding the cost of amino acid replacements resulting from substitutions in these positions separately and the sum of these costs. We applied a modified evolutionary algorithm and included four models of the genetic code assuming various restrictions on its structure. The SGC was compared both with the codes that minimize the objective function and those that maximize it. This approach allowed us to place the SGC in the global space of possible codes, which is a more appropriate and unbiased comparison than that with randomly generated codes because they are characterized by relatively uniform amino acid assignments to codons. The SGC appeared to be well optimized at the global scale, but its individual positions were not fully optimized because there were codes that were optimized for only one codon position and simultaneously outperformed the SGC at the other positions. We also found that different code structures may lead to the same optimality and that random codes can show a tendency to minimize costs under some of the genetic code models. Our results suggest that the optimality of SGC could be a by-product of other processes.
Collapse
Affiliation(s)
- Paweł Błażej
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Małgorzata Wnętrzak
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Dorota Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, Wrocław, Poland
- * E-mail:
| |
Collapse
|
37
|
Di Giulio M. On Earth, there would be a number of fundamental kinds of primary cells – cellular domains – greater than or equal to four. J Theor Biol 2018; 443:10-17. [DOI: 10.1016/j.jtbi.2018.01.025] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 01/10/2018] [Accepted: 01/19/2018] [Indexed: 11/15/2022]
|
38
|
Abstract
We advocate for a tRNA- rather than an mRNA-centric model for evolution of the genetic code. The mechanism for evolution of cloverleaf tRNA provides a root sequence for radiation of tRNAs and suggests a simplified understanding of code evolution. To analyze code sectoring, rooted tRNAomes were compared for several archaeal and one bacterial species. Rooting of tRNAome trees reveals conserved structures, indicating how the code was shaped during evolution and suggesting a model for evolution of a LUCA tRNAome tree. We propose the polyglycine hypothesis that the initial product of the genetic code may have been short chain polyglycine to stabilize protocells. In order to describe how anticodons were allotted in evolution, the sectoring-degeneracy hypothesis is proposed. Based on sectoring, a simple stepwise model is developed, in which the code sectors from a 1→4→8→∼16 letter code. At initial stages of code evolution, we posit strong positive selection for wobble base ambiguity, supporting convergence to 4-codon sectors and ∼16 letters. In a later stage, ∼5–6 letters, including stops, were added through innovating at the anticodon wobble position. In archaea and bacteria, tRNA wobble adenine is negatively selected, shrinking the maximum size of the primordial genetic code to 48 anticodons. Because 64 codons are recognized in mRNA, tRNA-mRNA coevolution requires tRNA wobble position ambiguity leading to degeneracy of the code.
Collapse
Affiliation(s)
- Daewoo Pak
- a Center for Statistical Training and Consulting , Michigan State University , E. Lansing , MI 48824 , USA
| | - Nan Du
- b Computer Science and Engineering , Michigan State University , E. Lansing , MI 48824
| | | | - Yanni Sun
- b Computer Science and Engineering , Michigan State University , E. Lansing , MI 48824
| | - Zachary F Burton
- d Department of Biochemistry and Molecular Biology , Michigan State University , E. Lansing , MI 48824-1319
| |
Collapse
|
39
|
The evolution of the genetic code: Impasses and challenges. Biosystems 2018; 164:217-225. [DOI: 10.1016/j.biosystems.2017.10.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2017] [Revised: 10/06/2017] [Accepted: 10/09/2017] [Indexed: 01/17/2023]
|
40
|
Michel CJ, Ngoune VN, Poch O, Ripp R, Thompson JD. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae. Life (Basel) 2017; 7:life7040052. [PMID: 29207500 PMCID: PMC5745565 DOI: 10.3390/life7040052] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Revised: 11/27/2017] [Accepted: 11/27/2017] [Indexed: 12/17/2022] Open
Abstract
A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first evidence for a significant enrichment of X motifs in the genes of an extant organism. They raise two hypotheses: the X motifs may be evolutionary relics of the primitive codes used for translation, or they may continue to play a functional role in the complex processes of genome decoding and protein synthesis.
Collapse
Affiliation(s)
- Christian J Michel
- Complex Systems and Translational Bioinformatics, ICube, University of Strasbourg, CNRS, 300 Boulevard Sébastien Brant, 67400 Illkirch, France.
| | - Viviane Nguefack Ngoune
- Complex Systems and Translational Bioinformatics, ICube, University of Strasbourg, CNRS, 300 Boulevard Sébastien Brant, 67400 Illkirch, France.
| | - Olivier Poch
- Complex Systems and Translational Bioinformatics, ICube, University of Strasbourg, CNRS, 300 Boulevard Sébastien Brant, 67400 Illkirch, France.
| | - Raymond Ripp
- Complex Systems and Translational Bioinformatics, ICube, University of Strasbourg, CNRS, 300 Boulevard Sébastien Brant, 67400 Illkirch, France.
| | - Julie D Thompson
- Complex Systems and Translational Bioinformatics, ICube, University of Strasbourg, CNRS, 300 Boulevard Sébastien Brant, 67400 Illkirch, France.
| |
Collapse
|
41
|
Affiliation(s)
- Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Artem S. Novozhilov
- Department of Mathematics, North Dakota State University, Fargo, North Dakota 58108, USA
| |
Collapse
|