1
|
Salman A, Biziaev N, Shuvalova E, Alkalaeva E. mRNA context and translation factors determine decoding in alternative nuclear genetic codes. Bioessays 2024; 46:e2400058. [PMID: 38724251 DOI: 10.1002/bies.202400058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 04/19/2024] [Accepted: 04/23/2024] [Indexed: 06/27/2024]
Abstract
The genetic code is a set of instructions that determine how the information in our genetic material is translated into amino acids. In general, it is universal for all organisms, from viruses and bacteria to humans. However, in the last few decades, exceptions to this rule have been identified both in pro- and eukaryotes. In this review, we discuss the 16 described alternative eukaryotic nuclear genetic codes and observe theories of their appearance in evolution. We consider possible molecular mechanisms that allow codon reassignment. Most reassignments in nuclear genetic codes are observed for stop codons. Moreover, in several organisms, stop codons can simultaneously encode amino acids and serve as termination signals. In this case, the meaning of the codon is determined by the additional factors besides the triplets. A comprehensive review of various non-standard coding events in the nuclear genomes provides a new insight into the translation mechanism in eukaryotes.
Collapse
Affiliation(s)
- Ali Salman
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| | - Nikita Biziaev
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| | - Ekaterina Shuvalova
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| | - Elena Alkalaeva
- Engelhardt Institute of Molecular Biology, the Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
2
|
Wang X, Dong Q, Chen G, Zhang J, Liu Y, Cai Y. Frameshift and wild-type proteins are often highly similar because the genetic code and genomes were optimized for frameshift tolerance. BMC Genomics 2022; 23:416. [PMID: 35655139 PMCID: PMC9164415 DOI: 10.1186/s12864-022-08435-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 03/02/2022] [Indexed: 11/10/2022] Open
Abstract
Frameshift mutations have been considered of significant importance for the molecular evolution of proteins and their coding genes, while frameshift protein sequences encoded in the alternative reading frames of coding genes have been considered to be meaningless. However, functional frameshifts have been found widely existing. It was puzzling how a frameshift protein kept its structure and functionality while substantial changes occurred in its primary amino-acid sequence. This study shows that the similarities among frameshifts and wild types are higher than random similarities and are determined at different levels. Frameshift substitutions are more conservative than random substitutions in the standard genetic code (SGC). The frameshift substitutions score of SGC ranks in the top 2.0-3.5% of alternative genetic codes, showing that SGC is nearly optimal for frameshift tolerance. In many genes and certain genomes, frameshift-resistant codons and codon pairs appear more frequently than expected, suggesting that frameshift tolerance is achieved through not only the optimality of the genetic code but, more importantly, the further optimization of a specific gene or genome through the usages of codons/codon pairs, which sheds light on the role of frameshift mutations in molecular and genomic evolution.
Collapse
|
3
|
Ferrer-I-Cancho R, Gómez-Rodríguez C, Esteban JL, Alemany-Puig L. Optimality of syntactic dependency distances. Phys Rev E 2022; 105:014308. [PMID: 35193296 DOI: 10.1103/physreve.105.014308] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 11/10/2021] [Indexed: 06/14/2023]
Abstract
It is often stated that human languages, as other biological systems, are shaped by cost-cutting pressures but, to what extent? Attempts to quantify the degree of optimality of languages by means of an optimality score have been scarce and focused mostly on English. Here we recast the problem of the optimality of the word order of a sentence as an optimization problem on a spatial network where the vertices are words, arcs indicate syntactic dependencies, and the space is defined by the linear order of the words in the sentence. We introduce a score to quantify the cognitive pressure to reduce the distance between linked words in a sentence. The analysis of sentences from 93 languages representing 19 linguistic families reveals that half of languages are optimized to a 70% or more. The score indicates that distances are not significantly reduced in a few languages and confirms two theoretical predictions: that longer sentences are more optimized and that distances are more likely to be longer than expected by chance in short sentences. We present a hierarchical ranking of languages by their degree of optimization. The score has implications for various fields of language research (dependency linguistics, typology, historical linguistics, clinical linguistics, and cognitive science). Finally, the principles behind the design of the score have implications for network science.
Collapse
Affiliation(s)
- Ramon Ferrer-I-Cancho
- Complexity and Quantitative Linguistics Lab, LARCA Research Group, Departament de Ciències de la Computació, Universitat Politècnica de Catalunya, Campus Nord, Edifici Omega, Jordi Girona Salgado 1-3 08034 Barcelona, Catalonia, Spain
| | - Carlos Gómez-Rodríguez
- Universidade da Coruña, CITIC, FASTPARSE Lab, LyS Research Group, Departamento de Ciencias de la Computación y Tecnologías de la Información, Facultade de Informática, Elviña, 15071, A Coruña, Spain
| | - Juan Luis Esteban
- Departament de Ciències de la Computació, Universitat Politècnica de Catalunya (UPC), Campus Nord, Edifici Omega, Jordi Girona Salgado 1-3 08034 Barcelona, Catalonia, Spain
| | - Lluís Alemany-Puig
- Complexity and Quantitative Linguistics Lab, LARCA Research Group, Departament de Ciències de la Computació, Universitat Politècnica de Catalunya, Campus Nord, Edifici Omega, Jordi Girona Salgado 1-3 08034 Barcelona, Catalonia, Spain
| |
Collapse
|
4
|
Abstract
The standard genetic code (SGC) has been extensively analyzed for the biological ramifications of its nonrandom structure. For instance, mismatch errors due to point mutation or mistranslation have an overall smaller effect on the amino acid polar requirement under the SGC than under random genetic codes (RGCs). A similar observation was recently made for frameshift errors, prompting the assertion that the SGC has been shaped by natural selection for frameshift-robustness-conservation of certain amino acid properties upon a frameshift mutation or translational frameshift. However, frameshift-robustness confers no benefit because frameshifts usually create premature stop codons that cause nonsense-mediated mRNA decay or production of nonfunctional truncated proteins. We here propose that the frameshift-robustness of the SGC is a byproduct of its mismatch-robustness. Of 564 amino acid properties considered, the SGC exhibits mismatch-robustness in 93-133 properties and frameshift-robustness in 55 properties, respectively, and that the latter is largely a subset of the former. For each of the 564 real and 564 randomly constructed fake properties of amino acids, there is a positive correlation between mismatch-robustness and frameshift-robustness across one million RGCs; this correlation arises because most amino acid changes resulting from a frameshift are also achievable by a mismatch error. Importantly, the SGC does not show significantly higher frameshift-robustness in any of the 55 properties than RGCs of comparable mismatch-robustness. These findings support that the frameshift-robustness of the SGC need not originate through direct selection and can instead be a site effect of its mismatch-robustness.
Collapse
Affiliation(s)
- Haiqing Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
5
|
Dila G, Michel CJ, Thompson JD. Optimality of circular codes versus the genetic code after frameshift errors. Biosystems 2020; 195:104134. [DOI: 10.1016/j.biosystems.2020.104134] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2020] [Revised: 03/23/2020] [Accepted: 03/25/2020] [Indexed: 12/24/2022]
|
6
|
A search for the physical basis of the genetic code. Biosystems 2020; 195:104148. [DOI: 10.1016/j.biosystems.2020.104148] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/09/2020] [Accepted: 04/09/2020] [Indexed: 01/01/2023]
|
7
|
Seligmann H. First arrived, first served: competition between codons for codon-amino acid stereochemical interactions determined early genetic code assignments. Naturwissenschaften 2020; 107:20. [PMID: 32367155 DOI: 10.1007/s00114-020-01676-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 03/10/2020] [Accepted: 04/05/2020] [Indexed: 12/12/2022]
Abstract
Stereochemical nucleotide-amino acid interactions, in the form of noncovalent nucleotide-amino acid interactions, potentially produced the genetic code's codon-amino acid assignments. Empirical estimates of single nucleotide-amino acid affinities on surfaces and in solution are used to test whether trinucleotide-amino acid affinities determined genetic code assignments pending the principle "first arrived, first served": presumed early amino acids have greater codon-amino acid affinities than ulterior ones. Here, these single nucleotide affinities are used to approximate all 64 × 20 trinucleotide-amino acid affinities. Analyses show that (1) on surfaces, genetic code codon-amino acid assignments tend to match high affinities for the amino acids that integrated earliest the genetic code (according to Wong's metabolic coevolution hypothesis between nucleotides and amino acids) and (2) in solution, the same principle holds for the anticodon-amino acid assignments. Affinity analyses match best genetic code assignments when assuming that trinucleotides competed for amino acids, rather than amino acids for trinucleotides. Codon-amino acid affinities stick better to genetic code assignments than anticodon-amino acid affinities. Presumably, two independent coding systems, on surfaces and in solution, converged, and formed the current translation system. Proto-translation on surfaces by direct codon-amino acid interactions without tRNA-like adaptors coadapted with a system emerging in solution by proto-tRNA anticodon-amino acid interactions. These systems assigned identical or similar cognates to codons on surfaces and to anticodons in solution. Results indicate that a prebiotic metabolism predated genetic code self-organization.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel. .,Faculty of Medicine, Université Grenoble Alpes, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700, La Tronche, France.
| |
Collapse
|
8
|
Demongeot J, Seligmann H. Why Is AUG the Start Codon?: Theoretical Minimal RNA Rings: Maximizing Coded Information Biases 1st Codon for the Universal Initiation Codon AUG. Bioessays 2020; 42:e1900201. [PMID: 32227358 DOI: 10.1002/bies.201900201] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2019] [Revised: 02/09/2020] [Indexed: 01/04/2023]
Abstract
The rational design of theoretical minimal RNA rings predetermines AUG as the universal start codon. This design maximizes coded amino acid diversity over minimal sequence length, defining in silico theoretical minimal RNA rings, candidate ancestral genes. RNA rings code for 21 amino acids and a stop codon after three consecutive translation rounds, and form a degradation-delaying stem-loop hairpin. Twenty-five RNA rings match these constraints, ten start with the universal initiation codon AUG. No first codon bias exists among remaining RNA rings. RNA ring design predetermines AUG as initiation codon. This is the only explanation yet for AUG as start codon. RNA ring design determines additional RNA ring gene- and tRNA-like properties described previously, because it presumably mimics constraints on life's primordial RNAs.
Collapse
Affiliation(s)
- Jacques Demongeot
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, F-38700, France
| | - Hervé Seligmann
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, F-38700, France.,The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, 91404, Israel
| |
Collapse
|
9
|
Demongeot J, Seligmann H. Deamination gradients within codons after 1<->2 position swap predict amino acid hydrophobicity and parallel β-sheet conformational preference. Biosystems 2020; 191-192:104116. [PMID: 32081715 DOI: 10.1016/j.biosystems.2020.104116] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 12/04/2019] [Accepted: 02/10/2020] [Indexed: 12/16/2022]
Abstract
Deaminations C->T and A->G are frequent mutations producing nucleotide content gradients across genomes proportional to singlestrandedness during replication/transcription. Hence, within single codons, deamination risks increase from first to third codon positions, while second codon positions are functionally most crucial. Here genetic codes are analyzed assuming that after anticodons protected codons from deaminations, first and second codon positions swapped (N2N1N3->N1N2N3), with lowest deamination risks for N2 in presumed primitive N2N1N3 codons. N2N1N3, not standard N1N2N3, codon structure minimizes deaminations inversely proportionally to cognate amino acid hydrophobicity and parallel betasheet conformational preference. For N1N2N3, deamination minimization increases with genetic code integration order of cognate amino acids: during the presumed N2N1N3->N1N2N3 codon structure transition, protein synthesis combined direct codon-amino acid interactions for late amino acids and tRNA-based translation for early amino acids. Hence N2N1N3 codons would correspond to tRNA-free translation by spontaneous codon-amino acid affinities, and tRNA-mediated translation presumably caused N2N1N3->N1N2N3 swaps. Results show that rational, not arbitrary rules link codon and amino acid structures. Some analyses detect mitochondrial RNAs and peptides in public data corresponding to systematic position swaps, suggesting occasional swapping polymerase activity.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France.
| | - Hervé Seligmann
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France; The National Natural History Collections, The Hebrew University of Jerusalem, 91404, Jerusalem, Israel.
| |
Collapse
|
10
|
Lean OM. Chemical arbitrariness and the causal role of molecular adapters. STUDIES IN HISTORY AND PHILOSOPHY OF BIOLOGICAL AND BIOMEDICAL SCIENCES 2019; 78:101180. [PMID: 31281071 DOI: 10.1016/j.shpsc.2019.101180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Revised: 04/08/2019] [Accepted: 06/23/2019] [Indexed: 06/09/2023]
Abstract
Jacques Monod (1971) argued that certain molecular processes rely critically on the property of chemical arbitrariness, which he claimed allows those processes to "transcend the laws of chemistry". It seems natural, as some philosophers have done, to interpret this in modal terms: a biological relationship is chemically arbitrary if it is possible, within the constraints of chemical "law", for that relationship to have been otherwise than it is. But while modality is certainly important for understanding chemical arbitrariness, understanding its biological role also requires an account of the concrete causal-functional features that distinguish arbitrary from non-arbitrary phenomena. In this paper I elaborate on this under-emphasised aspect by offering a general account of these features: arbitrary relations are instantiated by mechanisms that involve molecular adapters, which causally couple two properties or processes which would otherwise be uncorrelated. Additionally, adapters work by acting as intermediate rather than cooperating causes.
Collapse
Affiliation(s)
- Oliver M Lean
- Institut supérieur de philosophie, Université catholique de Louvain, 1348, Louvain-la-Neuve, Belgium.
| |
Collapse
|
11
|
Demongeot J, Seligmann H. Theoretical minimal RNA rings designed according to coding constraints mimic deamination gradients. THE SCIENCE OF NATURE - NATURWISSENSCHAFTEN 2019; 106:44. [DOI: 10.1007/s00114-019-1638-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2018] [Revised: 06/18/2019] [Accepted: 06/19/2019] [Indexed: 11/27/2022]
|
12
|
Optimization of the standard genetic code in terms of two mutation types: Point mutations and frameshifts. Biosystems 2019; 181:44-50. [DOI: 10.1016/j.biosystems.2019.04.012] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 04/27/2019] [Indexed: 02/08/2023]
|
13
|
Seligmann H. Localized Context-Dependent Effects of the "Ambush" Hypothesis: More Off-Frame Stop Codons Downstream of Shifty Codons. DNA Cell Biol 2019; 38:786-795. [PMID: 31157984 DOI: 10.1089/dna.2019.4725] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The ambush hypothesis speculates that off-frame stop codons increase translational efficiency after ribosomal frameshifts by stopping early frameshifted translation. Some evidences fit this hypothesis: (1) synonymous codon usages increase with their potential contribution to off-frame stops; (2) the genetic code assigns frequent amino acids to codon families contributing to off-frame stops; (3) positive biases for off-frame stops (AT rich) occur despite adverse nucleotide (GC) biases; and (4) mitochondrial off-frame stop codon densities increase with ribosomal structural instability, potential proxy of frameshift frequencies. In this study, analyses of vertebrate mitogenes and tRNA synthetase genes from all superkingdoms and viruses test a new prediction of the ambush hypothesis: sequences immediately downstream of frameshift-inducing homopolymer codons (AAA, CCC, GGG, and TTT) are off-frame stop rich. Codons immediately downstream of homopolymer codons form more than average off-frame stops, biases are stronger than for corresponding upstream distances and for any other group of synonymous codons. Sequences downstream of that high-density region are off-frame stop depleted. This decrease suggests that off-frame stops, combined with suppressor tRNAs regulate translation of overlapping coding sequences. Results show the predictive power of the ambush hypothesis, from macroevolutionary (genetic code structure) to detailed gene sequence anatomy.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
14
|
Demongeot J, Seligmann H. More Pieces of Ancient than Recent Theoretical Minimal Proto-tRNA-Like RNA Rings in Genes Coding for tRNA Synthetases. J Mol Evol 2019; 87:152-174. [DOI: 10.1007/s00239-019-09892-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 03/22/2019] [Indexed: 12/19/2022]
|
15
|
Kasman A. The Duplexing of the Genetic Code and Sequence-Dependent DNA Geometry. Bull Math Biol 2018; 80:2734-2760. [PMID: 30097915 DOI: 10.1007/s11538-018-0486-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 08/03/2018] [Indexed: 11/30/2022]
Abstract
It is well known that sequences of bases in DNA are translated into sequences of amino acids in cells via the genetic code. More recently, it has been discovered that the sequence of DNA bases also influences the geometry and deformability of the DNA. These two correspondences represent a naturally arising example of duplexed codes, providing two different ways of interpreting the same DNA sequence. This paper will set up the notation and basic results necessary to mathematically investigate the relationship between these two natural DNA codes. It then undertakes two very different such investigations: one graphical approach based only on expected values and another analytic approach incorporating the deformability of the DNA molecule and approximating the mutual information of the two codes. Special emphasis is paid to whether there is evidence that pressure to maximize the duplexing efficiency influenced the evolution of the genetic code. Disappointingly, the results fail to support the hypothesis that the genetic code was influenced in this way. In fact, applying both methods to samples of realistic alternative genetic codes shows that the duplexing of the genetic code found in nature is just slightly less efficient than average. The implications of this negative result are considered in the final section of the paper.
Collapse
|
16
|
The evolution of the genetic code: Impasses and challenges. Biosystems 2018; 164:217-225. [DOI: 10.1016/j.biosystems.2017.10.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2017] [Revised: 10/06/2017] [Accepted: 10/09/2017] [Indexed: 01/17/2023]
|
17
|
de Oliveira LL, Freitas AA, Tinós R. Multi-objective genetic algorithms in the study of the genetic code’s adaptability. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2017.10.022] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|