1
|
O'Connor PBF. The Evolutionary Transition of the RNA World to Obcells to Cellular-Based Life. J Mol Evol 2024; 92:278-285. [PMID: 38683368 DOI: 10.1007/s00239-024-10171-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 04/08/2024] [Indexed: 05/01/2024]
Abstract
The obcell hypothesis is a proposed route for the RNA world to develop into a primitive cellular one. It posits that this transition began with the emergence of the proto-ribosome which enabled RNA to colonise the external surface of lipids by the synthesis of amphipathic peptidyl-RNAs. The obcell hypothesis also posits that the emergence of a predation-based ecosystem provided a selection mechanism for continued sophistication amongst early life forms. Here, I argue for this hypothesis owing to its significant explanatory power; it offers a rationale why a ribosome which initially was capable only of producing short non-coded peptides was advantageous and it forgoes issues related to maintaining a replicating RNA inside a lipid enclosure. I develop this model by proposing that the evolutionary selection for improved membrane anchors resulted in the emergence of primitive membrane pores which enabled obcells to gradually evolve into a cellular morphology. Moreover, I introduce a model of obcell production which advances that tRNAs developed from primers of the RNA world.
Collapse
|
2
|
Russell MJ. A self-sustaining serpentinization mega-engine feeds the fougerite nanoengines implicated in the emergence of guided metabolism. Front Microbiol 2023; 14:1145915. [PMID: 37275164 PMCID: PMC10236563 DOI: 10.3389/fmicb.2023.1145915] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 03/22/2023] [Indexed: 06/07/2023] Open
Abstract
The demonstration by Ivan Barnes et al. that the serpentinization of fresh Alpine-type ultramafic rocks results in the exhalation of hot alkaline fluids is foundational to the submarine alkaline vent theory (AVT) for life's emergence to its 'improbable' thermodynamic state. In AVT, such alkaline fluids ≤ 150°C, bearing H2 > CH4 > HS--generated and driven convectively by a serpentinizing exothermic mega-engine operating in the ultramafic crust-exhale into the iron-rich, CO2> > > NO3--bearing Hadean ocean to result in hydrothermal precipitate mounds comprising macromolecular ferroferric-carbonate oxyhydroxide and minor sulfide. As the nanocrystalline minerals fougerite/green rust and mackinawite (FeS), they compose the spontaneously precipitated inorganic membranes that keep the highly contrasting solutions apart, thereby maintaining redox and pH disequilibria. They do so in the form of fine chimneys and chemical gardens. The same disequilibria drive the reduction of CO2 to HCOO- or CO, and the oxidation of CH4 to a methyl group-the two products reacting to form acetate in a sequence antedating the 'energy-producing' acetyl coenzyme-A pathway. Fougerite is a 2D-layered mineral in which the hydrous interlayers themselves harbor 2D solutions, in effect constricted to ~ 1D by preferentially directed electron hopping/tunneling, and proton Gröthuss 'bucket-brigading' when subject to charge. As a redox-driven nanoengine or peristaltic pump, fougerite forces the ordered reduction of nitrate to ammonium, the amination of pyruvate and oxalate to alanine and glycine, and their condensation to short peptides. In turn, these peptides have the flexibility to sequester the founding inorganic iron oxyhydroxide, sulfide, and pyrophosphate clusters, to produce metal- and phosphate-dosed organic films and cells. As the feed to the hydrothermal mound fails, the only equivalent sustenance on offer to the first autotrophs is the still mildly serpentinizing upper crust beneath. While the conditions here are very much less bountiful, they do offer the similar feed and disequilibria the survivors are accustomed to. Sometime during this transition, a replicating non-ribosomal guidance system is discovered to provide the rules to take on the incrementally changing surroundings. The details of how these replicating apparatuses emerged are the hard problem, but by doing so the progenote archaea and bacteria could begin to colonize what would become the deep biosphere. Indeed, that the anaerobic nitrate-respiring methanotrophic archaea and the deep-branching Acetothermia presently comprise a portion of that microbiome occupying serpentinizing rocks offers circumstantial support for this notion. However, the inescapable, if jarring conclusion is drawn that, absent fougerite/green rust, there would be no structured channelway to life.
Collapse
Affiliation(s)
- Michael J. Russell
- Dipartimento di Chimica, Università degli Studi di Torino, Torino, Italy
| |
Collapse
|
3
|
An Optimal Lysis Time Maximizes Bacteriophage Fitness in Quasi-Continuous Culture. mBio 2022; 13:e0359321. [PMID: 35467417 PMCID: PMC9239172 DOI: 10.1128/mbio.03593-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Optimality models have a checkered history in evolutionary biology. While optimality models have been successful in providing valuable insight into the evolution of a wide variety of biological traits, a common objection is that optimality models are overly simplistic and ignore organismal genetics. We revisit evolutionary optimization in the context of a major bacteriophage life history trait, lysis time. Lysis time refers to the period spanning phage infection of a host cell and its lysis, whereupon phage progenies are released. Lysis time, therefore, directly determines phage fecundity assuming progeny assembly does not exhaust host resources prior to lysis. Noting that previous tests of lysis time optimality rely on batch culture, we implemented a quasi-continuous culture system to observe productivity of a panel of isogenic phage λ genotypes differing in lysis time. We report that under our experimental conditions, λ phage productivity is maximized around optimal lysis times ranging from 60 to 100 min, and λ wildtype strain falls within this range. It would appear that natural selection on phage λ lysis time uncovered a set of genetic solutions that optimized progeny production in its ecological milieu relative to alternative genotypes. We discuss this finding in light of recent results that lysis time variation is also minimized in the strains with lysis times closer to the λ wild-type strain. IMPORTANCE Optimality theory presents the idea that natural selection acts on organismal traits to produce genotypes that maximize organismal survival and reproduction. As such, optimality theory is a valuable tool in guiding our understanding of the genetic constraints and tradeoffs organisms experience as their genotypes are selected to produce optimal solutions to biological problems. However, optimality theory is often critiqued as being overly simplistic and ignoring the roles of chance and history in the evolution of organismal traits. We show here that the wild-type genotype of a popular laboratory model organism, the bacteriophage λ, produces a phenotype for a major life history trait, lysis time, that maximizes the reproductive success of bearers of that genotype relative to other possible genotypes. This result demonstrates, as is rarely shown experimentally, that natural selection can achieve optimal solutions to ecological challenges.
Collapse
|
4
|
Snyder-Beattie AE, Bonsall MB. Catastrophe risk can accelerate unlikely evolutionary transitions. Proc Biol Sci 2022; 289:20212711. [PMID: 35350860 PMCID: PMC8965398 DOI: 10.1098/rspb.2021.2711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Intelligent life has emerged late in Earth's habitable lifetime, and required a preceding series of key evolutionary transitions. A simple model (the Carter model) explains the late arrival of intelligent life by positing these evolutionary transitions were exceptionally unlikely 'critical steps'. An alternative model (the neocatastrophism hypothesis) proposes that intelligent life was delayed by frequent catastrophes that served to set back evolutionary innovation. Here, we generalize the Carter model and explore this hypothesis by including catastrophes that can 'undo' an evolutionary transition. Introducing catastrophes or evolutionary dead ends can create situations in which critical steps occur rapidly or in clusters, suggesting that past estimates of the number of critical steps could be underestimated. If catastrophes affect complex life more than simple life, the critical steps will also exhibit a pattern of acceleration towards the present, suggesting that the increase in biological complexity over the past 500 Myr could reflect previously overlooked evolutionary transitions. Furthermore, our results have implications for understanding the different explanations (critical steps versus neo-catastrophes) for the evolution of intelligent life and the so-called Fermi paradox-the observation that intelligent life appears rare in the observable Universe.
Collapse
Affiliation(s)
- Andrew E. Snyder-Beattie
- Mathematical Ecology Research Group, Department of Zoology, University of Oxford, Oxford OX1 3PS, UK
| | - Michael B. Bonsall
- Mathematical Ecology Research Group, Department of Zoology, University of Oxford, Oxford OX1 3PS, UK
| |
Collapse
|
5
|
Shelah S, Strüngmann L. Infinite combinatorics in mathematical biology. Biosystems 2021; 204:104392. [PMID: 33731280 DOI: 10.1016/j.biosystems.2021.104392] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 02/11/2021] [Accepted: 02/18/2021] [Indexed: 12/12/2022]
Abstract
Is it possible to apply infinite combinatorics and (infinite) set theory in theoretical biology? We do not know the answer yet but in this article we try to present some techniques from infinite combinatorics and set theory that have been used over the last decades in order to prove existence results and independence theorems in algebra and that might have the flexibility and generality to be also used in theoretical biology. In particular, we will introduce the theory of forcing and an algebraic construction technique based on trees and forests using infinite binary sequences. We will also present an overview of the theory of circular codes. Such codes had been found in the genetic information and are assumed to play an important role in error detecting and error correcting mechanisms during the process of translation. Finally, examples and constructions of infinite mixed circular codes using binary sequences hopefully show some similarity between these theories - a starting point for future applications.
Collapse
Affiliation(s)
- Saharon Shelah
- Einstein Institute of Mathematics, The Hebrew University of Jerusalem(1), 9190401, Jerusalem, Israel; Department of Mathematics, Rutgers University, Piscataway, NJ, 08854-8019, USA.
| | - Lutz Strüngmann
- Institute of Mathematical Biology, Faculty of Computer Sciences, Mannheim University of Applied Sciences, 68163, Mannheim, Germany.
| |
Collapse
|
6
|
Nesterov-Mueller A, Popov R, Seligmann H. Combinatorial Fusion Rules to Describe Codon Assignment in the Standard Genetic Code. Life (Basel) 2020; 11:life11010004. [PMID: 33374866 PMCID: PMC7824455 DOI: 10.3390/life11010004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 12/15/2020] [Accepted: 12/21/2020] [Indexed: 11/16/2022] Open
Abstract
We propose combinatorial fusion rules that describe the codon assignment in the standard genetic code simply and uniformly for all canonical amino acids. These rules become obvious if the origin of the standard genetic code is considered as a result of a fusion of four protocodes: Two dominant AU and GC protocodes and two recessive AU and GC protocodes. The biochemical meaning of the fusion rules consists of retaining the complementarity between cognate codons of the small hydrophobic amino acids and large charged or polar amino acids within the protocodes. The proto tRNAs were assembled in form of two kissing hairpins with 9-base and 10-base loops in the case of dominant protocodes and two 9-base loops in the case of recessive protocodes. The fusion rules reveal the connection between the stop codons, the non-canonical amino acids, pyrrolysine and selenocysteine, and deviations in the translation of mitochondria. Using fusion rules, we predicted the existence of additional amino acids that are essential for the development of the standard genetic code. The validity of the proposed partition of the genetic code into dominant and recessive protocodes is considered referring to state-of-the-art hypotheses. The formation of two aminoacyl-tRNA synthetase classes is compatible with four-protocode partition.
Collapse
Affiliation(s)
- Alexander Nesterov-Mueller
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
- Correspondence:
| | - Roman Popov
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
| | - Hervé Seligmann
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), 76344 Eggenstein-Leopoldshafen, Germany; (R.P.); (H.S.)
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
- Laboratory AGEIS EA 7407, Team Tools for e-GnosisMedical & LabcomCNRS/UGA/OrangeLabs Telecoms4Health, Faculty of Medicine, Université Grenoble Alpes, F-38700 La Tronche, France
| |
Collapse
|
7
|
Seligmann H, Warthi G. Natural pyrrolysine-biased translation of stop codons in mitochondrial peptides entirely coded by expanded codons. Biosystems 2020; 196:104180. [PMID: 32534170 DOI: 10.1016/j.biosystems.2020.104180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 06/02/2020] [Accepted: 06/02/2020] [Indexed: 12/31/2022]
Abstract
During the noncanonical deletion transcription, k nucleotides are systematically skipped/deleted after each transcribed trinucleotide producing deletion-RNAs (delRNAs). Peptides matching delRNAs either result from (a) canonical translation of delRNAs; or (b) noncanonical translation of regular transcripts along expanded codons. Only along frame "0" (start site) (a) and (b) produce identical peptides. Here, mitochondrial mass spectrometry data analyses assume expanded codon/del-transcription with 3 + k (k from 0 to 12) nucleotides. Detected peptides map preferentially on previously identified delRNAs. More peptides were detected for k (1-12) when del-transcriptional and expanded codon translations start sites coincide (i.e. the 0th frame) than for frames +1 or +2. Hence, both (a) and (b) produced peptides identified here. Biases for frame 0 decrease for k > 2, reflecting codon/anticodon expansion limits. Further analyses find preferential pyrrolysine insertion at stop codons, suggesting Pyl-specific mitochondrial suppressor tRNAs loaded by Pyl-specific tRNA synthetases with unknown origins. Pyl biases at stops are stronger for regular than expanded codons suggesting that Pyl-tRNAs are less competitive with near-cognate tRNAs in expanded codon contexts. Statistical biases for these findings exclude that detected peptides are experimental and/or bioinformatic artefacts implying both del-transcription and expanded codons translation occur in human mitochondria.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404, Jerusalem, Israel; Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France.
| | - Ganesh Warthi
- Aix-Marseille University, IRD, VITROME, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, France.
| |
Collapse
|
8
|
Abstract
The origin of the modern genetic code and the mechanisms that have contributed to its present form raise many questions. The main goal of this work is to test two hypotheses concerning the development of the genetic code for their compatibility and complementarity and see if they could benefit from each other. On the one hand, Gonzalez, Giannerini and Rosa developed a theory, based on four-based codons, which they called tesserae. This theory can explain the degeneracy of the modern vertebrate mitochondrial code. On the other hand, in the 1990s, so-called circular codes were discovered in nature, which seem to ensure the maintenance of a correct reading-frame during the translation process. It turns out that the two concepts not only do not contradict each other, but on the contrary complement and enrichen each other.
Collapse
|
9
|
Gonzalez DL, Giannerini S, Rosa R. On the origin of degeneracy in the genetic code. Interface Focus 2019; 9:20190038. [PMID: 31641429 PMCID: PMC6802134 DOI: 10.1098/rsfs.2019.0038] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Accepted: 09/05/2019] [Indexed: 01/11/2023] Open
Abstract
The degeneracy of amino acid coding is one of the most crucial and enigmatic aspects of the genetic code. Different theories about the origin of the genetic code have been developed. However, to date, there is no comprehensive hypothesis on the mechanism that might have generated the degeneracy as we observe it. Here, we provide a new theory that explains the origin of the degeneracy based only on symmetry principles. The approach allows one to describe exactly the degeneracy of the early code (progenitor of the genetic code of LUCA, the last universal common ancestor) which is hypothesized to have the same degeneracy as the present vertebrate mitochondrial genetic code. The theory is based upon the tessera code, that fits as the progenitor of the early code. Moreover, we describe in detail the possible evolutionary transitions implied by our theory. The approach is supported by a unified mathematical framework that accounts for the degeneracy properties of both nuclear and mitochondrial genetic codes. Our work provides a new perspective to the understanding of the origin of the genetic code and the roles of symmetry principles in the organization of genetic information.
Collapse
Affiliation(s)
- D L Gonzalez
- CNR-IMM, UOS di Bologna, Via Gobetti 101, 40129 Bologna, Italy.,Dipartimento di Scienze Statistiche, Università di Bologna, via delle Belle Arti 41, 40126 Bologna, Italy
| | - S Giannerini
- Dipartimento di Scienze Statistiche, Università di Bologna, via delle Belle Arti 41, 40126 Bologna, Italy
| | - R Rosa
- CNR-IMM, UOS di Bologna, Via Gobetti 101, 40129 Bologna, Italy
| |
Collapse
|
10
|
Rumer's transformation: A symmetry puzzle standing for half a century. Biosystems 2019; 187:104036. [PMID: 31589913 DOI: 10.1016/j.biosystems.2019.104036] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 09/18/2019] [Accepted: 09/19/2019] [Indexed: 01/09/2023]
Abstract
In 1966, only a few months after the complete elucidation of the standard nuclear genetic code (Kay, 2000), the Russian theoretical physicist Yury Borisovich Rumer uncovered the existence of a particular symmetry (Rumer, 1966): when the keto-amino transformation (also known as Rumer's transformation) is applied to the bases of a codon then the degeneracy of the transformed codon was changed. In particular, if the amino acid associated to the starting codon has degeneracy 4, then the amino acid associated to the transformed codon has degeneracy 1, 2 or 3 (and vice versa). After half a century from this discovery and despite the universality of Rumer's symmetry, little is known about its origin and its possible biological significance. In this article we show that Rumer's symmetry could have originated in an ancestral version of the genetic code, i.e., the pre-early code, and is a natural consequence of the stereo-chemical symmetries of the ancestral synthesis machinery working around such code (Gonzalez et al., 2019). Moreover, the conservation of Rumer's symmetry through evolutionary periods suggests a connection with key biological features. In this respect, intriguing possibilities include those of error detection/correction, control over the synthesis of proteins, and frame maintenance. To a certain extent, such ideas have been explored in the framework of a mathematical model of the genetic code (the non-power model of the genetic code (Gonzalez, 2004; Gonzalez, 2008; Gonzalez et al., 2016), whose definition of dichotomic classes naturally includes Rumer's symmetry (Gonzalez, 2008; Gonzalez et al., 2006, 2008) and the theory of circular codes (Arquès and Michel, 1996; Gonzalez et al., 2011; Fimmel et al., 2015).
Collapse
|
11
|
Seligmann H, Warthi G. Chimeric Translation for Mitochondrial Peptides: Regular and Expanded Codons. Comput Struct Biotechnol J 2019; 17:1195-1202. [PMID: 31534643 PMCID: PMC6742854 DOI: 10.1016/j.csbj.2019.08.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 08/19/2019] [Accepted: 08/21/2019] [Indexed: 02/07/2023] Open
Abstract
Frameshifting protein translation occasionally results from insertion of amino acids at isolated mono- or dinucleotide-expanded codons by tRNAs with expanded anticodons. Previous analyses of two different types of human mitochondrial MS proteomic data (Fisher and Waters technologies) detect peptides entirely corresponding to expanded codon translation. Here, these proteomic data are reanalyzed searching for peptides consisting of at least eight consecutive amino acids translated according to regular tricodons, and at least eight adjacent consecutive amino acids translated according to expanded codons. Both datasets include chimerically translated peptides (mono- and dinucleotide expansions, 42 and 37, respectively). The regular tricodon-encoded part of some chimeric peptides corresponds to standard human mitochondrial proteins (mono- and dinucleotide expansions, six (AT6, CytB, ND1, 2xND2, ND5) and one (ND1), respectively). Chimeric translation probably increases the diversity of mitogenome-encoded proteins, putatively producing functional proteins. These might result from translation by tRNAs with expanded anticodons, or from regular tricodon translation of RNAs where transcription/posttranscriptional edition systematically deleted mono- or dinucleotides after each trinucleotide. The pairwise matched combination of adjacent peptide parts translated from regular and expanded codons strengthens the hypothesis that translation of stretches of consecutive expanded codons occurs. Results indicate statistical translation producing distributions of alternative proteins. Genetic engineering should account for potential unexpected, unwanted secondary products.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille University, IRD, VITROME, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, France
| |
Collapse
|
12
|
Fimmel E, Strüngmann L. Linear codes and the mitochondrial genetic code. Biosystems 2019; 184:103990. [PMID: 31326431 DOI: 10.1016/j.biosystems.2019.103990] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 07/09/2019] [Accepted: 07/10/2019] [Indexed: 11/29/2022]
Abstract
The origin of the genetic code can certainly be regarded as one of the most challenging problems in the theory of molecular evolution. Thus the known variants of the genetic code and a possible common ancestry of them haven been studied extensively in the literature. Gonzalez et al. (2012) developed the theory of a primeval mitochondrial genetic code composed of four base codons. These were called tesserae and it was shown that the tesserae code has some remarkable error detection capabilities. In our paper we will show that using classical coding theory we can construct the tessera code as a linear coding of the standard genetic code and at the same time it can be deduced from the code of all dinucleotides by Plotkin's construction. It shows that the tessera model of the mitochondrial code does not just have a biological explanation but also has a clear mathematical structure. This underlines the role that the tessera model might have played in evolution.
Collapse
Affiliation(s)
- Elena Fimmel
- Institute of Mathematical Biology, Faculty for Computer Sciences, and Competence Center for Algorithmic and Mathematical Methods in Biology, Biotechnology and Medicine, Mannheim University of Applied Sciences, 68163 Mannheim, Germany.
| | - Lutz Strüngmann
- Institute of Mathematical Biology, Faculty for Computer Sciences, and Competence Center for Algorithmic and Mathematical Methods in Biology, Biotechnology and Medicine, Mannheim University of Applied Sciences, 68163 Mannheim, Germany.
| |
Collapse
|
13
|
Fimmel E, Michel CJ, Pirot F, Sereni JS, Strüngmann L. Mixed circular codes. Math Biosci 2019; 317:108231. [PMID: 31325443 DOI: 10.1016/j.mbs.2019.108231] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2018] [Revised: 07/16/2019] [Accepted: 07/17/2019] [Indexed: 12/11/2022]
Abstract
By an extensive statistical analysis in genes of bacteria, archaea, eukaryotes, plasmids and viruses, a maximal C3-self-complementary trinucleotide circular code has been found to have the highest average occurrence in the reading frame of the ribosome during translation. Circular codes may play an important role in maintaining the correct reading frame. On the other hand, as several evolutionary theories propose primeval codes based on dinucleotides, trinucleotides and tetranucleotides, mixed circular codes were investigated. By using a graph-theoretical approach of circular codes recently developed, we study mixed circular codes, which are the union of a dinucleotide circular code, a trinucleotide circular code and a tetranucleotide circular code. Maximal mixed circular codes of (di,tri)-nucleotides, (tri,tetra)-nucleotides and (di,tri,tetra)-nucleotides are constructed, respectively. In particular, we show that any maximal dinucleotide circular code of size 6 can be embedded into a maximal mixed (di,tri)-nucleotide circular code such that its trinucleotide component is a maximal C3-comma-free code. The growth function of self-complementary mixed circular codes of dinucleotides and trinucleotides is given. Self-complementary mixed circular codes could have been involved in primitive genetic processes.
Collapse
Affiliation(s)
- Elena Fimmel
- Institute of Mathematical Biology, Faculty for Computer Sciences, Mannheim University of Applied Sciences, Mannheim 68163, Germany.
| | - Christian J Michel
- Theoretical Bioinformatics, ICube, C.N.R.S., University of Strasbourg, 300 Boulevard Sébastien Brant, Illkirch 67400, France.
| | - François Pirot
- Theoretical Bioinformatics, ICube, C.N.R.S., University of Strasbourg, 300 Boulevard Sébastien Brant, Illkirch 67400, France; LORIA (Orpailleur) and Dept. of Mathematics, University of Lorraine and Radboud University, Vandœuvre-lès-Nancy, France and Nijmegen, Netherlands.
| | - Jean-Sébastien Sereni
- Theoretical Bioinformatics, ICube, C.N.R.S., University of Strasbourg, 300 Boulevard Sébastien Brant, Illkirch 67400, France.
| | - Lutz Strüngmann
- Institute of Mathematical Biology, Faculty for Computer Sciences, Mannheim University of Applied Sciences, Mannheim 68163, Germany.
| |
Collapse
|
14
|
Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes. Biosystems 2018; 167:33-46. [DOI: 10.1016/j.biosystems.2018.03.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Revised: 03/18/2018] [Accepted: 03/19/2018] [Indexed: 12/11/2022]
|
15
|
Reviewing evidence for systematic transcriptional deletions, nucleotide exchanges, and expanded codons, and peptide clusters in human mitochondria. Biosystems 2017; 160:10-24. [PMID: 28807694 DOI: 10.1016/j.biosystems.2017.08.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Revised: 07/26/2017] [Accepted: 08/04/2017] [Indexed: 12/12/2022]
Abstract
Polymerization sometimes transforms sequences by (a) systematic deletions of mono-, dinucleotides after trinucleotides, or (b) 23 systematic nucleotide exchanges (9 symmetric, X<>Y, e.g. G<>T, 14 asymmetric, X > Y > Z > X, e.g. A > G > T > A), producing del- and swinger RNAs. Some peptides correspond to del- and swinger RNA translations, also according to tetracodons, codons expanded by a silent nucleotide. Here new analyzes assume different proteolytic patterns, partially alleviating false negative peptide detection biases, expanding noncanonical mitoproteome profiles. Mito-genomic, -transcriptomic and -proteomic evidence for noncanonical transcriptions and translations are reviewed and clusters of del- and swinger peptides (also along tetracodons) are described. Noncanonical peptide clusters indicate regulated expression of cryptically encoded mitochondrial protein coding genes. These candidate noncanonical proteins don't resemble known proteins.
Collapse
|
16
|
El Houmami N, Seligmann H. Evolution of Nucleotide Punctuation Marks: From Structural to Linear Signals. Front Genet 2017; 8:36. [PMID: 28396681 PMCID: PMC5366352 DOI: 10.3389/fgene.2017.00036] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Accepted: 03/13/2017] [Indexed: 01/13/2023] Open
Abstract
We present an evolutionary hypothesis assuming that signals marking nucleotide synthesis (DNA replication and RNA transcription) evolved from multi- to unidimensional structures, and were carried over from transcription to translation. This evolutionary scenario presumes that signals combining secondary and primary nucleotide structures are evolutionary transitions. Mitochondrial replication initiation fits this scenario. Some observations reported in the literature corroborate that several signals for nucleotide synthesis function in translation, and vice versa. (a) Polymerase-induced frameshift mutations occur preferentially at translational termination signals (nucleotide deletion is interpreted as termination of nucleotide polymerization, paralleling the role of stop codons in translation). (b) Stem-loop hairpin presence/absence modulates codon-amino acid assignments, showing that translational signals sometimes combine primary and secondary nucleotide structures (here codon and stem-loop). (c) Homopolymer nucleotide triplets (AAA, CCC, GGG, TTT) cause transcriptional and ribosomal frameshifts. Here we find in recently described human mitochondrial RNAs that systematically lack mono-, dinucleotides after each trinucleotide (delRNAs) that delRNA triplets include 2x more homopolymers than mitogenome regions not covered by delRNA. Further analyses of delRNAs show that the natural circular code X (a little-known group of 20 translational signals enabling ribosomal frame retrieval consisting of 20 codons {AAC, AAT, ACC, ATC, ATT, CAG, CTC, CTG, GAA, GAC, GAG, GAT, GCC, GGC, GGT, GTA, GTC, GTT, TAC, TTC} universally overrepresented in coding versus other frames of gene sequences), regulates frameshift in transcription and translation. This dual transcription and translation role confirms for X the hypothesis that translational signals were carried over from transcriptional signals.
Collapse
Affiliation(s)
- Nawal El Houmami
- URMITE, Aix Marseille Université UM63, CNRS 7278, IRD 198, INSERM 1095, IHU - Méditerranée Infection Marseille, France
| | - Hervé Seligmann
- URMITE, Aix Marseille Université UM63, CNRS 7278, IRD 198, INSERM 1095, IHU - Méditerranée Infection Marseille, France
| |
Collapse
|
17
|
Nemzer LR. A binary representation of the genetic code. Biosystems 2017; 155:10-19. [PMID: 28300609 DOI: 10.1016/j.biosystems.2017.03.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Revised: 03/03/2017] [Accepted: 03/06/2017] [Indexed: 12/23/2022]
Abstract
This article introduces a novel binary representation of the canonical genetic code based on both the structural similarities of the nucleotides, as well as the physicochemical properties of the encoded amino acids. Each of the four mRNA bases is assigned a unique 2-bit identifier, so that the 64 triplet codons are each indexed by a 6-bit label. The ordering of the bits reflects the hierarchical organization manifested by the DNA replication/repair and tRNA translation systems. In this system, transition and transversion mutations are naturally expressed as binary operations, and the severities of the different point mutations can be analyzed. Using a principal component analysis, it is shown that the physicochemical properties of amino acids related to protein folding also correlate with certain bit positions of their respective labels. Thus, the likelihood for a point mutation to be conservative, and less likely to cause a change in protein functionality, can be estimated.
Collapse
Affiliation(s)
- Louis R Nemzer
- Department of Chemistry and Physics, Halmos College of Natural Sciences and Oceanography, Nova Southeastern University, Davie, FL, USA.
| |
Collapse
|
18
|
Seligmann H. Natural mitochondrial proteolysis confirms transcription systematically exchanging/deleting nucleotides, peptides coded by expanded codons. J Theor Biol 2016; 414:76-90. [PMID: 27899286 DOI: 10.1016/j.jtbi.2016.11.021] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 11/11/2016] [Accepted: 11/22/2016] [Indexed: 12/19/2022]
Abstract
Protein sequences have higher linguistic complexities than human languages. This indicates undeciphered multilayered, overprinted information/genetic codes. Some superimposed genetic information is revealed by detections of transcripts systematically (a) exchanging nucleotides (nine symmetric, e.g. A<->C, fourteen asymmetric, e.g. A->C->G->A, swinger RNAs) translated according to tri-, tetra- and pentacodons, and (b) deleting mono-, dinucleotides after each trinucleotide (delRNAs). Here analyses of two independent proteomic datasets considering natural proteolysis confirm independently translation of these non-canonical RNAs, also along tetra- and pentacodons, increasing coverage of putative, cryptically encoded proteins. Analyses assuming endoproteinase GluC and elastase digestions (cleavages after residues D, E, and A, L, I, V, respectively) detect additional peptides colocalizing with detected non-canonical RNAs. Analyses detect fewer peptides matching GluC-, elastase- than trypsin-digestions: artificial trypsin-digestion outweighs natural proteolysis. Results suggest occurrences of complete proteins entirely matching non-canonical, superimposed encoding(s). Protein-coding after bijective transformations could explain genetic code symmetries, such as along Rumer's transformation.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, IHU (Institut Hospitalo-Universitaire), Aix-Marseille University, Marseille, France.
| |
Collapse
|
19
|
Unbiased Mitoproteome Analyses Confirm Non-canonical RNA, Expanded Codon Translations. Comput Struct Biotechnol J 2016; 14:391-403. [PMID: 27830053 PMCID: PMC5094600 DOI: 10.1016/j.csbj.2016.09.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Revised: 09/28/2016] [Accepted: 09/29/2016] [Indexed: 01/14/2023] Open
Abstract
Proteomic MS/MS mass spectrometry detections are usually biased towards peptides cleaved by experimentally added digestion enzyme(s). Hence peptides resulting from spontaneous degradation and natural proteolysis usually remain undetected. Previous analyses of tryptic human proteome data (cleavage after K, R) detected non-canonical tryptic peptides translated according to tetra- and pentacodons (codons expanded by silent mono- and dinucleotides), and from transcripts systematically (a) deleting mono-, dinucleotides after trinucleotides (delRNAs), (b) exchanging nucleotides according to 23 bijective transformations. Nine symmetric and fourteen asymmetric nucleotide exchanges (X ↔ Y, e.g. A ↔ C; and X → Y → Z → X, e.g. A → C → G → A) produce swinger RNAs. Here unbiased reanalyses of these proteomic data detect preferentially non-canonical tryptic peptides despite assuming random cleavage. Unbiased analyses couldn't reconstruct experimental tryptic digestion if most detected non-canonical peptides were false positives. Detected non-tryptic non-canonical peptides map preferentially on corresponding, previously described non-canonical transcripts, as for tryptic non-canonical peptides. Hence unbiased analyses independently confirm previous trypsin-biased analyses that showed translations of del- and swinger RNA and expanded codons. Accounting for natural proteolysis completes trypsin-biased mitopeptidome analyses, independently confirms non-canonical transcriptions and translations.
Collapse
|
20
|
Seligmann H. Natural chymotrypsin-like-cleaved human mitochondrial peptides confirm tetra-, pentacodon, non-canonical RNA translations. Biosystems 2016; 147:78-93. [PMID: 27477600 DOI: 10.1016/j.biosystems.2016.07.010] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2016] [Revised: 07/15/2016] [Accepted: 07/26/2016] [Indexed: 12/22/2022]
Abstract
Mass spectra of human mitochondrial peptides match non-canonical transcripts systematically (a) deleting mono/dinucleotides after trinucleotides (delRNA), (b) exchanging nucleotides (swinger RNA), translated according to tri, (c) tetra- and pentacodons (codons expanded by a 4th (and 5th) silent nucleotide(s)). Swinger transcriptions are 23 bijective transformations, nine symmetric (X<->Y, e.g. A<->C) and fourteen asymmetric exchanges (X->Y->Z->X, e.g. A->C->G->A). Here, proteomic analyses assuming cleavage after W,Y, F (chymotrypsin-like, for trypsinized samples) detect fewer chymotrypsinized than trypsinized peptides. Detected non-canonical peptides map preferentially on detected non-canonical RNAs for chymotrypsinized peptides, as previously found for trypsinized peptides. This suggests residual natural chymotrypsin-like digestion detectable within experimentally trypsinized peptide data. Some trypsinized peptides are detected twice, by analyses assuming trypsin, and those assuming chymotrypsin cleavages. They have higher spectra counts than peptides detected only once, meaning that abundant peptides are more frequently detected, but detection certainties resemble those for peptides detected only once. Analyses assuming 'incorrect' digestions are inadequate negative controls for digestion enzymes naturally active in biological samples. Chymotrypsin-analyses confirm non-canonical transcriptions/translations independently of results obtained assuming trypsinization, increase non-canonical peptidome coverage, indicating mitogenome-encoding of yet undetected proteins.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, Université d'Aix-Marseille, URMITE CNRS-IRD 198 UMER 6236, Marseille, France.
| |
Collapse
|
21
|
Atkins JF, Loughran G, Bhatt PR, Firth AE, Baranov PV. Ribosomal frameshifting and transcriptional slippage: From genetic steganography and cryptography to adventitious use. Nucleic Acids Res 2016; 44:7007-78. [PMID: 27436286 PMCID: PMC5009743 DOI: 10.1093/nar/gkw530] [Citation(s) in RCA: 161] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 05/26/2016] [Indexed: 12/15/2022] Open
Abstract
Genetic decoding is not ‘frozen’ as was earlier thought, but dynamic. One facet of this is frameshifting that often results in synthesis of a C-terminal region encoded by a new frame. Ribosomal frameshifting is utilized for the synthesis of additional products, for regulatory purposes and for translational ‘correction’ of problem or ‘savior’ indels. Utilization for synthesis of additional products occurs prominently in the decoding of mobile chromosomal element and viral genomes. One class of regulatory frameshifting of stable chromosomal genes governs cellular polyamine levels from yeasts to humans. In many cases of productively utilized frameshifting, the proportion of ribosomes that frameshift at a shift-prone site is enhanced by specific nascent peptide or mRNA context features. Such mRNA signals, which can be 5′ or 3′ of the shift site or both, can act by pairing with ribosomal RNA or as stem loops or pseudoknots even with one component being 4 kb 3′ from the shift site. Transcriptional realignment at slippage-prone sequences also generates productively utilized products encoded trans-frame with respect to the genomic sequence. This too can be enhanced by nucleic acid structure. Together with dynamic codon redefinition, frameshifting is one of the forms of recoding that enriches gene expression.
Collapse
Affiliation(s)
- John F Atkins
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland School of Microbiology, University College Cork, Cork, Ireland Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Gary Loughran
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Pramod R Bhatt
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Andrew E Firth
- Division of Virology, Department of Pathology, University of Cambridge, Hills Road, Cambridge CB2 0QQ, UK
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| |
Collapse
|
22
|
Chimeric mitochondrial peptides from contiguous regular and swinger RNA. Comput Struct Biotechnol J 2016; 14:283-97. [PMID: 27453772 PMCID: PMC4942731 DOI: 10.1016/j.csbj.2016.06.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Revised: 06/19/2016] [Accepted: 06/23/2016] [Indexed: 12/20/2022] Open
Abstract
Previous mass spectrometry analyses described human mitochondrial peptides entirely translated from swinger RNAs, RNAs where polymerization systematically exchanged nucleotides. Exchanges follow one among 23 bijective transformation rules, nine symmetric exchanges (X ↔ Y, e.g. A ↔ C) and fourteen asymmetric exchanges (X → Y → Z → X, e.g. A → C → G → A), multiplying by 24 DNA's protein coding potential. Abrupt switches from regular to swinger polymerization produce chimeric RNAs. Here, human mitochondrial proteomic analyses assuming abrupt switches between regular and swinger transcriptions, detect chimeric peptides, encoded by part regular, part swinger RNA. Contiguous regular- and swinger-encoded residues within single peptides are stronger evidence for translation of swinger RNA than previously detected, entirely swinger-encoded peptides: regular parts are positive controls matched with contiguous swinger parts, increasing confidence in results. Chimeric peptides are 200 × rarer than swinger peptides (3/100,000 versus 6/1000). Among 186 peptides with > 8 residues for each regular and swinger parts, regular parts of eleven chimeric peptides correspond to six among the thirteen recognized, mitochondrial protein-coding genes. Chimeric peptides matching partly regular proteins are rarer and less expressed than chimeric peptides matching non-coding sequences, suggesting targeted degradation of misfolded proteins. Present results strengthen hypotheses that the short mitogenome encodes far more proteins than hitherto assumed. Entirely swinger-encoded proteins could exist. Chimeric peptides are translated from contiguous regular and swinger RNA They are 200x rarer than mitochondrial swinger peptides Chimeric peptides integrated in regular mitochondrial proteins are downregulated Contiguous regular parts are matched positive controls for swinger parts The last point validates results beyond other statistical tests for robustness
Collapse
|
23
|
Systematically frameshifting by deletion of every 4th or 4th and 5th nucleotides during mitochondrial transcription: RNA self-hybridization regulates delRNA expression. Biosystems 2016; 142-143:43-51. [PMID: 27018206 DOI: 10.1016/j.biosystems.2016.03.009] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Revised: 03/11/2016] [Accepted: 03/23/2016] [Indexed: 02/05/2023]
Abstract
In mitochondria, secondary structures punctuate post-transcriptional RNA processing. Recently described transcripts match the human mitogenome after systematic deletions of every 4th, respectively every 4th and 5th nucleotides, called delRNAs. Here I explore predicted stem-loop hairpin formation by delRNAs, and their associations with delRNA transcription and detected peptides matching their translation. Despite missing 25, respectively 40% of the nucleotides in the original sequence, del-transformed sequences form significantly more secondary structures than corresponding randomly shuffled sequences, indicating biological function, independently of, and in combination with, previously detected delRNA and thereof translated peptides. Self-hybridization decreases delRNA abundances, indicating downregulation. Systematic deletions of the human mitogenome reveal new, unsuspected coding and structural informations.
Collapse
|
24
|
Gonzalez DL, Giannerini S, Rosa R. The non-power model of the genetic code: a paradigm for interpreting genomic information. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0062. [PMID: 26857679 DOI: 10.1098/rsta.2015.0062] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 10/27/2015] [Indexed: 06/05/2023]
Abstract
In this article, we present a mathematical framework based on redundant (non-power) representations of integer numbers as a paradigm for the interpretation of genomic information. The core of the approach relies on modelling the degeneracy of the genetic code. The model allows one to explain many features and symmetries of the genetic code and to uncover hidden symmetries. Also, it provides us with new tools for the analysis of genomic sequences. We review briefly three main areas: (i) the Euplotid nuclear code, (ii) the vertebrate mitochondrial code, and (iii) the main coding/decoding strategies used in the three domains of life. In every case, we show how the non-power model is a natural unified framework for describing degeneracy and deriving sound biological hypotheses on protein coding. The approach is rooted on number theory and group theory; nevertheless, we have kept the technical level to a minimum by focusing on key concepts and on the biological implications.
Collapse
Affiliation(s)
- Diego Luis Gonzalez
- Dipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126 Bologna, Italy CNR-IMM, Sezione di Bologna, Via Gobetti 101, 40129 Bologna, Italy
| | - Simone Giannerini
- Dipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126 Bologna, Italy
| | - Rodolfo Rosa
- CNR-IMM, Sezione di Bologna, Via Gobetti 101, 40129 Bologna, Italy
| |
Collapse
|
25
|
Kumar B, Saini S. Analysis of the optimality of the standard genetic code. MOLECULAR BIOSYSTEMS 2016; 12:2642-51. [DOI: 10.1039/c6mb00262e] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Many theories have been proposed attempting to explain the origin of the genetic code. In this work, we compare performance of the standard genetic code against millions of randomly generated codes. On left, ability of genetic codes to encode additional information and their robustness to frameshift mutations.
Collapse
Affiliation(s)
- Balaji Kumar
- Department of Chemical Engineering
- Indian Institute of Technology Bombay
- Mumbai – 400 076
- India
| | - Supreet Saini
- Department of Chemical Engineering
- Indian Institute of Technology Bombay
- Mumbai – 400 076
- India
| |
Collapse
|
26
|
Seligmann H. Translation of mitochondrial swinger RNAs according to tri-, tetra- and pentacodons. Biosystems 2015; 140:38-48. [PMID: 26723232 DOI: 10.1016/j.biosystems.2015.11.009] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 11/08/2015] [Accepted: 11/23/2015] [Indexed: 10/22/2022]
Abstract
Transcriptomes and proteomes include RNA and protein fragments not matching regular transcription/translation. Some 'non-canonical' mitochondrial transcripts match mitogenomes after assuming one among 23 systematic exchanges between nucleotides, producing swinger RNAs (nine symmetric, X↔Y, example C↔T; 14 asymmetric, X→Y→Z→X, example A→T→G→A) in GenBank's EST database. Here, reanalyzes of (a) public human mitochondrial transcriptome data (Illumina: RNA-seq) allowed to detect mitochondrial swinger RNAs for all 23 exchanges and (b) independent public human mitochondrial trypsinized proteomic mass spectrometry data allowed to detect peptides predicted from translation of parts of swinger-transformed mitogenomes covered by detected swinger reads. RNA-seq and previous EST swinger transcript data converge. Swinger RNA translation frequently inserts various amino acids at stop codons. Swinger RNA-peptide associations exist also for peptides matching systematically frameshifting translation, peptides entirely coded by tetra- and pentacodons (regular codons expanded by silent mononucleotides at 4th, and silent dinucleotides at 4th and 5th position(s), respectively). Swinger peptides differ from regular mitochondrial proteins: not membrane embedded, reflect warmer, anaerobic, low resource conditions, reminding a free-living ancestor. Tetra- and pentacoded peptides associate with low, high GC contents, respectively, suggesting expanded codon translations associate with thermic stresses. Results confirm experimentally predicted swinger, tetra- and pentacoded mitochondrial peptides, increasing mitogenomic coding density.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, Université de la Méditerranée, Marseille, France.
| |
Collapse
|
27
|
Seligmann H. Codon expansion and systematic transcriptional deletions produce tetra-, pentacoded mitochondrial peptides. J Theor Biol 2015; 387:154-65. [PMID: 26456204 DOI: 10.1016/j.jtbi.2015.09.030] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 09/28/2015] [Accepted: 09/30/2015] [Indexed: 11/28/2022]
Abstract
Genes include occasionally isolated codons with a fourth (and fifth) silent nucleotide(s). Assuming tetracodons, translated hypothetical peptides align with regular GenBank proteins; predicted tetracodons coevolve with predicted tRNAs with expanded anticodons in each mammal, Drosophila and Lepidosauria mitogenomes, GC contents and with lepidosaurian body temperatures, suggesting that expanded codons are an adaptation of translation to high temperature. Hypothetically, continuous stretches of tetra- and pentacodons code for peptides. Both systematic nucleotide deletions during transcription, and translation by tRNAs with expanded anticodons could produce these peptides. Reanalyses of human nanoLc mass spectrometry peptidome data detect numerous tetra- and pentapeptides translated from the human mitogenome. These map preferentially on (BLAST-detected) human RNAs matching the human mitogenome, assuming systematic mono- and dinucleotide deletions after each third nucleotide (delRNAs). Translation by expanded anticodons is incompatible with silent nucleotides in the midst rather than at codon 3' extremity. More than 1/3 of detected tetra- and pentapeptides assume silent positions at codon extremity, suggesting that both mechanisms, regular translation of delRNAs and translation of regular RNAs by expanded anticodons, produce this peptide subgroup. Results show that systematically deleting polymerization occurs, and confirm serial translation of expanded codons. Non-canonical transcriptions and translations considerably expand the coding potential of DNA and RNA sequences.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, Université de la Méditerranée, 13385 Marseille, France.
| |
Collapse
|
28
|
Limitations of the ‘ambush hypothesis’ at the single-gene scale: what codon biases are to blame? Mol Genet Genomics 2014; 290:493-504. [DOI: 10.1007/s00438-014-0937-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Accepted: 10/01/2014] [Indexed: 10/24/2022]
|
29
|
RNA editing and modifications of RNAs might have favoured the evolution of the triplet genetic code from an ennuplet code. J Theor Biol 2014; 359:1-5. [DOI: 10.1016/j.jtbi.2014.05.037] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2014] [Revised: 05/21/2014] [Accepted: 05/27/2014] [Indexed: 11/24/2022]
|
30
|
Seligmann H, Labra A. Tetracoding increases with body temperature in Lepidosauria. Biosystems 2013; 114:155-63. [DOI: 10.1016/j.biosystems.2013.09.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Revised: 09/04/2013] [Accepted: 09/05/2013] [Indexed: 10/26/2022]
|
31
|
The protein invasion: a broad review on the origin of the translational system. J Mol Evol 2013; 77:185-96. [PMID: 24145863 DOI: 10.1007/s00239-013-9592-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 10/12/2013] [Indexed: 12/25/2022]
Abstract
Translation, coded peptide synthesis, arguably exists at the heart of modern cellular life. By orchestrating an incredibly complex interaction between tRNAs, mRNAs, aaRSs, the ribosome, and numerous other small molecules, the translational system allows the interpretation of data in the form of DNA to create massively complex proteins which control and enact almost every cellular function. A natural question then, is how did this system evolve? Here we present a broad review of the existing theories of the last two decades on the origin of the translational system. We attempt to synthesize the wide variety of ideas as well as organize them into modular components, addressing the evolution of the peptide-RNA interaction, tRNA, mRNA, the ribosome, and the first proteins separately. We hope to provide both a comprehensive overview of the literature as well as a framework for future discussions and novel theories.
Collapse
|
32
|
Seligmann H. Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes. J Theor Biol 2013; 324:1-20. [PMID: 23416187 DOI: 10.1016/j.jtbi.2013.01.024] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2012] [Revised: 01/26/2013] [Accepted: 01/28/2013] [Indexed: 11/19/2022]
Abstract
GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges.
Collapse
Affiliation(s)
- Hervé Seligmann
- National Natural History Museum Collections, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel.
| |
Collapse
|
33
|
Putative mitochondrial polypeptides coded by expanded quadruplet codons, decoded by antisense tRNAs with unusual anticodons. Biosystems 2012; 110:84-106. [DOI: 10.1016/j.biosystems.2012.09.002] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Revised: 09/20/2012] [Accepted: 09/26/2012] [Indexed: 11/19/2022]
|
34
|
Grazziotin FG, Zaher H, Murphy RW, Scrocchi G, Benavides MA, Zhang YP, Bonatto SL. Molecular phylogeny of the New World Dipsadidae (Serpentes: Colubroidea): a reappraisal. Cladistics 2012; 28:437-459. [DOI: 10.1111/j.1096-0031.2012.00393.x] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
35
|
Error compensation of tRNA misacylation by codon-anticodon mismatch prevents translational amino acid misinsertion. Comput Biol Chem 2011; 35:81-95. [PMID: 21470914 DOI: 10.1016/j.compbiolchem.2011.03.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2010] [Revised: 02/22/2011] [Accepted: 03/01/2011] [Indexed: 11/20/2022]
Abstract
Codon-anticodon mismatches and tRNA misloadings cause translational amino acid misinsertions, producing dysfunctional proteins. Here I explore the original hypothesis whether mismatches tend to compensate misacylation, so as to insert the amino acid coded by the codon. This error compensation is promoted by the fact that codon-anticodon mismatch stabilities increase with tRNA misacylation potentials (predicted by 'tfam') by non-cognate amino acids coded by the mismatched codons for most tRNAs examined. Error compensation is independent of preferential misacylation by non-cognate amino acids physico-chemically similar to cognate amino acids, a phenomenon that decreases misinsertion impacts. Error compensation correlates negatively with (a) codon/anticodon abundance (in human mitochondria and Escherichia coli); (b) developmental instability (estimated by fluctuating asymmetry in bilateral counts of subdigital lamellae, in each of two lizard genera, Anolis and Sceloporus); and (c) pathogenicity of human mitochondrial tRNA polymorphisms. Patterns described here suggest that tRNA misacylation is sometimes compensated by codon-anticodon mismatches. Hence translation inserts the amino acid coded by the mismatched codon, despite mismatch and misloading. Results suggest that this phenomenon is sufficiently important to affect whole organism phenotypes, as shown by correlations with pathologies and morphological estimates of developmental stability.
Collapse
|
36
|
Ma W. The scenario on the origin of translation in the RNA world: in principle of replication parsimony. Biol Direct 2010; 5:65. [PMID: 21110883 PMCID: PMC3002371 DOI: 10.1186/1745-6150-5-65] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2010] [Accepted: 11/27/2010] [Indexed: 01/06/2023] Open
Abstract
Background It is now believed that in the origin of life, proteins should have been "invented" in an RNA world. However, due to the complexity of a possible RNA-based proto-translation system, this evolving process seems quite complicated and the associated scenario remains very blurry. Considering that RNA can bind amino acids with specificity, it has been reasonably supposed that initial peptides might have been synthesized on "RNA templates" containing multiple amino acid binding sites. This "Direct RNA Template (DRT)" mechanism is attractive because it should be the simplest mechanism for RNA to synthesize peptides, thus very likely to have been adopted initially in the RNA world. Then, how this mechanism could develop into a proto-translation system mechanism is an interesting problem. Presentation of the hypothesis Here an explanation to this problem is shown considering the principle of "replication parsimony" --- genetic information tends to be utilized in a parsimonious way under selection pressure, due to its replication cost (e.g., in the RNA world, nucleotides and ribozymes for RNA replication). Because a DRT would be quite long even for a short peptide, its replication cost would be great. Thus the diversity and the length of functional peptides synthesized by the DRT mechanism would be seriously limited. Adaptors (proto-tRNAs) would arise to allow a DRT's complementary strand (called "C-DRT" here) to direct the synthesis of the same peptide synthesized by the DRT itself. Because the C-DRT is a necessary part in the DRT's replication, fewer turns of the DRT's replication would be needed to synthesize definite copies of the functional peptide, thus saving the replication cost. Acting through adaptors, C-DRTs could transform into much shorter templates (called "proto-mRNAs" here) and substitute the role of DRTs, thus significantly saving the replication cost. A proto-rRNA corresponding to the small subunit rRNA would then emerge to aid the binding of proto-tRNAs and proto-mRNAs, allowing the reduction of base pairs between them (ultimately resulting in the triplet anticodon/codon pair), thus further saving the replication cost. In this context, the replication cost saved would allow the appearance of more and longer functional peptides and, finally, proteins. The hypothesis could be called "DRT-RP" ("RP" for "replication parsimony"). Testing the hypothesis The scenario described here is open for experimental work at some key scenes, including the compact DRT mechanism, the development of adaptors from aa-aptamers, the synthesis of peptides by proto-tRNAs and proto-mRNAs without the participation of proto-rRNAs, etc. Interestingly, a recent computer simulation study has demonstrated the plausibility of one of the evolving processes driven by replication parsimony in the scenario. Implication of the hypothesis An RNA-based proto-translation system could arise gradually from the DRT mechanism according to the principle of "replication parsimony" --- to save the replication cost of RNA templates for functional peptides. A surprising side deduction along the logic of the hypothesis is that complex, biosynthetic amino acids might have entered the genetic code earlier than simple, prebiotic amino acids, which is opposite to the common sense. Overall, the present discussion clarifies the blurry scenario concerning the origin of translation with a major clue, which shows vividly how life could "manage" to exploit potential chemical resources in nature, eventually in an efficient way over evolution. Reviewers This article was reviewed by Eugene V. Koonin, Juergen Brosius, and Arcady Mushegian.
Collapse
Affiliation(s)
- Wentao Ma
- College of Life Sciences, Wuhan University, Wuhan 430072, PR China.
| |
Collapse
|
37
|
Tlusty T. A colorful origin for the genetic code: Information theory, statistical mechanics and the emergence of molecular codes. Phys Life Rev 2010; 7:362-76. [DOI: 10.1016/j.plrev.2010.06.002] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2009] [Revised: 01/25/2010] [Accepted: 02/06/2010] [Indexed: 10/19/2022]
|
38
|
Bernhardt HS, Tate WP. The transition from noncoded to coded protein synthesis: did coding mRNAs arise from stability-enhancing binding partners to tRNA? Biol Direct 2010; 5:16. [PMID: 20377916 PMCID: PMC2859854 DOI: 10.1186/1745-6150-5-16] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2010] [Accepted: 04/09/2010] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND Understanding the origin of protein synthesis has been notoriously difficult. We have taken as a starting premise Wolf and Koonin's view that "evolution of the translation system is envisaged to occur in a compartmentalized ensemble of replicating, co-selected RNA segments, i.e., in an RNA world containing ribozymes with versatile activities". PRESENTATION OF THE HYPOTHESIS We propose that coded protein synthesis arose from a noncoded process in an RNA world as a natural consequence of the accumulation of a range of early tRNAs and their serendipitous RNA binding partners. We propose that, initially, RNA molecules with 3' CCA termini that could be aminoacylated by ribozymes, together with an ancestral peptidyl transferase ribozyme, produced small peptides with random or repetitive sequences. Our concept is that the first tRNA arose in this context from the ligation of two RNA hairpins and could be similarly aminoacylated at its 3' end to become a substrate for peptidyl transfer catalyzed by the ancestral ribozyme. Within this RNA world we hypothesize that proto-mRNAs appeared first simply as serendipitous binding partners, forming complementary base pair interactions with the anticodon loops of tRNA pairs. Initially this may have enhanced stability of the paired tRNA molecules so they were held together in close proximity, better positioning the 3' CCA termini for peptidyl transfer and enhancing the rate of peptide synthesis. If there were a selective advantage for the ensemble through the peptide products synthesized, it would provide a natural pathway for the evolution of a coding system with the expansion of a cohort of different tRNAs and their binding partners. The whole process could have occurred quite unremarkably for such a profound acquisition. TESTING THE HYPOTHESIS It should be possible to test the different parts of our model using the isolated contemporary 50S ribosomal subunit initially, and then with RNAs transcribed in vitro together with a minimal set of ribosomal proteins that are required today to support protein synthesis. IMPLICATIONS OF THE HYPOTHESIS This model proposes that genetic coding arose de novo from complementary base pair interactions between tRNAs and single-stranded RNAs present in the immediate environment. REVIEWERS This article was reviewed by Eugene Koonin, Rob Knight and Berthold Kastner (nominated by Laura Landweber).
Collapse
Affiliation(s)
- Harold Stephen Bernhardt
- Department of Biochemistry, Otago School of Medical Sciences, University of Otago, Dunedin, New Zealand.
| | | |
Collapse
|