1
|
Gornik SG, Flores V, Reinhardt F, Erber L, Salas-Leiva DE, Douvropoulou O, Lassadi I, Einarsson E, Mörl M, Git A, Stadler PF, Pain A, Waller RF. Mitochondrial Genomes in Perkinsus Decode Conserved Frameshifts in All Genes. Mol Biol Evol 2022; 39:6701636. [PMID: 36108082 PMCID: PMC9550989 DOI: 10.1093/molbev/msac191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Mitochondrial genomes of apicomplexans, dinoflagellates, and chrompodellids that collectively make up the Myzozoa, encode only three proteins (Cytochrome b [COB], Cytochrome c oxidase subunit 1 [COX1], Cytochrome c oxidase subunit 3 [COX3]), contain fragmented ribosomal RNAs, and display extensive recombination, RNA trans-splicing, and RNA-editing. The early-diverging Perkinsozoa is the final major myzozoan lineage whose mitochondrial genomes remained poorly characterized. Previous reports of Perkinsus genes indicated independent acquisition of non-canonical features, namely the occurrence of multiple frameshifts. To determine both ancestral myzozoan and novel perkinsozoan mitochondrial genome features, we sequenced and assembled mitochondrial genomes of four Perkinsus species. These data show a simple ancestral genome with the common reduced coding capacity but disposition for rearrangement. We identified 75 frameshifts across the four species that occur as distinct types and that are highly conserved in gene location. A decoding mechanism apparently employs unused codons at the frameshift sites that advance translation either +1 or +2 frames to the next used codon. The locations of frameshifts are seemingly positioned to regulate protein folding of the nascent protein as it emerges from the ribosome. The cox3 gene is distinct in containing only one frameshift and showing strong selection against residues that are otherwise frequently encoded at the frameshift positions in cox1 and cob. All genes lack cysteine codons implying a reduction to 19 amino acids in these genomes. Furthermore, mitochondrion-encoded rRNA fragment complements are incomplete in Perkinsus spp. but some are found in the nuclear DNA suggesting import into the organelle. Perkinsus demonstrates further remarkable trajectories of organelle genome evolution including pervasive integration of frameshift translation into genome expression.
Collapse
Affiliation(s)
| | - Victor Flores
- Department of Biochemistry, University of Cambridge, Hopkins Building, Downing Site, Tennis Court Road, Cambridge CB2 1QW, United Kingdom
| | - Franziska Reinhardt
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, Leipzig University, Härtelstraße 16-18, 04107 Leipzig, Germany
| | - Lieselotte Erber
- Institute for Biochemistry, Leipzig University, Brüderstr. 34, 04103 Leipzig, Germany
| | - Dayana E Salas-Leiva
- Department of Biochemistry, University of Cambridge, Hopkins Building, Downing Site, Tennis Court Road, Cambridge CB2 1QW, United Kingdom
| | - Olga Douvropoulou
- Pathogen Genomics Group, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia
| | - Imen Lassadi
- Department of Biochemistry, University of Cambridge, Hopkins Building, Downing Site, Tennis Court Road, Cambridge CB2 1QW, United Kingdom
| | - Elin Einarsson
- Department of Biochemistry, University of Cambridge, Hopkins Building, Downing Site, Tennis Court Road, Cambridge CB2 1QW, United Kingdom
| | - Mario Mörl
- Institute for Biochemistry, Leipzig University, Brüderstr. 34, 04103 Leipzig, Germany
| | - Anna Git
- Department of Biochemistry, University of Cambridge, Hopkins Building, Downing Site, Tennis Court Road, Cambridge CB2 1QW, United Kingdom
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Center for Bioinformatics, Leipzig University, Härtelstraße 16-18, 04107 Leipzig, Germany,Discrete Biomathematics, Max Planck Institute for Mathematics in the Sciences, 04103 Leipzig, Germany,Theoretical Biochemistry Group, Institute for Theoretical Chemistry, University of Vienna, Währinger Str. 17, Alsergrund, Vienna 1090, Austria,Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| | - Arnab Pain
- Pathogen Genomics Group, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia,International Institute for Zoonosis Control, Hokkaido University, 001-0020 North 20, West 10 Kita-ku, Sapporo 001-0020, Japan
| | | |
Collapse
|
2
|
Demongeot J, Moreira A, Seligmann H. Negative CG dinucleotide bias: An explanation based on feedback loops between Arginine codon assignments and theoretical minimal RNA rings. Bioessays 2020; 43:e2000071. [PMID: 33319381 DOI: 10.1002/bies.202000071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 11/23/2020] [Accepted: 11/26/2020] [Indexed: 01/05/2023]
Abstract
Theoretical minimal RNA rings are candidate primordial genes evolved for non-redundant coding of the genetic code's 22 coding signals (one codon per biogenic amino acid, a start and a stop codon) over the shortest possible length: 29520 22-nucleotide-long RNA rings solve this min-max constraint. Numerous RNA ring properties are reminiscent of natural genes. Here we present analyses showing that all RNA rings lack dinucleotide CG (a mutable, chemically instable dinucleotide coding for Arginine), bearing a resemblance to known CG-depleted genomes. CG in "incomplete" RNA rings (not coding for all coding signals, with only 3-12 nucleotides) gradually decreases towards CG absence in complete, 22-nucleotide-long RNA rings. Presumably, feedback loops during RNA ring growth during evolution (when amino acid assignment fixed the genetic code) assigned Arg to codons lacking CG (AGR) to avoid CG. Hence, as a chemical property of base pairs, CG mutability restructured the genetic code, thereby establishing itself as genetically encoded biological information.
Collapse
Affiliation(s)
- Jacques Demongeot
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, France
| | - Andrés Moreira
- Departamento de Informática, Universidad Técnica Federico Santa María, Santiago, Chile
| | - Hervé Seligmann
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, France.,The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel.,Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), Eggenstein-Leopoldshafen, Germany
| |
Collapse
|
3
|
Demongeot J, Seligmann H. Codon assignment evolvability in theoretical minimal RNA rings. Gene 2020; 769:145208. [PMID: 33031892 DOI: 10.1016/j.gene.2020.145208] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Revised: 09/28/2020] [Accepted: 09/29/2020] [Indexed: 12/28/2022]
Abstract
Genetic code codon-amino acid assignments evolve for 15 (AAA, AGA, AGG, ATA, CGG, CTA, CTG. CTC, CTT, TAA, TAG, TCA, TCG, TGA and TTA (GNN codons notably absent)) among 64 codons (23.4%) across the 31 genetic codes (NCBI list completed with recently suggested green algal mitochondrial genetic codes). Their usage in 25 theoretical minimal RNA rings is examined. RNA rings are designed in silico to code once over the shortest length for all 22 coding signals (start and stop codons and each amino acid according to the standard genetic code). Though designed along coding constraints, RNA rings resemble ancestral tRNA loops, assigning to each RNA ring a putative anticodon, a cognate amino acid and an evolutionary genetic code integration rank for that cognate amino acid. Analyses here show 1. biases against/for evolvable codons in the two first vs last thirds of RNA ring coding sequences, 2. RNA rings with evolvable codons have recent cognates, and 3. evolvable codon and cytosine numbers in RNA ring compositions are positively correlated. Applying alternative genetic codes to RNA rings designed for nonredundant coding according to the standard genetic code reveals unsuspected properties of the standard genetic code and of RNA rings, notably on codon assignment evolvability and the special role of cytosine in relation to codon assignment evolvability and of the genetic code's coding structure.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700 La Tronche, France
| | - Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404 Jerusalem, Israel.
| |
Collapse
|
4
|
Seligmann H, Warthi G. Natural pyrrolysine-biased translation of stop codons in mitochondrial peptides entirely coded by expanded codons. Biosystems 2020; 196:104180. [PMID: 32534170 DOI: 10.1016/j.biosystems.2020.104180] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 06/02/2020] [Accepted: 06/02/2020] [Indexed: 12/31/2022]
Abstract
During the noncanonical deletion transcription, k nucleotides are systematically skipped/deleted after each transcribed trinucleotide producing deletion-RNAs (delRNAs). Peptides matching delRNAs either result from (a) canonical translation of delRNAs; or (b) noncanonical translation of regular transcripts along expanded codons. Only along frame "0" (start site) (a) and (b) produce identical peptides. Here, mitochondrial mass spectrometry data analyses assume expanded codon/del-transcription with 3 + k (k from 0 to 12) nucleotides. Detected peptides map preferentially on previously identified delRNAs. More peptides were detected for k (1-12) when del-transcriptional and expanded codon translations start sites coincide (i.e. the 0th frame) than for frames +1 or +2. Hence, both (a) and (b) produced peptides identified here. Biases for frame 0 decrease for k > 2, reflecting codon/anticodon expansion limits. Further analyses find preferential pyrrolysine insertion at stop codons, suggesting Pyl-specific mitochondrial suppressor tRNAs loaded by Pyl-specific tRNA synthetases with unknown origins. Pyl biases at stops are stronger for regular than expanded codons suggesting that Pyl-tRNAs are less competitive with near-cognate tRNAs in expanded codon contexts. Statistical biases for these findings exclude that detected peptides are experimental and/or bioinformatic artefacts implying both del-transcription and expanded codons translation occur in human mitochondria.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404, Jerusalem, Israel; Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France.
| | - Ganesh Warthi
- Aix-Marseille University, IRD, VITROME, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, France.
| |
Collapse
|
5
|
Seligmann H. First arrived, first served: competition between codons for codon-amino acid stereochemical interactions determined early genetic code assignments. Naturwissenschaften 2020; 107:20. [PMID: 32367155 DOI: 10.1007/s00114-020-01676-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 03/10/2020] [Accepted: 04/05/2020] [Indexed: 12/12/2022]
Abstract
Stereochemical nucleotide-amino acid interactions, in the form of noncovalent nucleotide-amino acid interactions, potentially produced the genetic code's codon-amino acid assignments. Empirical estimates of single nucleotide-amino acid affinities on surfaces and in solution are used to test whether trinucleotide-amino acid affinities determined genetic code assignments pending the principle "first arrived, first served": presumed early amino acids have greater codon-amino acid affinities than ulterior ones. Here, these single nucleotide affinities are used to approximate all 64 × 20 trinucleotide-amino acid affinities. Analyses show that (1) on surfaces, genetic code codon-amino acid assignments tend to match high affinities for the amino acids that integrated earliest the genetic code (according to Wong's metabolic coevolution hypothesis between nucleotides and amino acids) and (2) in solution, the same principle holds for the anticodon-amino acid assignments. Affinity analyses match best genetic code assignments when assuming that trinucleotides competed for amino acids, rather than amino acids for trinucleotides. Codon-amino acid affinities stick better to genetic code assignments than anticodon-amino acid affinities. Presumably, two independent coding systems, on surfaces and in solution, converged, and formed the current translation system. Proto-translation on surfaces by direct codon-amino acid interactions without tRNA-like adaptors coadapted with a system emerging in solution by proto-tRNA anticodon-amino acid interactions. These systems assigned identical or similar cognates to codons on surfaces and to anticodons in solution. Results indicate that a prebiotic metabolism predated genetic code self-organization.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel. .,Faculty of Medicine, Université Grenoble Alpes, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700, La Tronche, France.
| |
Collapse
|
6
|
Demongeot J, Seligmann H. Deamination gradients within codons after 1<->2 position swap predict amino acid hydrophobicity and parallel β-sheet conformational preference. Biosystems 2020; 191-192:104116. [PMID: 32081715 DOI: 10.1016/j.biosystems.2020.104116] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 12/04/2019] [Accepted: 02/10/2020] [Indexed: 12/16/2022]
Abstract
Deaminations C->T and A->G are frequent mutations producing nucleotide content gradients across genomes proportional to singlestrandedness during replication/transcription. Hence, within single codons, deamination risks increase from first to third codon positions, while second codon positions are functionally most crucial. Here genetic codes are analyzed assuming that after anticodons protected codons from deaminations, first and second codon positions swapped (N2N1N3->N1N2N3), with lowest deamination risks for N2 in presumed primitive N2N1N3 codons. N2N1N3, not standard N1N2N3, codon structure minimizes deaminations inversely proportionally to cognate amino acid hydrophobicity and parallel betasheet conformational preference. For N1N2N3, deamination minimization increases with genetic code integration order of cognate amino acids: during the presumed N2N1N3->N1N2N3 codon structure transition, protein synthesis combined direct codon-amino acid interactions for late amino acids and tRNA-based translation for early amino acids. Hence N2N1N3 codons would correspond to tRNA-free translation by spontaneous codon-amino acid affinities, and tRNA-mediated translation presumably caused N2N1N3->N1N2N3 swaps. Results show that rational, not arbitrary rules link codon and amino acid structures. Some analyses detect mitochondrial RNAs and peptides in public data corresponding to systematic position swaps, suggesting occasional swapping polymerase activity.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France.
| | - Hervé Seligmann
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France; The National Natural History Collections, The Hebrew University of Jerusalem, 91404, Jerusalem, Israel.
| |
Collapse
|
7
|
Pentamers with Non-redundant Frames: Bias for Natural Circular Code Codons. J Mol Evol 2020; 88:194-201. [DOI: 10.1007/s00239-019-09925-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Accepted: 12/17/2019] [Indexed: 02/06/2023]
|
8
|
Seligmann H, Warthi G. Chimeric Translation for Mitochondrial Peptides: Regular and Expanded Codons. Comput Struct Biotechnol J 2019; 17:1195-1202. [PMID: 31534643 PMCID: PMC6742854 DOI: 10.1016/j.csbj.2019.08.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 08/19/2019] [Accepted: 08/21/2019] [Indexed: 02/07/2023] Open
Abstract
Frameshifting protein translation occasionally results from insertion of amino acids at isolated mono- or dinucleotide-expanded codons by tRNAs with expanded anticodons. Previous analyses of two different types of human mitochondrial MS proteomic data (Fisher and Waters technologies) detect peptides entirely corresponding to expanded codon translation. Here, these proteomic data are reanalyzed searching for peptides consisting of at least eight consecutive amino acids translated according to regular tricodons, and at least eight adjacent consecutive amino acids translated according to expanded codons. Both datasets include chimerically translated peptides (mono- and dinucleotide expansions, 42 and 37, respectively). The regular tricodon-encoded part of some chimeric peptides corresponds to standard human mitochondrial proteins (mono- and dinucleotide expansions, six (AT6, CytB, ND1, 2xND2, ND5) and one (ND1), respectively). Chimeric translation probably increases the diversity of mitogenome-encoded proteins, putatively producing functional proteins. These might result from translation by tRNAs with expanded anticodons, or from regular tricodon translation of RNAs where transcription/posttranscriptional edition systematically deleted mono- or dinucleotides after each trinucleotide. The pairwise matched combination of adjacent peptide parts translated from regular and expanded codons strengthens the hypothesis that translation of stretches of consecutive expanded codons occurs. Results indicate statistical translation producing distributions of alternative proteins. Genetic engineering should account for potential unexpected, unwanted secondary products.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille University, IRD, VITROME, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, France
| |
Collapse
|
9
|
Demongeot J, Seligmann H. Theoretical minimal RNA rings designed according to coding constraints mimic deamination gradients. THE SCIENCE OF NATURE - NATURWISSENSCHAFTEN 2019; 106:44. [DOI: 10.1007/s00114-019-1638-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2018] [Revised: 06/18/2019] [Accepted: 06/19/2019] [Indexed: 11/27/2022]
|
10
|
Seligmann H. Localized Context-Dependent Effects of the "Ambush" Hypothesis: More Off-Frame Stop Codons Downstream of Shifty Codons. DNA Cell Biol 2019; 38:786-795. [PMID: 31157984 DOI: 10.1089/dna.2019.4725] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The ambush hypothesis speculates that off-frame stop codons increase translational efficiency after ribosomal frameshifts by stopping early frameshifted translation. Some evidences fit this hypothesis: (1) synonymous codon usages increase with their potential contribution to off-frame stops; (2) the genetic code assigns frequent amino acids to codon families contributing to off-frame stops; (3) positive biases for off-frame stops (AT rich) occur despite adverse nucleotide (GC) biases; and (4) mitochondrial off-frame stop codon densities increase with ribosomal structural instability, potential proxy of frameshift frequencies. In this study, analyses of vertebrate mitogenes and tRNA synthetase genes from all superkingdoms and viruses test a new prediction of the ambush hypothesis: sequences immediately downstream of frameshift-inducing homopolymer codons (AAA, CCC, GGG, and TTT) are off-frame stop rich. Codons immediately downstream of homopolymer codons form more than average off-frame stops, biases are stronger than for corresponding upstream distances and for any other group of synonymous codons. Sequences downstream of that high-density region are off-frame stop depleted. This decrease suggests that off-frame stops, combined with suppressor tRNAs regulate translation of overlapping coding sequences. Results show the predictive power of the ambush hypothesis, from macroevolutionary (genetic code structure) to detailed gene sequence anatomy.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
11
|
Demongeot J, Seligmann H. More Pieces of Ancient than Recent Theoretical Minimal Proto-tRNA-Like RNA Rings in Genes Coding for tRNA Synthetases. J Mol Evol 2019; 87:152-174. [DOI: 10.1007/s00239-019-09892-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Accepted: 03/22/2019] [Indexed: 12/19/2022]
|
12
|
Abstract
INTRODUCTION Small open reading frames (sORFs) with potential protein-coding capacity have been disclosed in various transcripts, including long noncoding RNAs (LncRNAs), mRNAs (5'-upstream, coding domain, and 3'-downstream), circular RNAs, pri-miRNAs, and ribosomal RNAs (rRNAs). Recent characterization of several sORF-encoded peptides (SEPs or micropeptides) revealed their important roles in many fundamental biological processes in a broad range of species from yeast to human. The success in the mining of micropeptides attributes to the advanced bioinformatics and high-throughput sequencing techniques. Areas covered: sORFs and SEPs were overlooked for their tiny size and the difficulty of identification by bioinformatics analyses. With more and more sORFs and SEPs have been identified, this field has attracted more attention. This review covers recent advances in the strategies for the detection and identification of sORFs and SEPs. Expert commentary: The advantages and drawbacks of the strategies for detection and identification of sORFs and SEPs are discussed, as well as the techniques that are used to decipher the roles of micropeptides in organisms are described.
Collapse
Affiliation(s)
- Xinqiang Yin
- a The Engineering Research Center of Synthetic Polypeptide Drug Discovery and Evaluation of Jiangsu Province , China Pharmaceutical University , Nanjing , China.,b The Basic Medical School , North Sichuan Medical College , Nanchong , China
| | - Yuanyuan Jing
- c Department of Preventive Medicine , North Sichuan Medical College , Nanchong , China
| | - Hanmei Xu
- a The Engineering Research Center of Synthetic Polypeptide Drug Discovery and Evaluation of Jiangsu Province , China Pharmaceutical University , Nanjing , China.,d State Key Laboratory of Natural Medicines, Ministry of Education , China Pharmaceutical University , Nanjing , China
| |
Collapse
|
13
|
Alignment-based and alignment-free methods converge with experimental data on amino acids coded by stop codons at split between nuclear and mitochondrial genetic codes. Biosystems 2018; 167:33-46. [DOI: 10.1016/j.biosystems.2018.03.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2018] [Revised: 03/18/2018] [Accepted: 03/19/2018] [Indexed: 12/11/2022]
|
14
|
Bijective codon transformations show genetic code symmetries centered on cytosine's coding properties. Theory Biosci 2017; 137:17-31. [PMID: 29147851 DOI: 10.1007/s12064-017-0258-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Accepted: 11/13/2017] [Indexed: 12/11/2022]
Abstract
Homology of some RNAs with template DNA requires systematic exchanges between nucleotides. Such exchanges produce 'swinger' RNA along 23 bijective transformations (nine symmetric, X ↔ Y; and 14 asymmetric, X → Y → Z → X, for example A ↔ C and A → C → G → A, respectively). Here, analyses compare amino acids coded by swinger-transformed codons to those coded by untransformed codons, defining coding invariance after transformations. Swinger transformations cluster according to coding invariance in four groups characterized by transformations into cytosine (C = C, T → C, A → C, and G → C). C's central mutational coding role shows that swinger transformations constrained genetic code genesis. Coding invariance post-transformations correlate positively/negatively with mitochondrial swinger transcription/lepidosaurian body temperature. Presumably, low/high temperatures stabilize/revert rare swinger polymerization modes, producing long swinger sequences/point mutations, respectively. Coding invariance after swinger transformations might compensate effects of swinger polymerizations in species with low body temperatures. Hypothetically, swinger transcription increased coding potential of RNA self-replicating protolife systems under heating/cooling cycles.
Collapse
|
15
|
Seligmann H, Warthi G. Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes. Comput Struct Biotechnol J 2017; 15:412-424. [PMID: 28924459 PMCID: PMC5591391 DOI: 10.1016/j.csbj.2017.08.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Revised: 07/20/2017] [Accepted: 08/05/2017] [Indexed: 12/14/2022] Open
Abstract
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Collapse
Affiliation(s)
- Hervé Seligmann
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
- Dept. Ecol Evol Behav, Alexander Silberman Inst Life Sci, The Hebrew University of Jerusalem, IL-91904 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
| |
Collapse
|
16
|
Reviewing evidence for systematic transcriptional deletions, nucleotide exchanges, and expanded codons, and peptide clusters in human mitochondria. Biosystems 2017; 160:10-24. [PMID: 28807694 DOI: 10.1016/j.biosystems.2017.08.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Revised: 07/26/2017] [Accepted: 08/04/2017] [Indexed: 12/12/2022]
Abstract
Polymerization sometimes transforms sequences by (a) systematic deletions of mono-, dinucleotides after trinucleotides, or (b) 23 systematic nucleotide exchanges (9 symmetric, X<>Y, e.g. G<>T, 14 asymmetric, X > Y > Z > X, e.g. A > G > T > A), producing del- and swinger RNAs. Some peptides correspond to del- and swinger RNA translations, also according to tetracodons, codons expanded by a silent nucleotide. Here new analyzes assume different proteolytic patterns, partially alleviating false negative peptide detection biases, expanding noncanonical mitoproteome profiles. Mito-genomic, -transcriptomic and -proteomic evidence for noncanonical transcriptions and translations are reviewed and clusters of del- and swinger peptides (also along tetracodons) are described. Noncanonical peptide clusters indicate regulated expression of cryptically encoded mitochondrial protein coding genes. These candidate noncanonical proteins don't resemble known proteins.
Collapse
|
17
|
Seligmann H. Natural mitochondrial proteolysis confirms transcription systematically exchanging/deleting nucleotides, peptides coded by expanded codons. J Theor Biol 2016; 414:76-90. [PMID: 27899286 DOI: 10.1016/j.jtbi.2016.11.021] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 11/11/2016] [Accepted: 11/22/2016] [Indexed: 12/19/2022]
Abstract
Protein sequences have higher linguistic complexities than human languages. This indicates undeciphered multilayered, overprinted information/genetic codes. Some superimposed genetic information is revealed by detections of transcripts systematically (a) exchanging nucleotides (nine symmetric, e.g. A<->C, fourteen asymmetric, e.g. A->C->G->A, swinger RNAs) translated according to tri-, tetra- and pentacodons, and (b) deleting mono-, dinucleotides after each trinucleotide (delRNAs). Here analyses of two independent proteomic datasets considering natural proteolysis confirm independently translation of these non-canonical RNAs, also along tetra- and pentacodons, increasing coverage of putative, cryptically encoded proteins. Analyses assuming endoproteinase GluC and elastase digestions (cleavages after residues D, E, and A, L, I, V, respectively) detect additional peptides colocalizing with detected non-canonical RNAs. Analyses detect fewer peptides matching GluC-, elastase- than trypsin-digestions: artificial trypsin-digestion outweighs natural proteolysis. Results suggest occurrences of complete proteins entirely matching non-canonical, superimposed encoding(s). Protein-coding after bijective transformations could explain genetic code symmetries, such as along Rumer's transformation.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, IHU (Institut Hospitalo-Universitaire), Aix-Marseille University, Marseille, France.
| |
Collapse
|
18
|
Unbiased Mitoproteome Analyses Confirm Non-canonical RNA, Expanded Codon Translations. Comput Struct Biotechnol J 2016; 14:391-403. [PMID: 27830053 PMCID: PMC5094600 DOI: 10.1016/j.csbj.2016.09.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Revised: 09/28/2016] [Accepted: 09/29/2016] [Indexed: 01/14/2023] Open
Abstract
Proteomic MS/MS mass spectrometry detections are usually biased towards peptides cleaved by experimentally added digestion enzyme(s). Hence peptides resulting from spontaneous degradation and natural proteolysis usually remain undetected. Previous analyses of tryptic human proteome data (cleavage after K, R) detected non-canonical tryptic peptides translated according to tetra- and pentacodons (codons expanded by silent mono- and dinucleotides), and from transcripts systematically (a) deleting mono-, dinucleotides after trinucleotides (delRNAs), (b) exchanging nucleotides according to 23 bijective transformations. Nine symmetric and fourteen asymmetric nucleotide exchanges (X ↔ Y, e.g. A ↔ C; and X → Y → Z → X, e.g. A → C → G → A) produce swinger RNAs. Here unbiased reanalyses of these proteomic data detect preferentially non-canonical tryptic peptides despite assuming random cleavage. Unbiased analyses couldn't reconstruct experimental tryptic digestion if most detected non-canonical peptides were false positives. Detected non-tryptic non-canonical peptides map preferentially on corresponding, previously described non-canonical transcripts, as for tryptic non-canonical peptides. Hence unbiased analyses independently confirm previous trypsin-biased analyses that showed translations of del- and swinger RNA and expanded codons. Accounting for natural proteolysis completes trypsin-biased mitopeptidome analyses, independently confirms non-canonical transcriptions and translations.
Collapse
|
19
|
Seligmann H. Natural chymotrypsin-like-cleaved human mitochondrial peptides confirm tetra-, pentacodon, non-canonical RNA translations. Biosystems 2016; 147:78-93. [PMID: 27477600 DOI: 10.1016/j.biosystems.2016.07.010] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2016] [Revised: 07/15/2016] [Accepted: 07/26/2016] [Indexed: 12/22/2022]
Abstract
Mass spectra of human mitochondrial peptides match non-canonical transcripts systematically (a) deleting mono/dinucleotides after trinucleotides (delRNA), (b) exchanging nucleotides (swinger RNA), translated according to tri, (c) tetra- and pentacodons (codons expanded by a 4th (and 5th) silent nucleotide(s)). Swinger transcriptions are 23 bijective transformations, nine symmetric (X<->Y, e.g. A<->C) and fourteen asymmetric exchanges (X->Y->Z->X, e.g. A->C->G->A). Here, proteomic analyses assuming cleavage after W,Y, F (chymotrypsin-like, for trypsinized samples) detect fewer chymotrypsinized than trypsinized peptides. Detected non-canonical peptides map preferentially on detected non-canonical RNAs for chymotrypsinized peptides, as previously found for trypsinized peptides. This suggests residual natural chymotrypsin-like digestion detectable within experimentally trypsinized peptide data. Some trypsinized peptides are detected twice, by analyses assuming trypsin, and those assuming chymotrypsin cleavages. They have higher spectra counts than peptides detected only once, meaning that abundant peptides are more frequently detected, but detection certainties resemble those for peptides detected only once. Analyses assuming 'incorrect' digestions are inadequate negative controls for digestion enzymes naturally active in biological samples. Chymotrypsin-analyses confirm non-canonical transcriptions/translations independently of results obtained assuming trypsinization, increase non-canonical peptidome coverage, indicating mitogenome-encoding of yet undetected proteins.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, Université d'Aix-Marseille, URMITE CNRS-IRD 198 UMER 6236, Marseille, France.
| |
Collapse
|
20
|
Chimeric mitochondrial peptides from contiguous regular and swinger RNA. Comput Struct Biotechnol J 2016; 14:283-97. [PMID: 27453772 PMCID: PMC4942731 DOI: 10.1016/j.csbj.2016.06.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Revised: 06/19/2016] [Accepted: 06/23/2016] [Indexed: 12/20/2022] Open
Abstract
Previous mass spectrometry analyses described human mitochondrial peptides entirely translated from swinger RNAs, RNAs where polymerization systematically exchanged nucleotides. Exchanges follow one among 23 bijective transformation rules, nine symmetric exchanges (X ↔ Y, e.g. A ↔ C) and fourteen asymmetric exchanges (X → Y → Z → X, e.g. A → C → G → A), multiplying by 24 DNA's protein coding potential. Abrupt switches from regular to swinger polymerization produce chimeric RNAs. Here, human mitochondrial proteomic analyses assuming abrupt switches between regular and swinger transcriptions, detect chimeric peptides, encoded by part regular, part swinger RNA. Contiguous regular- and swinger-encoded residues within single peptides are stronger evidence for translation of swinger RNA than previously detected, entirely swinger-encoded peptides: regular parts are positive controls matched with contiguous swinger parts, increasing confidence in results. Chimeric peptides are 200 × rarer than swinger peptides (3/100,000 versus 6/1000). Among 186 peptides with > 8 residues for each regular and swinger parts, regular parts of eleven chimeric peptides correspond to six among the thirteen recognized, mitochondrial protein-coding genes. Chimeric peptides matching partly regular proteins are rarer and less expressed than chimeric peptides matching non-coding sequences, suggesting targeted degradation of misfolded proteins. Present results strengthen hypotheses that the short mitogenome encodes far more proteins than hitherto assumed. Entirely swinger-encoded proteins could exist. Chimeric peptides are translated from contiguous regular and swinger RNA They are 200x rarer than mitochondrial swinger peptides Chimeric peptides integrated in regular mitochondrial proteins are downregulated Contiguous regular parts are matched positive controls for swinger parts The last point validates results beyond other statistical tests for robustness
Collapse
|
21
|
Barthélémy RM, Seligmann H. Cryptic tRNAs in chaetognath mitochondrial genomes. Comput Biol Chem 2016; 62:119-32. [DOI: 10.1016/j.compbiolchem.2016.04.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Revised: 04/11/2016] [Accepted: 04/14/2016] [Indexed: 12/14/2022]
|
22
|
Swinger RNA self-hybridization and mitochondrial non-canonical swinger transcription, transcription systematically exchanging nucleotides. J Theor Biol 2016; 399:84-91. [PMID: 27079465 DOI: 10.1016/j.jtbi.2016.04.007] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Revised: 03/02/2016] [Accepted: 04/05/2016] [Indexed: 11/22/2022]
Abstract
Stem-loop hairpins punctuate mitochondrial post-transcriptional processing. Regulation of mitochondrial swinger transcription, transcription producing RNAs matching the mitogenome only assuming systematic exchanges between nucleotides (23 bijective transformations along 9 symmetric exchanges X<>Y, e.g. A<>G, and 14 asymmetric exchanges X>Y>Z>X, e.g. A>G>C>A) remains unknown. Does swinger RNA self-hybridization regulate swinger, as regular, transcription? Groups of 8 swinger transformations share canonical self-hybridization properties within each group, group 0 includes identity (regular) transcription. The human mitogenome has more stem-loop hairpins than randomized sequences for all groups. Group 2 transformations reveal complementarity of the light strand replication origin (OL) loop and a neighboring tRNA gene, detecting the longtime presumed OL/tRNA homology. Non-canonical G=U pairings in hairpins increases with swinger RNA detection. These results confirm biological relevancy of swinger-transformed DNA/RNA, independently of, and in combination with, previously detected swinger DNA/RNA and swinger peptides. Swinger-transformed mitogenomes include unsuspected multilayered information.
Collapse
|
23
|
Systematically frameshifting by deletion of every 4th or 4th and 5th nucleotides during mitochondrial transcription: RNA self-hybridization regulates delRNA expression. Biosystems 2016; 142-143:43-51. [PMID: 27018206 DOI: 10.1016/j.biosystems.2016.03.009] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Revised: 03/11/2016] [Accepted: 03/23/2016] [Indexed: 02/05/2023]
Abstract
In mitochondria, secondary structures punctuate post-transcriptional RNA processing. Recently described transcripts match the human mitogenome after systematic deletions of every 4th, respectively every 4th and 5th nucleotides, called delRNAs. Here I explore predicted stem-loop hairpin formation by delRNAs, and their associations with delRNA transcription and detected peptides matching their translation. Despite missing 25, respectively 40% of the nucleotides in the original sequence, del-transformed sequences form significantly more secondary structures than corresponding randomly shuffled sequences, indicating biological function, independently of, and in combination with, previously detected delRNA and thereof translated peptides. Self-hybridization decreases delRNA abundances, indicating downregulation. Systematic deletions of the human mitogenome reveal new, unsuspected coding and structural informations.
Collapse
|
24
|
Seligmann H. Translation of mitochondrial swinger RNAs according to tri-, tetra- and pentacodons. Biosystems 2015; 140:38-48. [PMID: 26723232 DOI: 10.1016/j.biosystems.2015.11.009] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 11/08/2015] [Accepted: 11/23/2015] [Indexed: 10/22/2022]
Abstract
Transcriptomes and proteomes include RNA and protein fragments not matching regular transcription/translation. Some 'non-canonical' mitochondrial transcripts match mitogenomes after assuming one among 23 systematic exchanges between nucleotides, producing swinger RNAs (nine symmetric, X↔Y, example C↔T; 14 asymmetric, X→Y→Z→X, example A→T→G→A) in GenBank's EST database. Here, reanalyzes of (a) public human mitochondrial transcriptome data (Illumina: RNA-seq) allowed to detect mitochondrial swinger RNAs for all 23 exchanges and (b) independent public human mitochondrial trypsinized proteomic mass spectrometry data allowed to detect peptides predicted from translation of parts of swinger-transformed mitogenomes covered by detected swinger reads. RNA-seq and previous EST swinger transcript data converge. Swinger RNA translation frequently inserts various amino acids at stop codons. Swinger RNA-peptide associations exist also for peptides matching systematically frameshifting translation, peptides entirely coded by tetra- and pentacodons (regular codons expanded by silent mononucleotides at 4th, and silent dinucleotides at 4th and 5th position(s), respectively). Swinger peptides differ from regular mitochondrial proteins: not membrane embedded, reflect warmer, anaerobic, low resource conditions, reminding a free-living ancestor. Tetra- and pentacoded peptides associate with low, high GC contents, respectively, suggesting expanded codon translations associate with thermic stresses. Results confirm experimentally predicted swinger, tetra- and pentacoded mitochondrial peptides, increasing mitogenomic coding density.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, Université de la Méditerranée, Marseille, France.
| |
Collapse
|
25
|
Seligmann H. Codon expansion and systematic transcriptional deletions produce tetra-, pentacoded mitochondrial peptides. J Theor Biol 2015; 387:154-65. [PMID: 26456204 DOI: 10.1016/j.jtbi.2015.09.030] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 09/28/2015] [Accepted: 09/30/2015] [Indexed: 11/28/2022]
Abstract
Genes include occasionally isolated codons with a fourth (and fifth) silent nucleotide(s). Assuming tetracodons, translated hypothetical peptides align with regular GenBank proteins; predicted tetracodons coevolve with predicted tRNAs with expanded anticodons in each mammal, Drosophila and Lepidosauria mitogenomes, GC contents and with lepidosaurian body temperatures, suggesting that expanded codons are an adaptation of translation to high temperature. Hypothetically, continuous stretches of tetra- and pentacodons code for peptides. Both systematic nucleotide deletions during transcription, and translation by tRNAs with expanded anticodons could produce these peptides. Reanalyses of human nanoLc mass spectrometry peptidome data detect numerous tetra- and pentapeptides translated from the human mitogenome. These map preferentially on (BLAST-detected) human RNAs matching the human mitogenome, assuming systematic mono- and dinucleotide deletions after each third nucleotide (delRNAs). Translation by expanded anticodons is incompatible with silent nucleotides in the midst rather than at codon 3' extremity. More than 1/3 of detected tetra- and pentapeptides assume silent positions at codon extremity, suggesting that both mechanisms, regular translation of delRNAs and translation of regular RNAs by expanded anticodons, produce this peptide subgroup. Results show that systematically deleting polymerization occurs, and confirm serial translation of expanded codons. Non-canonical transcriptions and translations considerably expand the coding potential of DNA and RNA sequences.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, Université de la Méditerranée, 13385 Marseille, France.
| |
Collapse
|
26
|
Swinger RNAs with sharp switches between regular transcription and transcription systematically exchanging ribonucleotides: Case studies. Biosystems 2015; 135:1-8. [PMID: 26163926 DOI: 10.1016/j.biosystems.2015.07.003] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Revised: 07/02/2015] [Accepted: 07/06/2015] [Indexed: 11/20/2022]
Abstract
During RNA transcription, DNA nucleotides A,C,G, T are usually matched by ribonucleotides A, C, G and U. However occasionally, this rule does not apply: transcript-DNA homologies are detectable only assuming systematic exchanges between ribonucleotides. Nine symmetric (X ↔ Y, e.g. A ↔ C) and fourteen asymmetric (X ↔ Y ↔ Z, e.g. A ↔ C ↔ G) exchanges exist, called swinger transcriptions. Putatively, polymerases occasionally stabilize in unspecified swinger conformations, possibly similar to transient conformations causing punctual misinsertions. This predicts chimeric transcripts, part regular, part swinger-transformed, reflecting polymerases switching to swinger polymerization conformation(s). Four chimeric Genbank transcripts (three from human mitochondrion and one murine cytosolic) are described here: (a) the 5' and 3' extremities reflect regular polymerization, the intervening sequence exchanges systematically between ribonucleotides (swinger rule G ↔ U, transcript (1), with sharp switches between regular and swinger sequences; (b) the 5' half is 'normal', the 3' half systematically exchanges ribonucleotides (swinger rule C ↔ G, transcript (2), with an intercalated sequence lacking homology; (c) the 3' extremity fits A ↔ G exchanges (10% of transcript length), the 5' half follows regular transcription; the intervening region seems a mix of regular and A ↔ G transcriptions (transcript 3); (d) murine cytosolic transcript 4 switches to A ↔ U + C ↔ G, and is fused with A ↔ U + C ↔ G swinger transformed precursor rRNA. In (c), each concomitant transcript 5' and 3' extremities match opposite genome strands. Transcripts 3 and 4 combine transcript fusions with partial swinger transcriptions. Occasional (usually sharp) switches between regular and swinger transcriptions reveal greater coding potential than detected until now, suggest stable polymerase swinger conformations.
Collapse
|
27
|
Michel CJ. The maximal C(3) self-complementary trinucleotide circular code X in genes of bacteria, eukaryotes, plasmids and viruses. J Theor Biol 2015; 380:156-77. [PMID: 25934352 DOI: 10.1016/j.jtbi.2015.04.009] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Revised: 02/28/2015] [Accepted: 04/09/2015] [Indexed: 11/28/2022]
Abstract
In 1996, a set X of 20 trinucleotides is identified in genes of both prokaryotes and eukaryotes which has in average the highest occurrence in reading frame compared to the two shifted frames (Arquès and Michel, 1996). Furthermore, this set X has an interesting mathematical property as X is a maximal C(3) self-complementary trinucleotide circular code (Arquès and Michel, 1996). In 2014, the number of trinucleotides in prokaryotic genes has been multiplied by a factor of 527. Furthermore, two new gene kingdoms of plasmids and viruses contain enough trinucleotide data to be analysed. The approach used in 1996 for identifying a preferential frame for a trinucleotide is quantified here with a new definition analysing the occurrence probability of a complementary/permutation (CP) trinucleotide set in a gene kingdom. Furthermore, in order to increase the statistical significance of results compared to those of 1996, the circular code X is studied on several gene taxonomic groups in a kingdom. Based on this new statistical approach, the circular code X is strengthened in genes of prokaryotes and eukaryotes, and now also identified in genes of plasmids. A subset of X with 18 or 16 trinucleotides is identified in genes of viruses. Furthermore, a simple probabilistic model based on the independent occurrence of trinucleotides in reading frame of genes explains the circular code frequencies and asymmetries observed in the shifted frames in all studied gene kingdoms. Finally, the developed approach allows to identify variant X codes in genes, i.e. trinucleotide codes which differ from X. In genes of bacteria, eukaryotes and plasmids, 14 among the 47 studied gene taxonomic groups (about 30%) have variant X codes. Seven variant X codes are identified with at least 16 trinucleotides of X. Two variant X codes XA in cyanobacteria and plasmids of cyanobacteria, and XD in birds are self-complementary, without permuted trinucleotides but non-circular. Five variant X codes XB in deinococcus, plasmids of chloroflexi and deinococcus, mammals and kinetoplasts, XC in elusimicrobia and apicomplexans, XE in fishes, XF in insects, and XG in basidiomycetes and plasmids of spirochaetes are C(3) self-complementary circular. In genes of viruses, no variant X code is found.
Collapse
Affiliation(s)
- Christian J Michel
- Theoretical Bioinformatics, ICube, University of Strasbourg, CNRS, 300 Boulevard Sébastien Brant, 67400 Illkirch, France.
| |
Collapse
|
28
|
Seligmann H. Sharp switches between regular and swinger mitochondrial replication: 16S rDNA systematically exchanging nucleotides A<->T+C<->G in the mitogenome of Kamimuria wangi. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:2440-6. [PMID: 25865623 DOI: 10.3109/19401736.2015.1033691] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Swinger DNAs are sequences whose homology with known sequences is detected only by assuming systematic exchanges between nucleotides. Nine symmetric (X<->Y, i.e. A<->C) and fourteen asymmetric (X->Y->Z, i.e. A->C->G) exchanges exist. All swinger DNA previously detected in GenBank follow the A<->T+C<->G exchange, while mitochondrial swinger RNAs distribute among different swinger types. Here different alignment criteria detect 87 additional swinger mitochondrial DNAs (86 from insects), including the first swinger gene embedded within a complete genome, corresponding to the mitochondrial 16S rDNA of the stonefly Kamimuria wangi. Other Kamimuria mt genome regions are "regular", stressing unanswered questions on (a) swinger polymerization regulation; (b) swinger 16S rDNA functions; and (c) specificity to rDNA, in particular 16S rDNA. Sharp switches between regular and swinger replication, together with previous observations on swinger transcription, suggest that swinger replication might be due to a switch in polymerization mode of regular polymerases and the possibility of swinger-encoded information, predicted in primordial genes such as rDNA.
Collapse
Affiliation(s)
- Hervé Seligmann
- a Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMR 6236, Université d'Aix-Marseille , Marseille , France
| |
Collapse
|
29
|
Phylogeny of genetic codes and punctuation codes within genetic codes. Biosystems 2015; 129:36-43. [DOI: 10.1016/j.biosystems.2015.01.003] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2014] [Revised: 01/02/2015] [Accepted: 01/14/2015] [Indexed: 11/23/2022]
|
30
|
Capt C, Passamonti M, Breton S. The human mitochondrial genome may code for more than 13 proteins. Mitochondrial DNA A DNA Mapp Seq Anal 2015; 27:3098-101. [PMID: 25630734 DOI: 10.3109/19401736.2014.1003924] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The human mitochondrial (mt) DNA is commonly described as a small, maternally inherited molecule that encodes 13 protein components of the oxidative phosphorylation system and 24 structural RNAs required for their translation. However, recent studies indicate that the human mtDNA has a larger functional repertoire than previously believed. This paper briefly summarizes these studies, which suggest to reconsider our way to describe the human mitochondrial DNA as it may code for more than 13 proteins.
Collapse
Affiliation(s)
- Charlotte Capt
- a Département de Sciences Biologiques , Université de Montréal , Montréal , Québec , Canada and
| | - Marco Passamonti
- b Dipartimento di Scienze Biologiche , Geologiche ed Ambientali, University of Bologna , Bologna , Italy
| | - Sophie Breton
- a Département de Sciences Biologiques , Université de Montréal , Montréal , Québec , Canada and
| |
Collapse
|
31
|
Root-Bernstein M, Root-Bernstein R. The ribosome as a missing link in the evolution of life. J Theor Biol 2014; 367:130-158. [PMID: 25500179 DOI: 10.1016/j.jtbi.2014.11.025] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Revised: 11/15/2014] [Accepted: 11/20/2014] [Indexed: 12/27/2022]
Abstract
Many steps in the evolution of cellular life are still mysterious. We suggest that the ribosome may represent one important missing link between compositional (or metabolism-first), RNA-world (or genes-first) and cellular (last universal common ancestor) approaches to the evolution of cells. We present evidence that the entire set of transfer RNAs for all twenty amino acids are encoded in both the 16S and 23S rRNAs of Escherichia coli K12; that nucleotide sequences that could encode key fragments of ribosomal proteins, polymerases, ligases, synthetases, and phosphatases are to be found in each of the six possible reading frames of the 16S and 23S rRNAs; and that every sequence of bases in rRNA has information encoding more than one of these functions in addition to acting as a structural component of the ribosome. Ribosomal RNA, in short, is not just a structural scaffold for proteins, but the vestigial remnant of a primordial genome that may have encoded a self-organizing, self-replicating, auto-catalytic intermediary between macromolecules and cellular life.
Collapse
MESH Headings
- Amino Acid Sequence
- Base Sequence
- Biological Evolution
- Escherichia coli K12/enzymology
- Escherichia coli K12/metabolism
- Escherichia coli Proteins/chemistry
- Escherichia coli Proteins/metabolism
- Molecular Sequence Data
- Nucleic Acid Conformation
- Probability
- RNA, Bacterial/chemistry
- RNA, Bacterial/genetics
- RNA, Ribosomal, 16S/chemistry
- RNA, Ribosomal, 16S/genetics
- RNA, Ribosomal, 23S/chemistry
- RNA, Ribosomal, 23S/genetics
- RNA, Transfer/chemistry
- RNA, Transfer/genetics
- Ribosomes/metabolism
- Transcription, Genetic
Collapse
Affiliation(s)
- Meredith Root-Bernstein
- School of Geography and the Environment, Oxford University, South Parks Road, Oxford, Oxfordshire OX1 3QY, United Kingdom
| | | |
Collapse
|
32
|
Seligmann H. Mitochondrial swinger replication: DNA replication systematically exchanging nucleotides and short 16S ribosomal DNA swinger inserts. Biosystems 2014; 125:22-31. [PMID: 25283331 DOI: 10.1016/j.biosystems.2014.09.012] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Revised: 09/08/2014] [Accepted: 09/29/2014] [Indexed: 11/27/2022]
Abstract
Assuming systematic exchanges between nucleotides (swinger RNAs) resolves genomic 'parenthood' of some orphan mitochondrial transcripts. Twenty-three different systematic nucleotide exchanges (bijective transformations) exist. Similarities between transcription and replication suggest occurrence of swinger DNA. GenBank searches for swinger DNA matching the 23 swinger versions of human and mouse mitogenomes detect only vertebrate mitochondrial swinger DNA for swinger type AT+CG (from five different studies, 149 sequences) matching three human and mouse mitochondrial genes: 12S and 16S ribosomal RNAs, and cytochrome oxidase subunit I. Exchange A<->T+C<->G conserves self-hybridization properties, putatively explaining swinger biases for rDNA, against protein coding genes. Twenty percent of the regular human mitochondrial 16S rDNA consists of short swinger repeats (from 13 exchanges). Swinger repeats could originate from recombinations between regular and swinger DNA: duplicated mitochondrial genes of the parthenogenetic gecko Heteronotia binoei include fewer short A<->T+C<->G swinger repeats than non-duplicated mitochondrial genomes of that species. Presumably, rare recombinations between female and male mitochondrial genes (and in parthenogenetic situations between duplicated genes), favors reverse-mutations of swinger repeat insertions, probably because most inserts affect negatively ribosomal function. Results show that swinger DNA exists, and indicate that swinger polymerization contributes to the genesis of genetic material and polymorphism.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, Université de la Méditerranée, Marseille, France.
| |
Collapse
|
33
|
Seligmann H. Species radiation by DNA replication that systematically exchanges nucleotides? J Theor Biol 2014; 363:216-22. [PMID: 25192628 DOI: 10.1016/j.jtbi.2014.08.036] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Revised: 08/14/2014] [Accepted: 08/19/2014] [Indexed: 11/28/2022]
Abstract
RNA and DNA syntheses share many properties. Therefore, the existence of 'swinger' RNAs, presumed 'orphan' transcripts matching genomic sequences only if transcription systematically exchanged nucleotides, suggests replication producing swinger DNA. Transcripts occur in many short-lived copies, the few cellular DNA molecules are long-lived. Hence pressures for functional swinger DNAs are greater than for swinger RNAs. Protein coding properties of swinger sequences differ from original sequences, suggesting rarity of corresponding swinger DNA. For genes producing structural RNAs, such as tRNAs and rRNAs, three exchanges (A<->T, C<->G and A<->T+C<->G) conserve self-hybridization properties. All nuclear eukaryote swinger DNA sequences detected in GenBank are for rRNA genes assuming A<->T+C<->G exchanges. In brachyuran crabs, 25 species had A<->T+C<->G swinger 18S rDNA, all matching the reverse-exchanged version of regular 18S rDNA of a related species. In this taxon, swinger replication of 18S rDNA apparently associated with, or even resulted in species radiation. A<->T+C<->G transformation doesn't invert sequence direction, differing from inverted repeats. Swinger repeats (detectable only assuming swinger transformations, A<->T+C<->G swinger repeats most frequent) within regular human rRNAs, independently confirm swinger polymerizations for most swinger types. Swinger replication might be an unsuspected molecular mechanism for ultrafast speciation.
Collapse
Affiliation(s)
- Hervé Seligmann
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Émergentes, Faculté de Médecine, URMITE CNRS-IRD 198 UMER 6236, Université de la Méditerranée, Marseille, France.
| |
Collapse
|
34
|
The relation between hairpin formation by mitochondrial WANCY tRNAs and the occurrence of the light strand replication origin in Lepidosauria. Gene 2014; 542:248-57. [DOI: 10.1016/j.gene.2014.02.021] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Revised: 12/27/2013] [Accepted: 02/17/2014] [Indexed: 01/28/2023]
|
35
|
Seligmann H, Labra A. Tetracoding increases with body temperature in Lepidosauria. Biosystems 2013; 114:155-63. [DOI: 10.1016/j.biosystems.2013.09.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Revised: 09/04/2013] [Accepted: 09/05/2013] [Indexed: 10/26/2022]
|
36
|
Seligmann H. Putative anticodons in mitochondrial tRNA sidearm loops: Pocketknife tRNAs? J Theor Biol 2013; 340:155-63. [PMID: 24012463 DOI: 10.1016/j.jtbi.2013.08.030] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2013] [Revised: 08/15/2013] [Accepted: 08/26/2013] [Indexed: 10/26/2022]
Abstract
The hypothesis that tRNA sidearm loops bear anticodons assumes crossovers between anticodon and sidearms, or translation by expressed aminoacylated tRNA halves forming single stem-loops. Only the latter might require ribosomal adaptations. Drosophila mitochondrial codon usages coevolve with sidearm numbers bearing matching putative anticodons (comparing different codon families in one genome, macroevolution) and when comparing different genomes for single codon families (microevolution). Coevolution between Drosophila and yeast mitochondrial antisense tRNAs and codon usages partly confounds microevolutionary patterns for putative sidearm anticodons. Some tRNA sidearm loops have more than seven nucleotides, putative expanded anticodons potentially matching quadruplet codons (tetracodons, codons expanded by a fourth silent position, forming tetragenes (predicted by alignment analyses of Drosophila mitochondrial genomes)). Tetracodon numbers coevolve with expanded tRNA sidearm loops. Sidearm coevolution with amino acid usages and tetragenes occurs for putative anticodons in 5' and 3' sidearms loops (D and TΨC loops, respectively), are stronger for the D-loop. Results slightly favour isolated stem-loops upon crossover hypotheses. An alternative hypothesis, that patterns observed for sidearm 'anticodons' do not imply translational activity, but recognition signals for tRNA synthetases that aminoacylate tRNAs, is incompatible with tetracodon/tetra-anticodon coevolution. Hence analyses strengthen translational hypotheses for tRNA sidearm anticodons, tetragenes, and antisense tRNAs.
Collapse
Affiliation(s)
- Hervé Seligmann
- National Natural History Museum Collections, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel; Department of Life Sciences, Ben Gurion University, 84105 Beer Sheva, Israel.
| |
Collapse
|
37
|
Seligmann H. Pocketknife tRNA hypothesis: Anticodons in mammal mitochondrial tRNA side-arm loops translate proteins? Biosystems 2013; 113:165-76. [DOI: 10.1016/j.biosystems.2013.07.004] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Revised: 07/02/2013] [Accepted: 07/03/2013] [Indexed: 12/11/2022]
|
38
|
Seligmann H. Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes. Biosystems 2013; 111:156-74. [PMID: 23410796 DOI: 10.1016/j.biosystems.2013.01.011] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 01/24/2013] [Accepted: 01/29/2013] [Indexed: 12/23/2022]
Abstract
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential.
Collapse
Affiliation(s)
- Hervé Seligmann
- National Natural History Museum Collections, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel.
| |
Collapse
|
39
|
Seligmann H. Systematic asymmetric nucleotide exchanges produce human mitochondrial RNAs cryptically encoding for overlapping protein coding genes. J Theor Biol 2013; 324:1-20. [PMID: 23416187 DOI: 10.1016/j.jtbi.2013.01.024] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2012] [Revised: 01/26/2013] [Accepted: 01/28/2013] [Indexed: 11/19/2022]
Abstract
GenBank's EST database includes RNAs matching exactly human mitochondrial sequences assuming systematic asymmetric nucleotide exchange-transcription along exchange rules: A→G→C→U/T→A (12 ESTs), A→U/T→C→G→A (4 ESTs), C→G→U/T→C (3 ESTs), and A→C→G→U/T→A (1 EST), no RNAs correspond to other potential asymmetric exchange rules. Hypothetical polypeptides translated from nucleotide-exchanged human mitochondrial protein coding genes align with numerous GenBank proteins, predicted secondary structures resemble their putative GenBank homologue's. Two independent methods designed to detect overlapping genes (one based on nucleotide contents analyses in relation to replicative deamination gradients at third codon positions, and circular code analyses of codon contents based on frame redundancy), confirm nucleotide-exchange-encrypted overlapping genes. Methods converge on which genes are most probably active, and which not, and this for the various exchange rules. Mean EST lengths produced by different nucleotide exchanges are proportional to (a) extents that various bioinformatics analyses confirm the protein coding status of putative overlapping genes; (b) known kinetic chemistry parameters of the corresponding nucleotide substitutions by the human mitochondrial DNA polymerase gamma (nucleotide DNA misinsertion rates); (c) stop codon densities in predicted overlapping genes (stop codon readthrough and exchanging polymerization regulate gene expression by counterbalancing each other). Numerous rarely expressed proteins seem encoded within regular mitochondrial genes through asymmetric nucleotide exchange, avoiding lengthening genomes. Intersecting evidence between several independent approaches confirms the working hypothesis status of gene encryption by systematic nucleotide exchanges.
Collapse
Affiliation(s)
- Hervé Seligmann
- National Natural History Museum Collections, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel.
| |
Collapse
|
40
|
Overlapping genes coded in the 3′-to-5′-direction in mitochondrial genes and 3′-to-5′ polymerization of non-complementary RNA by an ‘invertase’. J Theor Biol 2012; 315:38-52. [DOI: 10.1016/j.jtbi.2012.08.044] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Revised: 08/17/2012] [Accepted: 08/30/2012] [Indexed: 11/23/2022]
|