1
|
The Mutational Robustness of the Genetic Code and Codon Usage in Environmental Context: A Non-Extremophilic Preference? Life (Basel) 2021; 11:life11080773. [PMID: 34440517 PMCID: PMC8398314 DOI: 10.3390/life11080773] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 07/23/2021] [Accepted: 07/28/2021] [Indexed: 12/12/2022] Open
Abstract
The genetic code was evolved, to some extent, to minimize the effects of mutations. The effects of mutations depend on the amino acid repertoire, the structure of the genetic code and frequencies of amino acids in proteomes. The amino acid compositions of proteins and corresponding codon usages are still under selection, which allows us to ask what kind of environment the standard genetic code is adapted to. Using simple computational models and comprehensive datasets comprising genomic and environmental data from all three domains of Life, we estimate the expected severity of non-synonymous genomic mutations in proteins, measured by the change in amino acid physicochemical properties. We show that the fidelity in these physicochemical properties is expected to deteriorate with extremophilic codon usages, especially in thermophiles. These findings suggest that the genetic code performs better under non-extremophilic conditions, which not only explains the low substitution rates encountered in halophiles and thermophiles but the revealed relationship between the genetic code and habitat allows us to ponder on earlier phases in the history of Life.
Collapse
|
2
|
Phylogenetic analysis of mutational robustness based on codon usage supports that the standard genetic code does not prefer extreme environments. Sci Rep 2021; 11:10963. [PMID: 34040064 PMCID: PMC8154912 DOI: 10.1038/s41598-021-90440-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 05/10/2021] [Indexed: 02/04/2023] Open
Abstract
The mutational robustness of the genetic code is rarely discussed in the context of biological diversity, such as codon usage and related factors, often considered as independent of the actual organism's proteome. Here we put the living beings back to picture and use distortion as a metric of mutational robustness. Distortion estimates the expected severities of non-synonymous mutations measuring it by amino acid physicochemical properties and weighting for codon usage. Using the biological variance of codon frequencies, we interpret the mutational robustness of the standard genetic code with regards to their corresponding environments and genomic compositions (GC-content). Employing phylogenetic analyses, we show that coding fidelity in physicochemical properties can deteriorate with codon usages adapted to extreme environments and these putative effects are not the artefacts of phylogenetic bias. High temperature environments select for codon usages with decreased mutational robustness of hydrophobic, volumetric, and isoelectric properties. Selection at high saline concentrations also leads to reduced fidelity in polar and isoelectric patterns. These show that the genetic code performs best with mesophilic codon usages, strengthening the view that LUCA or its ancestors preferred lower temperature environments. Taxonomic implications, such as rooting the tree of life, are also discussed.
Collapse
|
3
|
Rogers SO. Evolution of the genetic code based on conservative changes of codons, amino acids, and aminoacyl tRNA synthetases. J Theor Biol 2019; 466:1-10. [PMID: 30658052 DOI: 10.1016/j.jtbi.2019.01.022] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Revised: 01/10/2019] [Accepted: 01/14/2019] [Indexed: 11/30/2022]
Abstract
The genetic code, as arranged in the standard tabular form, displays a non-random structure relating to the characteristics of the amino acids. An alternative arrangement can be made by organizing the code according to aminoacyl-tRNA synthetases (aaRSs), codons, and reverse complement codons, which illuminates a coevolutionary process that led to the contemporary genetic code. As amino acids were added to the genetic code, they were recognized by aaRSs that interact with stereochemically similar amino acids. Single nucleotide changes in the codons and anticodons were favored over more extensive changes, such that there was a logical stepwise progression in the evolution of the genetic code. The model presented traces the evolution of the genetic code accounting for these steps. Amino acid frequencies in ancient proteins and the preponderance of GNN codons in mRNAs for ancient proteins indicate that the genetic code began with alanine, aspartate, glutamate, glycine, and valine, with alanine being in the highest proportions. In addition to being consistent in terms of conservative changes in codon nucleotides, the model also is consistent with respect to aaRS classes, aaRS attachment to the tRNA, amino acid stereochemistry, and to a large extent with amino acid physicochemistry, and biochemical pathways.
Collapse
Affiliation(s)
- Scott O Rogers
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH, United States.
| |
Collapse
|
4
|
de Farias ST, Antonino D, Rêgo TG, José MV. Structural evolution of Glycyl-tRNA synthetases alpha subunit and its implication in the initial organization of the decoding system. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2018; 142:43-50. [PMID: 30142371 DOI: 10.1016/j.pbiomolbio.2018.08.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 07/13/2018] [Accepted: 08/14/2018] [Indexed: 11/27/2022]
Abstract
The origin and evolution of the genetic code is a fundamental challenge in modern biology. At the center of this problem is the correct interaction between amino acids and tRNAs. Aminoacyl-tRNA synthetase is the enzyme responsible for the correct binding between amino acids and tRNAs. Among the 20 canonical amino acid, glycine was the most abundant in prebiotic condition and it must have been one of the first to be incorporated into the genetic code. In this work, we derive the ancestral sequence of Glycyl-tRNA synthetase (GlyRS) and predict its 3D-structure. We show, via molecular docking experiments, the capacity of ancestral GlyRS to bind the tRNA anticodon stem loop, cofactors and substrates. These bindings exhibit high affinity and specificity. We propose that the primordial function of these interactions was to stabilize both compounds to make possible the catalysis. In this context, the anticodon stem loop did contribute to the encoding system and just with the emergence of the mRNA it was co-opted for codification. Thus, we present a model for the origin of the genetic code in which the operational and the anticodon codes did not evolve independently.
Collapse
Affiliation(s)
- Savio Torres de Farias
- Laboratório de Genética Evolutiva Paulo Leminsk, Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, Brazil.
| | - Daniel Antonino
- Laboratório de Genética Evolutiva Paulo Leminsk, Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, Brazil
| | - Thais Gaudêncio Rêgo
- Departamento de Informática, Universidade Federal da Paraíba, João Pessoa, Brazil
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Ciudad de México CDMX, C.P. 04510, Mexico.
| |
Collapse
|
5
|
Froese T, Campos JI, Fujishima K, Kiga D, Virgo N. Horizontal transfer of code fragments between protocells can explain the origins of the genetic code without vertical descent. Sci Rep 2018; 8:3532. [PMID: 29476089 PMCID: PMC5824800 DOI: 10.1038/s41598-018-21973-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 02/14/2018] [Indexed: 11/09/2022] Open
Abstract
Theories of the origin of the genetic code typically appeal to natural selection and/or mutation of hereditable traits to explain its regularities and error robustness, yet the present translation system presupposes high-fidelity replication. Woese's solution to this bootstrapping problem was to assume that code optimization had played a key role in reducing the effect of errors caused by the early translation system. He further conjectured that initially evolution was dominated by horizontal exchange of cellular components among loosely organized protocells ("progenotes"), rather than by vertical transmission of genes. Here we simulated such communal evolution based on horizontal transfer of code fragments, possibly involving pairs of tRNAs and their cognate aminoacyl tRNA synthetases or a precursor tRNA ribozyme capable of catalysing its own aminoacylation, by using an iterated learning model. This is the first model to confirm Woese's conjecture that regularity, optimality, and (near) universality could have emerged via horizontal interactions alone.
Collapse
Affiliation(s)
- Tom Froese
- Institute for Applied Mathematics and Systems Research (IIMAS), National Autonomous University of Mexico (UNAM), Mexico City, 04510, Mexico. .,Center for the Sciences of Complexity (C3), National Autonomous University of Mexico (UNAM), Mexico City, 04510, Mexico.
| | - Jorge I Campos
- Center for the Sciences of Complexity (C3), National Autonomous University of Mexico (UNAM), Mexico City, 04510, Mexico.,Faculty of Higher Education Aragon, National Autonomous University of Mexico (UNAM), Nezahualcoyotl City, State of Mexico, 57130, Mexico
| | - Kosuke Fujishima
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8550, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, 9970035, Japan
| | - Daisuke Kiga
- Faculty of Science and Engineering, School of Advanced Science and Engineering, Waseda University, Shinjuku, Tokyo, 169-8555, Japan
| | - Nathaniel Virgo
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8550, Japan
| |
Collapse
|
6
|
The evolution of the genetic code: Impasses and challenges. Biosystems 2018; 164:217-225. [DOI: 10.1016/j.biosystems.2017.10.006] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2017] [Revised: 10/06/2017] [Accepted: 10/09/2017] [Indexed: 01/17/2023]
|
7
|
Zamudio GS, José MV. Phenotypic Graphs and Evolution Unfold the Standard Genetic Code as the Optimal. ORIGINS LIFE EVOL B 2017; 48:83-91. [PMID: 29082465 DOI: 10.1007/s11084-017-9552-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2017] [Accepted: 10/16/2017] [Indexed: 10/18/2022]
Abstract
In this work, we explicitly consider the evolution of the Standard Genetic Code (SGC) by assuming two evolutionary stages, to wit, the primeval RNY code and two intermediate codes in between. We used network theory and graph theory to measure the connectivity of each phenotypic graph. The connectivity values are compared to the values of the codes under different randomization scenarios. An error-correcting optimal code is one in which the algebraic connectivity is minimized. We show that the SGC is optimal in regard to its robustness and error-tolerance when compared to all random codes under different assumptions.
Collapse
Affiliation(s)
- Gabriel S Zamudio
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico.
| |
Collapse
|
8
|
Seligmann H, Warthi G. Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes. Comput Struct Biotechnol J 2017; 15:412-424. [PMID: 28924459 PMCID: PMC5591391 DOI: 10.1016/j.csbj.2017.08.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Revised: 07/20/2017] [Accepted: 08/05/2017] [Indexed: 12/14/2022] Open
Abstract
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Collapse
Affiliation(s)
- Hervé Seligmann
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
- Dept. Ecol Evol Behav, Alexander Silberman Inst Life Sci, The Hebrew University of Jerusalem, IL-91904 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
| |
Collapse
|
9
|
Zamudio GS, José MV. On the Uniqueness of the Standard Genetic Code. Life (Basel) 2017; 7:life7010007. [PMID: 28208827 PMCID: PMC5370407 DOI: 10.3390/life7010007] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Revised: 02/07/2017] [Accepted: 02/08/2017] [Indexed: 11/16/2022] Open
Abstract
In this work, we determine the biological and mathematical properties that are sufficient and necessary to uniquely determine both the primeval RNY (purine-any base-pyrimidine) code and the standard genetic code (SGC). These properties are: the evolution of the SGC from the RNY code; the degeneracy of both codes, and the non-degeneracy of the assignments of aminoacyl-tRNA synthetases (aaRSs) to amino acids; the wobbling property; the consideration that glycine was the first amino acid; the topological and symmetrical properties of both codes.
Collapse
Affiliation(s)
- Gabriel S Zamudio
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, México D.F. 04510, Mexico.
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, México D.F. 04510, Mexico.
| |
Collapse
|
10
|
Aggarwal N, Bandhu AV, Sengupta S. Finite population analysis of the effect of horizontal gene transfer on the origin of an universal and optimal genetic code. Phys Biol 2016; 13:036007. [DOI: 10.1088/1478-3975/13/3/036007] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
11
|
Pathways of Genetic Code Evolution in Ancient and Modern Organisms. J Mol Evol 2015; 80:229-43. [DOI: 10.1007/s00239-015-9686-8] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 06/03/2015] [Indexed: 10/23/2022]
|
12
|
Guimarães RC. The Self-Referential Genetic Code is Biologic and Includes the Error Minimization Property. ORIGINS LIFE EVOL B 2015; 45:69-75. [PMID: 25773583 DOI: 10.1007/s11084-015-9417-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 11/24/2014] [Indexed: 11/26/2022]
Abstract
The distribution of the triplet to amino acid correspondences in the genetic code matrix contains blocks of similarity. There are (a) groups of similar triplets coding for the same amino acid, which is called code degeneracy, and (b) clusters of similar amino acids corresponding to similar triplets. Processes that led to this regionalization have been investigated through a variety of perspectives but no consensus has been reached and no model has been convincing enough to drive experimental tests. Most traditional has been the hypothesis that the code was derived from the standard evolutionary processes of testing variations in the correspondences through the fitness measure of reaching distributions in the matrix space in an optimal manner so that the effects of mutations on protein phenotypes would be minimized, that is, with reduction of the intensity or of the deviant quality of the functional alterations associated with variations. In contrast, the self-referential model for the formation of the code is based on an original regionalization of characters through the concerted superposition of the two components of the encodings: the four modules of dimers of tRNAs are occupied sequentially by sets of amino acids that are also sequentially devoted to fulfilling specific functions in the protein sites and motifs to which they preferentially belong. Therewith, part (b) of the error-minimizing property follows. Part (a) of the property, the code degeneracy, is derived from the synthetase character of developing specificities directed initially to the principal dinucleotides of the triplets, resulting in tetracodonic degeneracy. This was later partly modified during evolution according to the developments of codon usage and the introduction of new amino acids.
Collapse
Affiliation(s)
- Romeu Cardoso Guimarães
- Lab. Biodiversidade e Evolução Molecular, Dept. Biologia Geral, Inst. Ciências Biológicas, Univ. Federal de Minas Gerais, 31270.901, Belo Horizonte, MG, Brazil,
| |
Collapse
|
13
|
Sengupta S, Aggarwal N, Bandhu AV. Two perspectives on the origin of the standard genetic code. ORIGINS LIFE EVOL B 2014; 44:287-91. [PMID: 25585805 DOI: 10.1007/s11084-014-9394-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 09/30/2014] [Indexed: 10/24/2022]
Abstract
The origin of a genetic code made it possible to create ordered sequences of amino acids. In this article we provide two perspectives on code origin by carrying out simulations of code-sequence coevolution in finite populations with the aim of examining how the standard genetic code may have evolved from more primitive code(s) encoding a small number of amino acids. We determine the efficacy of the physico-chemical hypothesis of code origin in the absence and presence of horizontal gene transfer (HGT) by allowing a diverse collection of code-sequence sets to compete with each other. We find that in the absence of horizontal gene transfer, natural selection between competing codes distinguished by differences in the degree of physico-chemical optimization is unable to explain the structure of the standard genetic code. However, for certain probabilities of the horizontal transfer events, a universal code emerges having a structure that is consistent with the standard genetic code.
Collapse
MESH Headings
- Amino Acids/chemistry
- Amino Acids/metabolism
- Amino Acyl-tRNA Synthetases/chemistry
- Amino Acyl-tRNA Synthetases/metabolism
- Codon/chemistry
- Codon/metabolism
- Evolution, Molecular
- Gene Transfer, Horizontal
- Genes
- Genetic Code
- Models, Genetic
- Origin of Life
- Probability
- Protein Biosynthesis
- RNA, Messenger/chemistry
- RNA, Messenger/metabolism
- RNA, Transfer, Amino Acid-Specific/chemistry
- RNA, Transfer, Amino Acid-Specific/metabolism
- Selection, Genetic
Collapse
Affiliation(s)
- Supratim Sengupta
- Department of Physical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, 741246, India,
| | | | | |
Collapse
|