1
|
Tang GQ, Hu H, Douglas J, Carter C. Primordial aminoacyl-tRNA synthetases preferred minihelices to full-length tRNA. Nucleic Acids Res 2024; 52:7096-7111. [PMID: 38783009 PMCID: PMC11229368 DOI: 10.1093/nar/gkae417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 04/30/2024] [Accepted: 05/10/2024] [Indexed: 05/25/2024] Open
Abstract
Aminoacyl-tRNA synthetases (AARS) and tRNAs translate the genetic code in all living cells. Little is known about how their molecular ancestors began to enforce the coding rules for the expression of their own genes. Schimmel et al. proposed in 1993 that AARS catalytic domains began by reading an 'operational' code in the acceptor stems of tRNA minihelices. We show here that the enzymology of an AARS urzyme•TΨC-minihelix cognate pair is a rich in vitro realization of that idea. The TΨC-minihelixLeu is a very poor substrate for full-length Leucyl-tRNA synthetase. It is a superior RNA substrate for the corresponding urzyme, LeuAC. LeuAC active-site mutations shift the choice of both amino acid and RNA substrates. AARS urzyme•minihelix cognate pairs are thus small, pliant models for the ancestral decoding hardware. They are thus an ideal platform for detailed experimental study of the operational RNA code.
Collapse
Affiliation(s)
- Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Hao Hu
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, University of Auckland, New Zealand
- Department of Computer Science, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
2
|
Carter CW. Base Pairing Promoted the Self-Organization of Genetic Coding, Catalysis, and Free-Energy Transduction. Life (Basel) 2024; 14:199. [PMID: 38398709 PMCID: PMC10890426 DOI: 10.3390/life14020199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 01/21/2024] [Accepted: 01/25/2024] [Indexed: 02/25/2024] Open
Abstract
How Nature discovered genetic coding is a largely ignored question, yet the answer is key to explaining the transition from biochemical building blocks to life. Other, related puzzles also fall inside the aegis enclosing the codes themselves. The peptide bond is unstable with respect to hydrolysis. So, it requires some form of chemical free energy to drive it. Amino acid activation and acyl transfer are also slow and must be catalyzed. All living things must thus also convert free energy and synchronize cellular chemistry. Most importantly, functional proteins occupy only small, isolated regions of sequence space. Nature evolved heritable symbolic data processing to seek out and use those sequences. That system has three parts: a memory of how amino acids behave in solution and inside proteins, a set of code keys to access that memory, and a scoring function. The code keys themselves are the genes for cognate pairs of tRNA and aminoacyl-tRNA synthetases, AARSs. The scoring function is the enzymatic specificity constant, kcat/kM, which measures both catalysis and specificity. The work described here deepens the evidence for and understanding of an unexpected consequence of ancestral bidirectional coding. Secondary structures occur in approximately the same places within antiparallel alignments of their gene products. However, the polar amino acids that define the molecular surface of one are reflected into core-defining non-polar side chains on the other. Proteins translated from base-paired coding strands fold up inside out. Bidirectional genes thus project an inverted structural duality into the proteome. I review how experimental data root the scoring functions responsible for the origins of coding and catalyzed activation of unfavorable chemical reactions in that duality.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
3
|
Guo X, Geng L, Jiang C, Yao W, Jin J, Liu Z, Mu Y. Multiplexed genome engineering for porcine fetal fibroblasts with gRNA-tRNA arrays based on CRISPR/Cas9. Anim Biotechnol 2023; 34:4703-4712. [PMID: 36946758 DOI: 10.1080/10495398.2023.2187402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
Multiplex gene modifications are highly required for various fields of porcine research. In many species, the CRISPR/Cas9 system has been widely applied for genomic editing and provides a potential tool for introducing multiplex genome mutations simultaneously. Here, we present a CRISPR-Cas9 gRNA-tRNA array (GTR-CRISPR) for multiplexed engineering of porcine fetal fibroblasts (PFFs). We successfully produced multiple sgRNAs using only one Pol III promoter by taking advantage of the endogenous tRNA processing mechanism in porcine cells. Using an all-in-one construct carrying GTR and Cas9, we disrupted the IGFBP3, MSTN, MC4R, and SOCS2 genes in multiple codon regions in one PFF cell simultaneously. This technique allows the simultaneous disruption of four genes with 5.5% efficiency. As a result, this approach may effectively target multiple genes at the same time, making it a powerful tool for establishing multiple genes mutant cells in pigs.
Collapse
Affiliation(s)
- Xiaochen Guo
- Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, College of Life Science, Northeast Agricultural University, Harbin, China
| | - Lishuang Geng
- Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, College of Life Science, Northeast Agricultural University, Harbin, China
| | - Chaoqian Jiang
- Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, College of Life Science, Northeast Agricultural University, Harbin, China
| | - Wang Yao
- Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, College of Life Science, Northeast Agricultural University, Harbin, China
| | - Junxue Jin
- Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, College of Life Science, Northeast Agricultural University, Harbin, China
| | - Zhonghua Liu
- Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, College of Life Science, Northeast Agricultural University, Harbin, China
| | - Yanshuang Mu
- Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, College of Life Science, Northeast Agricultural University, Harbin, China
| |
Collapse
|
4
|
Štambuk N, Konjevoda P, Štambuk A. How ambiguity codes specify molecular descriptors and information flow in Code Biology. Biosystems 2023; 233:105034. [PMID: 37739308 DOI: 10.1016/j.biosystems.2023.105034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 09/12/2023] [Accepted: 09/12/2023] [Indexed: 09/24/2023]
Abstract
The article presents IUPAC ambiguity codes for incomplete nucleic acid specification, and their use in Code Biology. It is shown how to use this nomenclature in order to extract accurate information on different properties of the biological systems. We investigated the use of ambiguity codes, as mathematical and logical operators and truth table elements, for the encoding of amino acids by means of the Standard Genetic Code. It is explained how to use ambiguity codes and truth functions in order to obtain accurate information on different properties of the biological systems. Nucleotide ambiguity codes could be applied to: 1. encoding descriptive information of nucleotides, amino acids and proteins (e.g., of polarity, relative solvent accessibility, atom depth, etc.), and 2. system modelling ranging from standard bioinformatics tools to classic evolutionary models (i.e. from Miyazawa-Jernigan statistical potential to Kimura three-substitution-type model, respectively). It is shown that the algorithms based on IUPAC ambiguity codes, Boolean functions and truth table, Probabilistic Square of Opposition/Semiotic Square and Klein 4-groups-could be used for the bioinformatics analyses and Relational data modelling in natural science. Underlying mathematical, logical and semiotic concepts of interest are presented and addressed.
Collapse
Affiliation(s)
- Nikola Štambuk
- Centre for Nuclear Magnetic Resonance, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000, Zagreb, Croatia.
| | - Paško Konjevoda
- Laboratory for Epigenomics, Division of Molecular Medicine, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000, Zagreb, Croatia.
| | - Albert Štambuk
- Faculty of Kinesiology, University of Zagreb, Horvaćanski zavoj 15, HR-10000 Zagreb, Croatia
| |
Collapse
|
5
|
Schwander L, Brabender M, Mrnjavac N, Wimmer JLE, Preiner M, Martin WF. Serpentinization as the source of energy, electrons, organics, catalysts, nutrients and pH gradients for the origin of LUCA and life. Front Microbiol 2023; 14:1257597. [PMID: 37854333 PMCID: PMC10581274 DOI: 10.3389/fmicb.2023.1257597] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 09/04/2023] [Indexed: 10/20/2023] Open
Abstract
Serpentinization in hydrothermal vents is central to some autotrophic theories for the origin of life because it generates compartments, reductants, catalysts and gradients. During the process of serpentinization, water circulates through hydrothermal systems in the crust where it oxidizes Fe (II) in ultramafic minerals to generate Fe (III) minerals and H2. Molecular hydrogen can, in turn, serve as a freely diffusible source of electrons for the reduction of CO2 to organic compounds, provided that suitable catalysts are present. Using catalysts that are naturally synthesized in hydrothermal vents during serpentinization H2 reduces CO2 to formate, acetate, pyruvate, and methane. These compounds represent the backbone of microbial carbon and energy metabolism in acetogens and methanogens, strictly anaerobic chemolithoautotrophs that use the acetyl-CoA pathway of CO2 fixation and that inhabit serpentinizing environments today. Serpentinization generates reduced carbon, nitrogen and - as newer findings suggest - reduced phosphorous compounds that were likely conducive to the origins process. In addition, it gives rise to inorganic microcompartments and proton gradients of the right polarity and of sufficient magnitude to support chemiosmotic ATP synthesis by the rotor-stator ATP synthase. This would help to explain why the principle of chemiosmotic energy harnessing is more conserved (older) than the machinery to generate ion gradients via pumping coupled to exergonic chemical reactions, which in the case of acetogens and methanogens involve H2-dependent CO2 reduction. Serpentinizing systems exist in terrestrial and deep ocean environments. On the early Earth they were probably more abundant than today. There is evidence that serpentinization once occurred on Mars and is likely still occurring on Saturn's icy moon Enceladus, providing a perspective on serpentinization as a source of reductants, catalysts and chemical disequilibrium for life on other worlds.
Collapse
Affiliation(s)
- Loraine Schwander
- Institute of Molecular Evolution, Biology Department, Math. -Nat. Faculty, Heinrich-Heine-Universität, Düsseldorf, Germany
| | - Max Brabender
- Institute of Molecular Evolution, Biology Department, Math. -Nat. Faculty, Heinrich-Heine-Universität, Düsseldorf, Germany
| | - Natalia Mrnjavac
- Institute of Molecular Evolution, Biology Department, Math. -Nat. Faculty, Heinrich-Heine-Universität, Düsseldorf, Germany
| | - Jessica L. E. Wimmer
- Institute of Molecular Evolution, Biology Department, Math. -Nat. Faculty, Heinrich-Heine-Universität, Düsseldorf, Germany
| | - Martina Preiner
- Microcosm Earth Center, Max Planck Institute for Terrestrial Microbiology and Philipps-Universität, Marburg, Germany
| | - William F. Martin
- Institute of Molecular Evolution, Biology Department, Math. -Nat. Faculty, Heinrich-Heine-Universität, Düsseldorf, Germany
| |
Collapse
|
6
|
Zolyan S. On the minimal elements of the genetic code and their semiotic functions (degeneracy, complementarity, wobbling). Biosystems 2023; 231:104962. [PMID: 37437772 DOI: 10.1016/j.biosystems.2023.104962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/17/2023] [Accepted: 06/19/2023] [Indexed: 07/14/2023]
Abstract
We address semiotic features of genetic coding, primarily the mechanisms for distinguishing between triplets. It implies that the minimal elements that allow codon recognition and amino acid coding should be identified. Half a century ago, linguist Roman Jakobson and microbiologist François Jacob revealed functional similarities between nucleotides in the genetic code and phonemes in natural language. Developing this analogy, we introduce the concept of a semiotic nucleotide. Unlike "material" nucleotides, its characteristics are limited to the function of differentiation within the processing of genetic information. We demonstrate that, similarly to phonemes, nucleotides are also non-elementary entities and can be represented as a set of two differential features: a) the number of rings and b) the number of hydrogen bonds. This makes it possible to convert semiotic nucleotides into double-byte units of digital information. Proceeding from this assumption, we suggest a new vision of such phenomena as the heterogeneity of the genetic code in terms of coding types, to reveal the code-distinguishing potential of positions within triplets, and represent wobbling as a specific reading regime. All these phenomena relate to the peculiarity of the triplet's third position, where complete or partial neutralization of distinguishing features is possible.
Collapse
Affiliation(s)
- Suren Zolyan
- Immanuel Kant Baltic Federal University, Kaliningrad, Russia; Institute of Scientific Information on Social Sciences of the Russian Academy of Sciences, Moscow, Russia.
| |
Collapse
|
7
|
José MV, Bobadilla JR, Zamudio GS, de Farías ST. Symmetrical distributions of aminoacyl-tRNA synthetases during the evolution of the genetic code. Theory Biosci 2023; 142:211-219. [PMID: 37402895 PMCID: PMC10423125 DOI: 10.1007/s12064-023-00394-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 06/10/2023] [Indexed: 07/06/2023]
Abstract
In this work, we formulate the following question: How the distribution of aminoacyl-tRNA synthetases (aaRSs) went from an ancestral bidirectional gene (mirror symmetry) to the symmetrical distribution of aaRSs in a six-dimensional hypercube of the Standard Genetic Code (SGC)? We assume a primeval RNY code, two Extended Genetic RNA codes type 1 and 2, and the SGC. We outline the types of symmetries of the distribution of aaRSs in each code. The symmetry groups of aaRSs in each code are described, until the symmetries of the SGC display a mirror symmetry. Considering both Extended RNA codes the 20 aaRSs were already present before the Last Universal Ancestor. These findings reveal intricacies in the diversification of aaRSs accompanied by the evolution of the genetic code.
Collapse
Affiliation(s)
- Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, CP 04510, Mexico City, Mexico.
| | - Juan R Bobadilla
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, CP 04510, Mexico City, Mexico
| | - Gabriel S Zamudio
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, CP 04510, Mexico City, Mexico
| | - Sávio Torres de Farías
- Laboratório de Genética Evolutiva Paulo Leminsk, Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil
| |
Collapse
|
8
|
Wills PR. Origins of Genetic Coding: Self-Guided Molecular Self-Organisation. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1281. [PMID: 37761580 PMCID: PMC10527755 DOI: 10.3390/e25091281] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 08/22/2023] [Accepted: 08/28/2023] [Indexed: 09/29/2023]
Abstract
The origin of genetic coding is characterised as an event of cosmic significance in which quantum mechanical causation was transcended by constructive computation. Computational causation entered the physico-chemical processes of the pre-biotic world by the incidental satisfaction of a condition of reflexivity between polymer sequence information and system elements able to facilitate their own production through translation of that information. This event, which has previously been modelled in the dynamics of Gene-Replication-Translation systems, is properly described as a process of self-guided self-organisation. The spontaneous emergence of a primordial genetic code between two-letter alphabets of nucleotide triplets and amino acids is easily possible, starting with random peptide synthesis that is RNA-sequence-dependent. The evident self-organising mechanism is the simultaneous quasi-species bifurcation of the populations of information-carrying genes and enzymes with aminoacyl-tRNA synthetase-like activities. This mechanism allowed the code to evolve very rapidly to the ~20 amino acid limit apparent for the reflexive differentiation of amino acid properties using protein catalysts. The self-organisation of semantics in this domain of physical chemistry conferred on emergent molecular biology exquisite computational control over the nanoscopic events needed for its self-construction.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, Auckland PB 92019, New Zealand
| |
Collapse
|
9
|
Zagrovic B, Adlhart M, Kapral TH. Coding From Binding? Molecular Interactions at the Heart of Translation. Annu Rev Biophys 2023; 52:69-89. [PMID: 36626765 DOI: 10.1146/annurev-biophys-090622-102329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The mechanism and the evolution of DNA replication and transcription, the key elements of the central dogma of biology, are fundamentally well explained by the physicochemical complementarity between strands of nucleic acids. However, the determinants that have shaped the third part of the dogma-the process of biological translation and the universal genetic code-remain unclear. We review and seek parallels between different proposals that view the evolution of translation through the prism of weak, noncovalent interactions between biological macromolecules. In particular, we focus on a recent proposal that there exists a hitherto unrecognized complementarity at the heart of biology, that between messenger RNA coding regions and the proteins that they encode, especially if the two are unstructured. Reflecting the idea that the genetic code evolved from intrinsic binding propensities between nucleotides and amino acids, this proposal promises to forge a link between the distant past and the present of biological systems.
Collapse
Affiliation(s)
- Bojan Zagrovic
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Marlene Adlhart
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Thomas H Kapral
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
| |
Collapse
|
10
|
Guo X, Su M. The Origin of Translation: Bridging the Nucleotides and Peptides. Int J Mol Sci 2022; 24:ijms24010197. [PMID: 36613641 PMCID: PMC9820756 DOI: 10.3390/ijms24010197] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/09/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022] Open
Abstract
Extant biology uses RNA to record genetic information and proteins to execute biochemical functions. Nucleotides are translated into amino acids via transfer RNA in the central dogma. tRNA is essential in translation as it connects the codon and the cognate amino acid. To reveal how the translation emerged in the prebiotic context, we start with the structure and dissection of tRNA, followed by the theory and hypothesis of tRNA and amino acid recognition. Last, we review how amino acids assemble on the tRNA and further form peptides. Understanding the origin of life will also promote our knowledge of artificial living systems.
Collapse
Affiliation(s)
- Xuyuan Guo
- School of Genetics and Microbiology, Trinity College Dublin, The University of Dublin, College Green, Dublin 2, D02 PN40 Dublin, Ireland
| | - Meng Su
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
- Correspondence:
| |
Collapse
|
11
|
A Leucyl-tRNA Synthetase Urzyme: Authenticity of tRNA Synthetase Catalytic Activities and Promiscuous Phosphorylation of Leucyl-5'AMP. Int J Mol Sci 2022; 23:ijms23084229. [PMID: 35457045 PMCID: PMC9026127 DOI: 10.3390/ijms23084229] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 03/30/2022] [Accepted: 03/31/2022] [Indexed: 02/05/2023] Open
Abstract
Aminoacyl-tRNA synthetase (aaRS)/tRNA cognate pairs translate the genetic code by synthesizing specific aminoacyl-tRNAs that are assembled on messenger RNA by the ribosome. Deconstruction of the two distinct aaRS superfamilies (Classes) has provided conceptual and experimental models for their early evolution. Urzymes, containing ~120–130 amino acids excerpted from regions where genetic coding sequence complementarities have been identified, are key experimental models motivated by the proposal of a single bidirectional ancestral gene. Previous reports that Class I and Class II urzymes accelerate both amino acid activation and tRNA aminoacylation have not been extended to other synthetases. We describe a third urzyme (LeuAC) prepared from the Class IA Pyrococcus horikoshii leucyl-tRNA synthetase. We adduce multiple lines of evidence for the authenticity of its catalysis of both canonical reactions, amino acid activation and tRNALeu aminoacylation. Mutation of the three active-site lysine residues to alanine causes significant, but modest reduction in both amino acid activation and aminoacylation. LeuAC also catalyzes production of ADP, a non-canonical enzymatic function that has been overlooked since it first was described for several full-length aaRS in the 1970s. Structural data suggest that the LeuAC active site accommodates two ATP conformations that are prominent in water but rarely seen bound to proteins, accounting for successive, in situ phosphorylation of the bound leucyl-5′AMP phosphate, accounting for ADP production. This unusual ATP consumption regenerates the transition state for amino acid activation and suggests, in turn, that in the absence of the editing and anticodon-binding domains, LeuAC releases leu-5′AMP unusually slowly, relative to the two phosphorylation reactions.
Collapse
|
12
|
Konjevoda P, Štambuk N. Relational model of the standard genetic code. Biosystems 2021; 210:104529. [PMID: 34464669 DOI: 10.1016/j.biosystems.2021.104529] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 08/26/2021] [Accepted: 08/27/2021] [Indexed: 11/28/2022]
Abstract
The genetic code is a set of rules that establishes mapping between triplets in messenger RNA and amino acids in proteins. The most common way to display these rules is the Standard Genetic Code (SGC) table. This paper takes an alternative approach, based on the relational data model by Edgar F. Codd (Commun. ACM, 13:377-387, 1970). The relational model (RM) proposes a distributed storage of data into a collection of tables (called relations), that can be connected by shared communality. Basic elements of the table are rows (called records or tuples), and columns (called fields or attributes). The SGC table, according to the relational data model, represents the so called unnormalized form of a table. Using normalization rules it is possible to subdivide the SGC table into four tables. The rows and columns of single tables are defined by the first and second base and individual tables by the third codon base. The result of this model is an approach to managing genetic code data, represented in terms of tuples and grouped into relations, with table structure and language consistent with first-order (predicate) logic. The RM explains that the final step in the development of the SGC was the adoption of coding function by the third base, which makes an informational/functional unit with the first base, despite the different physical location in a triplet. This enabled the synthesis of specific proteins without ambiguity, in accordance with the concept of ambiguity reduction and five phases of the general model on the origin of biological codes by Marcello Barbieri (BioSystems 181:11-19, 2019).
Collapse
Affiliation(s)
- Paško Konjevoda
- Laboratory for Epigenomics, Division of Molecular Medicine, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000 Zagreb, Croatia.
| | - Nikola Štambuk
- Center for Nuclear Magnetic Resonance, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000 Zagreb, Croatia.
| |
Collapse
|
13
|
Abstract
Codon-dependent translation underlies genetics and phylogenetic inferences, but its origins pose two challenges. Prevailing narratives cannot account for the fact that aminoacyl-tRNA synthetases (aaRSs), which translate the genetic code, must collectively enforce the rules used to assemble themselves. Nor can they explain how specific assignments arose from rudimentary differentiation between ancestral aaRSs and corresponding transfer RNAs (tRNAs). Experimental deconstruction of the two aaRS superfamilies created new experimental tools with which to analyze the emergence of the code. Amino acid and tRNA substrate recognition are linked to phase transfer free energies of amino acids and arise largely from aaRS class-specific differences in secondary structure. Sensitivity to protein folding rules endowed ancestral aaRS-tRNA pairs with the feedback necessary to rapidly compare alternative genetic codes and coding sequences. These and other experimental data suggest that the aaRS bidirectional genetic ancestry stabilized the differentiation and interdependence required to initiate and elaborate the genetic coding table.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA;
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland 1142, New Zealand
| |
Collapse
|
14
|
Carter CW, Wills PR. Reciprocally-Coupled Gating: Strange Loops in Bioenergetics, Genetics, and Catalysis. Biomolecules 2021; 11:265. [PMID: 33670192 PMCID: PMC7916928 DOI: 10.3390/biom11020265] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 02/04/2021] [Accepted: 02/06/2021] [Indexed: 12/12/2022] Open
Abstract
Bioenergetics, genetic coding, and catalysis are all difficult to imagine emerging without pre-existing historical context. That context is often posed as a "Chicken and Egg" problem; its resolution is concisely described by de Grasse Tyson: "The egg was laid by a bird that was not a chicken". The concision and generality of that answer furnish no details-only an appropriate framework from which to examine detailed paradigms that might illuminate paradoxes underlying these three life-defining biomolecular processes. We examine experimental aspects here of five examples that all conform to the same paradigm. In each example, a paradox is resolved by coupling "if, and only if" conditions for reciprocal transitions between levels, such that the consequent of the first test is the antecedent for the second. Each condition thus restricts fluxes through, or "gates" the other. Reciprocally-coupled gating, in which two gated processes constrain one another, is self-referential, hence maps onto the formal structure of "strange loops". That mapping uncovers two different kinds of forces that may help unite the axioms underlying three phenomena that distinguish biology from chemistry. As a physical analog for Gödel's logic, biomolecular strange-loops provide a natural metaphor around which to organize a large body of experimental data, linking biology to information, free energy, and the second law of thermodynamics.
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| | - Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand;
| |
Collapse
|
15
|
Abstract
What were the physico-chemical forces that drove the origins of life? We discuss four major prebiotic 'discoveries': persistent sampling of chemical reaction space; sequence-encodable foldable catalysts; assembly of functional pathways; and encapsulation and heritability. We describe how a 'proteins-first' world gives plausible mechanisms. We note the importance of hydrophobic and polar compositions of matter in these advances.
Collapse
Affiliation(s)
- K. A. Dill
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
- Department of Chemistry, Stony Brook University, Stony Brook, NY, USA
- Department Physics and Astronomy, Stony Brook University, Stony Brook, NY, USA
| | - L. Agozzino
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, USA
| |
Collapse
|
16
|
Kaiser F, Krautwurst S, Salentin S, Haupt VJ, Leberecht C, Bittrich S, Labudde D, Schroeder M. The structural basis of the genetic code: amino acid recognition by aminoacyl-tRNA synthetases. Sci Rep 2020; 10:12647. [PMID: 32724042 PMCID: PMC7387524 DOI: 10.1038/s41598-020-69100-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 07/06/2020] [Indexed: 12/29/2022] Open
Abstract
Storage and directed transfer of information is the key requirement for the development of life. Yet any information stored on our genes is useless without its correct interpretation. The genetic code defines the rule set to decode this information. Aminoacyl-tRNA synthetases are at the heart of this process. We extensively characterize how these enzymes distinguish all natural amino acids based on the computational analysis of crystallographic structure data. The results of this meta-analysis show that the correct read-out of genetic information is a delicate interplay between the composition of the binding site, non-covalent interactions, error correction mechanisms, and steric effects.
Collapse
Affiliation(s)
- Florian Kaiser
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany. .,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany.
| | - Sarah Krautwurst
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | | - V Joachim Haupt
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany.,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany
| | | | | | - Dirk Labudde
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | |
Collapse
|
17
|
Determining amino acid scores of the genetic code table: Complementarity, structure, function and evolution. Biosystems 2020; 187:104026. [DOI: 10.1016/j.biosystems.2019.104026] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 08/28/2019] [Indexed: 11/22/2022]
|
18
|
Buckle AM, Buckle M. Ribosome Evolution and Structural Capacitance. Front Mol Biosci 2019; 6:123. [PMID: 31803754 PMCID: PMC6872460 DOI: 10.3389/fmolb.2019.00123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Accepted: 10/24/2019] [Indexed: 11/13/2022] Open
Abstract
In addition to the canonical loss-of-function mutations, mutations in proteins may additionally result in gain-of-function through the binary activation of cryptic "structural capacitance elements." Our previous bioinformatic analysis allowed us to propose a new mechanism of protein evolution - structural capacitance - that arises via the generation of new elements of microstructure upon mutations that cause a disorder-to-order (D→O) transition in previously disordered regions of proteins. Here we propose that the D→O transition is a necessary follow-on from expected early codon-anticodon and tRNA acceptor stem-amino acid usage, via the accumulation of structural capacitance elements - reservoirs of disorder in proteins. We develop this argument further to posit that structural capacitance is an inherent consequence of the evolution of the genetic code.
Collapse
Affiliation(s)
- Ashley M Buckle
- Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia
| | - Malcolm Buckle
- LBPA, ENS Paris-Saclay, CNRS, Université Paris-Saclay, Cachan, France
| |
Collapse
|
19
|
The Ancient Operational Code is Embedded in the Amino Acid Substitution Matrix and aaRS Phylogenies. J Mol Evol 2019; 88:136-150. [PMID: 31781936 DOI: 10.1007/s00239-019-09918-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/14/2019] [Indexed: 10/25/2022]
Abstract
The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the aaSM, in which (i) the number of distinguishable amino acid types, (ii) the matrix dimension and (iii) the number of parameters, each increases by one for each bifurcation in an aaRS phylogeny. Parameterized matrices corresponding to trees in which the size of an amino acid sidechain is the only discernible property behind its categorization as a substrate, exclusively for a Class I or II aaRS, provide a significantly better fit to empirically determined aaSM than trees with random bifurcation patterns. A second split between polar and nonpolar amino acids in each Class effects a vastly greater further improvement. The earliest Class-separated epochs in the phylogenies of the aaRS reflect these enzymes' capability to distinguish tRNAs through the recognition of acceptor stem identity elements via the minor (Class I) and major (Class II) helical grooves, which is how the ancient operational code functioned. The advent of tRNA recognition using the anticodon loop supports the evolution of the optimal map of amino acid chemistry found in the later genetic code, an essentially digital categorization, in which polarity is the major functional property, compensating for the unrefined, haphazard differentiation of amino acids achieved by the operational code.
Collapse
|
20
|
Reflexivity, coding and quantum biology. Biosystems 2019; 185:104027. [PMID: 31494127 DOI: 10.1016/j.biosystems.2019.104027] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2019] [Revised: 08/29/2019] [Accepted: 08/31/2019] [Indexed: 12/31/2022]
Abstract
Biological systems are fundamentally computational in that they process information in an apparently purposeful fashion rather than just transferring bits of it in a purely syntactical manner. Biological information, such has genetic information stored in DNA sequences, has semantic content. It carries meaning that is defined by the molecular context of its cellular environment. Information processing in biological systems displays an inherent reflexivity, a tendency for the computational information-processing to be "about" the behaviour of the molecules that participate in the computational process. This is most evident in the operation of the genetic code, where the specificity of the reactions catalysed by the aminoacyl-tRNA synthetase (aaRS) enzymes is required to be self-sustaining. A cell's suite of aaRS enzymes completes a reflexively autocatalytic set of molecular components capable of making themselves through the operation of the code. This set requires the existence of a body of reflexive information to be stored in an organism's genome. The genetic code is a reflexively self-organised mapping of the chemical properties of amino acid sidechains onto codon "tokens". It is a highly evolved symbolic system of chemical self-description. Although molecular biological coding is generally portrayed in terms of classical bit-transfer events, various biochemical events explicitly require quantum coherence for their occurrence. Whether the implicit transfer of quantum information, qbits, is indicative of wide-ranging quantum computation in living systems is currently the subject of extensive investigation and speculation in the field of Quantum Biology.
Collapse
|
21
|
Carter CW, Wills PR. Experimental solutions to problems defining the origin of codon-directed protein synthesis. Biosystems 2019; 183:103979. [PMID: 31176803 PMCID: PMC6693952 DOI: 10.1016/j.biosystems.2019.103979] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 05/27/2019] [Accepted: 05/29/2019] [Indexed: 12/13/2022]
Abstract
How genetic coding differentiated biology from chemistry is a long-standing challenge in Biology, for which there have been few experimental approaches, despite a wide-ranging speculative literature. We summarize five coordinated areas-experimental characterization of functional approximations to the minimal peptides (protozymes and urzymes) necessary to activate amino acids and acylate tRNA; showing that specificities of these experimental models match those expected from the synthetase Class division; population of disjoint regions of amino acid sequence space via bidirectional coding ancestry of the two synthetase Classes; showing that the phase transfer equilibria of amino acid side chains that form a two-dimensional basis set for protein folding are embedded in patterns of bases in the tRNA acceptor stem and anticodon; and identification of molecular signatures of ancestral synthetases and tRNAs necessary to define the earliest cognate synthetase:tRNA pairs-that now compose an extensive experimentally testable paradigm for progress toward understanding the coordinated emergence of the codon table and viable mRNA coding sequences. We briefly discuss recent progress toward identifying the remaining outstanding questions-the nature of the earliest amino acid alphabets and the origin of binding discrimination via distinct amino acid sequence-independent protein secondary structures-and how these, too, might be addressed experimentally.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, United States
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| |
Collapse
|
22
|
Carter CW, Wills PR. Hierarchical groove discrimination by Class I and II aminoacyl-tRNA synthetases reveals a palimpsest of the operational RNA code in the tRNA acceptor-stem bases. Nucleic Acids Res 2019; 46:9667-9683. [PMID: 30016476 PMCID: PMC6182185 DOI: 10.1093/nar/gky600] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 07/12/2018] [Indexed: 01/01/2023] Open
Abstract
Class I and II aaRS recognition of opposite grooves was likely among the earliest determinants fixed in the tRNA acceptor stem bases. A new regression model identifies those determinants in bacterial tRNAs. Integral coefficients relate digital dependent to independent variables with perfect agreement between observed and calculated grooves for all twenty isoaccepting tRNAs. Recognition is mediated by the Discriminator base 73, the first base pair, and base 2 of the acceptor stem. Subsets of these coefficients also identically compute grooves recognized by smaller numbers of aaRS. Thus, the model is hierarchical, suggesting that new rules were added to pre-existing ones as new amino acids joined the coding alphabet. A thermodynamic rationale for the simplest model implies that Class-dependent aaRS secondary structures exploited differential tendencies of the acceptor stem to form the hairpin observed in Class I aaRS•tRNA complexes, enabling the earliest groove discrimination. Curiously, groove recognition also depends explicitly on the identity of base 2 in a manner consistent with the middle bases of the codon table, confirming a hidden ancestry of codon-anticodon pairing in the acceptor stem. That, and the lack of correlation with anticodon bases support prior productive coding interaction of tRNA minihelices with proto-mRNA.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| | - Peter R Wills
- Department of Physics, Centre for Computational Evolution, and Te Ao Marama Centre for Fundamental Enquiry, University of Auckland, PB 92109, Auckland 1142, New Zealand
| |
Collapse
|
23
|
Carter CW, Wills PR. Class I and II aminoacyl-tRNA synthetase tRNA groove discrimination created the first synthetase-tRNA cognate pairs and was therefore essential to the origin of genetic coding. IUBMB Life 2019; 71:1088-1098. [PMID: 31190358 DOI: 10.1002/iub.2094] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Revised: 04/14/2019] [Accepted: 04/15/2019] [Indexed: 12/20/2022]
Abstract
The genetic code likely arose when a bidirectional gene replicating as a quasi-species began to produce ancestral aminoacyl-tRNA synthetases (aaRS) capable of distinguishing between two distinct sets of amino acids. The synthetase class division therefore necessarily implies a mechanism by which the two ancestral synthetases could also discriminate between two different kinds of tRNA substrates. We used regression methods to uncover the possible patterns of base sequences capable of such discrimination and find that they appear to be related to thermodynamic differences in the relative stabilities of a hairpin necessary for recognition of tRNA substrates by Class I aaRS. The thermodynamic differences appear to be exploited by secondary structural differences between models for the ancestral aaRS called synthetase Urzymes and reinforced by packing of aromatic amino acid side chains against the nonpolar face of the ribose of A76 if and only if the tRNA CCA sequence forms a hairpin. The patterns of bases 1, 2, and 73 and stabilization of the hairpin by structural complementarity with Class I, but not Class II, aaRS Urzymes appear to be necessary and sufficient to have enabled the generation of the first two aaRS-tRNA cognate pairs, and the launch of a rudimentary binary genetic coding related recognizably to contemporary cognate pairs. As a consequence, it seems likely that nonrandom aminoacylation of tRNAs preceded the advent of the tRNA anticodon stem-loop. Consistent with this suggestion, coding rules in the acceptor-stem bases also reveal a palimpsest of the codon-anticodon interaction, as previously proposed. © 2019 IUBMB Life, 2019 © 2019 IUBMB Life, 71(8):1088-1098, 2019.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, Auckland, New Zealand
| |
Collapse
|
24
|
Schaffer AE, Pinkard O, Coller JM. tRNA Metabolism and Neurodevelopmental Disorders. Annu Rev Genomics Hum Genet 2019; 20:359-387. [PMID: 31082281 DOI: 10.1146/annurev-genom-083118-015334] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
tRNAs are short noncoding RNAs required for protein translation. The human genome includes more than 600 putative tRNA genes, many of which are considered redundant. tRNA transcripts are subject to tightly controlled, multistep maturation processes that lead to the removal of flanking sequences and the addition of nontemplated nucleotides. Furthermore, tRNAs are highly structured and posttranscriptionally modified. Together, these unique features have impeded the adoption of modern genomics and transcriptomics technologies for tRNA studies. Nevertheless, it has become apparent from human neurogenetic research that many tRNA biogenesis proteins cause brain abnormalities and other neurological disorders when mutated. The cerebral cortex, cerebellum, and peripheral nervous system show defects, impairment, and degeneration upon tRNA misregulation, suggesting that they are particularly sensitive to changes in tRNA expression or function. An integrated approach to identify tRNA species and contextually characterize tRNA function will be imperative to drive future tool development and novel therapeutic design for tRNA-associated disorders.
Collapse
Affiliation(s)
- Ashleigh E Schaffer
- Department of Genetics and Genome Sciences and Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio 44106, USA;
| | - Otis Pinkard
- Department of Genetics and Genome Sciences and Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio 44106, USA;
| | - Jeffery M Coller
- Department of Genetics and Genome Sciences and Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, Ohio 44106, USA;
| |
Collapse
|
25
|
Bittrich S, Schroeder M, Labudde D. Characterizing the relation of functional and Early Folding Residues in protein structures using the example of aminoacyl-tRNA synthetases. PLoS One 2018; 13:e0206369. [PMID: 30376559 PMCID: PMC6207335 DOI: 10.1371/journal.pone.0206369] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 10/11/2018] [Indexed: 01/10/2023] Open
Abstract
Proteins are chains of amino acids which adopt a three-dimensional structure and are then able to catalyze chemical reactions or propagate signals in organisms. Without external influence, many proteins fold into their native structure, and a small number of Early Folding Residues (EFR) have previously been shown to initiate the formation of secondary structure elements and guide their respective assembly. Using the two diverse superfamilies of aminoacyl-tRNA synthetases (aaRS), it is shown that the position of EFR is preserved over the course of evolution even when the corresponding sequence conservation is small. Folding initiation sites are positioned in the center of secondary structure elements, independent of aaRS class. In class I, the predicted position of EFR resembles an ancient structural packing motif present in many seemingly unrelated proteins. Furthermore, it is shown that EFR and functionally relevant residues in aaRS are almost entirely disjoint sets of residues. The Start2Fold database is used to investigate whether this separation of EFR and functional residues can be observed for other proteins. EFR are found to constitute crucial connectors of protein regions which are distant at sequence level. Especially, these residues exhibit a high number of non-covalent residue-residue contacts such as hydrogen bonds and hydrophobic interactions. This tendency also manifests as energetically stable local regions, as substantiated by a knowledge-based potential. Despite profound differences regarding how EFR and functional residues are embedded in protein structures, a strict separation of structurally and functionally relevant residues cannot be observed for a more general collection of proteins.
Collapse
Affiliation(s)
- Sebastian Bittrich
- Applied Computer Sciences & Biosciences, University of Applied Sciences Mittweida, Mittweida, Saxony, Germany
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Saxony, Germany
| | - Michael Schroeder
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Saxony, Germany
| | - Dirk Labudde
- Applied Computer Sciences & Biosciences, University of Applied Sciences Mittweida, Mittweida, Saxony, Germany
| |
Collapse
|
26
|
Vitas M, Dobovišek A. In the Beginning was a Mutualism - On the Origin of Translation. ORIGINS LIFE EVOL B 2018; 48:223-243. [PMID: 29713988 DOI: 10.1007/s11084-018-9557-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Accepted: 04/23/2018] [Indexed: 12/28/2022]
Abstract
The origin of translation is critical for understanding the evolution of life, including the origins of life. The canonical genetic code is one of the most dominant aspects of life on this planet, while the origin of heredity is one of the key evolutionary transitions in living world. Why the translation apparatus evolved is one of the enduring mysteries of molecular biology. Assuming the hypothesis, that during the emergence of life evolution had to first involve autocatalytic systems which only subsequently acquired the capacity of genetic heredity, we propose and discuss possible mechanisms, basic aspects of the emergence and subsequent molecular evolution of translation and ribosomes, as well as enzymes as we know them today. It is possible, in this sense, to view the ribosome as a digital-to-analogue information converter. The proposed mechanism is based on the abilities and tendencies of short RNA and polypeptides to fold and to catalyse biochemical reactions. The proposed mechanism is in concordance with the hypothesis of a possible chemical co-evolution of RNA and proteins in the origin of the genetic code or even more generally at the early evolution of life on Earth. The possible abundance and availability of monomers at prebiotic conditions are considered in the mechanism. The hypothesis that early polypeptides were folding on the RNA scaffold is also considered and mutualism in molecular evolutionary development of RNA and peptides is favoured.
Collapse
Affiliation(s)
- Marko Vitas
- , Laze pri Borovnici 38, Borovnica, Slovenia.
| | - Andrej Dobovišek
- Faculty of Natural Sciences and Mathematics, University of Maribor, Koroška cesta 160, 2000, Maribor, Slovenia
| |
Collapse
|
27
|
Kaiser F, Bittrich S, Salentin S, Leberecht C, Haupt VJ, Krautwurst S, Schroeder M, Labudde D. Backbone Brackets and Arginine Tweezers delineate Class I and Class II aminoacyl tRNA synthetases. PLoS Comput Biol 2018; 14:e1006101. [PMID: 29659563 PMCID: PMC5919687 DOI: 10.1371/journal.pcbi.1006101] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Revised: 04/26/2018] [Accepted: 03/20/2018] [Indexed: 12/22/2022] Open
Abstract
The origin of the machinery that realizes protein biosynthesis in all organisms is still unclear. One key component of this machinery are aminoacyl tRNA synthetases (aaRS), which ligate tRNAs to amino acids while consuming ATP. Sequence analyses revealed that these enzymes can be divided into two complementary classes. Both classes differ significantly on a sequence and structural level, feature different reaction mechanisms, and occur in diverse oligomerization states. The one unifying aspect of both classes is their function of binding ATP. We identified Backbone Brackets and Arginine Tweezers as most compact ATP binding motifs characteristic for each Class. Geometric analysis shows a structural rearrangement of the Backbone Brackets upon ATP binding, indicating a general mechanism of all Class I structures. Regarding the origin of aaRS, the Rodin-Ohno hypothesis states that the peculiar nature of the two aaRS classes is the result of their primordial forms, called Protozymes, being encoded on opposite strands of the same gene. Backbone Brackets and Arginine Tweezers were traced back to the proposed Protozymes and their more efficient successors, the Urzymes. Both structural motifs can be observed as pairs of residues in contemporary structures and it seems that the time of their addition, indicated by their placement in the ancient aaRS, coincides with the evolutionary trace of Proto- and Urzymes. Aminoacyl tRNA synthetases (aaRS) are primordial enzymes essential for interpretation and transfer of genetic information. Understanding the origin of the peculiarities observed with aaRS can explain what constituted the earliest life forms and how the genetic code was established. The increasing amount of experimentally determined three-dimensional structures of aaRS opens up new avenues for high-throughput analyses of molecular mechanisms. In this study, we present an exhaustive structural analysis of ATP binding motifs. We unveil an oppositional implementation of enzyme substrate binding in each aaRS Class. While Class I binds via interactions mediated by backbone hydrogen bonds, Class II uses a pair of arginine residues to establish salt bridges to its ATP ligand. We show how nature realized the binding of the same ligand species with completely different mechanisms. In addition, we demonstrate that sequence or even structure analysis for conserved residues may miss important functional aspects which can only be revealed by ligand interaction studies. Additionally, the placement of those key residues in the structure supports a popular hypothesis, which states that prototypic aaRS were once coded on complementary strands of the same gene.
Collapse
Affiliation(s)
- Florian Kaiser
- University of Applied Sciences Mittweida, Mittweida, Germany
- Biotechnology Center (BIOTEC), TU Dresden, Dresden, Germany
- * E-mail:
| | - Sebastian Bittrich
- University of Applied Sciences Mittweida, Mittweida, Germany
- Biotechnology Center (BIOTEC), TU Dresden, Dresden, Germany
| | | | - Christoph Leberecht
- University of Applied Sciences Mittweida, Mittweida, Germany
- Biotechnology Center (BIOTEC), TU Dresden, Dresden, Germany
| | | | | | | | - Dirk Labudde
- University of Applied Sciences Mittweida, Mittweida, Germany
| |
Collapse
|
28
|
Abstract
Inhibition of tRNA aminoacylation has proven to be an effective antimicrobial strategy, impeding an essential step of protein synthesis. Mupirocin, the well-known selective inhibitor of bacterial isoleucyl-tRNA synthetase, is one of three aminoacylation inhibitors now approved for human or animal use. However, design of novel aminoacylation inhibitors is complicated by the steadfast requirement to avoid off-target inhibition of protein synthesis in human cells. Here we review available data regarding known aminoacylation inhibitors as well as key amino-acid residues in aminoacyl-tRNA synthetases (aaRSs) and nucleotides in tRNA that determine the specificity and strength of the aaRS-tRNA interaction. Unlike most ligand-protein interactions, the aaRS-tRNA recognition interaction represents coevolution of both the tRNA and aaRS structures to conserve the specificity of aminoacylation. This property means that many determinants of tRNA recognition in pathogens have diverged from those of humans-a phenomenon that provides a valuable source of data for antimicrobial drug development.
Collapse
Affiliation(s)
- Joanne M Ho
- a Department of BioSciences , Rice University , Houston , TX , United States
| | | | - Dieter Söll
- c Departments of Molecular Biophysics & Biochemistry , Yale University , New Haven , CT , United States.,d Department of Chemistry , Yale University , New Haven , CT , United States
| | | |
Collapse
|
29
|
Carter CW, Wills PR. Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding. Mol Biol Evol 2018; 35:269-286. [PMID: 29077934 PMCID: PMC5850816 DOI: 10.1093/molbev/msx265] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Genetic coding is generally thought to have required ribozymes whose functions were taken over by polypeptide aminoacyl-tRNA synthetases (aaRS). Two discoveries about aaRS and their interactions with tRNA substrates now furnish a unifying rationale for the opposite conclusion: that the key processes of the Central Dogma of molecular biology emerged simultaneously and naturally from simple origins in a peptide•RNA partnership, eliminating the epistemological utility of a prior RNA world. First, the two aaRS classes likely arose from opposite strands of the same ancestral gene, implying a simple genetic alphabet. The resulting inversion symmetries in aaRS structural biology would have stabilized the initial and subsequent differentiation of coding specificities, rapidly promoting diversity in the proteome. Second, amino acid physical chemistry maps onto tRNA identity elements, establishing reflexive, nanoenvironmental sensing in protein aaRS. Bootstrapping of increasingly detailed coding is thus intrinsic to polypeptide aaRS, but impossible in an RNA world. These notions underline the following concepts that contradict gradual replacement of ribozymal aaRS by polypeptide aaRS: 1) aaRS enzymes must be interdependent; 2) reflexivity intrinsic to polypeptide aaRS production dynamics promotes bootstrapping; 3) takeover of RNA-catalyzed aminoacylation by enzymes will necessarily degrade specificity; and 4) the Central Dogma's emergence is most probable when replication and translation error rates remain comparable. These characteristics are necessary and sufficient for the essentially de novo emergence of a coupled gene-replicase-translatase system of genetic coding that would have continuously preserved the functional meaning of genetically encoded protein genes whose phylogenetic relationships match those observed today.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland, New Zealand
| |
Collapse
|
30
|
Genetic coding algorithm for sense and antisense peptide interactions. Biosystems 2018; 164:199-216. [DOI: 10.1016/j.biosystems.2017.10.009] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Revised: 10/13/2017] [Accepted: 10/16/2017] [Indexed: 12/31/2022]
|
31
|
Opuu V, Silvert M, Simonson T. Computational design of fully overlapping coding schemes for protein pairs and triplets. Sci Rep 2017; 7:15873. [PMID: 29158504 PMCID: PMC5696523 DOI: 10.1038/s41598-017-16221-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 11/09/2017] [Indexed: 11/26/2022] Open
Abstract
Gene pairs that overlap in their coding regions are rare except in viruses. They may occur transiently in gene creation and are of biotechnological interest. We have examined the possibility to encode an arbitrary pair of protein domains as a dual gene, with the shorter coding sequence completely embedded in the longer one. For 500 × 500 domain pairs (X, Y), we computationally designed homologous pairs (X', Y') coded this way, using an algorithm that provably maximizes the sequence similarity between (X', Y') and (X, Y). Three schemes were considered, with X' and Y' coded on the same or complementary strands. For 16% of the pairs, an overlapping coding exists where the level of homology of X', Y' to the natural proteins represents an E-value of 10-10 or better. Thus, for an arbitrary domain pair, it is surprisingly easy to design homologous sequences that can be encoded as a fully-overlapping gene pair. The algorithm is general and was used to design 200 triple genes, with three proteins encoded by the same DNA segment. The ease of design suggests overlapping genes may have occurred frequently in evolution and could be readily used to compress or constrain artificial genomes.
Collapse
Affiliation(s)
- Vaitea Opuu
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France
| | - Martin Silvert
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France
| | - Thomas Simonson
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France.
| |
Collapse
|
32
|
Zamudio GS, José MV. Phenotypic Graphs and Evolution Unfold the Standard Genetic Code as the Optimal. ORIGINS LIFE EVOL B 2017; 48:83-91. [PMID: 29082465 DOI: 10.1007/s11084-017-9552-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2017] [Accepted: 10/16/2017] [Indexed: 10/18/2022]
Abstract
In this work, we explicitly consider the evolution of the Standard Genetic Code (SGC) by assuming two evolutionary stages, to wit, the primeval RNY code and two intermediate codes in between. We used network theory and graph theory to measure the connectivity of each phenotypic graph. The connectivity values are compared to the values of the codes under different randomization scenarios. An error-correcting optimal code is one in which the algebraic connectivity is minimized. We show that the SGC is optimal in regard to its robustness and error-tolerance when compared to all random codes under different assumptions.
Collapse
Affiliation(s)
- Gabriel S Zamudio
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico.
| |
Collapse
|
33
|
Xu L, Zhao L, Gao Y, Xu J, Han R. Empower multiplex cell and tissue-specific CRISPR-mediated gene manipulation with self-cleaving ribozymes and tRNA. Nucleic Acids Res 2017; 45:e28. [PMID: 27799472 PMCID: PMC5389707 DOI: 10.1093/nar/gkw1048] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2016] [Revised: 10/06/2016] [Accepted: 10/20/2016] [Indexed: 01/12/2023] Open
Abstract
Clustered regularly interspaced short palindromic repeat/Cas9 (CRISPR/Cas9) system has emerged in recent years as a highly efficient RNA-guided gene manipulation platform. Simultaneous editing or transcriptional activation/suppression of different genes becomes feasible with the co-delivery of multiple guide RNAs (gRNAs). Here, we report that multiple gRNAs linked with self-cleaving ribozymes and/or tRNA could be simultaneously expressed from a single U6 promoter to exert genome editing of dystrophin and myosin binding protein C3 in human and mouse cells. Moreover, this strategy allows the expression of multiple gRNAs for synergistic transcription activation of follistatin when used with catalytically inactive dCas9-VP64 or dCas9-p300core fusions. Finally, the gRNAs linked by the self-cleaving ribozymes and tRNA could be expressed from RNA polymerase type II (pol II) promoters such as generic CMV and muscle/heart-specific MHCK7. This is particularly useful for in vivo applications when the packaging capacity of recombinant adeno-associated virus is limited while tissue-specific delivery of gRNAs and Cas9 is desired. Taken together, this study provides a novel strategy to enable tissue-specific expression of more than one gRNAs for multiplex gene editing from a single pol II promoter.
Collapse
MESH Headings
- Animals
- CRISPR-Cas Systems
- Cell Line
- Cytomegalovirus/genetics
- Cytomegalovirus/metabolism
- DNA Polymerase II/genetics
- DNA Polymerase II/metabolism
- Dystrophin/genetics
- Dystrophin/metabolism
- Follistatin/genetics
- Follistatin/metabolism
- Gene Editing
- HEK293 Cells
- Humans
- Liver/metabolism
- Macrophages/cytology
- Macrophages/metabolism
- Mice
- Mice, Inbred C57BL
- Mice, Transgenic
- Muscle, Skeletal/metabolism
- Myoblasts/cytology
- Myoblasts/metabolism
- Myosins/genetics
- Myosins/metabolism
- Promoter Regions, Genetic
- RNA, Catalytic/genetics
- RNA, Catalytic/metabolism
- RNA, Guide, CRISPR-Cas Systems/genetics
- RNA, Guide, CRISPR-Cas Systems/metabolism
- RNA, Small Nuclear/genetics
- RNA, Small Nuclear/metabolism
- RNA, Transfer/genetics
- RNA, Transfer/metabolism
- Recombinant Fusion Proteins/genetics
- Recombinant Fusion Proteins/metabolism
- Transcriptional Activation
Collapse
Affiliation(s)
- Li Xu
- Department of Surgery, Davis Heart and Lung Research Institute, Biomedical Sciences Graduate Program, Biophysics Graduate Program, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| | - Lixia Zhao
- Department of Surgery, Davis Heart and Lung Research Institute, Biomedical Sciences Graduate Program, Biophysics Graduate Program, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| | - Yandi Gao
- Department of Surgery, Davis Heart and Lung Research Institute, Biomedical Sciences Graduate Program, Biophysics Graduate Program, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| | - Jing Xu
- Department of Surgery, Davis Heart and Lung Research Institute, Biomedical Sciences Graduate Program, Biophysics Graduate Program, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| | - Renzhi Han
- Department of Surgery, Davis Heart and Lung Research Institute, Biomedical Sciences Graduate Program, Biophysics Graduate Program, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| |
Collapse
|
34
|
Czapiński J, Kiełbus M, Kałafut J, Kos M, Stepulak A, Rivero-Müller A. How to Train a Cell-Cutting-Edge Molecular Tools. Front Chem 2017; 5:12. [PMID: 28344971 PMCID: PMC5344921 DOI: 10.3389/fchem.2017.00012] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 02/20/2017] [Indexed: 12/28/2022] Open
Abstract
In biological systems, the formation of molecular complexes is the currency for all cellular processes. Traditionally, functional experimentation was targeted to single molecular players in order to understand its effects in a cell or animal phenotype. In the last few years, we have been experiencing rapid progress in the development of ground-breaking molecular biology tools that affect the metabolic, structural, morphological, and (epi)genetic instructions of cells by chemical, optical (optogenetic) and mechanical inputs. Such precise dissection of cellular processes is not only essential for a better understanding of biological systems, but will also allow us to better diagnose and fix common dysfunctions. Here, we present several of these emerging and innovative techniques by providing the reader with elegant examples on how these tools have been implemented in cells, and, in some cases, organisms, to unravel molecular processes in minute detail. We also discuss their advantages and disadvantages with particular focus on their translation to multicellular organisms for in vivo spatiotemporal regulation. We envision that further developments of these tools will not only help solve the processes of life, but will give rise to novel clinical and industrial applications.
Collapse
Affiliation(s)
- Jakub Czapiński
- Department of Biochemistry and Molecular Biology, Medical University of LublinLublin, Poland
- Postgraduate School of Molecular Medicine, Medical University of WarsawWarsaw, Poland
| | - Michał Kiełbus
- Department of Biochemistry and Molecular Biology, Medical University of LublinLublin, Poland
| | - Joanna Kałafut
- Department of Biochemistry and Molecular Biology, Medical University of LublinLublin, Poland
| | - Michał Kos
- Department of Biochemistry and Molecular Biology, Medical University of LublinLublin, Poland
| | - Andrzej Stepulak
- Department of Biochemistry and Molecular Biology, Medical University of LublinLublin, Poland
| | - Adolfo Rivero-Müller
- Department of Biochemistry and Molecular Biology, Medical University of LublinLublin, Poland
- Turku Centre for Biotechnology, University of Turku and Åbo Akademi UniversityTurku, Finland
- Department of Biosciences, Åbo Akademi UniversityTurku, Finland
| |
Collapse
|
35
|
José MV, Zamudio GS, Morgado ER. A unified model of the standard genetic code. ROYAL SOCIETY OPEN SCIENCE 2017; 4:160908. [PMID: 28405378 PMCID: PMC5383835 DOI: 10.1098/rsos.160908] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Accepted: 01/30/2017] [Indexed: 06/07/2023]
Abstract
The Rodin-Ohno (RO) and the Delarue models divide the table of the genetic code into two classes of aminoacyl-tRNA synthetases (aaRSs I and II) with recognition from the minor or major groove sides of the tRNA acceptor stem, respectively. These models are asymmetric but they are biologically meaningful. On the other hand, the standard genetic code (SGC) can be derived from the primeval RNY code (R stands for purines, Y for pyrimidines and N any of them). In this work, the RO-model is derived by means of group actions, namely, symmetries represented by automorphisms, assuming that the SGC originated from a primeval RNY code. It turns out that the RO-model is symmetric in a six-dimensional (6D) hypercube. Conversely, using the same automorphisms, we show that the RO-model can lead to the SGC. In addition, the asymmetric Delarue model becomes symmetric by means of quotient group operations. We formulate isometric functions that convert the class aaRS I into the class aaRS II and vice versa. We show that the four polar requirement categories display a symmetrical arrangement in our 6D hypercube. Altogether these results cannot be attained, neither in two nor in three dimensions. We discuss the present unified 6D algebraic model, which is compatible with both the SGC (based upon the primeval RNY code) and the RO-model.
Collapse
Affiliation(s)
- Marco V. José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, MexicoD.F. 04510, Mexico
| | - Gabriel S. Zamudio
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, MexicoD.F. 04510, Mexico
| | - Eberto R. Morgado
- Facultad de Matemática, Física y Computación, Universidad Central ‘Marta Abreu’ de Las Villas, Santa Clara, Cuba
| |
Collapse
|
36
|
Carter CW. Coding of Class I and II Aminoacyl-tRNA Synthetases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017; 966:103-148. [PMID: 28828732 PMCID: PMC5927602 DOI: 10.1007/5584_2017_93] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels-protozymes and Urzymes-associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric-middle base-pairing frequencies in sense/antisense alignments-that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically-active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-7260, USA.
| |
Collapse
|
37
|
Caetano-Anollés D, Caetano-Anollés G. Piecemeal Buildup of the Genetic Code, Ribosomes, and Genomes from Primordial tRNA Building Blocks. Life (Basel) 2016; 6:life6040043. [PMID: 27918435 PMCID: PMC5198078 DOI: 10.3390/life6040043] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Revised: 11/21/2016] [Accepted: 11/29/2016] [Indexed: 01/10/2023] Open
Abstract
The origin of biomolecular machinery likely centered around an ancient and central molecule capable of interacting with emergent macromolecular complexity. tRNA is the oldest and most central nucleic acid molecule of the cell. Its co-evolutionary interactions with aminoacyl-tRNA synthetase protein enzymes define the specificities of the genetic code and those with the ribosome their accurate biosynthetic interpretation. Phylogenetic approaches that focus on molecular structure allow reconstruction of evolutionary timelines that describe the history of RNA and protein structural domains. Here we review phylogenomic analyses that reconstruct the early history of the synthetase enzymes and the ribosome, their interactions with RNA, and the inception of amino acid charging and codon specificities in tRNA that are responsible for the genetic code. We also trace the age of domains and tRNA onto ancient tRNA homologies that were recently identified in rRNA. Our findings reveal a timeline of recruitment of tRNA building blocks for the formation of a functional ribosome, which holds both the biocatalytic functions of protein biosynthesis and the ability to store genetic memory in primordial RNA genomic templates.
Collapse
Affiliation(s)
- Derek Caetano-Anollés
- Department of Evolutionary Genetics, Max-Planck-Institut für Evolutionsbiologie, 24306 Plön, Germany.
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA.
| |
Collapse
|
38
|
Abstract
Aminoacyl-tRNA synthetases (aaRSs) are modular enzymes globally conserved in the three kingdoms of life. All catalyze the same two-step reaction, i.e., the attachment of a proteinogenic amino acid on their cognate tRNAs, thereby mediating the correct expression of the genetic code. In addition, some aaRSs acquired other functions beyond this key role in translation. Genomics and X-ray crystallography have revealed great structural diversity in aaRSs (e.g., in oligomery and modularity, in ranking into two distinct groups each subdivided in 3 subgroups, by additional domains appended on the catalytic modules). AaRSs show huge structural plasticity related to function and limited idiosyncrasies that are kingdom or even species specific (e.g., the presence in many Bacteria of non discriminating aaRSs compensating for the absence of one or two specific aaRSs, notably AsnRS and/or GlnRS). Diversity, as well, occurs in the mechanisms of aaRS gene regulation that are not conserved in evolution, notably between distant groups such as Gram-positive and Gram-negative Bacteria. The review focuses on bacterial aaRSs (and their paralogs) and covers their structure, function, regulation, and evolution. Structure/function relationships are emphasized, notably the enzymology of tRNA aminoacylation and the editing mechanisms for correction of activation and charging errors. The huge amount of genomic and structural data that accumulated in last two decades is reviewed, showing how the field moved from essentially reductionist biology towards more global and integrated approaches. Likewise, the alternative functions of aaRSs and those of aaRS paralogs (e.g., during cell wall biogenesis and other metabolic processes in or outside protein synthesis) are reviewed. Since aaRS phylogenies present promiscuous bacterial, archaeal, and eukaryal features, similarities and differences in the properties of aaRSs from the three kingdoms of life are pinpointed throughout the review and distinctive characteristics of bacterium-like synthetases from organelles are outlined.
Collapse
Affiliation(s)
- Richard Giegé
- Architecture et Réactivité de l'ARN, Université de Strasbourg, CNRS, IBMC, 67084 Strasbourg, France
| | - Mathias Springer
- Université Paris Diderot, Sorbonne Cité, UPR9073 CNRS, IBPC, 75005 Paris, France
| |
Collapse
|
39
|
Wills PR. The generation of meaningful information in molecular systems. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0066. [PMID: 26857673 DOI: 10.1098/rsta.2015.0066] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 07/17/2015] [Indexed: 06/05/2023]
Abstract
The physico-chemical processes occurring inside cells are under the computational control of genetic (DNA) and epigenetic (internal structural) programming. The origin and evolution of genetic information (nucleic acid sequences) is reasonably well understood, but scant attention has been paid to the origin and evolution of the molecular biological interpreters that give phenotypic meaning to the sequence information that is quite faithfully replicated during cellular reproduction. The near universality and age of the mapping from nucleotide triplets to amino acids embedded in the functionality of the protein synthetic machinery speaks to the early development of a system of coding which is still extant in every living organism. We take the origin of genetic coding as a paradigm of the emergence of computation in natural systems, focusing on the requirement that the molecular components of an interpreter be synthesized autocatalytically. Within this context, it is seen that interpreters of increasing complexity are generated by series of transitions through stepped dynamic instabilities (non-equilibrium phase transitions). The early phylogeny of the amino acyl-tRNA synthetase enzymes is discussed in such terms, leading to the conclusion that the observed optimality of the genetic code is a natural outcome of the processes of self-organization that produced it.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, PB 92019, Auckland 1142, Aotearoa, New Zealand
| |
Collapse
|
40
|
Wills PR. DNA as information. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0417. [PMID: 26857666 DOI: 10.1098/rsta.2015.0417] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 12/14/2015] [Indexed: 06/05/2023]
Abstract
This article reviews contributions to this theme issue covering the topic 'DNA as information' in relation to the structure of DNA, the measure of its information content, the role and meaning of information in biology and the origin of genetic coding as a transition from uninformed to meaningful computational processes in physical systems.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, PO Box 92019, Auckland 1142, New Zealand Institut für Biochemie und Molekularbiologie, Universität Hamburg, Martin-Luther-King-Platz 6, 20146 Hamburg, Germany Department of Biochemistry and Biophysics, University of North Carolina, 120 Mason Farm Road, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
41
|
Carter CW, Wolfenden R. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry. RNA Biol 2015; 13:145-51. [PMID: 26595350 DOI: 10.1080/15476286.2015.1112488] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology.
Collapse
Affiliation(s)
- Charles W Carter
- a Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , Chapel Hill , NC 27599-7260
| | - Richard Wolfenden
- a Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , Chapel Hill , NC 27599-7260
| |
Collapse
|
42
|
Abstract
The hydrophobicities of the 20 common amino acids are reflected in their tendencies to appear in interior positions in globular proteins and in deeply buried positions of membrane proteins. To determine whether these relationships might also have been valid in the warm surroundings where life may have originated, we examined the effect of temperature on the hydrophobicities of the amino acids as measured by the equilibrium constants for transfer of their side-chains from neutral solution to cyclohexane (K(w > c)). The hydrophobicities of most amino acids were found to increase with increasing temperature. Because that effect is more pronounced for the more polar amino acids, the numerical range of K(w > c) values decreases with increasing temperature. There are also modest changes in the ordering of the more polar amino acids. However, those changes are such that they would have tended to minimize the otherwise disruptive effects of a changing thermal environment on the evolution of protein structure. Earlier, the genetic code was found to be organized in such a way that--with a single exception (threonine)--the side-chain dichotomy polar/nonpolar matches the nucleic acid base dichotomy purine/pyrimidine at the second position of each coding triplet at 25 °C. That dichotomy is preserved at 100 °C. The accessible surface areas of amino acid side-chains in folded proteins are moderately correlated with hydrophobicity, but when free energies of vapor-to-cyclohexane transfer (corresponding to size) are taken into consideration, a closer relationship becomes apparent.
Collapse
|
43
|
Carter CW. What RNA World? Why a Peptide/RNA Partnership Merits Renewed Experimental Attention. Life (Basel) 2015; 5:294-320. [PMID: 25625599 PMCID: PMC4390853 DOI: 10.3390/life5010294] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Accepted: 01/12/2015] [Indexed: 12/16/2022] Open
Abstract
We review arguments that biology emerged from a reciprocal partnership in which small ancestral oligopeptides and oligonucleotides initially both contributed rudimentary information coding and catalytic rate accelerations, and that the superior information-bearing qualities of RNA and the superior catalytic potential of proteins emerged from such complexes only with the gradual invention of the genetic code. A coherent structural basis for that scenario was articulated nearly a decade before the demonstration of catalytic RNA. Parallel hierarchical catalytic repertoires for increasingly highly conserved sequences from the two synthetase classes now increase the likelihood that they arose as translation products from opposite strands of a single gene. Sense/antisense coding affords a new bioinformatic metric for phylogenetic relationships much more distant than can be reconstructed from multiple sequence alignments of a single superfamily. Evidence for distinct coding properties in tRNA acceptor stems and anticodons, and experimental demonstration that the two synthetase family ATP binding sites can indeed be coded by opposite strands of the same gene supplement these biochemical and bioinformatic data, establishing a solid basis for key intermediates on a path from simple, stereochemically coded, reciprocally catalytic peptide/RNA complexes through the earliest peptide catalysts to contemporary aminoacyl-tRNA synthetases. That scenario documents a path to increasing complexity that obviates the need for a single polymer to act both catalytically and as an informational molecule.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA.
| |
Collapse
|