1
|
Wong TF. Triphasic Development of the Genetic Code. Chem Rev 2024; 124:9866-9872. [PMID: 39088192 DOI: 10.1021/acs.chemrev.3c00915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/02/2024]
Abstract
The genetic code contains an alphabet of genetically encoded amino acids. The ten Phase 1 amino acids, including Gly, Ala, Ser, Asp, Glu, Val, Leu, Ile, Pro and Thr, were available from the prebiotic environment, whereas the ten Phase 2 amino acids, including Phe, Tyr, Arg, His, Trp, Asn, Gln, Lys, Cys, and Met, became available only later from amino acid biosyntheses. In the archaeon Methanopyrus kandleri, the oldest organism known, the standard alphabet of 20 amino acids was "frozen" and no additional amino acid was encoded in the subsequent 3 Gyrs. Four decades ago, it was discovered that the code was frozen because all the organisms were so well adapted to the standard amino acids that oligogenic barriers, consisting of genes that are thoroughly dependent on the standard code, would cause loss of viability upon the deletion of any one amino acid from the code. Once the reason for the freezing of the code was ascertained, procedures were devised by scientists worldwide to enable the encoding of novel noncanonical amino acids (ncAAs). These encoded Phase 3 ncAAs now surpass the 20 canonical Phase 2 amino acids in the code.
Collapse
Affiliation(s)
- Tze-Fei Wong
- Division of Life Science and Applied Genomics Center, Hong Kong University of Science & Technology Hong Kong, China
| |
Collapse
|
2
|
Tang GQ, Hu H, Douglas J, Carter C. Primordial aminoacyl-tRNA synthetases preferred minihelices to full-length tRNA. Nucleic Acids Res 2024; 52:7096-7111. [PMID: 38783009 PMCID: PMC11229368 DOI: 10.1093/nar/gkae417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 04/30/2024] [Accepted: 05/10/2024] [Indexed: 05/25/2024] Open
Abstract
Aminoacyl-tRNA synthetases (AARS) and tRNAs translate the genetic code in all living cells. Little is known about how their molecular ancestors began to enforce the coding rules for the expression of their own genes. Schimmel et al. proposed in 1993 that AARS catalytic domains began by reading an 'operational' code in the acceptor stems of tRNA minihelices. We show here that the enzymology of an AARS urzyme•TΨC-minihelix cognate pair is a rich in vitro realization of that idea. The TΨC-minihelixLeu is a very poor substrate for full-length Leucyl-tRNA synthetase. It is a superior RNA substrate for the corresponding urzyme, LeuAC. LeuAC active-site mutations shift the choice of both amino acid and RNA substrates. AARS urzyme•minihelix cognate pairs are thus small, pliant models for the ancestral decoding hardware. They are thus an ideal platform for detailed experimental study of the operational RNA code.
Collapse
Affiliation(s)
- Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Hao Hu
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, University of Auckland, New Zealand
- Department of Computer Science, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
3
|
Carter CW. Base Pairing Promoted the Self-Organization of Genetic Coding, Catalysis, and Free-Energy Transduction. Life (Basel) 2024; 14:199. [PMID: 38398709 PMCID: PMC10890426 DOI: 10.3390/life14020199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 01/21/2024] [Accepted: 01/25/2024] [Indexed: 02/25/2024] Open
Abstract
How Nature discovered genetic coding is a largely ignored question, yet the answer is key to explaining the transition from biochemical building blocks to life. Other, related puzzles also fall inside the aegis enclosing the codes themselves. The peptide bond is unstable with respect to hydrolysis. So, it requires some form of chemical free energy to drive it. Amino acid activation and acyl transfer are also slow and must be catalyzed. All living things must thus also convert free energy and synchronize cellular chemistry. Most importantly, functional proteins occupy only small, isolated regions of sequence space. Nature evolved heritable symbolic data processing to seek out and use those sequences. That system has three parts: a memory of how amino acids behave in solution and inside proteins, a set of code keys to access that memory, and a scoring function. The code keys themselves are the genes for cognate pairs of tRNA and aminoacyl-tRNA synthetases, AARSs. The scoring function is the enzymatic specificity constant, kcat/kM, which measures both catalysis and specificity. The work described here deepens the evidence for and understanding of an unexpected consequence of ancestral bidirectional coding. Secondary structures occur in approximately the same places within antiparallel alignments of their gene products. However, the polar amino acids that define the molecular surface of one are reflected into core-defining non-polar side chains on the other. Proteins translated from base-paired coding strands fold up inside out. Bidirectional genes thus project an inverted structural duality into the proteome. I review how experimental data root the scoring functions responsible for the origins of coding and catalyzed activation of unfavorable chemical reactions in that duality.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
4
|
Douglas J, Bouckaert R, Carter CW, Wills P. Enzymic recognition of amino acids drove the evolution of primordial genetic codes. Nucleic Acids Res 2024; 52:558-571. [PMID: 38048305 PMCID: PMC10810186 DOI: 10.1093/nar/gkad1160] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 10/28/2023] [Accepted: 11/20/2023] [Indexed: 12/06/2023] Open
Abstract
How genetic information gained its exquisite control over chemical processes needed to build living cells remains an enigma. Today, the aminoacyl-tRNA synthetases (AARS) execute the genetic codes in all living systems. But how did the AARS that emerged over three billion years ago as low-specificity, protozymic forms then spawn the full range of highly-specific enzymes that distinguish between 22 diverse amino acids? A phylogenetic reconstruction of extant AARS genes, enhanced by analysing modular acquisitions, reveals six AARS with distinct bacterial, archaeal, eukaryotic, or organellar clades, resulting in a total of 36 families of AARS catalytic domains. Small structural modules that differentiate one AARS family from another played pivotal roles in discriminating between amino acid side chains, thereby expanding the genetic code and refining its precision. The resulting model shows a tendency for less elaborate enzymes, with simpler catalytic domains, to activate amino acids that were not synthesised until later in the evolution of the code. The most probable evolutionary route for an emergent amino acid type to establish a place in the code was by recruiting older, less specific AARS, rather than adapting contemporary lineages. This process, retrofunctionalisation, differs from previously described mechanisms through which amino acids would enter the code.
Collapse
Affiliation(s)
- Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, The University of Auckland, New Zealand
| | - Remco Bouckaert
- Centre for Computational Evolution, The University of Auckland, New Zealand
- School of Computer Science, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, USA
| | - Peter R Wills
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, The University of Auckland, New Zealand
| |
Collapse
|
5
|
Patra SK, Douglas J, Wills PR, Bouckeart R, Betts L, Qing TG, Carter CW. Genomic database furnishes a spontaneous example of a functional Class II glycyl-tRNA synthetase urzyme. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.11.575260. [PMID: 38260702 PMCID: PMC10802616 DOI: 10.1101/2024.01.11.575260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
The chief barrier to studies of how genetic coding emerged is the lack of experimental models for ancestral aminoacyl-tRNA synthetases (AARS). We hypothesized that conserved core catalytic sites could represent such ancestors. That hypothesis enabled engineering functional "urzymes" from TrpRS, LeuRS, and HisRS. We describe here a fourth urzyme, GlyCA, detected in an open reading frame from the genomic record of the arctic fox, Vulpes lagopus. GlyCA is homologous to a bacterial heterotetrameric Class II GlyRS-B. Alphafold2 predicted that the N-terminal 81 amino acids would adopt a 3D structure nearly identical to the HisRS urzyme (HisCA1). We expressed and purified that N-terminal segment. Enzymatic characterization revealed a robust single-turnover burst size and a catalytic rate for ATP consumption well in excess of that previously published for HisCA1. Time-dependent aminoacylation of tRNAGly proceeds at a rate consistent with that observed for amino acid activation. In fact, GlyCA is actually 35 times more active in glycine activation by ATP than the full-length GlyRS-B α-subunit dimer. ATP-dependent activation of the 20 canonical amino acids favors Class II amino acids that complement those favored by HisCA and LeuAC. These properties reinforce the notion that urzymes represent the requisite ancestral catalytic activities to implement a reduced genetic coding alphabet.
Collapse
Affiliation(s)
- Sourav Kumar Patra
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260
| | - Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, University of Auckland, New Zealand
| | - Peter R. Wills
- Department of Physics, The University of Auckland, New Zealand
| | - Remco Bouckeart
- Centre for Computational Evolution, University of Auckland, New Zealand
- Department of Computer Science, The University of Auckland, New Zealand
| | - Laurie Betts
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260
| | | | - Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260
| |
Collapse
|
6
|
José MV, Bobadilla JR, Zamudio GS, de Farías ST. Symmetrical distributions of aminoacyl-tRNA synthetases during the evolution of the genetic code. Theory Biosci 2023; 142:211-219. [PMID: 37402895 PMCID: PMC10423125 DOI: 10.1007/s12064-023-00394-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 06/10/2023] [Indexed: 07/06/2023]
Abstract
In this work, we formulate the following question: How the distribution of aminoacyl-tRNA synthetases (aaRSs) went from an ancestral bidirectional gene (mirror symmetry) to the symmetrical distribution of aaRSs in a six-dimensional hypercube of the Standard Genetic Code (SGC)? We assume a primeval RNY code, two Extended Genetic RNA codes type 1 and 2, and the SGC. We outline the types of symmetries of the distribution of aaRSs in each code. The symmetry groups of aaRSs in each code are described, until the symmetries of the SGC display a mirror symmetry. Considering both Extended RNA codes the 20 aaRSs were already present before the Last Universal Ancestor. These findings reveal intricacies in the diversification of aaRSs accompanied by the evolution of the genetic code.
Collapse
Affiliation(s)
- Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, CP 04510, Mexico City, Mexico.
| | - Juan R Bobadilla
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, CP 04510, Mexico City, Mexico
| | - Gabriel S Zamudio
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, CP 04510, Mexico City, Mexico
| | - Sávio Torres de Farías
- Laboratório de Genética Evolutiva Paulo Leminsk, Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil
| |
Collapse
|
7
|
Tang GQ, Elder JJH, Douglas J, Carter CW. Domain acquisition by class I aminoacyl-tRNA synthetase urzymes coordinated the catalytic functions of HVGH and KMSKS motifs. Nucleic Acids Res 2023; 51:8070-8084. [PMID: 37470821 PMCID: PMC10450160 DOI: 10.1093/nar/gkad590] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 06/23/2023] [Accepted: 07/11/2023] [Indexed: 07/21/2023] Open
Abstract
Leucyl-tRNA synthetase (LeuRS) is a Class I aminoacyl-tRNA synthetase (aaRS) that synthesizes leucyl-tRNAleu for codon-directed protein synthesis. Two signature sequences, HxGH and KMSKS help stabilize transition-states for amino acid activation and tRNA aminoacylation by all Class I aaRS. Separate alanine mutants of each signature, together with the double mutant, behave in opposite ways in Pyrococcus horikoshii LeuRS and the 129-residue urzyme ancestral model generated from it (LeuAC). Free energy coupling terms, Δ(ΔG‡), for both reactions are large and favourable for LeuRS, but unfavourable for LeuAC. Single turnover assays with 32Pα-ATP show correspondingly different internal products. These results implicate domain motion in catalysis by full-length LeuRS. The distributed thermodynamic cycle of mutational changes authenticates LeuAC urzyme catalysis far more convincingly than do single point mutations. Most importantly, the evolutionary gain of function induced by acquiring the anticodon-binding (ABD) and multiple insertion modules in the catalytic domain appears to be to coordinate the catalytic function of the HxGH and KMSKS signature sequences. The implication that backbone elements of secondary structures achieve a major portion of the overall transition-state stabilization by LeuAC is also consistent with coevolution of the genetic code and metabolic pathways necessary to produce histidine and lysine sidechains.
Collapse
Affiliation(s)
- Guo Qing Tang
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jessica J H Elder
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| | - Jordan Douglas
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
- Department of Physics, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
8
|
Zagrovic B, Adlhart M, Kapral TH. Coding From Binding? Molecular Interactions at the Heart of Translation. Annu Rev Biophys 2023; 52:69-89. [PMID: 36626765 DOI: 10.1146/annurev-biophys-090622-102329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The mechanism and the evolution of DNA replication and transcription, the key elements of the central dogma of biology, are fundamentally well explained by the physicochemical complementarity between strands of nucleic acids. However, the determinants that have shaped the third part of the dogma-the process of biological translation and the universal genetic code-remain unclear. We review and seek parallels between different proposals that view the evolution of translation through the prism of weak, noncovalent interactions between biological macromolecules. In particular, we focus on a recent proposal that there exists a hitherto unrecognized complementarity at the heart of biology, that between messenger RNA coding regions and the proteins that they encode, especially if the two are unstructured. Reflecting the idea that the genetic code evolved from intrinsic binding propensities between nucleotides and amino acids, this proposal promises to forge a link between the distant past and the present of biological systems.
Collapse
Affiliation(s)
- Bojan Zagrovic
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Marlene Adlhart
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Thomas H Kapral
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
| |
Collapse
|
9
|
Halpern A, Bartsch LR, Ibrahim K, Harrison SA, Ahn M, Christodoulou J, Lane N. Biophysical Interactions Underpin the Emergence of Information in the Genetic Code. Life (Basel) 2023; 13:life13051129. [PMID: 37240774 DOI: 10.3390/life13051129] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 04/25/2023] [Accepted: 04/30/2023] [Indexed: 05/28/2023] Open
Abstract
The genetic code conceals a 'code within the codons', which hints at biophysical interactions between amino acids and their cognate nucleotides. Yet, research over decades has failed to corroborate systematic biophysical interactions across the code. Using molecular dynamics simulations and NMR, we have analysed interactions between the 20 standard proteinogenic amino acids and 4 RNA mononucleotides in 3 charge states. Our simulations show that 50% of amino acids bind best with their anticodonic middle base in the -1 charge state common to the backbone of RNA, while 95% of amino acids interact most strongly with at least 1 of their codonic or anticodonic bases. Preference for the cognate anticodonic middle base was greater than 99% of randomised assignments. We verify a selection of our results using NMR, and highlight challenges with both techniques for interrogating large numbers of weak interactions. Finally, we extend our simulations to a range of amino acids and dinucleotides, and corroborate similar preferences for cognate nucleotides. Despite some discrepancies between the predicted patterns and those observed in biology, the existence of weak stereochemical interactions means that random RNA sequences could template non-random peptides. This offers a compelling explanation for the emergence of genetic information in biology.
Collapse
Affiliation(s)
- Aaron Halpern
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Lilly R Bartsch
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Kaan Ibrahim
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Stuart A Harrison
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Minkoo Ahn
- Department of Structural and Molecular Biology, Institute of Structural and Molecular Biology (ISMB), University College London, London WC1E 6BT, UK
| | - John Christodoulou
- Department of Structural and Molecular Biology, Institute of Structural and Molecular Biology (ISMB), University College London, London WC1E 6BT, UK
| | - Nick Lane
- UCL Centre for Life's Origins and Evolution (CLOE), Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| |
Collapse
|
10
|
A Leucyl-tRNA Synthetase Urzyme: Authenticity of tRNA Synthetase Catalytic Activities and Promiscuous Phosphorylation of Leucyl-5'AMP. Int J Mol Sci 2022; 23:ijms23084229. [PMID: 35457045 PMCID: PMC9026127 DOI: 10.3390/ijms23084229] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 03/30/2022] [Accepted: 03/31/2022] [Indexed: 02/05/2023] Open
Abstract
Aminoacyl-tRNA synthetase (aaRS)/tRNA cognate pairs translate the genetic code by synthesizing specific aminoacyl-tRNAs that are assembled on messenger RNA by the ribosome. Deconstruction of the two distinct aaRS superfamilies (Classes) has provided conceptual and experimental models for their early evolution. Urzymes, containing ~120–130 amino acids excerpted from regions where genetic coding sequence complementarities have been identified, are key experimental models motivated by the proposal of a single bidirectional ancestral gene. Previous reports that Class I and Class II urzymes accelerate both amino acid activation and tRNA aminoacylation have not been extended to other synthetases. We describe a third urzyme (LeuAC) prepared from the Class IA Pyrococcus horikoshii leucyl-tRNA synthetase. We adduce multiple lines of evidence for the authenticity of its catalysis of both canonical reactions, amino acid activation and tRNALeu aminoacylation. Mutation of the three active-site lysine residues to alanine causes significant, but modest reduction in both amino acid activation and aminoacylation. LeuAC also catalyzes production of ADP, a non-canonical enzymatic function that has been overlooked since it first was described for several full-length aaRS in the 1970s. Structural data suggest that the LeuAC active site accommodates two ATP conformations that are prominent in water but rarely seen bound to proteins, accounting for successive, in situ phosphorylation of the bound leucyl-5′AMP phosphate, accounting for ADP production. This unusual ATP consumption regenerates the transition state for amino acid activation and suggests, in turn, that in the absence of the editing and anticodon-binding domains, LeuAC releases leu-5′AMP unusually slowly, relative to the two phosphorylation reactions.
Collapse
|
11
|
Kondratyeva LG, Dyachkova MS, Galchenko AV. The Origin of Genetic Code and Translation in the Framework of Current Concepts on the Origin of Life. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:150-169. [PMID: 35508902 DOI: 10.1134/s0006297922020079] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The origin of genetic code and translation system is probably the central and most difficult problem in the investigations on the origin of life and one of the most complex problems in the evolutionary biology in general. There are multiple hypotheses on the emergence and development of existing genetic systems that propose the mechanisms for the origin and early evolution of genetic code, as well as for the emergence of replication and translation. Here, we discuss the most well-known of these hypotheses, although none of them provides a description of the early evolution of genetic systems without gaps and assumptions. The RNA world hypothesis is a currently prevailing scientific idea on the early evolution of biological and pre-biological structures, the main advantage of which is the assumption that RNAs as the first living systems were self-sufficient, i.e., capable of functioning as both catalysts and templates. However, this hypothesis has also significant limitations. In particular, no ribozymes with processive polymerase activity have been yet discovered or synthesized. Taking into account the mutual need of proteins and nucleic acids in each other in the current world, many authors propose the early evolution scenarios based on the co-evolution of these two classes of organic molecules. They postulate that the emergence of translation was necessary for the replication of nucleic acids, in contrast to the RNA world hypothesis, according to which the emergence of translation was preceded by the era of self-replicating RNAs. Although such scenarios are less parsimonious from the evolutionary point of view, since they require simultaneous emergence and evolution of two classes of organic molecules, as well as the emergence of synchronized replication and translation, their major advantage is that they explain the development of processive and much more accurate protein-dependent replication.
Collapse
Affiliation(s)
- Liya G Kondratyeva
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| | | | - Alexey V Galchenko
- Peoples' Friendship University of Russia (RUDN University), Moscow, 117198, Russia.
| |
Collapse
|
12
|
Carter CW, Popinga A, Bouckaert R, Wills PR. Multidimensional Phylogenetic Metrics Identify Class I Aminoacyl-tRNA Synthetase Evolutionary Mosaicity and Inter-Modular Coupling. Int J Mol Sci 2022; 23:ijms23031520. [PMID: 35163448 PMCID: PMC8835825 DOI: 10.3390/ijms23031520] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 01/17/2022] [Accepted: 01/17/2022] [Indexed: 02/01/2023] Open
Abstract
The role of aminoacyl-tRNA synthetases (aaRS) in the emergence and evolution of genetic coding poses challenging questions concerning their provenance. We seek evidence about their ancestry from curated structure-based multiple sequence alignments of a structurally invariant “scaffold” shared by all 10 canonical Class I aaRS. Three uncorrelated phylogenetic metrics—mutation frequency, its uniformity, and row-by-row cladistic congruence—imply that the Class I scaffold is a mosaic assembled from successive genetic sources. Metrics for different modules vary in accordance with their presumed functionality. Sequences derived from the ATP– and amino acid– binding sites exhibit specific two-way coupling to those derived from Connecting Peptide 1, a third module whose metrics suggest later acquisition. The data help validate: (i) experimental fragmentations of the canonical Class I structure into three partitions that retain catalytic activities in proportion to their length; and (ii) evidence that the ancestral Class I aaRS gene also encoded a Class II ancestor in frame on the opposite strand. A 46-residue Class I “protozyme” roots the Class I tree prior to the adaptive radiation of the Rossmann dinucleotide binding fold that refined substrate discrimination. Such rooting implies near simultaneous emergence of genetic coding and the origin of the proteome, resolving a conundrum posed by previous inferences that Class I aaRS evolved after the genetic code had been implemented in an RNA world. Further, pinpointing discontinuous enhancements of aaRS fidelity establishes a timeline for the growth of coding from a binary amino acid alphabet.
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
- Correspondence: ; Tel.: +1-919-966-3263
| | - Alex Popinga
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Remco Bouckaert
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand;
| |
Collapse
|
13
|
Pal S, Goswami S, Das D. Cross β amyloid assemblies as complex catalytic machinery. Chem Commun (Camb) 2021; 57:7597-7609. [PMID: 34278403 DOI: 10.1039/d1cc02880d] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
How modern enzymes evolved as complex catalytic machineries to facilitate diverse chemical transformations is an open question for the emerging field of systems chemistry. Inspired by Nature's ingenuity in creating complex catalytic structures for exotic functions, short peptide-based cross β amyloid sequences have been shown to access intricate binding surfaces demonstrating the traits of extant enzymes and proteins. Based on their catalytic proficiencies reported recently, these amyloid assemblies have been argued as the earliest protein folds. Herein, we map out the recent progress made by our laboratory and other research groups that demonstrate the catalytic diversity of cross β amyloid assemblies. The important role of morphology and specific mutations in peptide sequences has been underpinned in this review. We have divided the feature article into different sections where examples from biology have been covered demonstrating the mechanism of extant biocatalysts and compared with recent works on cross β amyloid folds showing covalent catalysis, aldolase, hydrolase, peroxidase-like activities and complex cascade catalysis. Beyond equilibrium, we have extended our discussion towards transient catalytic amyloid phases mimicking the energy driven cytoskeleton polymerization. Finally, a future outlook has been provided on the way ahead for short peptide-based systems chemistry approaches that can lead to the development of robust catalytic networks with improved enzyme-like proficiencies and higher complexities. The discussed examples along with the rationale behind selecting specific amino acids sequence will benefit readers to design systems for achieving catalytic reactivity similar to natural complex enzymes.
Collapse
Affiliation(s)
- Sumit Pal
- Department of Chemical Sciences and Centre for Advanced Functional Materials, Indian Institute of Science Education and Research (IISER) Kolkata, Mohanpur-741246, India.
| | - Surashree Goswami
- Department of Chemical Sciences and Centre for Advanced Functional Materials, Indian Institute of Science Education and Research (IISER) Kolkata, Mohanpur-741246, India.
| | - Dibyendu Das
- Department of Chemical Sciences and Centre for Advanced Functional Materials, Indian Institute of Science Education and Research (IISER) Kolkata, Mohanpur-741246, India.
| |
Collapse
|
14
|
Amino acid activation analysis of primitive aminoacyl-tRNA synthetases encoded by both strands of a single gene using the malachite green assay. Biosystems 2021; 208:104481. [PMID: 34245865 DOI: 10.1016/j.biosystems.2021.104481] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 07/07/2021] [Accepted: 07/07/2021] [Indexed: 12/19/2022]
Abstract
The Rodin-Ohno hypothesis postulates that two classes of aminoacyl-tRNA synthetases were encoded complementary to double-stranded DNA. Particularly, Geobacillus stearothermophilus tryptophanyl-tRNA synthetase (TrpRS, belonging to class I) and Escherichia coli histidyl-tRNA synthetase (HisRS, belonging to class II) show high complementarity of the middle base of the codons in the mRNA sequence encoding each ATP binding site. Here, for the reported 46-residue peptides designed from the three-dimensional structures of TrpRS and HisRS, amino acid activation analysis was performed using the malachite green assay, which detects the pyrophosphate departing from ATP in the forward reaction of the first step of tRNA aminoacylation. A maltose-binding protein fusion with the 46 residues of TrpRS (TrpRS46mer) exhibited high activation capacity for several amino acids in the presence of ATP and amino acids, but the activity of an alanine substitution mutant of the first histidine in the HIGH motif (TrpRS46merH15A) was largely reduced. In contrast, pyrophosphate release by HisRS46mer in the histidine activation step was lower than that in the case of TrpRS46mer. Both HisRS46mer and the alanine mutant at the 113th arginine (HisRS46merR113A) showed slightly higher levels of pyrophosphate release than the maltose-binding protein alone. These results do not rule out the Rodin-Ohno hypothesis, but may suggest the necessity of establishing unique evolutionary models from different perspectives.
Collapse
|
15
|
Abstract
Codon-dependent translation underlies genetics and phylogenetic inferences, but its origins pose two challenges. Prevailing narratives cannot account for the fact that aminoacyl-tRNA synthetases (aaRSs), which translate the genetic code, must collectively enforce the rules used to assemble themselves. Nor can they explain how specific assignments arose from rudimentary differentiation between ancestral aaRSs and corresponding transfer RNAs (tRNAs). Experimental deconstruction of the two aaRS superfamilies created new experimental tools with which to analyze the emergence of the code. Amino acid and tRNA substrate recognition are linked to phase transfer free energies of amino acids and arise largely from aaRS class-specific differences in secondary structure. Sensitivity to protein folding rules endowed ancestral aaRS-tRNA pairs with the feedback necessary to rapidly compare alternative genetic codes and coding sequences. These and other experimental data suggest that the aaRS bidirectional genetic ancestry stabilized the differentiation and interdependence required to initiate and elaborate the genetic coding table.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA;
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland 1142, New Zealand
| |
Collapse
|
16
|
|
17
|
Carter CW, Wills PR. Reciprocally-Coupled Gating: Strange Loops in Bioenergetics, Genetics, and Catalysis. Biomolecules 2021; 11:265. [PMID: 33670192 PMCID: PMC7916928 DOI: 10.3390/biom11020265] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 02/04/2021] [Accepted: 02/06/2021] [Indexed: 12/12/2022] Open
Abstract
Bioenergetics, genetic coding, and catalysis are all difficult to imagine emerging without pre-existing historical context. That context is often posed as a "Chicken and Egg" problem; its resolution is concisely described by de Grasse Tyson: "The egg was laid by a bird that was not a chicken". The concision and generality of that answer furnish no details-only an appropriate framework from which to examine detailed paradigms that might illuminate paradoxes underlying these three life-defining biomolecular processes. We examine experimental aspects here of five examples that all conform to the same paradigm. In each example, a paradox is resolved by coupling "if, and only if" conditions for reciprocal transitions between levels, such that the consequent of the first test is the antecedent for the second. Each condition thus restricts fluxes through, or "gates" the other. Reciprocally-coupled gating, in which two gated processes constrain one another, is self-referential, hence maps onto the formal structure of "strange loops". That mapping uncovers two different kinds of forces that may help unite the axioms underlying three phenomena that distinguish biology from chemistry. As a physical analog for Gödel's logic, biomolecular strange-loops provide a natural metaphor around which to organize a large body of experimental data, linking biology to information, free energy, and the second law of thermodynamics.
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| | - Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand;
| |
Collapse
|
18
|
Carter CW. Simultaneous codon usage, the origin of the proteome, and the emergence of de-novo proteins. Curr Opin Struct Biol 2021; 68:142-148. [PMID: 33529785 DOI: 10.1016/j.sbi.2021.01.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 01/05/2021] [Indexed: 12/21/2022]
Abstract
Genetic coding generally uses only one of a gene's two strands; its complement serving as template for replication. Aminoacyl-tRNA synthetases, aaRS, apparently first emerged as pairs on bidirectional genes, in which anticodons in the template strand served as codons for an entirely different protein. Interpreting both strands in frame constrained such genes sufficiently that it was rapidly superseded, leaving only traces in the elevated pairing between codon middle bases in antiparallel alignments. Codon assignments actually promote using information from both strands in multiple reading frames. Related phenomena, known as overprinting, are widely associated with viruses. In-frame bidirectional coding and overprinting nevertheless imply different structural and functional relationships, and different roles in generating folded proteins throughout the evolution of the proteome.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry, Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, United States.
| |
Collapse
|
19
|
Wills PR, Carter CW. Impedance Matching and the Choice Between Alternative Pathways for the Origin of Genetic Coding. Int J Mol Sci 2020; 21:E7392. [PMID: 33036401 PMCID: PMC7582391 DOI: 10.3390/ijms21197392] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 09/28/2020] [Accepted: 09/30/2020] [Indexed: 01/07/2023] Open
Abstract
We recently observed that errors in gene replication and translation could be seen qualitatively to behave analogously to the impedances in acoustical and electronic energy transducing systems. We develop here quantitative relationships necessary to confirm that analogy and to place it into the context of the minimization of dissipative losses of both chemical free energy and information. The formal developments include expressions for the information transferred from a template to a new polymer, Iσ; an impedance parameter, Z; and an effective alphabet size, neff; all of which have non-linear dependences on the fidelity parameter, q, and the alphabet size, n. Surfaces of these functions over the {n,q} plane reveal key new insights into the origin of coding. Our conclusion is that the emergence and evolutionary refinement of information transfer in biology follow principles previously identified to govern physical energy flows, strengthening analogies (i) between chemical self-organization and biological natural selection, and (ii) between the course of evolutionary trajectories and the most probable pathways for time-dependent transitions in physics. Matching the informational impedance of translation to the four-letter alphabet of genes uncovers a pivotal role for the redundancy of triplet codons in preserving as much intrinsic genetic information as possible, especially in early stages when the coding alphabet size was small.
Collapse
Affiliation(s)
- Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| | - Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
20
|
Frenkel-Pinter M, Samanta M, Ashkenasy G, Leman LJ. Prebiotic Peptides: Molecular Hubs in the Origin of Life. Chem Rev 2020; 120:4707-4765. [PMID: 32101414 DOI: 10.1021/acs.chemrev.9b00664] [Citation(s) in RCA: 148] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The fundamental roles that peptides and proteins play in today's biology makes it almost indisputable that peptides were key players in the origin of life. Insofar as it is appropriate to extrapolate back from extant biology to the prebiotic world, one must acknowledge the critical importance that interconnected molecular networks, likely with peptides as key components, would have played in life's origin. In this review, we summarize chemical processes involving peptides that could have contributed to early chemical evolution, with an emphasis on molecular interactions between peptides and other classes of organic molecules. We first summarize mechanisms by which amino acids and similar building blocks could have been produced and elaborated into proto-peptides. Next, non-covalent interactions of peptides with other peptides as well as with nucleic acids, lipids, carbohydrates, metal ions, and aromatic molecules are discussed in relation to the possible roles of such interactions in chemical evolution of structure and function. Finally, we describe research involving structural alternatives to peptides and covalent adducts between amino acids/peptides and other classes of molecules. We propose that ample future breakthroughs in origin-of-life chemistry will stem from investigations of interconnected chemical systems in which synergistic interactions between different classes of molecules emerge.
Collapse
Affiliation(s)
- Moran Frenkel-Pinter
- NSF/NASA Center for Chemical Evolution, https://centerforchemicalevolution.com/.,School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Mousumi Samanta
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Gonen Ashkenasy
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Luke J Leman
- NSF/NASA Center for Chemical Evolution, https://centerforchemicalevolution.com/.,Department of Chemistry, The Scripps Research Institute, La Jolla, California 92037, United States
| |
Collapse
|
21
|
Sarkhel B, Chatterjee A, Das D. Covalent Catalysis by Cross β Amyloid Nanotubes. J Am Chem Soc 2020; 142:4098-4103. [PMID: 32083482 DOI: 10.1021/jacs.9b13517] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The binding pockets of extant enzymes feature precise positioning of amino acid residues that facilitate multiple complex transformations exploiting covalent and non-covalent interactions. Reversible covalent anchoring is extensively used as an efficient tool by Nature for activating modern enzymes such as esterases and dehydratases and also for proteins like opsins for the complex process of visual phototransduction. Here we construct paracrystalline amyloid surfaces through the self-propagation of short peptides which offer binding pockets exposed with arrays of imidazoles and lysines. As covalent catalysis is utilized by modern-day enzymes, these homogeneous amyloid nanotubes exploit Schiff imine formation via the exposed lysines to efficiently hydrolyze both activated and inactivated esters. Controls where lysines were mutated with charged residues accessed similar morphologies but did not augment the rate. The designed amyloid microphases thus foreshadow the generation of binding pockets of advanced proteins and have the potential to contribute to the development of functional materials.
Collapse
Affiliation(s)
- Baishakhi Sarkhel
- Department of Chemical Sciences, Indian Institute of Science Education and Research (IISER) Kolkata, Mohanpur 741246, India
| | - Ayan Chatterjee
- Department of Chemical Sciences, Indian Institute of Science Education and Research (IISER) Kolkata, Mohanpur 741246, India
| | - Dibyendu Das
- Department of Chemical Sciences, Indian Institute of Science Education and Research (IISER) Kolkata, Mohanpur 741246, India
| |
Collapse
|
22
|
Mayer C. Life in The Context of Order and Complexity. Life (Basel) 2020; 10:life10010005. [PMID: 31963637 PMCID: PMC7175320 DOI: 10.3390/life10010005] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 01/06/2020] [Accepted: 01/16/2020] [Indexed: 11/17/2022] Open
Abstract
It is generally accepted that life requires structural complexity. However, a chaotic mixture of organic compounds like the one formed by extensive reaction sequences over time may be extremely complex, but could just represent a static asphalt-like dead end situation. Likewise, it is accepted that life requires a certain degree of structural order. However, even extremely ordered structures like mineral crystals show no tendency to be alive. So neither complexity nor order alone can characterize a living organism. In order to come close to life, and in order for life to develop to higher organisms, both conditions have to be fulfilled and advanced simultaneously. Only a combination of the two requirements, complexity and structural order, can mark the difference between living and dead matter. It is essential for the development of prebiotic chemistry into life and characterizes the course and the result of Darwinian evolution. For this reason, it is worthwhile to define complexity and order as an essential pair of characteristics of life and to use them as fundamental parameters to evaluate early steps in prebiotic development. A combination of high order and high complexity also represents a universal type of biosignature which could be used to identify unknown forms of life or remnants thereof.
Collapse
Affiliation(s)
- Christian Mayer
- Institute of Physical Chemistry, CENIDE, University of Duisburg-Essen, 45141 Essen, Germany
| |
Collapse
|
23
|
The Ancient Operational Code is Embedded in the Amino Acid Substitution Matrix and aaRS Phylogenies. J Mol Evol 2019; 88:136-150. [PMID: 31781936 DOI: 10.1007/s00239-019-09918-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/14/2019] [Indexed: 10/25/2022]
Abstract
The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the aaSM, in which (i) the number of distinguishable amino acid types, (ii) the matrix dimension and (iii) the number of parameters, each increases by one for each bifurcation in an aaRS phylogeny. Parameterized matrices corresponding to trees in which the size of an amino acid sidechain is the only discernible property behind its categorization as a substrate, exclusively for a Class I or II aaRS, provide a significantly better fit to empirically determined aaSM than trees with random bifurcation patterns. A second split between polar and nonpolar amino acids in each Class effects a vastly greater further improvement. The earliest Class-separated epochs in the phylogenies of the aaRS reflect these enzymes' capability to distinguish tRNAs through the recognition of acceptor stem identity elements via the minor (Class I) and major (Class II) helical grooves, which is how the ancient operational code functioned. The advent of tRNA recognition using the anticodon loop supports the evolution of the optimal map of amino acid chemistry found in the later genetic code, an essentially digital categorization, in which polarity is the major functional property, compensating for the unrefined, haphazard differentiation of amino acids achieved by the operational code.
Collapse
|
24
|
Carter CW, Wills PR. Experimental solutions to problems defining the origin of codon-directed protein synthesis. Biosystems 2019; 183:103979. [PMID: 31176803 PMCID: PMC6693952 DOI: 10.1016/j.biosystems.2019.103979] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 05/27/2019] [Accepted: 05/29/2019] [Indexed: 12/13/2022]
Abstract
How genetic coding differentiated biology from chemistry is a long-standing challenge in Biology, for which there have been few experimental approaches, despite a wide-ranging speculative literature. We summarize five coordinated areas-experimental characterization of functional approximations to the minimal peptides (protozymes and urzymes) necessary to activate amino acids and acylate tRNA; showing that specificities of these experimental models match those expected from the synthetase Class division; population of disjoint regions of amino acid sequence space via bidirectional coding ancestry of the two synthetase Classes; showing that the phase transfer equilibria of amino acid side chains that form a two-dimensional basis set for protein folding are embedded in patterns of bases in the tRNA acceptor stem and anticodon; and identification of molecular signatures of ancestral synthetases and tRNAs necessary to define the earliest cognate synthetase:tRNA pairs-that now compose an extensive experimentally testable paradigm for progress toward understanding the coordinated emergence of the codon table and viable mRNA coding sequences. We briefly discuss recent progress toward identifying the remaining outstanding questions-the nature of the earliest amino acid alphabets and the origin of binding discrimination via distinct amino acid sequence-independent protein secondary structures-and how these, too, might be addressed experimentally.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, United States
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| |
Collapse
|
25
|
Carter CW, Wills PR. Hierarchical groove discrimination by Class I and II aminoacyl-tRNA synthetases reveals a palimpsest of the operational RNA code in the tRNA acceptor-stem bases. Nucleic Acids Res 2019; 46:9667-9683. [PMID: 30016476 PMCID: PMC6182185 DOI: 10.1093/nar/gky600] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 07/12/2018] [Indexed: 01/01/2023] Open
Abstract
Class I and II aaRS recognition of opposite grooves was likely among the earliest determinants fixed in the tRNA acceptor stem bases. A new regression model identifies those determinants in bacterial tRNAs. Integral coefficients relate digital dependent to independent variables with perfect agreement between observed and calculated grooves for all twenty isoaccepting tRNAs. Recognition is mediated by the Discriminator base 73, the first base pair, and base 2 of the acceptor stem. Subsets of these coefficients also identically compute grooves recognized by smaller numbers of aaRS. Thus, the model is hierarchical, suggesting that new rules were added to pre-existing ones as new amino acids joined the coding alphabet. A thermodynamic rationale for the simplest model implies that Class-dependent aaRS secondary structures exploited differential tendencies of the acceptor stem to form the hairpin observed in Class I aaRS•tRNA complexes, enabling the earliest groove discrimination. Curiously, groove recognition also depends explicitly on the identity of base 2 in a manner consistent with the middle bases of the codon table, confirming a hidden ancestry of codon-anticodon pairing in the acceptor stem. That, and the lack of correlation with anticodon bases support prior productive coding interaction of tRNA minihelices with proto-mRNA.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| | - Peter R Wills
- Department of Physics, Centre for Computational Evolution, and Te Ao Marama Centre for Fundamental Enquiry, University of Auckland, PB 92109, Auckland 1142, New Zealand
| |
Collapse
|
26
|
Carter CW, Wills PR. Class I and II aminoacyl-tRNA synthetase tRNA groove discrimination created the first synthetase-tRNA cognate pairs and was therefore essential to the origin of genetic coding. IUBMB Life 2019; 71:1088-1098. [PMID: 31190358 DOI: 10.1002/iub.2094] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Revised: 04/14/2019] [Accepted: 04/15/2019] [Indexed: 12/20/2022]
Abstract
The genetic code likely arose when a bidirectional gene replicating as a quasi-species began to produce ancestral aminoacyl-tRNA synthetases (aaRS) capable of distinguishing between two distinct sets of amino acids. The synthetase class division therefore necessarily implies a mechanism by which the two ancestral synthetases could also discriminate between two different kinds of tRNA substrates. We used regression methods to uncover the possible patterns of base sequences capable of such discrimination and find that they appear to be related to thermodynamic differences in the relative stabilities of a hairpin necessary for recognition of tRNA substrates by Class I aaRS. The thermodynamic differences appear to be exploited by secondary structural differences between models for the ancestral aaRS called synthetase Urzymes and reinforced by packing of aromatic amino acid side chains against the nonpolar face of the ribose of A76 if and only if the tRNA CCA sequence forms a hairpin. The patterns of bases 1, 2, and 73 and stabilization of the hairpin by structural complementarity with Class I, but not Class II, aaRS Urzymes appear to be necessary and sufficient to have enabled the generation of the first two aaRS-tRNA cognate pairs, and the launch of a rudimentary binary genetic coding related recognizably to contemporary cognate pairs. As a consequence, it seems likely that nonrandom aminoacylation of tRNAs preceded the advent of the tRNA anticodon stem-loop. Consistent with this suggestion, coding rules in the acceptor-stem bases also reveal a palimpsest of the codon-anticodon interaction, as previously proposed. © 2019 IUBMB Life, 2019 © 2019 IUBMB Life, 71(8):1088-1098, 2019.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, Auckland, New Zealand
| |
Collapse
|
27
|
Bilus M, Semanjski M, Mocibob M, Zivkovic I, Cvetesic N, Tawfik DS, Toth-Petroczy A, Macek B, Gruic-Sovulj I. On the Mechanism and Origin of Isoleucyl-tRNA Synthetase Editing against Norvaline. J Mol Biol 2019; 431:1284-1297. [PMID: 30711543 DOI: 10.1016/j.jmb.2019.01.029] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Revised: 01/20/2019] [Accepted: 01/22/2019] [Indexed: 11/17/2022]
Abstract
Aminoacyl-tRNA synthetases (aaRSs), the enzymes responsible for coupling tRNAs to their cognate amino acids, minimize translational errors by intrinsic hydrolytic editing. Here, we compared norvaline (Nva), a linear amino acid not coded for protein synthesis, to the proteinogenic, branched valine (Val) in their propensity to mistranslate isoleucine (Ile) in proteins. We show that in the synthetic site of isoleucyl-tRNA synthetase (IleRS), Nva and Val are activated and transferred to tRNA at similar rates. The efficiency of the synthetic site in pre-transfer editing of Nva and Val also appears to be similar. Post-transfer editing was, however, more rapid with Nva and consequently IleRS misaminoacylates Nva-tRNAIle at slower rate than Val-tRNAIle. Accordingly, an Escherichia coli strain lacking IleRS post-transfer editing misincorporated Nva and Val in the proteome to a similar extent and at the same Ile positions. However, Nva mistranslation inflicted higher toxicity than Val, in agreement with IleRS editing being optimized for hydrolysis of Nva-tRNAIle. Furthermore, we found that the evolutionary-related IleRS, leucyl- and valyl-tRNA synthetases (I/L/VRSs), all efficiently hydrolyze Nva-tRNAs even when editing of Nva seems redundant. We thus hypothesize that editing of Nva-tRNAs had already existed in the last common ancestor of I/L/VRSs, and that the editing domain of I/L/VRSs had primarily evolved to prevent infiltration of Nva into modern proteins.
Collapse
Affiliation(s)
- Mirna Bilus
- Department of Chemistry, Faculty of Science, University of Zagreb, Zagreb 10000, Croatia
| | - Maja Semanjski
- Proteome Center Tuebingen, University of Tuebingen, Tuebingen 72076, Germany
| | - Marko Mocibob
- Department of Chemistry, Faculty of Science, University of Zagreb, Zagreb 10000, Croatia
| | - Igor Zivkovic
- Department of Chemistry, Faculty of Science, University of Zagreb, Zagreb 10000, Croatia
| | - Nevena Cvetesic
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, and the MRC London Institute of Medical Sciences, London, W12 0NN, United Kingdom
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Agnes Toth-Petroczy
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden 01307, Germany
| | - Boris Macek
- Proteome Center Tuebingen, University of Tuebingen, Tuebingen 72076, Germany
| | - Ita Gruic-Sovulj
- Department of Chemistry, Faculty of Science, University of Zagreb, Zagreb 10000, Croatia.
| |
Collapse
|
28
|
Abstract
Abundant and essential motifs, such as phosphate-binding loops (P-loops), are presumed to be the seeds of modern enzymes. The Walker-A P-loop is absolutely essential in modern NTPase enzymes, in mediating binding, and transfer of the terminal phosphate groups of NTPs. However, NTPase function depends on many additional active-site residues placed throughout the protein's scaffold. Can motifs such as P-loops confer function in a simpler context? We applied a phylogenetic analysis that yielded a sequence logo of the putative ancestral Walker-A P-loop element: a β-strand connected to an α-helix via the P-loop. Computational design incorporated this element into de novo designed β-α repeat proteins with relatively few sequence modifications. We obtained soluble, stable proteins that unlike modern P-loop NTPases bound ATP in a magnesium-independent manner. Foremost, these simple P-loop proteins avidly bound polynucleotides, RNA, and single-strand DNA, and mutations in the P-loop's key residues abolished binding. Binding appears to be facilitated by the structural plasticity of these proteins, including quaternary structure polymorphism that promotes a combined action of multiple P-loops. Accordingly, oligomerization enabled a 55-aa protein carrying a single P-loop to confer avid polynucleotide binding. Overall, our results show that the P-loop Walker-A motif can be implemented in small and simple β-α repeat proteins, primarily as a polynucleotide binding motif.
Collapse
|
29
|
Liu Y, Sumpter DJT. Mathematical modeling reveals spontaneous emergence of self-replication in chemical reaction systems. J Biol Chem 2018; 293:18854-18863. [PMID: 30282809 PMCID: PMC6295724 DOI: 10.1074/jbc.ra118.003795] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Revised: 09/29/2018] [Indexed: 01/20/2023] Open
Abstract
Explaining the origin of life requires us to elucidate how self-replication arises. To be specific, how can a self-replicating entity develop spontaneously from a chemical reaction system in which no reaction is self-replicating? Previously proposed mathematical models either supply an explicit framework for a minimal living system or consider only catalyzed reactions, and thus fail to provide a comprehensive theory. Here, we set up a general mathematical model for chemical reaction systems that properly accounts for energetics, kinetics, and the conservation law. We found that 1) some systems are collectively catalytic, a mode whereby reactants are transformed into end products with the assistance of intermediates (as in the citric acid cycle), whereas some others are self-replicating, that is, different parts replicate each other and the system self-replicates as a whole (as in the formose reaction, in which sugar is replicated from formaldehyde); 2) side reactions do not always inhibit such systems; 3) randomly chosen chemical universes (namely random artificial chemistries) often contain one or more such systems; 4) it is possible to construct a self-replicating system in which the entropy of some parts spontaneously decreases, in a manner similar to that discussed by Schrödinger; and 5) complex self-replicating molecules can emerge spontaneously and relatively easily from simple chemical reaction systems through a sequence of transitions. Together, these results start to explain the origins of prebiotic evolution.
Collapse
Affiliation(s)
- Yu Liu
- From the Department of Mathematics, Uppsala University, 75105 Uppsala, Sweden
| | - David J T Sumpter
- From the Department of Mathematics, Uppsala University, 75105 Uppsala, Sweden
| |
Collapse
|
30
|
Noda-Garcia L, Liebermeister W, Tawfik DS. Metabolite–Enzyme Coevolution: From Single Enzymes to Metabolic Pathways and Networks. Annu Rev Biochem 2018; 87:187-216. [DOI: 10.1146/annurev-biochem-062917-012023] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
How individual enzymes evolved is relatively well understood. However, individual enzymes rarely confer a physiological advantage on their own. Judging by its current state, the emergence of metabolism seemingly demanded the simultaneous emergence of many enzymes. Indeed, how multicomponent interlocked systems, like metabolic pathways, evolved is largely an open question. This complexity can be unlocked if we assume that survival of the fittest applies not only to genes and enzymes but also to the metabolites they produce. This review develops our current knowledge of enzyme evolution into a wider hypothesis of pathway and network evolution. We describe the current models for pathway evolution and offer an integrative metabolite–enzyme coevolution hypothesis. Our hypothesis addresses the origins of new metabolites and of new enzymes and the order of their recruitment. We aim to not only survey established knowledge but also present open questions and potential ways of addressing them.
Collapse
Affiliation(s)
- Lianet Noda-Garcia
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 76100, Israel;,
| | - Wolfram Liebermeister
- INRA, Unité MaIAGE, 78352 Jouy en Josas, France
- Institute of Biochemistry, Charité Universitätsmedizin, Berlin, 10117 Berlin, Germany
| | - Dan S. Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 76100, Israel;,
| |
Collapse
|
31
|
Carter CW, Wills PR. Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding. Mol Biol Evol 2018; 35:269-286. [PMID: 29077934 PMCID: PMC5850816 DOI: 10.1093/molbev/msx265] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Genetic coding is generally thought to have required ribozymes whose functions were taken over by polypeptide aminoacyl-tRNA synthetases (aaRS). Two discoveries about aaRS and their interactions with tRNA substrates now furnish a unifying rationale for the opposite conclusion: that the key processes of the Central Dogma of molecular biology emerged simultaneously and naturally from simple origins in a peptide•RNA partnership, eliminating the epistemological utility of a prior RNA world. First, the two aaRS classes likely arose from opposite strands of the same ancestral gene, implying a simple genetic alphabet. The resulting inversion symmetries in aaRS structural biology would have stabilized the initial and subsequent differentiation of coding specificities, rapidly promoting diversity in the proteome. Second, amino acid physical chemistry maps onto tRNA identity elements, establishing reflexive, nanoenvironmental sensing in protein aaRS. Bootstrapping of increasingly detailed coding is thus intrinsic to polypeptide aaRS, but impossible in an RNA world. These notions underline the following concepts that contradict gradual replacement of ribozymal aaRS by polypeptide aaRS: 1) aaRS enzymes must be interdependent; 2) reflexivity intrinsic to polypeptide aaRS production dynamics promotes bootstrapping; 3) takeover of RNA-catalyzed aminoacylation by enzymes will necessarily degrade specificity; and 4) the Central Dogma's emergence is most probable when replication and translation error rates remain comparable. These characteristics are necessary and sufficient for the essentially de novo emergence of a coupled gene-replicase-translatase system of genetic coding that would have continuously preserved the functional meaning of genetically encoded protein genes whose phylogenetic relationships match those observed today.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland, New Zealand
| |
Collapse
|
32
|
Opuu V, Silvert M, Simonson T. Computational design of fully overlapping coding schemes for protein pairs and triplets. Sci Rep 2017; 7:15873. [PMID: 29158504 PMCID: PMC5696523 DOI: 10.1038/s41598-017-16221-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 11/09/2017] [Indexed: 11/26/2022] Open
Abstract
Gene pairs that overlap in their coding regions are rare except in viruses. They may occur transiently in gene creation and are of biotechnological interest. We have examined the possibility to encode an arbitrary pair of protein domains as a dual gene, with the shorter coding sequence completely embedded in the longer one. For 500 × 500 domain pairs (X, Y), we computationally designed homologous pairs (X', Y') coded this way, using an algorithm that provably maximizes the sequence similarity between (X', Y') and (X, Y). Three schemes were considered, with X' and Y' coded on the same or complementary strands. For 16% of the pairs, an overlapping coding exists where the level of homology of X', Y' to the natural proteins represents an E-value of 10-10 or better. Thus, for an arbitrary domain pair, it is surprisingly easy to design homologous sequences that can be encoded as a fully-overlapping gene pair. The algorithm is general and was used to design 200 triple genes, with three proteins encoded by the same DNA segment. The ease of design suggests overlapping genes may have occurred frequently in evolution and could be readily used to compress or constrain artificial genomes.
Collapse
Affiliation(s)
- Vaitea Opuu
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France
| | - Martin Silvert
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France
| | - Thomas Simonson
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France.
| |
Collapse
|
33
|
Mignon D, Panel N, Chen X, Fuentes EJ, Simonson T. Computational Design of the Tiam1 PDZ Domain and Its Ligand Binding. J Chem Theory Comput 2017; 13:2271-2289. [PMID: 28394603 DOI: 10.1021/acs.jctc.6b01255] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
PDZ domains direct protein-protein interactions and serve as models for protein design. Here, we optimized a protein design energy function for the Tiam1 and Cask PDZ domains that combines a molecular mechanics energy, Generalized Born solvent, and an empirical unfolded state model. Designed sequences were recognized as PDZ domains by the Superfamily fold recognition tool and had similarity scores comparable to natural PDZ sequences. The optimized model was used to redesign the two PDZ domains, by gradually varying the chemical potential of hydrophobic amino acids; the tendency of each position to lose or gain a hydrophobic character represents a novel hydrophobicity index. We also redesigned four positions in the Tiam1 PDZ domain involved in peptide binding specificity. The calculated affinity differences between designed variants reproduced experimental data and suggest substitutions with altered specificities.
Collapse
Affiliation(s)
- David Mignon
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique , Palaiseau, France
| | - Nicolas Panel
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique , Palaiseau, France
| | - Xingyu Chen
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique , Palaiseau, France
| | - Ernesto J Fuentes
- Department of Biochemistry, Roy J. & Lucille A. Carver College of Medicine and Holden Comprehensive Cancer Center, University of Iowa , Iowa City, Iowa 52242-1109, United States
| | - Thomas Simonson
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique , Palaiseau, France
| |
Collapse
|
34
|
Wieczorek R, Adamala K, Gasperi T, Polticelli F, Stano P. Small and Random Peptides: An Unexplored Reservoir of Potentially Functional Primitive Organocatalysts. The Case of Seryl-Histidine. Life (Basel) 2017; 7:E19. [PMID: 28397774 PMCID: PMC5492141 DOI: 10.3390/life7020019] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2017] [Revised: 04/03/2017] [Accepted: 04/05/2017] [Indexed: 12/11/2022] Open
Abstract
Catalysis is an essential feature of living systems biochemistry, and probably, it played a key role in primordial times, helping to produce more complex molecules from simple ones. However, enzymes, the biocatalysts par excellence, were not available in such an ancient context, and so, instead, small molecule catalysis (organocatalysis) may have occurred. The best candidates for the role of primitive organocatalysts are amino acids and short random peptides, which are believed to have been available in an early period on Earth. In this review, we discuss the occurrence of primordial organocatalysts in the form of peptides, in particular commenting on reports about seryl-histidine dipeptide, which have recently been investigated. Starting from this specific case, we also mention a peptide fragment condensation scenario, as well as other potential roles of peptides in primordial times. The review actually aims to stimulate further investigation on an unexplored field of research, namely one that specifically looks at the catalytic activity of small random peptides with respect to reactions relevant to prebiotic chemistry and early chemical evolution.
Collapse
Affiliation(s)
- Rafal Wieczorek
- Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Katarzyna Adamala
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, MN 55455, USA.
| | - Tecla Gasperi
- Department of Science, Roma Tre University, Viale G. Marconi 446, 00146 Rome, Italy.
| | - Fabio Polticelli
- Department of Science, Roma Tre University, Viale G. Marconi 446, 00146 Rome, Italy.
- National Institute of Nuclear Physics, Roma Tre Section, Via della Vasca Navale 84, 00146 Rome, Italy.
| | - Pasquale Stano
- Department of Biological and Environmental Sciences and Technologies (DiSTeBA), University of Salento, Campus Ecotekne (S.P. 6 Lecce-Monteroni), 73100 Lecce, Italy.
| |
Collapse
|
35
|
Hordijk W. Autocatalytic Sets and RNA Secondary Structure. J Mol Evol 2017; 84:153-158. [PMID: 28378190 DOI: 10.1007/s00239-017-9787-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Accepted: 03/09/2017] [Indexed: 11/30/2022]
Abstract
The dominant paradigm in origin of life research is that of an RNA world. However, despite experimental progress towards the spontaneous formation of RNA, the RNA world hypothesis still has its problems. Here, we introduce a novel computational model of chemical reaction networks based on RNA secondary structure and analyze the existence of autocatalytic sub-networks in random instances of this model, by combining two well-established computational tools. Our main results are that (i) autocatalytic sets are highly likely to exist, even for very small reaction networks and short RNA sequences, and (ii) sequence diversity seems to be a more important factor in the formation of autocatalytic sets than sequence length. These findings could shed new light on the probability of the spontaneous emergence of an RNA world as a network of mutually collaborative ribozymes.
Collapse
Affiliation(s)
- Wim Hordijk
- Konrad Lorenz Institute for Evolution and Cognition Research, Klosterneuburg, Austria.
| |
Collapse
|
36
|
Iqubal MA, Sharma R, Jheeta S, Kamaluddin. Thermal Condensation of Glycine and Alanine on Metal Ferrite Surface: Primitive Peptide Bond Formation Scenario. Life (Basel) 2017; 7:E15. [PMID: 28346388 PMCID: PMC5492137 DOI: 10.3390/life7020015] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2016] [Revised: 03/13/2017] [Accepted: 03/24/2017] [Indexed: 11/17/2022] Open
Abstract
The amino acid condensation reaction on a heterogeneous mineral surface has been regarded as one of the important pathways for peptide bond formation. Keeping this in view, we have studied the oligomerization of the simple amino acids, glycine and alanine, on nickel ferrite (NiFe₂O₄), cobalt ferrite (CoFe₂O₄), copper ferrite (CuFe₂O₄), zinc ferrite (ZnFe₂O₄), and manganese ferrite (MnFe₂O₄) nanoparticles surfaces, in the temperature range from 50-120 °C for 1-35 days, without applying any wetting/drying cycles. Among the metal ferrites tested for their catalytic activity, NiFe₂O₄ produced the highest yield of products by oligomerizing glycine to the trimer level and alanine to the dimer level, whereas MnFe₂O₄ was the least efficient catalyst, producing the lowest yield of products, as well as shorter oligomers of amino acids under the same set of experimental conditions. It produced primarily diketopiperazine (Ala) with a trace amount of alanine dimer from alanine condensation, while glycine was oligomerized to the dimer level. The trend in product formation is in accordance with the surface area of the minerals used. A temperature as low as 50 °C can even favor peptide bond formation in the present study, which is important in the sense that the condensation process is highly feasible without any sort of localized heat that may originate from volcanoes or hydrothermal vents. However, at a high temperature of 120 °C, anhydrides of glycine and alanine formation are favored, while the optimum temperature for the highest yield of product formation was found to be 90 °C.
Collapse
Affiliation(s)
- Md Asif Iqubal
- Department of Chemistry, Indian Institute of Technology Roorkee, Roorkee 247 667, Uttarakhand, India.
| | - Rachana Sharma
- Department of Chemistry, Indian Institute of Technology Roorkee, Roorkee 247 667, Uttarakhand, India.
| | - Sohan Jheeta
- Network of Researchers on Horizontal Gene Transfer and Last Universal, Common Ancestor Leeds, Leeds LS7 3RB, UK.
| | - Kamaluddin
- Department of Chemistry, Indian Institute of Technology Roorkee, Roorkee 247 667, Uttarakhand, India.
| |
Collapse
|
37
|
Carter CW. High-Dimensional Mutant and Modular Thermodynamic Cycles, Molecular Switching, and Free Energy Transduction. Annu Rev Biophys 2017; 46:433-453. [PMID: 28375734 DOI: 10.1146/annurev-biophys-070816-033811] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Understanding how distinct parts of proteins produce coordinated behavior has driven and continues to drive advances in protein science and enzymology. However, despite consensus about the conceptual basis for allostery, the idiosyncratic nature of allosteric mechanisms resists general approaches. Computational methods can identify conformational transition states from structural changes, revealing common switching mechanisms that impose multistate behavior. Thermodynamic cycles use factorial perturbations to measure coupling energies between side chains in molecular switches that mediate shear during domain motion. Such cycles have now been complemented by modular cycles that measure energetic coupling between separable domains. For one model system, energetic coupling between domains has been shown to be quantitatively equivalent to that between dynamic side chains. Linkages between domain motion, switching residues, and catalysis make nucleoside triphosphate hydrolysis conditional on domain movement, confirming an essential yet neglected aspect of free energy transduction and suggesting the potential generality of these studies.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27514;
| |
Collapse
|
38
|
Carter CW. Coding of Class I and II Aminoacyl-tRNA Synthetases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017; 966:103-148. [PMID: 28828732 PMCID: PMC5927602 DOI: 10.1007/5584_2017_93] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels-protozymes and Urzymes-associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric-middle base-pairing frequencies in sense/antisense alignments-that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically-active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-7260, USA.
| |
Collapse
|
39
|
Oprescu SN, Griffin LB, Beg AA, Antonellis A. Predicting the pathogenicity of aminoacyl-tRNA synthetase mutations. Methods 2016; 113:139-151. [PMID: 27876679 DOI: 10.1016/j.ymeth.2016.11.013] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Revised: 11/12/2016] [Accepted: 11/18/2016] [Indexed: 10/24/2022] Open
Abstract
Aminoacyl-tRNA synthetases (ARSs) are ubiquitously expressed, essential enzymes responsible for charging tRNA with cognate amino acids-the first step in protein synthesis. ARSs are required for protein translation in the cytoplasm and mitochondria of all cells. Surprisingly, mutations in 28 of the 37 nuclear-encoded human ARS genes have been linked to a variety of recessive and dominant tissue-specific disorders. Current data indicate that impaired enzyme function is a robust predictor of the pathogenicity of ARS mutations. However, experimental model systems that distinguish between pathogenic and non-pathogenic ARS variants are required for implicating newly identified ARS mutations in disease. Here, we outline strategies to assist in predicting the pathogenicity of ARS variants and urge cautious evaluation of genetic and functional data prior to linking an ARS mutation to a human disease phenotype.
Collapse
Affiliation(s)
- Stephanie N Oprescu
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, United States
| | - Laurie B Griffin
- Cellular and Molecular Biology Program, University of Michigan Medical School, Ann Arbor, MI, United States; Medical Scientist Training Program, and University of Michigan Medical School, Ann Arbor, MI, United States
| | - Asim A Beg
- Department of Pharmacology, University of Michigan Medical School, Ann Arbor, MI, United States
| | - Anthony Antonellis
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, United States; Cellular and Molecular Biology Program, University of Michigan Medical School, Ann Arbor, MI, United States.
| |
Collapse
|
40
|
Sapienza PJ, Li L, Williams T, Lee AL, Carter CW. An Ancestral Tryptophanyl-tRNA Synthetase Precursor Achieves High Catalytic Rate Enhancement without Ordered Ground-State Tertiary Structures. ACS Chem Biol 2016; 11:1661-8. [PMID: 27008438 PMCID: PMC5461432 DOI: 10.1021/acschembio.5b01011] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Urzymes-short, active core modules derived from enzyme superfamilies-prepared from the two aminoacyl-tRNA synthetase (aaRS) classes contain only the modules shared by all related family members. They have been described as models for ancestral forms. Understanding them currently depends on inferences drawn from the crystal structures of the full-length enzymes. As aaRS Urzymes lack much of the mass of modern aaRS's, retaining only a small portion of the hydrophobic cores of the full-length enzymes, it is desirable to characterize their structures. We report preliminary characterization of (15)N tryptophanyl-tRNA synthetase Urzyme by heteronuclear single quantum coherence (HSQC) NMR spectroscopy supplemented by circular dichroism, thermal melting, and induced fluorescence of bound dye. The limited dispersion of (1)H chemical shifts (0.5 ppm) is inconsistent with a narrow ensemble of well-packed structures in either free or substrate-bound forms, although the number of resonances from the bound state increases, indicating a modest, ligand-dependent gain in structure. Circular dichroism spectroscopy shows the presence of helices and evidence of cold denaturation, and all ligation states induce Sypro Orange fluorescence at ambient temperatures. Although the term "molten globule" is difficult to define precisely, these characteristics are consistent with most such definitions. Active-site titration shows that a majority of molecules retain ∼60% of the transition state stabilization free energy observed in modern synthetases. In contrast to the conventional view that enzymes require stable tertiary structures, we conclude that a highly flexible ground-state ensemble can nevertheless bind tightly to the transition state for amino acid activation.
Collapse
Affiliation(s)
- Paul J. Sapienza
- Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy
| | - Li Li
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 25799
| | - Tishan Williams
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 25799
| | - Andrew L. Lee
- Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy
| | - Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, 25799
| |
Collapse
|
41
|
Coevolution Theory of the Genetic Code at Age Forty: Pathway to Translation and Synthetic Life. Life (Basel) 2016; 6:life6010012. [PMID: 26999216 PMCID: PMC4810243 DOI: 10.3390/life6010012] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2016] [Revised: 02/26/2016] [Accepted: 03/04/2016] [Indexed: 11/17/2022] Open
Abstract
The origins of the components of genetic coding are examined in the present study. Genetic information arose from replicator induction by metabolite in accordance with the metabolic expansion law. Messenger RNA and transfer RNA stemmed from a template for binding the aminoacyl-RNA synthetase ribozymes employed to synthesize peptide prosthetic groups on RNAs in the Peptidated RNA World. Coevolution of the genetic code with amino acid biosynthesis generated tRNA paralogs that identify a last universal common ancestor (LUCA) of extant life close to Methanopyrus, which in turn points to archaeal tRNA introns as the most primitive introns and the anticodon usage of Methanopyrus as an ancient mode of wobble. The prediction of the coevolution theory of the genetic code that the code should be a mutable code has led to the isolation of optional and mandatory synthetic life forms with altered protein alphabets.
Collapse
|
42
|
Wills PR. The generation of meaningful information in molecular systems. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0066. [PMID: 26857673 DOI: 10.1098/rsta.2015.0066] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 07/17/2015] [Indexed: 06/05/2023]
Abstract
The physico-chemical processes occurring inside cells are under the computational control of genetic (DNA) and epigenetic (internal structural) programming. The origin and evolution of genetic information (nucleic acid sequences) is reasonably well understood, but scant attention has been paid to the origin and evolution of the molecular biological interpreters that give phenotypic meaning to the sequence information that is quite faithfully replicated during cellular reproduction. The near universality and age of the mapping from nucleotide triplets to amino acids embedded in the functionality of the protein synthetic machinery speaks to the early development of a system of coding which is still extant in every living organism. We take the origin of genetic coding as a paradigm of the emergence of computation in natural systems, focusing on the requirement that the molecular components of an interpreter be synthesized autocatalytically. Within this context, it is seen that interpreters of increasing complexity are generated by series of transitions through stepped dynamic instabilities (non-equilibrium phase transitions). The early phylogeny of the amino acyl-tRNA synthetase enzymes is discussed in such terms, leading to the conclusion that the observed optimality of the genetic code is a natural outcome of the processes of self-organization that produced it.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, PB 92019, Auckland 1142, Aotearoa, New Zealand
| |
Collapse
|
43
|
Chandrasekaran SN, Das J, Dokholyan NV, Carter CW. A modified PATH algorithm rapidly generates transition states comparable to those found by other well established algorithms. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2016; 3:012101. [PMID: 26958584 PMCID: PMC4769271 DOI: 10.1063/1.4941599] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 01/22/2016] [Indexed: 06/05/2023]
Abstract
PATH rapidly computes a path and a transition state between crystal structures by minimizing the Onsager-Machlup action. It requires input parameters whose range of values can generate different transition-state structures that cannot be uniquely compared with those generated by other methods. We outline modifications to estimate these input parameters to circumvent these difficulties and validate the PATH transition states by showing consistency between transition-states derived by different algorithms for unrelated protein systems. Although functional protein conformational change trajectories are to a degree stochastic, they nonetheless pass through a well-defined transition state whose detailed structural properties can rapidly be identified using PATH.
Collapse
Affiliation(s)
- Srinivas Niranj Chandrasekaran
- Department of Biochemistry and Biophysics, The University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| | - Jhuma Das
- Department of Biochemistry and Biophysics, The University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| | - Nikolay V Dokholyan
- Department of Biochemistry and Biophysics, The University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| | - Charles W Carter
- Department of Biochemistry and Biophysics, The University of North Carolina at Chapel Hill , Chapel Hill, North Carolina 27599-7260, USA
| |
Collapse
|
44
|
Carter CW, Wolfenden R. tRNA acceptor-stem and anticodon bases embed separate features of amino acid chemistry. RNA Biol 2015; 13:145-51. [PMID: 26595350 DOI: 10.1080/15476286.2015.1112488] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
The universal genetic code is a translation table by which nucleic acid sequences can be interpreted as polypeptides with a wide range of biological functions. That information is used by aminoacyl-tRNA synthetases to translate the code. Moreover, amino acid properties dictate protein folding. We recently reported that digital correlation techniques could identify patterns in tRNA identity elements that govern recognition by synthetases. Our analysis, and the functionality of truncated synthetases that cannot recognize the tRNA anticodon, support the conclusion that the tRNA acceptor stem houses an independent code for the same 20 amino acids that likely functioned earlier in the emergence of genetics. The acceptor-stem code, related to amino acid size, is distinct from a code in the anticodon that is related to amino acid polarity. Details of the acceptor-stem code suggest that it was useful in preserving key properties of stereochemically-encoded peptides that had developed the capacity to interact catalytically with RNA. The quantitative embedding of the chemical properties of amino acids into tRNA bases has implications for the origins of molecular biology.
Collapse
Affiliation(s)
- Charles W Carter
- a Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , Chapel Hill , NC 27599-7260
| | - Richard Wolfenden
- a Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , Chapel Hill , NC 27599-7260
| |
Collapse
|
45
|
Martinez-Rodriguez L, Erdogan O, Jimenez-Rodriguez M, Gonzalez-Rivera K, Williams T, Li L, Weinreb V, Collier M, Chandrasekaran SN, Ambroggio X, Kuhlman B, Carter CW. Functional Class I and II Amino Acid-activating Enzymes Can Be Coded by Opposite Strands of the Same Gene. J Biol Chem 2015; 290:19710-25. [PMID: 26088142 DOI: 10.1074/jbc.m115.642876] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Indexed: 01/11/2023] Open
Abstract
Aminoacyl-tRNA synthetases (aaRS) catalyze both chemical steps that translate the universal genetic code. Rodin and Ohno offered an explanation for the existence of two aaRS classes, observing that codons for the most highly conserved Class I active-site residues are anticodons for corresponding Class II active-site residues. They proposed that the two classes arose simultaneously, by translation of opposite strands from the same gene. We have characterized wild-type 46-residue peptides containing ATP-binding sites of Class I and II synthetases and those coded by a gene designed by Rosetta to encode the corresponding peptides on opposite strands. Catalysis by WT and designed peptides is saturable, and the designed peptides are sensitive to active-site residue mutation. All have comparable apparent second-order rate constants 2.9-7.0E-3 M(-1) s(-1) or ∼750,000-1,300,000 times the uncatalyzed rate. The activities of the two complementary peptides demonstrate that the unique information in a gene can have two functional interpretations, one from each complementary strand. The peptides contain phylogenetic signatures of longer, more sophisticated catalysts we call Urzymes and are short enough to bridge the gap between them and simpler uncoded peptides. Thus, they directly substantiate the sense/antisense coding ancestry of Class I and II aaRS. Furthermore, designed 46-mers achieve similar catalytic proficiency to wild-type 46-mers by significant increases in both kcat and Km values, supporting suggestions that the earliest peptide catalysts activated ATP for biosynthetic purposes.
Collapse
Affiliation(s)
- Luis Martinez-Rodriguez
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Ozgün Erdogan
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Mariel Jimenez-Rodriguez
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Katiria Gonzalez-Rivera
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Tishan Williams
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Li Li
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Violetta Weinreb
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Martha Collier
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Srinivas Niranj Chandrasekaran
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Xavier Ambroggio
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Brian Kuhlman
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| | - Charles W Carter
- From the Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260
| |
Collapse
|
46
|
tRNA acceptor stem and anticodon bases form independent codes related to protein folding. Proc Natl Acad Sci U S A 2015; 112:7489-94. [PMID: 26034281 DOI: 10.1073/pnas.1507569112] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Aminoacyl-tRNA synthetases recognize tRNA anticodon and 3' acceptor stem bases. Synthetase Urzymes acylate cognate tRNAs even without anticodon-binding domains, in keeping with the possibility that acceptor stem recognition preceded anticodon recognition. Representing tRNA identity elements with two bits per base, we show that the anticodon encodes the hydrophobicity of each amino acid side-chain as represented by its water-to-cyclohexane distribution coefficient, and this relationship holds true over the entire temperature range of liquid water. The acceptor stem codes preferentially for the surface area or size of each side-chain, as represented by its vapor-to-cyclohexane distribution coefficient. These orthogonal experimental properties are both necessary to account satisfactorily for the exposed surface area of amino acids in folded proteins. Moreover, the acceptor stem codes correctly for β-branched and carboxylic acid side-chains, whereas the anticodon codes for a wider range of such properties, but not for size or β-branching. These and other results suggest that genetic coding of 3D protein structures evolved in distinct stages, based initially on the size of the amino acid and later on its compatibility with globular folding in water.
Collapse
|
47
|
Nontemplate-driven polymers: clues to a minimal form of organization closure at the early stages of living systems. Theory Biosci 2015; 134:47-64. [DOI: 10.1007/s12064-015-0209-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Accepted: 04/16/2015] [Indexed: 12/27/2022]
|
48
|
Abstract
An RNA World that predated the modern world of polypeptide and polynucleotide is one of the most widely accepted models in origin of life research. In this model, the translation system shepherded the RNA World into the extant biology of DNA, RNA, and protein. Here, we examine the RNA World Hypothesis in the context of increasingly detailed information available about the origins, evolution, functions, and mechanisms of the translation system. We conclude that the translation system presents critical challenges to RNA World Hypotheses. Firstly, a timeline of the RNA World is problematic when the ribosome is incorporated. The mechanism of peptidyl transfer of the ribosome appears distinct from evolved enzymes, signaling origins in a chemical rather than biological milieu. Secondly, we have no evidence that the basic biochemical toolset of life is subject to substantive change by Darwinian evolution, as required for the transition from the RNA world to extant biology. Thirdly, we do not see specific evidence for biological takeover of ribozyme function by protein enzymes. Finally, we can find no basis for preservation of the ribosome as ribozyme or the universality of translation, if it were the case that other information transducing ribozymes, such as ribozyme polymerases, were replaced by protein analogs and erased from the phylogenetic record. We suggest that an updated model of the RNA World should address the current state of knowledge of the translation system.
Collapse
|
49
|
Francis BR. The Hypothesis that the Genetic Code Originated in Coupled Synthesis of Proteins and the Evolutionary Predecessors of Nucleic Acids in Primitive Cells. Life (Basel) 2015; 5:467-505. [PMID: 25679748 PMCID: PMC4390864 DOI: 10.3390/life5010467] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 02/02/2015] [Indexed: 12/22/2022] Open
Abstract
Although analysis of the genetic code has allowed explanations for its evolution to be proposed, little evidence exists in biochemistry and molecular biology to offer an explanation for the origin of the genetic code. In particular, two features of biology make the origin of the genetic code difficult to understand. First, nucleic acids are highly complicated polymers requiring numerous enzymes for biosynthesis. Secondly, proteins have a simple backbone with a set of 20 different amino acid side chains synthesized by a highly complicated ribosomal process in which mRNA sequences are read in triplets. Apparently, both nucleic acid and protein syntheses have extensive evolutionary histories. Supporting these processes is a complex metabolism and at the hub of metabolism are the carboxylic acid cycles. This paper advances the hypothesis that the earliest predecessor of the nucleic acids was a β-linked polyester made from malic acid, a highly conserved metabolite in the carboxylic acid cycles. In the β-linked polyester, the side chains are carboxylic acid groups capable of forming interstrand double hydrogen bonds. Evolution of the nucleic acids involved changes to the backbone and side chain of poly(β-d-malic acid). Conversion of the side chain carboxylic acid into a carboxamide or a longer side chain bearing a carboxamide group, allowed information polymers to form amide pairs between polyester chains. Aminoacylation of the hydroxyl groups of malic acid and its derivatives with simple amino acids such as glycine and alanine allowed coupling of polyester synthesis and protein synthesis. Use of polypeptides containing glycine and l-alanine for activation of two different monomers with either glycine or l-alanine allowed simple coded autocatalytic synthesis of polyesters and polypeptides and established the first genetic code. A primitive cell capable of supporting electron transport, thioester synthesis, reduction reactions, and synthesis of polyesters and polypeptides is proposed. The cell consists of an iron-sulfide particle enclosed by tholin, a heterogeneous organic material that is produced by Miller-Urey type experiments that simulate conditions on the early Earth. As the synthesis of nucleic acids evolved from β-linked polyesters, the singlet coding system for replication evolved into a four nucleotide/four amino acid process (AMP = aspartic acid, GMP = glycine, UMP = valine, CMP = alanine) and then into the triplet ribosomal process that permitted multiple copies of protein to be synthesized independent of replication. This hypothesis reconciles the “genetics first” and “metabolism first” approaches to the origin of life and explains why there are four bases in the genetic alphabet.
Collapse
Affiliation(s)
- Brian R Francis
- Department of Molecular Biology, University of Wyoming, Laramie, WY 82071, USA.
| |
Collapse
|
50
|
Carter CW. What RNA World? Why a Peptide/RNA Partnership Merits Renewed Experimental Attention. Life (Basel) 2015; 5:294-320. [PMID: 25625599 PMCID: PMC4390853 DOI: 10.3390/life5010294] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Accepted: 01/12/2015] [Indexed: 12/16/2022] Open
Abstract
We review arguments that biology emerged from a reciprocal partnership in which small ancestral oligopeptides and oligonucleotides initially both contributed rudimentary information coding and catalytic rate accelerations, and that the superior information-bearing qualities of RNA and the superior catalytic potential of proteins emerged from such complexes only with the gradual invention of the genetic code. A coherent structural basis for that scenario was articulated nearly a decade before the demonstration of catalytic RNA. Parallel hierarchical catalytic repertoires for increasingly highly conserved sequences from the two synthetase classes now increase the likelihood that they arose as translation products from opposite strands of a single gene. Sense/antisense coding affords a new bioinformatic metric for phylogenetic relationships much more distant than can be reconstructed from multiple sequence alignments of a single superfamily. Evidence for distinct coding properties in tRNA acceptor stems and anticodons, and experimental demonstration that the two synthetase family ATP binding sites can indeed be coded by opposite strands of the same gene supplement these biochemical and bioinformatic data, establishing a solid basis for key intermediates on a path from simple, stereochemically coded, reciprocally catalytic peptide/RNA complexes through the earliest peptide catalysts to contemporary aminoacyl-tRNA synthetases. That scenario documents a path to increasing complexity that obviates the need for a single polymer to act both catalytically and as an informational molecule.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA.
| |
Collapse
|