1
|
Koper K, Han SW, Kothadia R, Salamon H, Yoshikuni Y, Maeda HA. Multisubstrate specificity shaped the complex evolution of the aminotransferase family across the tree of life. Proc Natl Acad Sci U S A 2024; 121:e2405524121. [PMID: 38885378 PMCID: PMC11214133 DOI: 10.1073/pnas.2405524121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 05/14/2024] [Indexed: 06/20/2024] Open
Abstract
Aminotransferases (ATs) are an ancient enzyme family that play central roles in core nitrogen metabolism, essential to all organisms. However, many of the AT enzyme functions remain poorly defined, limiting our fundamental understanding of the nitrogen metabolic networks that exist in different organisms. Here, we traced the deep evolutionary history of the AT family by analyzing AT enzymes from 90 species spanning the tree of life (ToL). We found that each organism has maintained a relatively small and constant number of ATs. Mapping the distribution of ATs across the ToL uncovered that many essential AT reactions are carried out by taxon-specific AT enzymes due to wide-spread nonorthologous gene displacements. This complex evolutionary history explains the difficulty of homology-based AT functional prediction. Biochemical characterization of diverse aromatic ATs further revealed their broad substrate specificity, unlike other core metabolic enzymes that evolved to catalyze specific reactions today. Interestingly, however, we found that these AT enzymes that diverged over billion years share common signatures of multisubstrate specificity by employing different nonconserved active site residues. These findings illustrate that AT family enzymes had leveraged their inherent substrate promiscuity to maintain a small yet distinct set of multifunctional AT enzymes in different taxa. This evolutionary history of versatile ATs likely contributed to the establishment of robust and diverse nitrogen metabolic networks that exist throughout the ToL. The study provides a critical foundation to systematically determine diverse AT functions and underlying nitrogen metabolic networks across the ToL.
Collapse
Affiliation(s)
- Kaan Koper
- Department of Botany, University of Wisconsin-Madison, Madison, WI53706
| | - Sang-Woo Han
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- Department of Biotechnology, Konkuk University, Chungju27478, South Korea
| | - Ramani Kothadia
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Hugh Salamon
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
| | - Yasuo Yoshikuni
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA94720
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA94720
- Center for Advanced Bioenergy and Bioproducts Innovation, Lawrence Berkeley National Laboratory, Berkeley, CA94720
- Global Center for Food, Land, and Water Resources, Research Faculty of Agriculture, Hokkaido University, Hokkaido, Japan 060-8589
- Institute of Global Innovation Research, Tokyo University of Agriculture and Technology, Tokyo183-8538, Japan
| | - Hiroshi A. Maeda
- Department of Botany, University of Wisconsin-Madison, Madison, WI53706
| |
Collapse
|
2
|
Wehbi S, Wheeler A, Morel B, Minh BQ, Lauretta DS, Masel J. Order of amino acid recruitment into the genetic code resolved by Last Universal Common Ancestor's protein domains. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.13.589375. [PMID: 38659899 PMCID: PMC11042313 DOI: 10.1101/2024.04.13.589375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
The current "consensus" order in which amino acids were added to the genetic code is based on potentially biased criteria such as absence of sulfur-containing amino acids from the Urey-Miller experiment which lacked sulfur. Even if inferred perfectly, abiotic abundance might not reflect abundance in the organisms in which the genetic code evolved. Here, we instead exploit the fact that proteins that emerged prior to the genetic code's completion are likely enriched in early amino acids and depleted in late amino acids. We identify the most ancient protein-coding sequences born prior to the archaeal-bacterial split. Amino acid usage in protein sequences whose ancestors date back to a single homolog in the Last Universal Common Ancestor (LUCA) largely matches the consensus order. However, our findings indicate that metal-binding (cysteine and histidine) and sulfur-containing (cysteine and methionine) amino acids were added to the genetic code much earlier than previously thought. Surprisingly, even more ancient protein sequences - those that had already diversified into multiple distinct copies in LUCA - show a different pattern to single copy LUCA sequences: significantly less depleted in the late amino acids tryptophan and tyrosine, and enriched rather than depleted in phenylalanine. This is compatible with at least some of these sequences predating the current genetic code. Their distinct enrichment patterns thus provide hints about earlier, alternative genetic codes.
Collapse
Affiliation(s)
- Sawsan Wehbi
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, Arizona, 85721, USA
| | - Andrew Wheeler
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, Arizona, 85721, USA
| | - Benoit Morel
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Bui Quang Minh
- School of Computing, Australian National University, Canberra, ACT, Australia
| | - Dante S Lauretta
- Lunar and Planetary Laboratory, University of Arizona, Tucson, AZ 85721, USA
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
| |
Collapse
|
3
|
Williamson MP. Autocatalytic Selection as a Driver for the Origin of Life. Life (Basel) 2024; 14:590. [PMID: 38792611 PMCID: PMC11122578 DOI: 10.3390/life14050590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 04/29/2024] [Accepted: 05/02/2024] [Indexed: 05/26/2024] Open
Abstract
Darwin's theory of evolution by natural selection was revolutionary because it provided a mechanism by which variation could be selected. This mechanism can only operate on living systems and thus cannot be applied to the origin of life. Here, we propose a viable alternative mechanism for prebiotic systems: autocatalytic selection, in which molecules catalyze reactions and processes that lead to increases in their concentration. Crucially, this provides a driver for increases in concentrations of molecules to a level that permits prebiotic metabolism. We show how this can produce high levels of amino acids, sugar phosphates, nucleotides and lipids and then lead on to polymers. Our outline is supported by a set of guidelines to support the identification of the most likely prebiotic routes. Most of the steps in this pathway are already supported by experimental results. These proposals generate a coherent and viable set of pathways that run from established Hadean geochemistry to the beginning of life.
Collapse
Affiliation(s)
- Mike P Williamson
- School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
| |
Collapse
|
4
|
Lefèvre-Morand RYL, Nikel PI, Acevedo-Rocha CG. How many Mutations are needed to Evolve the Chemical Makeup of a Synthetic Cell? Chembiochem 2024; 25:e202300829. [PMID: 38226957 DOI: 10.1002/cbic.202300829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 01/13/2024] [Accepted: 01/16/2024] [Indexed: 01/17/2024]
Abstract
The chemical evolution of a synthetic cell endowed with a synthetic amino acid as building block, analog to tryptophan, required the emergence of key mutations in genes involved in, inter alia, the general stress response, amino acid metabolism, stringent response, and chemotaxis. Understanding adaptation mechanisms to non-canonical biomass components will inform strategies for engineering synthetic metabolic pathways and cells.
Collapse
Affiliation(s)
- Rodrigue Yves Louis Lefèvre-Morand
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet 220, 2800, Kongens Lyngby, Denmark
| | - Pablo Iván Nikel
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet 220, 2800, Kongens Lyngby, Denmark
| | - Carlos G Acevedo-Rocha
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet 220, 2800, Kongens Lyngby, Denmark
| |
Collapse
|
5
|
Douglas J, Bouckaert R, Carter CW, Wills P. Enzymic recognition of amino acids drove the evolution of primordial genetic codes. Nucleic Acids Res 2024; 52:558-571. [PMID: 38048305 PMCID: PMC10810186 DOI: 10.1093/nar/gkad1160] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 10/28/2023] [Accepted: 11/20/2023] [Indexed: 12/06/2023] Open
Abstract
How genetic information gained its exquisite control over chemical processes needed to build living cells remains an enigma. Today, the aminoacyl-tRNA synthetases (AARS) execute the genetic codes in all living systems. But how did the AARS that emerged over three billion years ago as low-specificity, protozymic forms then spawn the full range of highly-specific enzymes that distinguish between 22 diverse amino acids? A phylogenetic reconstruction of extant AARS genes, enhanced by analysing modular acquisitions, reveals six AARS with distinct bacterial, archaeal, eukaryotic, or organellar clades, resulting in a total of 36 families of AARS catalytic domains. Small structural modules that differentiate one AARS family from another played pivotal roles in discriminating between amino acid side chains, thereby expanding the genetic code and refining its precision. The resulting model shows a tendency for less elaborate enzymes, with simpler catalytic domains, to activate amino acids that were not synthesised until later in the evolution of the code. The most probable evolutionary route for an emergent amino acid type to establish a place in the code was by recruiting older, less specific AARS, rather than adapting contemporary lineages. This process, retrofunctionalisation, differs from previously described mechanisms through which amino acids would enter the code.
Collapse
Affiliation(s)
- Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, The University of Auckland, New Zealand
| | - Remco Bouckaert
- Centre for Computational Evolution, The University of Auckland, New Zealand
- School of Computer Science, The University of Auckland, New Zealand
| | - Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, USA
| | - Peter R Wills
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, The University of Auckland, New Zealand
| |
Collapse
|
6
|
Cuevas-Zuviría B, Adam ZR, Goldman AD, Kaçar B. Informatic Capabilities of Translation and Its Implications for the Origins of Life. J Mol Evol 2023; 91:567-569. [PMID: 37526692 DOI: 10.1007/s00239-023-10125-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 06/22/2023] [Indexed: 08/02/2023]
Abstract
The ability to encode and convert heritable information into molecular function is a defining feature of life as we know it. The conversion of information into molecular function is performed by the translation process, in which triplets of nucleotides in a nucleic acid polymer (mRNA) encode specific amino acids in a protein polymer that folds into a three-dimensional structure. The folded protein then performs one or more molecular activities, often as one part of a complex and coordinated physiological network. Prebiotic systems, lacking the ability to explicitly translate information between genotype and phenotype, would have depended upon either chemosynthetic pathways to generate its components-constraining its complexity and evolvability- or on the ambivalence of RNA as both carrier of information and of catalytic functions-a possibility which is still supported by a very limited set of catalytic RNAs. Thus, the emergence of translation during early evolutionary history may have allowed life to unmoor from the setting of its origin. The origin of translation machinery also represents an entirely novel and distinct threshold of behavior for which there is no abiotic counterpart-it could be the only known example of computing that emerged naturally at the chemical level. Here we describe translation machinery's decoding system as the basis of cellular translation's information-processing capabilities, and the four operation types that find parallels in computer systems engineering that this biological machinery exhibits.
Collapse
Affiliation(s)
- Bruno Cuevas-Zuviría
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA.
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid, Madrid, Spain.
| | - Zachary R Adam
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Department of Geosciences, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Betül Kaçar
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
7
|
Wills PR. Origins of Genetic Coding: Self-Guided Molecular Self-Organisation. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1281. [PMID: 37761580 PMCID: PMC10527755 DOI: 10.3390/e25091281] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 08/22/2023] [Accepted: 08/28/2023] [Indexed: 09/29/2023]
Abstract
The origin of genetic coding is characterised as an event of cosmic significance in which quantum mechanical causation was transcended by constructive computation. Computational causation entered the physico-chemical processes of the pre-biotic world by the incidental satisfaction of a condition of reflexivity between polymer sequence information and system elements able to facilitate their own production through translation of that information. This event, which has previously been modelled in the dynamics of Gene-Replication-Translation systems, is properly described as a process of self-guided self-organisation. The spontaneous emergence of a primordial genetic code between two-letter alphabets of nucleotide triplets and amino acids is easily possible, starting with random peptide synthesis that is RNA-sequence-dependent. The evident self-organising mechanism is the simultaneous quasi-species bifurcation of the populations of information-carrying genes and enzymes with aminoacyl-tRNA synthetase-like activities. This mechanism allowed the code to evolve very rapidly to the ~20 amino acid limit apparent for the reflexive differentiation of amino acid properties using protein catalysts. The self-organisation of semantics in this domain of physical chemistry conferred on emergent molecular biology exquisite computational control over the nanoscopic events needed for its self-construction.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, Auckland PB 92019, New Zealand
| |
Collapse
|
8
|
Guo X, Su M. The Origin of Translation: Bridging the Nucleotides and Peptides. Int J Mol Sci 2022; 24:ijms24010197. [PMID: 36613641 PMCID: PMC9820756 DOI: 10.3390/ijms24010197] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/09/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022] Open
Abstract
Extant biology uses RNA to record genetic information and proteins to execute biochemical functions. Nucleotides are translated into amino acids via transfer RNA in the central dogma. tRNA is essential in translation as it connects the codon and the cognate amino acid. To reveal how the translation emerged in the prebiotic context, we start with the structure and dissection of tRNA, followed by the theory and hypothesis of tRNA and amino acid recognition. Last, we review how amino acids assemble on the tRNA and further form peptides. Understanding the origin of life will also promote our knowledge of artificial living systems.
Collapse
Affiliation(s)
- Xuyuan Guo
- School of Genetics and Microbiology, Trinity College Dublin, The University of Dublin, College Green, Dublin 2, D02 PN40 Dublin, Ireland
| | - Meng Su
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
- Correspondence:
| |
Collapse
|
9
|
Carr CE. Resolving the History of Life on Earth by Seeking Life As We Know It on Mars. ASTROBIOLOGY 2022; 22:880-888. [PMID: 35467949 PMCID: PMC9298492 DOI: 10.1089/ast.2021.0043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
An origin of Earth life on Mars would resolve significant inconsistencies between the inferred history of life and Earth's geologic history. Life as we know it utilizes amino acids, nucleic acids, and lipids for the metabolic, informational, and compartment-forming subsystems of a cell. Such building blocks may have formed simultaneously from cyanosulfidic chemical precursors in a planetary surface scenario involving ultraviolet light, wet-dry cycling, and volcanism. On the inferred water world of early Earth, such an origin would have been limited to volcanic island hotspots. A cyanosulfidic origin of life could have taken place on Mars via photoredox chemistry, facilitated by orders-of-magnitude more sub-aerial crust than early Earth, and an earlier transition to oxidative conditions that could have been involved in final fixation of the genetic code. Meteoritic bombardment may have generated transient habitable environments and ejected and transferred life to Earth. Ongoing and future missions to Mars offer an unprecedented opportunity to confirm or refute evidence consistent with a cyanosulfidic origin of life on Mars, search for evidence of ancient life, and constrain the evolution of Mars' oxidation state over time. We should seek to prove or refute a martian origin for life on Earth alongside other possibilities.
Collapse
Affiliation(s)
- Christopher E. Carr
- Daniel Guggenheim School of Aerospace Engineering, Georgia Institute of Technology, Atlanta, Georgia, USA
- School of Earth and Atmospheric Sciences, Georgia Institute of Technology, Atlanta, Georgia, USA
- Address correspondence to: Christopher E. Carr, ESM Building, Room G10, 620 Cherry St NW, Atlanta, GA 30332, USA
| |
Collapse
|
10
|
Koper K, Han SW, Pastor DC, Yoshikuni Y, Maeda HA. Evolutionary Origin and Functional Diversification of Aminotransferases. J Biol Chem 2022; 298:102122. [PMID: 35697072 PMCID: PMC9309667 DOI: 10.1016/j.jbc.2022.102122] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 06/06/2022] [Accepted: 06/07/2022] [Indexed: 11/30/2022] Open
Abstract
Aminotransferases (ATs) are pyridoxal 5′-phosphate–dependent enzymes that catalyze the transamination reactions between amino acid donor and keto acid acceptor substrates. Modern AT enzymes constitute ∼2% of all classified enzymatic activities, play central roles in nitrogen metabolism, and generate multitude of primary and secondary metabolites. ATs likely diverged into four distinct AT classes before the appearance of the last universal common ancestor and further expanded to a large and diverse enzyme family. Although the AT family underwent an extensive functional specialization, many AT enzymes retained considerable substrate promiscuity and multifunctionality because of their inherent mechanistic, structural, and functional constraints. This review summarizes the evolutionary history, diverse metabolic roles, reaction mechanisms, and structure–function relationships of the AT family enzymes, with a special emphasis on their substrate promiscuity and multifunctionality. Comprehensive characterization of AT substrate specificity is still needed to reveal their true metabolic functions in interconnecting various branches of the nitrogen metabolic network in different organisms.
Collapse
Affiliation(s)
- Kaan Koper
- Department of Botany, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Sang-Woo Han
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - Yasuo Yoshikuni
- The US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Global Center for Food, Land, and Water Resources, Research Faculty of Agriculture, Hokkaido University, Hokkaido 060-8589, Japan
| | - Hiroshi A Maeda
- Department of Botany, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
11
|
A Closer Look at Non-random Patterns Within Chemistry Space for a Smaller, Earlier Amino Acid Alphabet. J Mol Evol 2022; 90:307-323. [PMID: 35666290 DOI: 10.1007/s00239-022-10061-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 05/11/2022] [Indexed: 10/18/2022]
Abstract
Recent findings, in vitro and in silico, are strengthening the idea of a simpler, earlier stage of genetically encoded proteins which used amino acids produced by prebiotic chemistry. These findings motivate a re-examination of prior work which has identified unusual properties of the set of twenty amino acids found within the full genetic code, while leaving it unclear whether similar patterns also characterize the subset of prebiotically plausible amino acids. We have suggested previously that this ambiguity may result from the low number of amino acids recognized by the definition of prebiotic plausibility used for the analysis. Here, we test this hypothesis using significantly updated data for organic material detected within meteorites, which contain several coded and non-coded amino acids absent from prior studies. In addition to confirming the well-established idea that "late" arriving amino acids expanded the chemistry space encoded by genetic material, we find that a prebiotically plausible subset of coded amino acids generally emulates the patterns found in the full set of 20, namely an exceptionally broad and even distribution of volumes and an exceptionally even distribution of hydrophobicities (quantified as logP) over a narrow range. However, the strength of this pattern varies depending on both the size and composition the library used to create a background (null model) for a random alphabet, and the precise definition of exactly which amino acids were present in a simpler, earlier code. Findings support the idea that a small sample size of amino acids caused previous ambiguous results, and further improvements in meteorite analysis, and/or prebiotic simulations will further clarify the nature and extent of unusual properties. We discuss the case of sulfur-containing amino acids as a specific and clear example and conclude by reviewing the potential impact of better understanding the chemical "logic" of a smaller forerunner to the standard amino acid alphabet.
Collapse
|
12
|
Crapitto AJ, Campbell A, Harris AJ, Goldman AD. A consensus view of the proteome of the last universal common ancestor. Ecol Evol 2022; 12:e8930. [PMID: 35784055 PMCID: PMC9165204 DOI: 10.1002/ece3.8930] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 04/11/2022] [Accepted: 04/14/2022] [Indexed: 12/30/2022] Open
Abstract
The availability of genomic and proteomic data from across the tree of life has made it possible to infer features of the genome and proteome of the last universal common ancestor (LUCA). A number of studies have done so, all using a unique set of methods and bioinformatics databases. Here, we compare predictions across eight such studies and measure both their agreement with one another and with the consensus predictions among them. We find that some LUCA genome studies show a strong agreement with the consensus predictions of the others, but that no individual study shares a high or even moderate degree of similarity with any other individual study. From these observations, we conclude that the consensus among studies provides a more accurate depiction of the core proteome of the LUCA and its functional repertoire. The set of consensus LUCA protein family predictions between all of these studies portrays a LUCA genome that, at minimum, encoded functions related to protein synthesis, amino acid metabolism, nucleotide metabolism, and the use of common, nucleotide‐derived organic cofactors.
Collapse
Affiliation(s)
| | - Amy Campbell
- Perelman School of Medicine University of Pennsylvania Philadelphia Pennsylvania USA
| | - AJ Harris
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization South China Botanical Garden Chinese Academy of Sciences Guangzhou China
| | - Aaron D. Goldman
- Department of Biology Oberlin College Oberlin Ohio USA
- Blue Marble Space Institute of Science Seattle Washington USA
| |
Collapse
|
13
|
Abstract
In the mid-twentieth century, multiple Nobel Prizes rewarded discoveries of a seemingly universal set of molecules and interactions that collectively defined the chemical basis for life. Twenty-first-century science knows that every detail of this Central Dogma of Molecular Biology can vary through either biological evolution, human engineering (synthetic biology) or both. Clearly the material, molecular basis of replicating, evolving entities can be different. There is far less clarity yet for what constitutes this set of possibilities. One approach to better understand the limits and scope of moving beyond life's central dogma comes from those who study life's origins. RNA, proteins and the genetic code that binds them each look like products of natural selection. This raises the question of what step(s) preceded these particular components? Answers here will clarify whether any discrete point in time or biochemical evolution will objectively merit the label of life's origin, or whether life unfolds seamlessly from the non-living universe.
Collapse
Affiliation(s)
- Stephen Freeland
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD 21250, USA
| |
Collapse
|
14
|
Carter CW, Popinga A, Bouckaert R, Wills PR. Multidimensional Phylogenetic Metrics Identify Class I Aminoacyl-tRNA Synthetase Evolutionary Mosaicity and Inter-Modular Coupling. Int J Mol Sci 2022; 23:ijms23031520. [PMID: 35163448 PMCID: PMC8835825 DOI: 10.3390/ijms23031520] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 01/17/2022] [Accepted: 01/17/2022] [Indexed: 02/01/2023] Open
Abstract
The role of aminoacyl-tRNA synthetases (aaRS) in the emergence and evolution of genetic coding poses challenging questions concerning their provenance. We seek evidence about their ancestry from curated structure-based multiple sequence alignments of a structurally invariant “scaffold” shared by all 10 canonical Class I aaRS. Three uncorrelated phylogenetic metrics—mutation frequency, its uniformity, and row-by-row cladistic congruence—imply that the Class I scaffold is a mosaic assembled from successive genetic sources. Metrics for different modules vary in accordance with their presumed functionality. Sequences derived from the ATP– and amino acid– binding sites exhibit specific two-way coupling to those derived from Connecting Peptide 1, a third module whose metrics suggest later acquisition. The data help validate: (i) experimental fragmentations of the canonical Class I structure into three partitions that retain catalytic activities in proportion to their length; and (ii) evidence that the ancestral Class I aaRS gene also encoded a Class II ancestor in frame on the opposite strand. A 46-residue Class I “protozyme” roots the Class I tree prior to the adaptive radiation of the Rossmann dinucleotide binding fold that refined substrate discrimination. Such rooting implies near simultaneous emergence of genetic coding and the origin of the proteome, resolving a conundrum posed by previous inferences that Class I aaRS evolved after the genetic code had been implemented in an RNA world. Further, pinpointing discontinuous enhancements of aaRS fidelity establishes a timeline for the growth of coding from a binary amino acid alphabet.
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
- Correspondence: ; Tel.: +1-919-966-3263
| | - Alex Popinga
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Remco Bouckaert
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand;
| |
Collapse
|
15
|
Furukawa R, Yokobori SI, Sato R, Kumagawa T, Nakagawa M, Katoh K, Yamagishi A. Amino Acid Specificity of Ancestral Aminoacyl-tRNA Synthetase Prior to the Last Universal Common Ancestor Commonote commonote. J Mol Evol 2022; 90:73-94. [PMID: 35084522 PMCID: PMC8821087 DOI: 10.1007/s00239-021-10043-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 12/16/2021] [Indexed: 11/24/2022]
Abstract
Extant organisms commonly use 20 amino acids in protein synthesis. In the translation system, aminoacyl-tRNA synthetase (ARS) selectively binds an amino acid and transfers it to the cognate tRNA. It is postulated that the amino acid repertoire of ARS expanded during the development of the translation system. In this study we generated composite phylogenetic trees for seven ARSs (SerRS, ProRS, ThrRS, GlyRS-1, HisRS, AspRS, and LysRS) which are thought to have diverged by gene duplication followed by mutation, before the evolution of the last universal common ancestor. The composite phylogenetic tree shows that the AspRS/LysRS branch diverged from the other five ARSs at the deepest node, with the GlyRS/HisRS branch and the other three ARSs (ThrRS, ProRS and SerRS) diverging at the second deepest node. ThrRS diverged next, and finally ProRS and SerRS diverged from each other. Based on the phylogenetic tree, sequences of the ancestral ARSs prior to the evolution of the last universal common ancestor were predicted. The amino acid specificity of each ancestral ARS was then postulated by comparison with amino acid recognition sites of ARSs of extant organisms. Our predictions demonstrate that ancestral ARSs had substantial specificity and that the number of amino acid types amino-acylated by proteinaceous ARSs was limited before the appearance of a fuller range of proteinaceous ARS species. From an assumption that 10 amino acid species are required for folding and function, proteinaceous ARS possibly evolved in a translation system composed of preexisting ribozyme ARSs, before the evolution of the last universal common ancestor.
Collapse
Affiliation(s)
- Ryutaro Furukawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.,Faculty of Human Science, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Shin-Ichi Yokobori
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Riku Sato
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Taimu Kumagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Mizuho Nakagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Kazutaka Katoh
- Department of Genome Informatics, Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.
| |
Collapse
|
16
|
TwinCons: Conservation score for uncovering deep sequence similarity and divergence. PLoS Comput Biol 2021; 17:e1009541. [PMID: 34714829 PMCID: PMC8580257 DOI: 10.1371/journal.pcbi.1009541] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 11/10/2021] [Accepted: 10/06/2021] [Indexed: 11/19/2022] Open
Abstract
We have developed the program TwinCons, to detect noisy signals of deep ancestry of proteins or nucleic acids. As input, the program uses a composite alignment containing pre-defined groups, and mathematically determines a 'cost' of transforming one group to the other at each position of the alignment. The output distinguishes conserved, variable and signature positions. A signature is conserved within groups but differs between groups. The method automatically detects continuous characteristic stretches (segments) within alignments. TwinCons provides a convenient representation of conserved, variable and signature positions as a single score, enabling the structural mapping and visualization of these characteristics. Structure is more conserved than sequence. TwinCons highlights alternative sequences of conserved structures. Using TwinCons, we detected highly similar segments between proteins from the translation and transcription systems. TwinCons detects conserved residues within regions of high functional importance for the ribosomal RNA (rRNA) and demonstrates that signatures are not confined to specific regions but are distributed across the rRNA structure. The ability to evaluate both nucleic acid and protein alignments allows TwinCons to be used in combined sequence and structural analysis of signatures and conservation in rRNA and in ribosomal proteins (rProteins). TwinCons detects a strong sequence conservation signal between bacterial and archaeal rProteins related by circular permutation. This conserved sequence is structurally colocalized with conserved rRNA, indicated by TwinCons scores of rRNA alignments of bacterial and archaeal groups. This combined analysis revealed deep co-evolution of rRNA and rProtein buried within the deepest branching points in the tree of life.
Collapse
|
17
|
Abstract
Diverse models have been advanced for the evolution of the genetic code. Here, models for tRNA, aminoacyl-tRNA synthetase (aaRS) and genetic code evolution were combined with an understanding of EF-Tu suppression of tRNA 3rd anticodon position wobbling. The result is a highly detailed scheme that describes the placements of all amino acids in the standard genetic code. The model describes evolution of 6-, 4-, 3-, 2- and 1-codon sectors. Innovation in column 3 of the code is explained. Wobbling and code degeneracy are explained. Separate distribution of serine sectors between columns 2 and 4 of the code is described. We conclude that very little chaos contributed to evolution of the genetic code and that the pattern of evolution of aaRS enzymes describes a history of the evolution of the code. A model is proposed to describe the biological selection for the earliest evolution of the code and for protocell evolution.
Collapse
Affiliation(s)
- Lei Lei
- Department of Biology, University of New England, Biddeford, ME, USA
| | - Zachary Frome Burton
- Department of Biochemistry and Molecular Biology, Michigan State University, E. Lansing, MI, USA
| |
Collapse
|
18
|
Makarov M, Meng J, Tretyachenko V, Srb P, Březinová A, Giacobelli VG, Bednárová L, Vondrášek J, Dunker AK, Hlouchová K. Enzyme catalysis prior to aromatic residues: Reverse engineering of a dephospho-CoA kinase. Protein Sci 2021; 30:1022-1034. [PMID: 33739538 PMCID: PMC8040869 DOI: 10.1002/pro.4068] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 03/12/2021] [Accepted: 03/13/2021] [Indexed: 11/07/2022]
Abstract
The wide variety of protein structures and functions results from the diverse properties of the 20 canonical amino acids. The generally accepted hypothesis is that early protein evolution was associated with enrichment of a primordial alphabet, thereby enabling increased protein catalytic efficiencies and functional diversification. Aromatic amino acids were likely among the last additions to genetic code. The main objective of this study was to test whether enzyme catalysis can occur without the aromatic residues (aromatics) by studying the structure and function of dephospho-CoA kinase (DPCK) following aromatic residue depletion. We designed two variants of a putative DPCK from Aquifex aeolicus by substituting (a) Tyr, Phe and Trp or (b) all aromatics (including His). Their structural characterization indicates that substituting the aromatics does not markedly alter their secondary structures but does significantly loosen their side chain packing and increase their sizes. Both variants still possess ATPase activity, although with 150-300 times lower efficiency in comparison with the wild-type phosphotransferase activity. The transfer of the phosphate group to the dephospho-CoA substrate becomes heavily uncoupled and only the His-containing variant is still able to perform the phosphotransferase reaction. These data support the hypothesis that proteins in the early stages of life could support catalytic activities, albeit with low efficiencies. An observed significant contraction upon ligand binding is likely important for appropriate organization of the active site. Formation of firm hydrophobic cores, which enable the assembly of stably structured active sites, is suggested to provide a selective advantage for adding the aromatic residues.
Collapse
Affiliation(s)
- Mikhail Makarov
- Department of Cell Biology, Faculty of ScienceCharles University, BIOCEVPragueCzech Republic
- Department of Biochemistry, Faculty of ScienceCharles UniversityPragueCzech Republic
| | - Jingwei Meng
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and BioinformaticsIndiana University School of MedicineIndianapolisIndianaUSA
| | - Vyacheslav Tretyachenko
- Department of Cell Biology, Faculty of ScienceCharles University, BIOCEVPragueCzech Republic
- Department of Biochemistry, Faculty of ScienceCharles UniversityPragueCzech Republic
| | - Pavel Srb
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| | - Anna Březinová
- Proteomics Core Facility, BIOCEV, Faculty of Science, Charles UniversityPragueCzech Republic
| | | | - Lucie Bednárová
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| | - Jiří Vondrášek
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| | - A. Keith Dunker
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and BioinformaticsIndiana University School of MedicineIndianapolisIndianaUSA
| | - Klára Hlouchová
- Department of Cell Biology, Faculty of ScienceCharles University, BIOCEVPragueCzech Republic
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| |
Collapse
|
19
|
Jabłońska J, Tawfik DS. The evolution of oxygen-utilizing enzymes suggests early biosphere oxygenation. Nat Ecol Evol 2021; 5:442-448. [PMID: 33633374 DOI: 10.1038/s41559-020-01386-9] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 12/16/2020] [Indexed: 01/31/2023]
Abstract
Production of molecular oxygen was a turning point in the Earth's history. The geological record indicates the Great Oxidation Event, which marked a permanent transition to an oxidizing atmosphere around 2.4 Ga. However, the degree to which oxygen was available to life before oxygenation of the atmosphere remains unknown. Here, phylogenetic analysis of all known oxygen-utilizing and -producing enzymes (O2-enzymes) indicates that oxygen became widely available to living organisms well before the Great Oxidation Event. About 60% of the O2-enzyme families whose birth can be dated appear to have emerged at the separation of terrestrial and marine bacteria (22 families, compared to two families assigned to the last universal common ancestor). This node, dubbed the last universal oxygen ancestor, coincides with a burst of emergence of both oxygenases and other oxidoreductases, thus suggesting a wider availability of oxygen around 3.1 Ga.
Collapse
Affiliation(s)
- Jagoda Jabłońska
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
20
|
Harris AJ, Goldman AD. The very early evolution of protein translocation across membranes. PLoS Comput Biol 2021; 17:e1008623. [PMID: 33684113 PMCID: PMC7987157 DOI: 10.1371/journal.pcbi.1008623] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2020] [Revised: 03/23/2021] [Accepted: 12/10/2020] [Indexed: 11/18/2022] Open
Abstract
In this study, we used a computational approach to investigate the early evolutionary history of a system of proteins that, together, embed and translocate other proteins across cell membranes. Cell membranes comprise the basis for cellularity, which is an ancient, fundamental organizing principle shared by all organisms and a key innovation in the evolution of life on Earth. Two related requirements for cellularity are that organisms are able to both embed proteins into membranes and translocate proteins across membranes. One system that accomplishes these tasks is the signal recognition particle (SRP) system, in which the core protein components are the paralogs, FtsY and Ffh. Complementary to the SRP system is the Sec translocation channel, in which the primary channel-forming protein is SecY. We performed phylogenetic analyses that strongly supported prior inferences that FtsY, Ffh, and SecY were all present by the time of the last universal common ancestor of life, the LUCA, and that the ancestor of FtsY and Ffh existed before the LUCA. Further, we combined ancestral sequence reconstruction and protein structure and function prediction to show that the LUCA had an SRP system and Sec translocation channel that were similar to those of extant organisms. We also show that the ancestor of Ffh and FtsY that predated the LUCA was more similar to FtsY than Ffh but could still have comprised a rudimentary protein translocation system on its own. Duplication of the ancestor of FtsY and Ffh facilitated the specialization of FtsY as a membrane bound receptor and Ffh as a cytoplasmic protein that could bind nascent proteins with specific membrane-targeting signal sequences. Finally, we analyzed amino acid frequencies in our ancestral sequence reconstructions to infer that the ancestral Ffh/FtsY protein likely arose prior to or just after the completion of the canonical genetic code. Taken together, our results offer a window into the very early evolutionary history of cellularity.
Collapse
Affiliation(s)
- AJ Harris
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
- Department of Biology, Oberlin College and Conservatory, K123 Science Center, Oberlin, Ohio, United States of America
| | - Aaron David Goldman
- Department of Biology, Oberlin College and Conservatory, K123 Science Center, Oberlin, Ohio, United States of America
- Blue Marble Space Institute of Science, Seattle, Washington, United States of America
| |
Collapse
|
21
|
Kaiser F, Krautwurst S, Salentin S, Haupt VJ, Leberecht C, Bittrich S, Labudde D, Schroeder M. The structural basis of the genetic code: amino acid recognition by aminoacyl-tRNA synthetases. Sci Rep 2020; 10:12647. [PMID: 32724042 PMCID: PMC7387524 DOI: 10.1038/s41598-020-69100-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 07/06/2020] [Indexed: 12/29/2022] Open
Abstract
Storage and directed transfer of information is the key requirement for the development of life. Yet any information stored on our genes is useless without its correct interpretation. The genetic code defines the rule set to decode this information. Aminoacyl-tRNA synthetases are at the heart of this process. We extensively characterize how these enzymes distinguish all natural amino acids based on the computational analysis of crystallographic structure data. The results of this meta-analysis show that the correct read-out of genetic information is a delicate interplay between the composition of the binding site, non-covalent interactions, error correction mechanisms, and steric effects.
Collapse
Affiliation(s)
- Florian Kaiser
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany. .,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany.
| | - Sarah Krautwurst
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | | - V Joachim Haupt
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany.,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany
| | | | | | - Dirk Labudde
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | |
Collapse
|
22
|
The phylogenetic distribution of the glutaminyl-tRNA synthetase and Glu-tRNA Gln amidotransferase in the fundamental lineages would imply that the ancestor of archaea, that of eukaryotes and LUCA were progenotes. Biosystems 2020; 196:104174. [PMID: 32535177 DOI: 10.1016/j.biosystems.2020.104174] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 05/25/2020] [Accepted: 05/25/2020] [Indexed: 12/21/2022]
Abstract
The function of the glutaminyl-tRNA synthetase and Glu-tRNAGln amidotransferase might be related to the origin of the genetic code because, for example, glutaminyl-tRNA synthetase catalyses the fundamental reaction that makes the genetic code. If the evolutionary stage of the origin of these two enzymes could be unambiguously identified, then the genetic code should still have been originating at that particular evolutionary stage because the fundamental reaction that makes the code itself was still evidently evolving. This would result in that particular evolutionary moment being attributed to the evolutionary stage of the progenote because it would have a relationship between the genotype and the phenotype not yet fully realized because the genetic code was precisely still originating. I then analyzed the distribution of the glutaminyl-tRNA synthetase and Glu-tRNAGln aminodotrasferase in the main phyletic lineages. Since in some cases the origin of these two enzymes can be related to the evolutionary stages of ancestors of archaea and eukaryotes, this would indicate these ancestors as progenotes because at that evolutionary moment the genetic code was evidently still evolving, thus realizing the definition of progenote. The conclusion that the ancestor of archaea and that of eukaryotes were progenotes would imply that even the last universal common ancestor (LUCA) was a progenote because it appeared, on the tree of life, temporally before these ancestors.
Collapse
|
23
|
Gospodinov A, Kunnev D. Universal Codons with Enrichment from GC to AU Nucleotide Composition Reveal a Chronological Assignment from Early to Late Along with LUCA Formation. Life (Basel) 2020; 10:life10060081. [PMID: 32516985 PMCID: PMC7345086 DOI: 10.3390/life10060081] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 05/30/2020] [Accepted: 06/03/2020] [Indexed: 12/14/2022] Open
Abstract
The emergence of a primitive genetic code should be considered the most essential event during the origin of life. Almost a complete set of codons (as we know them) should have been established relatively early during the evolution of the last universal common ancestor (LUCA) from which all known organisms descended. Many hypotheses have been proposed to explain the driving forces and chronology of the evolution of the genetic code; however, none is commonly accepted. In the current paper, we explore the features of the genetic code that, in our view, reflect the mechanism and the chronological order of the origin of the genetic code. Our hypothesis postulates that the primordial RNA was mostly GC-rich, and this bias was reflected in the order of amino acid codon assignment. If we arrange the codons and their corresponding amino acids from GC-rich to AU-rich, we find that: 1. The amino acids encoded by GC-rich codons (Ala, Gly, Arg, and Pro) are those that contribute the most to the interactions with RNA (if incorporated into short peptides). 2. This order correlates with the addition of novel functions necessary for the evolution from simple to longer folded peptides. 3. The overlay of aminoacyl-tRNA synthetases (aaRS) to the amino acid order produces a distinctive zonal distribution for class I and class II suggesting an interdependent origin. These correlations could be explained by the active role of the bridge peptide (BP), which we proposed earlier in the evolution of the genetic code.
Collapse
Affiliation(s)
- Anastas Gospodinov
- Roumen Tsanev Institute of Molecular Biology, Bulgarian Academy of Sciences, Acad. G. Bonchev Str. 21, Sofia 1113, Bulgaria;
| | - Dimiter Kunnev
- Department of Molecular & Cellular Biology, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
- Correspondence:
| |
Collapse
|
24
|
Takénaka A, Moras D. Correlation between equi-partition of aminoacyl-tRNA synthetases and amino-acid biosynthesis pathways. Nucleic Acids Res 2020; 48:3277-3285. [PMID: 31965182 PMCID: PMC7102985 DOI: 10.1093/nar/gkaa013] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 12/31/2019] [Accepted: 01/07/2020] [Indexed: 12/11/2022] Open
Abstract
The partition of aminoacyl-tRNA synthetases (aaRSs) into two classes of equal size and the correlated amino acid distribution is a puzzling still unexplained observation. We propose that the time scale of the amino-acid synthesis, assumed to be proportional to the number of reaction steps (NE) involved in the biosynthesis pathway, is one of the parameters that controlled the timescale of aaRSs appearance. Because all pathways are branched at fructose-6-phosphate on the metabolic pathway, this product is defined as the common origin for the NE comparison. For each amino-acid, the NE value, counted from the origin to the final product, provides a timescale for the pathways to be established. An archeological approach based on NE reveals that aaRSs of the two classes are generated in pair along this timescale. The results support the coevolution theory for the origin of the genetic code with an earlier appearance of class II aaRSs.
Collapse
Affiliation(s)
- Akio Takénaka
- Research Institute, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba 275-0016, Japan.,Faculty of Pharmacy, Shenyang Pharmaceutical University, Benxi, Liaoning 117004, China
| | - Dino Moras
- Department of Integrated Structural Biology, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC) 1 rue Laurent Fries, Illkirch 67404, France.,Centre National de Recherche Scientifique (CNRS) UMR 7104, France.,Institut National de Santé et de Recherche Médicale (INSERM) U1258, France.,Université de Strasbourg, Illkirch, France
| |
Collapse
|
25
|
Liberles DA, Chang B, Geiler-Samerotte K, Goldman A, Hey J, Kaçar B, Meyer M, Murphy W, Posada D, Storfer A. Emerging Frontiers in the Study of Molecular Evolution. J Mol Evol 2020; 88:211-226. [PMID: 32060574 PMCID: PMC7386396 DOI: 10.1007/s00239-020-09932-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
A collection of the editors of Journal of Molecular Evolution have gotten together to pose a set of key challenges and future directions for the field of molecular evolution. Topics include challenges and new directions in prebiotic chemistry and the RNA world, reconstruction of early cellular genomes and proteins, macromolecular and functional evolution, evolutionary cell biology, genome evolution, molecular evolutionary ecology, viral phylodynamics, theoretical population genomics, somatic cell molecular evolution, and directed evolution. While our effort is not meant to be exhaustive, it reflects research questions and problems in the field of molecular evolution that are exciting to our editors.
Collapse
Affiliation(s)
- David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA.
| | - Belinda Chang
- Department of Ecology and Evolutionary Biology and Department of Cell and Systems Biology, University of Toronto, 25 Harbord Street, Toronto, ON, M5S 3G5, Canada
| | - Kerry Geiler-Samerotte
- Center for Mechanisms of Evolution, School of Life Sciences, Arizona State University, Tempe, AZ, 85287, USA
| | - Aaron Goldman
- Department of Biology, Oberlin College and Conservatory, K123 Science Center, 119 Woodland Street, Oberlin, OH, 44074, USA
| | - Jody Hey
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - Betül Kaçar
- Department of Molecular and Cell Biology, University of Arizona, Tucson, AZ, 85721, USA
| | - Michelle Meyer
- Department of Biology, Boston College, Chestnut Hill, MA, 02467, USA
| | - William Murphy
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, 77843, USA
| | - David Posada
- Biomedical Research Center (CINBIO), University of Vigo, Vigo, Spain
| | - Andrew Storfer
- School of Biological Sciences, Washington State University, Pullman, WA, 99164, USA
| |
Collapse
|
26
|
Evolution of Life on Earth: tRNA, Aminoacyl-tRNA Synthetases and the Genetic Code. Life (Basel) 2020; 10:life10030021. [PMID: 32131473 PMCID: PMC7151597 DOI: 10.3390/life10030021] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2020] [Revised: 02/13/2020] [Accepted: 02/27/2020] [Indexed: 02/07/2023] Open
Abstract
Life on Earth and the genetic code evolved around tRNA and the tRNA anticodon. We posit that the genetic code initially evolved to synthesize polyglycine as a cross-linking agent to stabilize protocells. We posit that the initial amino acids to enter the code occupied larger sectors of the code that were then invaded by incoming amino acids. Displacements of amino acids follow selection rules. The code sectored from a glycine code to a four amino acid code to an eight amino acid code to an ~16 amino acid code to the standard 20 amino acid code with stops. The proposed patterns of code sectoring are now most apparent from patterns of aminoacyl-tRNA synthetase evolution. The Elongation Factor-Tu GTPase anticodon-codon latch that checks the accuracy of translation appears to have evolved at about the eight amino acid to ~16 amino acid stage. Before evolution of the EF-Tu latch, we posit that both the 1st and 3rd anticodon positions were wobble positions. The genetic code evolved via tRNA charging errors and via enzymatic modifications of amino acids joined to tRNAs, followed by tRNA and aminoacyl-tRNA synthetase differentiation. Fidelity mechanisms froze the code by inhibiting further innovation.
Collapse
|
27
|
Pollack JD, Gerard D, Makhatadze GI, Pearl DK. Evolutionary conservation and structural localizations suggest a physical trace of metabolism’s progressive geochronological emergence. J Biomol Struct Dyn 2019; 38:3700-3719. [DOI: 10.1080/07391102.2019.1679666] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- J. Dennis Pollack
- Department of Molecular Virology, Immunology and Medical Genetics, College of Medicine, The Ohio State University, Columbus, Ohio, USA
| | - David Gerard
- Department of Mathematics and Statistics, American University, Washington, DC, USA
| | - George I. Makhatadze
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, New York, USA
| | - Dennis K. Pearl
- Department of Statistics, Penn State University, University Park, Pennsylvania, USA
| |
Collapse
|
28
|
The Ancient Operational Code is Embedded in the Amino Acid Substitution Matrix and aaRS Phylogenies. J Mol Evol 2019; 88:136-150. [PMID: 31781936 DOI: 10.1007/s00239-019-09918-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/14/2019] [Indexed: 10/25/2022]
Abstract
The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the aaSM, in which (i) the number of distinguishable amino acid types, (ii) the matrix dimension and (iii) the number of parameters, each increases by one for each bifurcation in an aaRS phylogeny. Parameterized matrices corresponding to trees in which the size of an amino acid sidechain is the only discernible property behind its categorization as a substrate, exclusively for a Class I or II aaRS, provide a significantly better fit to empirically determined aaSM than trees with random bifurcation patterns. A second split between polar and nonpolar amino acids in each Class effects a vastly greater further improvement. The earliest Class-separated epochs in the phylogenies of the aaRS reflect these enzymes' capability to distinguish tRNAs through the recognition of acceptor stem identity elements via the minor (Class I) and major (Class II) helical grooves, which is how the ancient operational code functioned. The advent of tRNA recognition using the anticodon loop supports the evolution of the optimal map of amino acid chemistry found in the later genetic code, an essentially digital categorization, in which polarity is the major functional property, compensating for the unrefined, haphazard differentiation of amino acids achieved by the operational code.
Collapse
|
29
|
Adaptive Properties of the Genetically Encoded Amino Acid Alphabet Are Inherited from Its Subsets. Sci Rep 2019; 9:12468. [PMID: 31462646 PMCID: PMC6713743 DOI: 10.1038/s41598-019-47574-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 07/08/2019] [Indexed: 01/11/2023] Open
Abstract
Life uses a common set of 20 coded amino acids (CAAs) to construct proteins. This set was likely canonicalized during early evolution; before this, smaller amino acid sets were gradually expanded as new synthetic, proofreading and coding mechanisms became biologically available. Many possible subsets of the modern CAAs or other presently uncoded amino acids could have comprised the earlier sets. We explore the hypothesis that the CAAs were selectively fixed due to their unique adaptive chemical properties, which facilitate folding, catalysis, and solubility of proteins, and gave adaptive value to organisms able to encode them. Specifically, we studied in silico hypothetical CAA sets of 3–19 amino acids comprised of 1913 structurally diverse α-amino acids, exploring the adaptive value of their combined physicochemical properties relative to those of the modern CAA set. We find that even hypothetical sets containing modern CAA members are especially adaptive; it is difficult to find sets even among a large choice of alternatives that cover the chemical property space more amply. These results suggest that each time a CAA was discovered and embedded during evolution, it provided an adaptive value unusual among many alternatives, and each selective step may have helped bootstrap the developing set to include still more CAAs.
Collapse
|
30
|
Carter CW, Wills PR. Interdependence, Reflexivity, Fidelity, Impedance Matching, and the Evolution of Genetic Coding. Mol Biol Evol 2018; 35:269-286. [PMID: 29077934 PMCID: PMC5850816 DOI: 10.1093/molbev/msx265] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Genetic coding is generally thought to have required ribozymes whose functions were taken over by polypeptide aminoacyl-tRNA synthetases (aaRS). Two discoveries about aaRS and their interactions with tRNA substrates now furnish a unifying rationale for the opposite conclusion: that the key processes of the Central Dogma of molecular biology emerged simultaneously and naturally from simple origins in a peptide•RNA partnership, eliminating the epistemological utility of a prior RNA world. First, the two aaRS classes likely arose from opposite strands of the same ancestral gene, implying a simple genetic alphabet. The resulting inversion symmetries in aaRS structural biology would have stabilized the initial and subsequent differentiation of coding specificities, rapidly promoting diversity in the proteome. Second, amino acid physical chemistry maps onto tRNA identity elements, establishing reflexive, nanoenvironmental sensing in protein aaRS. Bootstrapping of increasingly detailed coding is thus intrinsic to polypeptide aaRS, but impossible in an RNA world. These notions underline the following concepts that contradict gradual replacement of ribozymal aaRS by polypeptide aaRS: 1) aaRS enzymes must be interdependent; 2) reflexivity intrinsic to polypeptide aaRS production dynamics promotes bootstrapping; 3) takeover of RNA-catalyzed aminoacylation by enzymes will necessarily degrade specificity; and 4) the Central Dogma's emergence is most probable when replication and translation error rates remain comparable. These characteristics are necessary and sufficient for the essentially de novo emergence of a coupled gene-replicase-translatase system of genetic coding that would have continuously preserved the functional meaning of genetically encoded protein genes whose phylogenetic relationships match those observed today.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland, New Zealand
| |
Collapse
|
31
|
Fujishima K, Wang KM, Palmer JA, Abe N, Nakahigashi K, Endy D, Rothschild LJ. Reconstruction of cysteine biosynthesis using engineered cysteine-free enzymes. Sci Rep 2018; 8:1776. [PMID: 29379050 PMCID: PMC5788988 DOI: 10.1038/s41598-018-19920-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Accepted: 01/03/2018] [Indexed: 12/28/2022] Open
Abstract
Amino acid biosynthesis pathways observed in nature typically require enzymes that are made with the amino acids they produce. For example, Escherichia coli produces cysteine from serine via two enzymes that contain cysteine: serine acetyltransferase (CysE) and O-acetylserine sulfhydrylase (CysK/CysM). To solve this chicken-and-egg problem, we substituted alternate amino acids in CysE, CysK and CysM for cysteine and methionine, which are the only two sulfur-containing proteinogenic amino acids. Using a cysteine-dependent auxotrophic E. coli strain, CysE function was rescued by cysteine-free and methionine-deficient enzymes, and CysM function was rescued by cysteine-free enzymes. CysK function, however, was not rescued in either case. Enzymatic assays showed that the enzymes responsible for rescuing the function in CysE and CysM also retained their activities in vitro. Additionally, substitution of the two highly conserved methionines in CysM decreased but did not eliminate overall activity. Engineering amino acid biosynthetic enzymes to lack the so-produced amino acids can provide insights into, and perhaps eventually fully recapitulate via a synthetic approach, the biogenesis of biotic amino acids.
Collapse
Affiliation(s)
- Kosuke Fujishima
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, 1528550, Japan.,Universities Space Research Association, NASA Ames Research Center, Moffett Field, California, 94035, USA.,Institute for Advanced Biosciences, Keio University, Tsuruoka, 9970035, Japan
| | - Kendrick M Wang
- Stanford University Department of Bioengineering, Stanford, California, 94305, USA
| | - Jesse A Palmer
- Universities Space Research Association, NASA Ames Research Center, Moffett Field, California, 94035, USA
| | - Nozomi Abe
- Institute for Advanced Biosciences, Keio University, Tsuruoka, 9970035, Japan
| | - Kenji Nakahigashi
- Institute for Advanced Biosciences, Keio University, Tsuruoka, 9970035, Japan.,Spiber Inc. 234-1 Mizukami, Kakuganji, Tsuruoka, 9970052, Japan
| | - Drew Endy
- Stanford University Department of Bioengineering, Stanford, California, 94305, USA
| | | |
Collapse
|
32
|
Akanuma S. Characterization of Reconstructed Ancestral Proteins Suggests a Change in Temperature of the Ancient Biosphere. Life (Basel) 2017; 7:life7030033. [PMID: 28783077 PMCID: PMC5617958 DOI: 10.3390/life7030033] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Revised: 08/02/2017] [Accepted: 08/03/2017] [Indexed: 01/02/2023] Open
Abstract
Understanding the evolution of ancestral life, and especially the ability of some organisms to flourish in the variable environments experienced in Earth’s early biosphere, requires knowledge of the characteristics and the environment of these ancestral organisms. Information about early life and environmental conditions has been obtained from fossil records and geological surveys. Recent advances in phylogenetic analysis, and an increasing number of protein sequences available in public databases, have made it possible to infer ancestral protein sequences possessed by ancient organisms. However, the in silico studies that assess the ancestral base content of ribosomal RNAs, the frequency of each amino acid in ancestral proteins, and estimate the environmental temperatures of ancient organisms, show conflicting results. The characterization of ancestral proteins reconstructed in vitro suggests that ancient organisms had very thermally stable proteins, and therefore were thermophilic or hyperthermophilic. Experimental data supports the idea that only thermophilic ancestors survived the catastrophic increase in temperature of the biosphere that was likely associated with meteorite impacts during the early history of Earth. In addition, by expanding the timescale and including more ancestral proteins for reconstruction, it appears as though the Earth’s surface temperature gradually decreased over time, from Archean to present.
Collapse
Affiliation(s)
- Satoshi Akanuma
- Faculty of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama 359-1192, Japan.
| |
Collapse
|
33
|
Cantine MD, Fournier GP. Environmental Adaptation from the Origin of Life to the Last Universal Common Ancestor. ORIGINS LIFE EVOL B 2017; 48:35-54. [PMID: 28685374 DOI: 10.1007/s11084-017-9542-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Accepted: 06/15/2017] [Indexed: 01/03/2023]
Abstract
Extensive fundamental molecular and biological evolution took place between the prebiotic origins of life and the state of the Last Universal Common Ancestor (LUCA). Considering the evolutionary innovations between these two endpoints from the perspective of environmental adaptation, we explore the hypothesis that LUCA was temporally, spatially, and environmentally distinct from life's earliest origins in an RNA world. Using this lens, we interpret several molecular biological features as indicating an environmental transition between a cold, radiation-shielded origin of life and a mesophilic, surface-dwelling LUCA. Cellularity provides motility and permits Darwinian evolution by connecting genetic material and its products, and thus establishing heredity and lineage. Considering the importance of compartmentalization and motility, we propose that the early emergence of cellularity is required for environmental dispersal and diversification during these transitions. Early diversification and the emergence of ecology before LUCA could be an important pre-adaptation for life's persistence on a changing planet.
Collapse
Affiliation(s)
- Marjorie D Cantine
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA, 02139, USA
| | - Gregory P Fournier
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA, 02139, USA.
| |
Collapse
|
34
|
Mukai T, Reynolds NM, Crnković A, Söll D. Bioinformatic Analysis Reveals Archaeal tRNA Tyr and tRNA Trp Identities in Bacteria. Life (Basel) 2017; 7:life7010008. [PMID: 28230768 PMCID: PMC5370408 DOI: 10.3390/life7010008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2017] [Revised: 02/15/2017] [Accepted: 02/17/2017] [Indexed: 12/01/2022] Open
Abstract
The tRNA identity elements for some amino acids are distinct between the bacterial and archaeal domains. Searching in recent genomic and metagenomic sequence data, we found some candidate phyla radiation (CPR) bacteria with archaeal tRNA identity for Tyr-tRNA and Trp-tRNA synthesis. These bacteria possess genes for tyrosyl-tRNA synthetase (TyrRS) and tryptophanyl-tRNA synthetase (TrpRS) predicted to be derived from DPANN superphylum archaea, while the cognate tRNATyr and tRNATrp genes reveal bacterial or archaeal origins. We identified a trace of domain fusion and swapping in the archaeal-type TyrRS gene of a bacterial lineage, suggesting that CPR bacteria may have used this mechanism to create diverse proteins. Archaeal-type TrpRS of bacteria and a few TrpRS species of DPANN archaea represent a new phylogenetic clade (named TrpRS-A). The TrpRS-A open reading frames (ORFs) are always associated with another ORF (named ORF1) encoding an unknown protein without global sequence identity to any known protein. However, our protein structure prediction identified a putative HIGH-motif and KMSKS-motif as well as many α-helices that are characteristic of class I aminoacyl-tRNA synthetase (aaRS) homologs. These results provide another example of the diversity of molecular components that implement the genetic code and provide a clue to the early evolution of life and the genetic code.
Collapse
Affiliation(s)
- Takahito Mukai
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
| | - Noah M Reynolds
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
| | - Ana Crnković
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
| | - Dieter Söll
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
- Department of Chemistry, Yale University, New Haven, CT 06520, USA.
| |
Collapse
|
35
|
Jakubowski H. Homocysteine Editing, Thioester Chemistry, Coenzyme A, and the Origin of Coded Peptide Synthesis †. Life (Basel) 2017; 7:life7010006. [PMID: 28208756 PMCID: PMC5370406 DOI: 10.3390/life7010006] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 02/03/2017] [Indexed: 12/22/2022] Open
Abstract
Aminoacyl-tRNA synthetases (AARSs) have evolved “quality control” mechanisms which prevent tRNA aminoacylation with non-protein amino acids, such as homocysteine, homoserine, and ornithine, and thus their access to the Genetic Code. Of the ten AARSs that possess editing function, five edit homocysteine: Class I MetRS, ValRS, IleRS, LeuRS, and Class II LysRS. Studies of their editing function reveal that catalytic modules of these AARSs have a thiol-binding site that confers the ability to catalyze the aminoacylation of coenzyme A, pantetheine, and other thiols. Other AARSs also catalyze aminoacyl-thioester synthesis. Amino acid selectivity of AARSs in the aminoacyl thioesters formation reaction is relaxed, characteristic of primitive amino acid activation systems that may have originated in the Thioester World. With homocysteine and cysteine as thiol substrates, AARSs support peptide bond synthesis. Evolutionary origin of these activities is revealed by genomic comparisons, which show that AARSs are structurally related to proteins involved in coenzyme A/sulfur metabolism and non-coded peptide bond synthesis. These findings suggest that the extant AARSs descended from ancestral forms that were involved in non-coded Thioester-dependent peptide synthesis, functionally similar to the present-day non-ribosomal peptide synthetases.
Collapse
Affiliation(s)
- Hieronim Jakubowski
- Department of Microbiology, Biochemistry and Molecular Genetics, New Jersey Medical School, Rutgers University, Newark, NJ 07103, USA.
- Department of Biochemistry and Biotechnology, University of Life Sciences, Poznan 60-632, Poland.
| |
Collapse
|
36
|
Carter CW. Coding of Class I and II Aminoacyl-tRNA Synthetases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2017; 966:103-148. [PMID: 28828732 PMCID: PMC5927602 DOI: 10.1007/5584_2017_93] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The aminoacyl-tRNA synthetases and their cognate transfer RNAs translate the universal genetic code. The twenty canonical amino acids are sufficiently diverse to create a selective advantage for dividing amino acid activation between two distinct, apparently unrelated superfamilies of synthetases, Class I amino acids being generally larger and less polar, Class II amino acids smaller and more polar. Biochemical, bioinformatic, and protein engineering experiments support the hypothesis that the two Classes descended from opposite strands of the same ancestral gene. Parallel experimental deconstructions of Class I and II synthetases reveal parallel losses in catalytic proficiency at two novel modular levels-protozymes and Urzymes-associated with the evolution of catalytic activity. Bi-directional coding supports an important unification of the proteome; affords a genetic relatedness metric-middle base-pairing frequencies in sense/antisense alignments-that probes more deeply into the evolutionary history of translation than do single multiple sequence alignments; and has facilitated the analysis of hitherto unknown coding relationships in tRNA sequences. Reconstruction of native synthetases by modular thermodynamic cycles facilitated by domain engineering emphasizes the subtlety associated with achieving high specificity, shedding new light on allosteric relationships in contemporary synthetases. Synthetase Urzyme structural biology suggests that they are catalytically-active molten globules, broadening the potential manifold of polypeptide catalysts accessible to primitive genetic coding and motivating revisions of the origins of catalysis. Finally, bi-directional genetic coding of some of the oldest genes in the proteome places major limitations on the likelihood that any RNA World preceded the origins of coded proteins.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-7260, USA.
| |
Collapse
|
37
|
Kwon I, Choi ES. Forced Ambiguity of the Leucine Codons for Multiple-Site-Specific Incorporation of a Noncanonical Amino Acid. PLoS One 2016; 11:e0152826. [PMID: 27028506 PMCID: PMC4814082 DOI: 10.1371/journal.pone.0152826] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Accepted: 03/03/2016] [Indexed: 11/24/2022] Open
Abstract
Multiple-site-specific incorporation of a noncanonical amino acid into a recombinant protein would be a very useful technique to generate multiple chemical handles for bioconjugation and multivalent binding sites for the enhanced interaction. Previously combination of a mutant yeast phenylalanyl-tRNA synthetase variant and the yeast phenylalanyl-tRNA containing the AAA anticodon was used to incorporate a noncanonical amino acid into multiple UUU phenylalanine (Phe) codons in a site-specific manner. However, due to the less selective codon recognition of the AAA anticodon, there was significant misincorporation of a noncanonical amino acid into unwanted UUC Phe codons. To enhance codon selectivity, we explored degenerate leucine (Leu) codons instead of Phe degenerate codons. Combined use of the mutant yeast phenylalanyl-tRNA containing the CAA anticodon and the yPheRS_naph variant allowed incorporation of a phenylalanine analog, 2-naphthylalanine, into murine dihydrofolate reductase in response to multiple UUG Leu codons, but not to other Leu codon sites. Despite the moderate UUG codon occupancy by 2-naphthylalaine, these results successfully demonstrated that the concept of forced ambiguity of the genetic code can be achieved for the Leu codons, available for multiple-site-specific incorporation.
Collapse
Affiliation(s)
- Inchan Kwon
- School of Materials Science and Engineering, Gwangju Institute of Science and Technology (GIST), Gwangju, Republic of Korea
- Department of Chemical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
- * E-mail:
| | - Eun Sil Choi
- School of Materials Science and Engineering, Gwangju Institute of Science and Technology (GIST), Gwangju, Republic of Korea
- Department of Biological Sciences, College of Natural Sciences, Chonnam National University, Gwangju, Republic of Korea
| |
Collapse
|
38
|
Wills PR. The generation of meaningful information in molecular systems. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2016; 374:rsta.2015.0066. [PMID: 26857673 DOI: 10.1098/rsta.2015.0066] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 07/17/2015] [Indexed: 06/05/2023]
Abstract
The physico-chemical processes occurring inside cells are under the computational control of genetic (DNA) and epigenetic (internal structural) programming. The origin and evolution of genetic information (nucleic acid sequences) is reasonably well understood, but scant attention has been paid to the origin and evolution of the molecular biological interpreters that give phenotypic meaning to the sequence information that is quite faithfully replicated during cellular reproduction. The near universality and age of the mapping from nucleotide triplets to amino acids embedded in the functionality of the protein synthetic machinery speaks to the early development of a system of coding which is still extant in every living organism. We take the origin of genetic coding as a paradigm of the emergence of computation in natural systems, focusing on the requirement that the molecular components of an interpreter be synthesized autocatalytically. Within this context, it is seen that interpreters of increasing complexity are generated by series of transitions through stepped dynamic instabilities (non-equilibrium phase transitions). The early phylogeny of the amino acyl-tRNA synthetase enzymes is discussed in such terms, leading to the conclusion that the observed optimality of the genetic code is a natural outcome of the processes of self-organization that produced it.
Collapse
Affiliation(s)
- Peter R Wills
- Department of Physics, University of Auckland, PB 92019, Auckland 1142, Aotearoa, New Zealand
| |
Collapse
|
39
|
Akanuma S, Yokobori SI, Nakajima Y, Bessho M, Yamagishi A. Robustness of predictions of extremely thermally stable proteins in ancient organisms. Evolution 2015; 69:2954-62. [PMID: 26404857 DOI: 10.1111/evo.12779] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Revised: 09/08/2015] [Accepted: 09/10/2015] [Indexed: 01/06/2023]
Abstract
A number of studies have addressed the environmental temperatures experienced by ancient life. Computational studies using a nonhomogeneous evolution model have estimated ancestral G + C contents of ribosomal RNAs and the amino acid compositions of ancestral proteins, generating hypotheses regarding the mesophilic last universal common ancestor. In contrast, our previous study computationally reconstructed ancestral amino acid sequences of nucleoside diphosphate kinases using a homogeneous model and then empirically resurrected the ancestral proteins. The thermal stabilities of these ancestral proteins were equivalent to or greater than those of extant homologous thermophilic proteins, supporting the thermophilic universal ancestor theory. In this study, we reinferred ancestral sequences using a dataset from which hyperthermophilic sequences were excluded. We also reinferred ancestral sequences using a nonhomogeneous evolution model. The newly reconstructed ancestral proteins are still thermally stable, further supporting the hypothesis that the ancient organisms contained thermally stable proteins and therefore that they were thermophilic.
Collapse
Affiliation(s)
- Satoshi Akanuma
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, 192-0392, Japan.,Faculty of Human Sciences, Waseda University, Tokorozawa, Saitama, 359-1192, Japan
| | - Shin-ichi Yokobori
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, 192-0392, Japan
| | - Yoshiki Nakajima
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, 192-0392, Japan
| | - Mizumo Bessho
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, 192-0392, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, 192-0392, Japan.
| |
Collapse
|
40
|
Hoesl MG, Oehm S, Durkin P, Darmon E, Peil L, Aerni HR, Rappsilber J, Rinehart J, Leach D, Söll D, Budisa N. Chemical Evolution of a Bacterial Proteome. Angew Chem Int Ed Engl 2015; 54:10030-4. [PMID: 26136259 PMCID: PMC4782924 DOI: 10.1002/anie.201502868] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Indexed: 11/09/2022]
Abstract
We have changed the amino acid set of the genetic code of Escherichia coli by evolving cultures capable of growing on the synthetic noncanonical amino acid L-β-(thieno[3,2-b]pyrrolyl)alanine ([3,2]Tpa) as a sole surrogate for the canonical amino acid L-tryptophan (Trp). A long-term cultivation experiment in defined synthetic media resulted in the evolution of cells capable of surviving Trp→[3,2]Tpa substitutions in their proteomes in response to the 20,899 TGG codons of the E. coli W3110 genome. These evolved bacteria with new-to-nature amino acid composition showed robust growth in the complete absence of Trp. Our experimental results illustrate an approach for the evolution of synthetic cells with alternative biochemical building blocks.
Collapse
Affiliation(s)
- Michael Georg Hoesl
- Institut für Chemie, Technische Universität Berlin, Müller-Breslau-Strasse 10, 10623 Berlin (Germany)
| | - Stefan Oehm
- Institut für Chemie, Technische Universität Berlin, Müller-Breslau-Strasse 10, 10623 Berlin (Germany)
| | - Patrick Durkin
- Institut für Chemie, Technische Universität Berlin, Müller-Breslau-Strasse 10, 10623 Berlin (Germany)
| | - Elise Darmon
- Institute of Cell Biology, School of Biological Sciences, University of Edinburgh (UK)
| | - Lauri Peil
- Wellcome Trust Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, 4.17 Michael Swann Building, Edinburgh EH9 3BF (UK)
| | - Hans-Rudolf Aerni
- Systems Biology Institute, Yale University, West Haven, CT 06516 (USA)
| | - Juri Rappsilber
- Institut für Biotechnolgie, Technische Universität Berlin, Gustav-Meyer-Allee 25, 13355 Berlin (Germany)
- Wellcome Trust Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, 4.17 Michael Swann Building, Edinburgh EH9 3BF (UK)
| | - Jesse Rinehart
- Systems Biology Institute, Yale University, West Haven, CT 06516 (USA)
| | - David Leach
- Institute of Cell Biology, School of Biological Sciences, University of Edinburgh (UK)
| | - Dieter Söll
- Department of Molecular Biophysics and Biochemistry and Department of Chemistry, Yale University, New Haven, CT 06520 (USA)
| | - Nediljko Budisa
- Institut für Chemie, Technische Universität Berlin, Müller-Breslau-Strasse 10, 10623 Berlin (Germany).
| |
Collapse
|
41
|
Hoesl MG, Oehm S, Durkin P, Darmon E, Peil L, Aerni HR, Rappsilber J, Rinehart J, Leach D, Söll D, Budisa N. Chemische Evolution eines bakteriellen Proteoms. Angew Chem Int Ed Engl 2015. [DOI: 10.1002/ange.201502868] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|