1
|
Wesp V, Theißen G, Schuster S. Statistical analysis of synonymous and stop codons in pseudo-random and real sequences as a function of GC content. Sci Rep 2023; 13:22996. [PMID: 38151539 PMCID: PMC10752896 DOI: 10.1038/s41598-023-49626-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 12/10/2023] [Indexed: 12/29/2023] Open
Abstract
Knowledge of the frequencies of synonymous triplets in protein-coding and non-coding DNA stretches can be used in gene finding. These frequencies depend on the GC content of the genome or parts of it. An example of interest is provided by stop codons. This is relevant for the definition of Open Reading Frames. A generic case is provided by pseudo-random sequences, especially when they code for complex proteins or when they are non-coding and not subject to selection pressure. Here, we calculate, for such sequences and for all 25 known genetic codes, the frequency of each amino acid and stop codon based on their set of codons and as a function of GC content. The amino acids can be classified into five groups according to the GC content where their expected frequency reaches its maximum. We determine the overall Shannon information based on groups of synonymous codons and show that it becomes maximum at a percent GC of 43.3% (for the standard code). This is in line with the observation that in most fungi, plants, and animals, this genomic parameter is in the range from 35 to 50%. By analysing natural sequences, we show that there is a clear bias for triplets corresponding to stop codons near the 5'- and 3'-splice sites in the introns of various clades.
Collapse
Affiliation(s)
- Valentin Wesp
- Department of Bioinformatics, Matthias Schleiden Institute, Friedrich Schiller University Jena, Ernst-Abbe-Platz 2, 07743, Jena, Germany
| | - Günter Theißen
- Department of Genetics, Matthias Schleiden Institute, Friedrich Schiller University Jena, Philosophenweg 12, 07743, Jena, Germany
| | - Stefan Schuster
- Department of Bioinformatics, Matthias Schleiden Institute, Friedrich Schiller University Jena, Ernst-Abbe-Platz 2, 07743, Jena, Germany.
| |
Collapse
|
2
|
Brown SM, Mayer-Bacon C, Freeland S. Xeno Amino Acids: A Look into Biochemistry as We Do Not Know It. Life (Basel) 2023; 13:2281. [PMID: 38137883 PMCID: PMC10744825 DOI: 10.3390/life13122281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 11/18/2023] [Accepted: 11/20/2023] [Indexed: 12/24/2023] Open
Abstract
Would another origin of life resemble Earth's biochemical use of amino acids? Here, we review current knowledge at three levels: (1) Could other classes of chemical structure serve as building blocks for biopolymer structure and catalysis? Amino acids now seem both readily available to, and a plausible chemical attractor for, life as we do not know it. Amino acids thus remain important and tractable targets for astrobiological research. (2) If amino acids are used, would we expect the same L-alpha-structural subclass used by life? Despite numerous ideas, it is not clear why life favors L-enantiomers. It seems clearer, however, why life on Earth uses the shortest possible (alpha-) amino acid backbone, and why each carries only one side chain. However, assertions that other backbones are physicochemically impossible have relaxed into arguments that they are disadvantageous. (3) Would we expect a similar set of side chains to those within the genetic code? Many plausible alternatives exist. Furthermore, evidence exists for both evolutionary advantage and physicochemical constraint as explanatory factors for those encoded by life. Overall, as focus shifts from amino acids as a chemical class to specific side chains used by post-LUCA biology, the probable role of physicochemical constraint diminishes relative to that of biological evolution. Exciting opportunities now present themselves for laboratory work and computing to explore how changing the amino acid alphabet alters the universe of protein folds. Near-term milestones include: (a) expanding evidence about amino acids as attractors within chemical evolution; (b) extending characterization of other backbones relative to biological proteins; and (c) merging computing and laboratory explorations of structures and functions unlocked by xeno peptides.
Collapse
|
3
|
Dörrenhaus R, Wagner PK, Kath-Schorr S. Two are not enough: synthetic strategies and applications of unnatural base pairs. Biol Chem 2023; 404:883-896. [PMID: 37354104 DOI: 10.1515/hsz-2023-0169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 06/05/2023] [Indexed: 06/26/2023]
Abstract
Nucleic acid chemistry is a rapidly evolving field, and the need for novel nucleotide modifications and artificial nucleotide building blocks for diagnostic and therapeutic use, material science or for studying cellular processes continues unabated. This review focusses on the development and application of unnatural base pairs as part of an expanded genetic alphabet. Not only recent developments in "nature-like" artificial base pairs are presented, but also current synthetic methods to get access to C-glycosidic nucleotides. Wide-ranging viability in synthesis is a prerequisite for the successful use of unnatural base pairs in a broader spectrum and will be discussed.
Collapse
|
4
|
Fried SD, Fujishima K, Makarov M, Cherepashuk I, Hlouchova K. Peptides before and during the nucleotide world: an origins story emphasizing cooperation between proteins and nucleic acids. J R Soc Interface 2022; 19:20210641. [PMID: 35135297 PMCID: PMC8833103 DOI: 10.1098/rsif.2021.0641] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Recent developments in Origins of Life research have focused on substantiating the narrative of an abiotic emergence of nucleic acids from organic molecules of low molecular weight, a paradigm that typically sidelines the roles of peptides. Nevertheless, the simple synthesis of amino acids, the facile nature of their activation and condensation, their ability to recognize metals and cofactors and their remarkable capacity to self-assemble make peptides (and their analogues) favourable candidates for one of the earliest functional polymers. In this mini-review, we explore the ramifications of this hypothesis. Diverse lines of research in molecular biology, bioinformatics, geochemistry, biophysics and astrobiology provide clues about the progression and early evolution of proteins, and lend credence to the idea that early peptides served many central prebiotic roles before they were encodable by a polynucleotide template, in a putative 'peptide-polynucleotide stage'. For example, early peptides and mini-proteins could have served as catalysts, compartments and structural hubs. In sum, we shed light on the role of early peptides and small proteins before and during the nucleotide world, in which nascent life fully grasped the potential of primordial proteins, and which has left an imprint on the idiosyncratic properties of extant proteins.
Collapse
Affiliation(s)
- Stephen D Fried
- Department of Chemistry, Johns Hopkins University, Baltimore, MD 21212, USA.,Department of Biophysics, Johns Hopkins University, Baltimore, MD 21212, USA
| | - Kosuke Fujishima
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo 1528550, Japan.,Graduate School of Media and Governance, Keio University, Fujisawa 2520882, Japan
| | - Mikhail Makarov
- Department of Cell Biology, Faculty of Science, Charles University, BIOCEV, Prague 12800, Czech Republic
| | - Ivan Cherepashuk
- Department of Cell Biology, Faculty of Science, Charles University, BIOCEV, Prague 12800, Czech Republic
| | - Klara Hlouchova
- Department of Cell Biology, Faculty of Science, Charles University, BIOCEV, Prague 12800, Czech Republic.,Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague 16610, Czech Republic
| |
Collapse
|
5
|
Habermann FA, Kaltner H, Higuero AM, García Caballero G, Ludwig AK, C. Manning J, Abad-Rodríguez J, Gabius HJ. What Cyto- and Histochemistry Can Do to Crack the Sugar Code. Acta Histochem Cytochem 2021; 54:31-48. [PMID: 34012175 PMCID: PMC8116616 DOI: 10.1267/ahc.21-00017] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 02/18/2021] [Indexed: 12/13/2022] Open
Abstract
As letters form the vocabulary of a language, biochemical 'symbols' (the building blocks of oligo- and polymers) make writing molecular messages possible. Compared to nucleotides and amino acids, sugars have chemical properties that facilitate to reach an unsurpassed level of oligomer diversity. These glycans are a part of the ubiquitous cellular glycoconjugates. Cyto- and histochemically, the glycans' structural complexity is mapped by glycophenotyping of cells and tissues using receptors ('readers', thus called lectins), hereby revealing its dynamic spatiotemporal regulation: these data support the concept of a sugar code. When proceeding from work with plant (haem)agglutinins as such tools to the discovery of endogenous (tissue) lectins, it became clear that a broad panel of biological meanings can indeed be derived from the sugar-based vocabulary (the natural glycome incl. post-synthetic modifications) by glycan-lectin recognition in situ. As consequence, the immunocyto- and histochemical analysis of lectin expression is building a solid basis for the steps toward tracking down functional correlations, for example in processes leading to cell adhesion, apoptosis, autophagy or growth regulation as well as targeted delivery of glycoproteins. Introduction of labeled tissue lectins to glycan profiling assists this endeavor by detecting counterreceptor(s) in situ. Combining these tools and their applications strategically will help to take the trip toward the following long-range aim: to compile a dictionary for the glycan vocabulary that translates each message (oligosaccharide) into its bioresponse(s), that is to crack the sugar code.
Collapse
Affiliation(s)
- Felix A. Habermann
- Institute of Anatomy, Histology and Embryology, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstr. 13, 80539 Munich, Germany
| | - Herbert Kaltner
- Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstr. 13, 80539 Munich, Germany
| | - Alonso M. Higuero
- Membrane and Axonal Repair Laboratory, National Hospital for Paraplegics (SESCAM), Finca La Peraleda s/n, 45071 Toledo, Spain
| | - Gabriel García Caballero
- Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstr. 13, 80539 Munich, Germany
| | - Anna-Kristin Ludwig
- Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstr. 13, 80539 Munich, Germany
| | - Joachim C. Manning
- Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstr. 13, 80539 Munich, Germany
| | - José Abad-Rodríguez
- Membrane and Axonal Repair Laboratory, National Hospital for Paraplegics (SESCAM), Finca La Peraleda s/n, 45071 Toledo, Spain
| | - Hans-Joachim Gabius
- Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstr. 13, 80539 Munich, Germany
| |
Collapse
|
6
|
A search for the physical basis of the genetic code. Biosystems 2020; 195:104148. [DOI: 10.1016/j.biosystems.2020.104148] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/09/2020] [Accepted: 04/09/2020] [Indexed: 01/01/2023]
|
7
|
Subramanian H, Brown J, Gatenby R. Prebiotic competition and evolution in self-replicating polynucleotides can explain the properties of DNA/RNA in modern living systems. BMC Evol Biol 2020; 20:75. [PMID: 32590933 PMCID: PMC7318430 DOI: 10.1186/s12862-020-01641-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2019] [Accepted: 06/17/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We hypothesize prebiotic evolution of self-replicating macro-molecules (Alberts, Molecular biology of the cell, 2015; Orgel, Crit Rev Biochem Mol Biol 39:99-123, 2004; Hud, Nat Commun 9:5171) favoured the constituent nucleotides and biophysical properties observed in the RNA and DNA of modern organisms. Assumed initial conditions are a shallow tide pool, containing a racemic mix of diverse nucleotide monomers (Barks et al., Chembiochem 11:1240-1243, 2010; Krishnamurthy, Nat Commun 9:5175, 2018; Hirao, Curr Opin Chem Biol 10:622-627), subject to day/night thermal fluctuations (Piccirilli et al., Nature 343:33-37, 1990). Self-replication, like Polymerase Chain Reactions, followed as higher daytime thermal energy "melted" inter-strand hydrogen bonds causing strand separation while solar UV radiation increased prebiotic nucleobase formation (Szathmary, Proc Biol Sci 245:91-99, 1991; Materese et al., Astrobiology 17:761-770, 2017; Bera et al., Astrobiology 17:771-785, 2017). Lower night energies allowed free monomers to form hydrogen bonds with their template counterparts leading to daughter strand synthesis (Hirao, Biotechniques 40:711, 2006). RESULTS Evolutionary selection favoured increasing strand length to maximize auto-catalytic function in RNA and polymer stability in double stranded DNA (Krishnamurthy, Chemistry 24:16708-16715, 2018; Szathmary, Nat Rev Genet 4:995-1001, 2003). However, synthesis of the full daughter strand before daytime temperatures produced strand separation, longer polymer length required increased speed of self-replication. Computer simulations demonstrate optimal polynucleotide autocatalytic speed is achieved when the constituent nucleotides possess a left-right asymmetry that decreases the hydrogen bond kinetic barrier for the free nucleotide attachment to the template on one side and increases bond barrier on the other side preventing it from releasing prior to covalent bond formation. This phenomenon is similar to asymmetric kinetics observed during polymerization of the front and the back ends of linear cytoskeletal proteins such as actin and microtubules (Orgel, Nature 343:18-20, 1990; Henry, Curr Opin Chem Biol 7:727-733, 2003; Walker et al., J Cell Biol 108:931-937, 1989; Crevenna et al., J Biol Chem 288:12102-12113, 2013). Since rotation of the nucleotide would disrupt the asymmetry, the optimal nucleotides must form two or more hydrogen bonds with their counterpart on the template strand. All nucleotides in modern RNA and DNA have these predicted properties. Our models demonstrate these constraints on the properties of constituent monomers result in biophysical properties found in modern DNA and RNA including strand directionality, anti-parallel strand orientation, homochirality, quadruplet alphabet, and complementary base pairing. Furthermore, competition between RNA and DNA auto-replicators for 3 nucleotides in common permit states coexistence and possible cooperative interactions that could be incorporated into nascent living systems. CONCLUSION Our findings demonstrate the molecular properties of DNA/RNA could have emerged from Darwinian competition among macromolecular replicators that selected nucleotide monomers that maximized the speed of autocatalysis.
Collapse
Affiliation(s)
- Hemachander Subramanian
- Cancer Biology and Evolution Program, Tampa, FL, 33612, USA.,Integrated Mathematical Oncology Department, H. Lee Moffitt Cancer Center, 12902 Magnolia Dr, Tampa, FL, 33612, USA.,Present Address: Department of Physics, National Institute of Technology, Durgapur, West Bengal, India
| | - Joel Brown
- Cancer Biology and Evolution Program, Tampa, FL, 33612, USA.,Integrated Mathematical Oncology Department, H. Lee Moffitt Cancer Center, 12902 Magnolia Dr, Tampa, FL, 33612, USA
| | - Robert Gatenby
- Cancer Biology and Evolution Program, Tampa, FL, 33612, USA. .,Integrated Mathematical Oncology Department, H. Lee Moffitt Cancer Center, 12902 Magnolia Dr, Tampa, FL, 33612, USA.
| |
Collapse
|
8
|
Taylor AI, Houlihan G, Holliger P. Beyond DNA and RNA: The Expanding Toolbox of Synthetic Genetics. Cold Spring Harb Perspect Biol 2019; 11:11/6/a032490. [PMID: 31160351 DOI: 10.1101/cshperspect.a032490] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The remarkable physicochemical properties of the natural nucleic acids, DNA and RNA, define modern biology at the molecular level and are widely believed to have been central to life's origins. However, their ability to form repositories of information as well as functional structures such as ligands (aptamers) and catalysts (ribozymes/DNAzymes) is not unique. A range of nonnatural alternatives, collectively termed xeno nucleic acids (XNAs), are also capable of supporting genetic information storage and propagation as well as evolution. This gives rise to a new field of "synthetic genetics," which seeks to expand the nucleic acid chemical toolbox for applications in both biotechnology and molecular medicine. In this review, we outline XNA polymerase and reverse transcriptase engineering as a key enabling technology and summarize the application of "synthetic genetics" to the development of aptamers, enzymes, and nanostructures.
Collapse
Affiliation(s)
- Alexander I Taylor
- Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
| | - Gillian Houlihan
- Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
| | - Philipp Holliger
- Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
9
|
Matsuura MF, Kim HJ, Takahashi D, Abboud KA, Benner SA. Crystal structures of deprotonated nucleobases from an expanded DNA alphabet. ACTA CRYSTALLOGRAPHICA SECTION C-STRUCTURAL CHEMISTRY 2016; 72:952-959. [PMID: 27918296 DOI: 10.1107/s2053229616017071] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 10/24/2016] [Indexed: 11/10/2022]
Abstract
Reported here is the crystal structure of a heterocycle that implements a donor-donor-acceptor hydrogen-bonding pattern, as found in the Z component [6-amino-5-nitropyridin-2(1H)-one] of an artificially expanded genetic information system (AEGIS). AEGIS is a new form of DNA from synthetic biology that has six replicable nucleotides, rather than the four found in natural DNA. Remarkably, Z crystallizes from water as a 1:1 complex of its neutral and deprotonated forms, and forms a `skinny' pyrimidine-pyrimidine pair in this structure. The pair resembles the known intercalated cytosine pair. The formation of the same pair in two different salts, namely poly[[aqua(μ6-2-amino-6-oxo-3-nitro-1,6-dihydropyridin-1-ido)sodium]-6-amino-5-nitropyridin-2(1H)-one-water (1/1/1)], denoted Z-Sod, {[Na(C5H4N3O3)(H2O)]·C5H5N3O3·H2O}n, and ammonium 2-amino-6-oxo-3-nitro-1,6-dihydropyridin-1-ide-6-amino-5-nitropyridin-2(1H)-one-water (1/1/1), denoted Z-Am, NH4+·C5H4N3O3-·C5H5N3O3·H2O, under two different crystallization conditions suggests that the pair is especially stable. Implications of this structure for the use of this heterocycle in artificial DNA are discussed.
Collapse
Affiliation(s)
- Mariko F Matsuura
- Department of Chemistry, University of Florida, Gainesville, FL 32611, USA
| | - Hyo Joong Kim
- Foundation for Applied Molecular Evolution (FfAME), 13709 Progress Blvd, Box 17, Alachua, FL 32615, USA
| | - Daisuke Takahashi
- Department of Chemistry, University of Florida, Gainesville, FL 32611, USA
| | - Khalil A Abboud
- Department of Chemistry, University of Florida, Gainesville, FL 32611, USA
| | - Steven A Benner
- Foundation for Applied Molecular Evolution (FfAME), 13709 Progress Blvd, Box 17, Alachua, FL 32615, USA
| |
Collapse
|
10
|
Ashwood B, Pollum M, Crespo-Hernández CE. Can a Six-Letter Alphabet Increase the Likelihood of Photochemical Assault to the Genetic Code? Chemistry 2016; 22:16648-16656. [PMID: 27723147 DOI: 10.1002/chem.201602160] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Indexed: 12/21/2022]
Abstract
In 2014, two unnatural nucleosides, d5SICS and dNaM, were shown to selectively base pair and replicate with high fidelity in a modified strain of E. coli, thus effectively expanding its genetic alphabet from four to six letters. More recently, a significant reduction in cell proliferation was reported in cells cultured with d5SICS, and putatively with dNaM, upon exposure to brief periods of near-visible radiation. The photosensitizing properties of the lowest-energy excited triplet state of both d5SICS and dNaM were implicated in their cytotoxicity. Importantly, however, the excited-state mechanisms by which near-visible excitation populates the triplet states of d5SICS and dNaM are currently unknown. In this study, steady-state and time-resolved spectroscopies are combined with quantum-chemical calculations in order to reveal the excited-state relaxation mechanisms leading to efficient population of the triplet states in these unnatural nucleosides in solution. It is shown that excitation of d5SICS or dNaM with near-visible light leads overwhelmingly to ultrafast population of their triplet states on the femtosecond time scale. The results presented in this work lend strong support to the proposal that photoexcitation of these unnatural nucleosides can accelerate oxidatively generated damage to DNA and other biomolecules within the cellular environment.
Collapse
Affiliation(s)
- Brennan Ashwood
- Department of Chemistry and Center for Chemical Dynamics, Case Western Reserve University, Cleveland, Ohio, 44106, USA
| | - Marvin Pollum
- Department of Chemistry and Center for Chemical Dynamics, Case Western Reserve University, Cleveland, Ohio, 44106, USA
| | - Carlos E Crespo-Hernández
- Department of Chemistry and Center for Chemical Dynamics, Case Western Reserve University, Cleveland, Ohio, 44106, USA.
| |
Collapse
|
11
|
Karalkar NB, Leal NA, Kim MS, Bradley KM, Benner SA. Synthesis and Enzymology of 2'-Deoxy-7-deazaisoguanosine Triphosphate and Its Complement: A Second Generation Pair in an Artificially Expanded Genetic Information System. ACS Synth Biol 2016; 5:672-8. [PMID: 26914388 DOI: 10.1021/acssynbio.5b00276] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
As with natural nucleic acids, pairing between artificial nucleotides can be influenced by tautomerism, with different placements of protons on the heterocyclic nucleobase changing patterns of hydrogen bonding that determine replication fidelity. For example, the major tautomer of isoguanine presents a hydrogen bonding donor-donor-acceptor pattern complementary to the acceptor-acceptor-donor pattern of 5-methylisocytosine. However, in its minor tautomer, isoguanine presents a hydrogen bond donor-acceptor-donor pattern complementary to thymine. Calculations, crystallography, and physical organic experiments suggest that this tautomeric ambiguity might be "fixed" by replacing the N-7 nitrogen of isoguanine by a CH unit. To test this hypothesis, we prepared the triphosphate of 2'-deoxy-7-deazaiso-guanosine and used it in PCR to estimate an effective tautomeric ratio "seen" by Taq DNA polymerase. With 7-deazaisoguanine, fidelity-per-round was ∼92%. The analogous PCR with isoguanine gave a lower fidelity-per-round of ∼86%. These results confirm the hypothesis with polymerases, and deepen our understanding of the role of minor groove hydrogen bonding and proton tautomerism in both natural and expanded genetic "alphabets", major targets in synthetic biology.
Collapse
Affiliation(s)
- Nilesh B. Karalkar
- Foundation for Applied Molecular Evolution (FfAME), 13709 Progress Boulevard, Box
7, Alachua, Florida 32615, United States
| | - Nicole A. Leal
- Foundation for Applied Molecular Evolution (FfAME), 13709 Progress Boulevard, Box
7, Alachua, Florida 32615, United States
- Firebird Biomolecular Sciences LLC, 13709 Progress Boulevard, Box 17, Alachua, Florida 32615, United States
| | - Myong-Sang Kim
- Firebird Biomolecular Sciences LLC, 13709 Progress Boulevard, Box 17, Alachua, Florida 32615, United States
| | - Kevin M. Bradley
- Foundation for Applied Molecular Evolution (FfAME), 13709 Progress Boulevard, Box
7, Alachua, Florida 32615, United States
| | - Steven A. Benner
- Foundation for Applied Molecular Evolution (FfAME), 13709 Progress Boulevard, Box
7, Alachua, Florida 32615, United States
- Firebird Biomolecular Sciences LLC, 13709 Progress Boulevard, Box 17, Alachua, Florida 32615, United States
| |
Collapse
|
12
|
|
13
|
Francis BR. The Hypothesis that the Genetic Code Originated in Coupled Synthesis of Proteins and the Evolutionary Predecessors of Nucleic Acids in Primitive Cells. Life (Basel) 2015; 5:467-505. [PMID: 25679748 PMCID: PMC4390864 DOI: 10.3390/life5010467] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 02/02/2015] [Indexed: 12/22/2022] Open
Abstract
Although analysis of the genetic code has allowed explanations for its evolution to be proposed, little evidence exists in biochemistry and molecular biology to offer an explanation for the origin of the genetic code. In particular, two features of biology make the origin of the genetic code difficult to understand. First, nucleic acids are highly complicated polymers requiring numerous enzymes for biosynthesis. Secondly, proteins have a simple backbone with a set of 20 different amino acid side chains synthesized by a highly complicated ribosomal process in which mRNA sequences are read in triplets. Apparently, both nucleic acid and protein syntheses have extensive evolutionary histories. Supporting these processes is a complex metabolism and at the hub of metabolism are the carboxylic acid cycles. This paper advances the hypothesis that the earliest predecessor of the nucleic acids was a β-linked polyester made from malic acid, a highly conserved metabolite in the carboxylic acid cycles. In the β-linked polyester, the side chains are carboxylic acid groups capable of forming interstrand double hydrogen bonds. Evolution of the nucleic acids involved changes to the backbone and side chain of poly(β-d-malic acid). Conversion of the side chain carboxylic acid into a carboxamide or a longer side chain bearing a carboxamide group, allowed information polymers to form amide pairs between polyester chains. Aminoacylation of the hydroxyl groups of malic acid and its derivatives with simple amino acids such as glycine and alanine allowed coupling of polyester synthesis and protein synthesis. Use of polypeptides containing glycine and l-alanine for activation of two different monomers with either glycine or l-alanine allowed simple coded autocatalytic synthesis of polyesters and polypeptides and established the first genetic code. A primitive cell capable of supporting electron transport, thioester synthesis, reduction reactions, and synthesis of polyesters and polypeptides is proposed. The cell consists of an iron-sulfide particle enclosed by tholin, a heterogeneous organic material that is produced by Miller-Urey type experiments that simulate conditions on the early Earth. As the synthesis of nucleic acids evolved from β-linked polyesters, the singlet coding system for replication evolved into a four nucleotide/four amino acid process (AMP = aspartic acid, GMP = glycine, UMP = valine, CMP = alanine) and then into the triplet ribosomal process that permitted multiple copies of protein to be synthesized independent of replication. This hypothesis reconciles the “genetics first” and “metabolism first” approaches to the origin of life and explains why there are four bases in the genetic alphabet.
Collapse
Affiliation(s)
- Brian R Francis
- Department of Molecular Biology, University of Wyoming, Laramie, WY 82071, USA.
| |
Collapse
|
14
|
Abstract
Synthetic biology seeks to probe fundamental aspects of biological form and function by construction (resynthesis) rather than deconstruction (analysis). Here we discuss how such an approach could be applied to assemble synthetic quasibiological systems able to replicate and evolve, illuminating universal properties of life and the search for its origins.
Collapse
Affiliation(s)
- James Attwater
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| | - Philipp Holliger
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| |
Collapse
|
15
|
Rios AC, Yu HT, Tor Y. Hydrolytic Fitness of N-glycosyl Bonds: Comparing the Deglycosylation Kinetics of Modified, Alternative and Native Nucleosides. J PHYS ORG CHEM 2014; 28:173-180. [PMID: 25750482 DOI: 10.1002/poc.3318] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Nature's selection of the contemporary nucleobases in RNA and DNA continues to intrigue the origin of life community. While the prebiotic synthesis of the N-glycosyl bond has historically been a central area of investigation, variations in hydrolytic stabilities among the N-glycosyl bonds may have presented an additional selection pressure that contributed to nucleobase and nucleoside evolution. To experimentally probe this hypothesis, a systematic kinetic analysis of the hydrolytic deglycosylation reactions of modified, alternative and native nucleosides was undertaken. Rate constants were measured as a function of temperature (at pH 1) to produce Arrhenius and Eyring plots for extrapolation to 37°C and determination of thermodynamic activation parameters. Rate enhancements based on the differences in reaction rates of deoxyribo- and ribo-glycosidic bonds were found to vary under the same conditions. Rate constants of deoxynucleosides were also measured across the pH range of 1 - 3 (at 50°C), which highlighted how simple changes to the heterocycle alone can lead to significant variation in deglycosylation rates. The contemporary nucleosides exhibited the slowest deglycosylation rates in comparison to the non-native/alternative nucleosides, which we suggest as experimental support for nature's selection of the fittest N-glycosyl bonds.
Collapse
Affiliation(s)
- Andro C Rios
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0358
| | - Hiu T Yu
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0358
| | - Yitzhak Tor
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0358
| |
Collapse
|
16
|
Solís D, Bovin NV, Davis AP, Jiménez-Barbero J, Romero A, Roy R, Smetana K, Gabius HJ. A guide into glycosciences: How chemistry, biochemistry and biology cooperate to crack the sugar code. Biochim Biophys Acta Gen Subj 2014; 1850:186-235. [PMID: 24685397 DOI: 10.1016/j.bbagen.2014.03.016] [Citation(s) in RCA: 172] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Revised: 03/13/2014] [Accepted: 03/18/2014] [Indexed: 01/17/2023]
Abstract
BACKGROUND The most demanding challenge in research on molecular aspects within the flow of biological information is posed by the complex carbohydrates (glycan part of cellular glycoconjugates). How the 'message' encoded in carbohydrate 'letters' is 'read' and 'translated' can only be unraveled by interdisciplinary efforts. SCOPE OF REVIEW This review provides a didactic step-by-step survey of the concept of the sugar code and the way strategic combination of experimental approaches characterizes structure-function relationships, with resources for teaching. MAJOR CONCLUSIONS The unsurpassed coding capacity of glycans is an ideal platform for generating a broad range of molecular 'messages'. Structural and functional analyses of complex carbohydrates have been made possible by advances in chemical synthesis, rendering production of oligosaccharides, glycoclusters and neoglycoconjugates possible. This availability facilitates to test the glycans as ligands for natural sugar receptors (lectins). Their interaction is a means to turn sugar-encoded information into cellular effects. Glycan/lectin structures and their spatial modes of presentation underlie the exquisite specificity of the endogenous lectins in counterreceptor selection, that is, to home in on certain cellular glycoproteins or glycolipids. GENERAL SIGNIFICANCE Understanding how sugar-encoded 'messages' are 'read' and 'translated' by lectins provides insights into fundamental mechanisms of life, with potential for medical applications.
Collapse
Affiliation(s)
- Dolores Solís
- Instituto de Química Física "Rocasolano", CSIC, Serrano 119, 28006 Madrid, Spain; Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES), 07110 Bunyola, Mallorca, Illes Baleares, Spain.
| | - Nicolai V Bovin
- Shemyakin & Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, ul Miklukho-Maklaya 16/10, 117871 GSP-7, V-437, Moscow, Russian Federation.
| | - Anthony P Davis
- School of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK.
| | - Jesús Jiménez-Barbero
- Chemical and Physical Biology, Centro de Investigaciones Biológicas, CSIC, Ramiro de Maeztu, 9, 28040 Madrid, Spain.
| | - Antonio Romero
- Chemical and Physical Biology, Centro de Investigaciones Biológicas, CSIC, Ramiro de Maeztu, 9, 28040 Madrid, Spain.
| | - René Roy
- Department of Chemistry, Université du Québec à Montréal, P.O. Box 8888, Succ. Centre-Ville, Montréal, Québec H3C 3P8, Canada.
| | - Karel Smetana
- Charles University, 1st Faculty of Medicine, Institute of Anatomy, U nemocnice 3, 128 00 Prague 2, Czech Republic.
| | - Hans-Joachim Gabius
- Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians-University Munich, Veterinärstr. 13, 80539 München, Germany.
| |
Collapse
|
17
|
Cornish-Bowden A, Peretó J, Cárdenas ML. Biochemistry and evolutionary biology: Two disciplines that need each other. J Biosci 2014; 39:13-27. [DOI: 10.1007/s12038-014-9414-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
18
|
Dunn IS. Are molecular alphabets universal enabling factors for the evolution of complex life? ORIGINS LIFE EVOL B 2013; 43:445-64. [PMID: 24510462 DOI: 10.1007/s11084-014-9354-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2013] [Accepted: 01/22/2014] [Indexed: 10/25/2022]
Abstract
Terrestrial biosystems depend on macromolecules, and this feature is often considered as a likely universal aspect of life. While opinions differ regarding the importance of small-molecule systems in abiogenesis, escalating biological functional demands are linked with increasing complexity in key molecules participating in biosystem operations, and many such requirements cannot be efficiently mediated by relatively small compounds. It has long been recognized that known life is associated with the evolution of two distinct molecular alphabets (nucleic acid and protein), specific sequence combinations of which serve as informational and functional polymers. In contrast, much less detailed focus has been directed towards the potential universal need for molecular alphabets in constituting complex chemically-based life, and the implications of such a requirement. To analyze this, emphasis here is placed on the generalizable replicative and functional characteristics of molecular alphabets and their concatenates. A primary replicative alphabet based on the simplest possible molecular complementarity can potentially enable evolutionary processes to occur, including the encoding of secondarily functional alphabets. Very large uniquely specified ('non-alphabetic') molecules cannot feasibly underlie systems capable of the replicative and evolutionary properties which characterize complex biosystems. Transitions in the molecular evolution of alphabets can be related to progressive bridging of barriers which enable higher levels of biosystem organization. It is thus highly probable that molecular alphabets are an obligatory requirement for complex chemically-based life anywhere in the universe. In turn, reference to molecular alphabets should be usefully applied in current definitions of life.
Collapse
Affiliation(s)
- Ian S Dunn
- CytoCure LLC, Suite 430C, 100 Cummings Center, Beverly, MA, 01915, USA,
| |
Collapse
|
19
|
Szilágyi A, Zachar I, Szathmáry E. Gause's principle and the effect of resource partitioning on the dynamical coexistence of replicating templates. PLoS Comput Biol 2013; 9:e1003193. [PMID: 23990769 PMCID: PMC3749944 DOI: 10.1371/journal.pcbi.1003193] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2013] [Accepted: 07/05/2013] [Indexed: 01/05/2023] Open
Abstract
Models of competitive template replication, although basic for replicator dynamics and primordial evolution, have not yet taken different sequences explicitly into account, neither have they analyzed the effect of resource partitioning (feeding on different resources) on coexistence. Here we show by analytical and numerical calculations that Gause's principle of competitive exclusion holds for template replicators if resources (nucleotides) affect growth linearly and coexistence is at fixed point attractors. Cases of complementary or homologous pairing between building blocks with parallel or antiparallel strands show no deviation from the rule that the nucleotide compositions of stably coexisting species must be different and there cannot be more coexisting replicator species than nucleotide types. Besides this overlooked mechanism of template coexistence we show also that interesting sequence effects prevail as parts of sequences that are copied earlier affect coexistence more strongly due to the higher concentration of the corresponding replication intermediates. Template and copy always count as one species due their constraint of strict stoichiometric coupling. Stability of fixed-point coexistence tends to decrease with the length of sequences, although this effect is unlikely to be detrimental for sequences below 100 nucleotides. In sum, resource partitioning (niche differentiation) is the default form of competitive coexistence for replicating templates feeding on a cocktail of different nucleotides, as it may have been the case in the RNA world. Our analysis of different pairing and strand orientation schemes is relevant for artificial and potentially astrobiological genetics.
Collapse
Affiliation(s)
- András Szilágyi
- Department of Plant Systematics, Ecology and Theoretical Biology, Institute of Biology, Eötvös University, Budapest, Hungary
- Department of Plant Systematics, Ecology and Theoretical Biology, Research Group of Ecology and Theoretical Biology, Eötvös University and The Hungarian Academy of Sciences, Budapest, Hungary
- Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, Germany
| | - István Zachar
- Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, Germany
| | - Eörs Szathmáry
- Department of Plant Systematics, Ecology and Theoretical Biology, Institute of Biology, Eötvös University, Budapest, Hungary
- Department of Plant Systematics, Ecology and Theoretical Biology, Research Group of Ecology and Theoretical Biology, Eötvös University and The Hungarian Academy of Sciences, Budapest, Hungary
- Parmenides Center for the Conceptual Foundations of Science, Munich/Pullach, Germany
- * E-mail:
| |
Collapse
|
20
|
Nechaev SK, Sobolevski AN, Valba OV. Planar diagrams from optimization for concave potentials. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 87:012102. [PMID: 23410278 DOI: 10.1103/physreve.87.012102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2012] [Revised: 09/17/2012] [Indexed: 06/01/2023]
Abstract
We propose a toy model of a heteropolymer chain capable of forming planar secondary structures typical for RNA molecules. In this model, the sequential intervals between neighboring monomers along a chain are considered as quenched random variables, and energies of nonlocal bonds are assumed to be concave functions of those intervals. A few factors are neglected: the contribution of loop factors to the partition function, the variation in energies of different types of complementary nucleotides, the stacking interactions, and constraints on the minimal size of loops. However, the model captures well the formation of folded structures without pseudoknots in an arbitrary sequence of nucleotides. Using the optimization procedure for a special class of concave-type potentials, borrowed from optimal transport analysis, we derive the local difference equation for the ground state free energy of the chain with the planar (RNA-like) architecture of paired links. We consider various distribution functions of intervals between neighboring monomers (truncated Gaussian and scale free) and demonstrate the existence of a topological crossover from sequential to essentially nested configurations of paired links.
Collapse
Affiliation(s)
- S K Nechaev
- LPTMS, Université Paris Sud, 91405 Orsay Cedex, France
| | | | | |
Collapse
|
21
|
Pinheiro VB, Loakes D, Holliger P. Synthetic polymers and their potential as genetic materials. Bioessays 2012; 35:113-22. [PMID: 23281109 DOI: 10.1002/bies.201200135] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
DNA and RNA are the only known natural genetic materials. Systematic modification of each of their chemical building blocks (nucleobase, sugar, and phosphate) has enabled the study of the key properties that make those nucleic acids genetic materials. All three moieties contribute to replication and, significantly, all three moieties can be replaced by synthetic analogs without loss of function. Synthetic nucleic acid polymers capable of storing and propagating information not only expand the central dogma, but also highlight that DNA and RNA are not unique chemical solutions for genetic information storage. By considering replication as a question of information transfer, we propose that any polymer that can be replicated could serve as a genetic material.
Collapse
Affiliation(s)
- Vitor B Pinheiro
- Medical Research Council, Laboratory of Molecular Biology, Cambridge, UK.
| | | | | |
Collapse
|
22
|
Hirao I, Kimoto M, Yamashige R. Natural versus artificial creation of base pairs in DNA: origin of nucleobases from the perspectives of unnatural base pair studies. Acc Chem Res 2012; 45:2055-65. [PMID: 22263525 DOI: 10.1021/ar200257x] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Since life began on Earth, the four types of bases (A, G, C, and T(U)) that form two sets of base pairs have remained unchanged as the components of nucleic acids that replicate and transfer genetic information. Throughout evolution, except for the U to T modification, the four base structures have not changed. This constancy within the genetic code raises the question of how these complicated nucleotides were generated from the molecules in a primordial soup on the early Earth. At some prebiotic stage, the complementarity of base pairs might have accelerated the generation and accumulation of nucleotides or oligonucleotides. We have no clues whether one pair of nucleobases initially appeared on the early Earth during this process or a set of two base pairs appeared simultaneously. Recently, researchers have developed new artificial pairs of nucleobases (unnatural base pairs) that function alongside the natural base pairs. Some unnatural base pairs in duplex DNA can be efficiently and faithfully amplified in a polymerase chain reaction (PCR) using thermostable DNA polymerases. The addition of unnatural base pair systems could expand the genetic alphabet of DNA, thus providing a new mechanism for the generation novel biopolymers by the site-specific incorporation of functional components into nucleic acids and proteins. Furthermore, the process of unnatural base pair development might provide clues to the origin of the natural base pairs in a primordial soup on the early Earth. In this Account, we describe the development of three representative types of unnatural base pairs that function as a third pair of nucleobases in PCR and reconsider the origin of the natural nucleic acids. As researchers developing unnatural base pairs, they use repeated "proof of concept" experiments. As researchers design new base pairs, they improve the structures that function in PCR and eliminate those that do not. We expect that this process is similar to the one functioning in the chemical evolution and selection of the natural nucleobases. Interestingly, the initial structures designed by each research group were quite similar to those of the latest successful unnatural base pairs. In this regard, it is tempting to form a hypothesis that the base pairs on the primordial Earth, in which the natural purine bases, A and G, and pyrimidine bases, C and T(U), originated from structurally similar compounds, such as hypoxanthine for a purine base predecessor. Subsequently, the initial base pair evolved to the present two sets of base pairs via a keto-enol tautomerization of the initial compounds.
Collapse
Affiliation(s)
- Ichiro Hirao
- RIKEN Systems and Structural Biology Center (SSBC), 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
- TagCyx Biotechnologies, 1-6-126 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Michiko Kimoto
- RIKEN Systems and Structural Biology Center (SSBC), 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
- TagCyx Biotechnologies, 1-6-126 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Rie Yamashige
- RIKEN Systems and Structural Biology Center (SSBC), 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| |
Collapse
|
23
|
Deplazes-Zemp A. The conception of life in synthetic biology. SCIENCE AND ENGINEERING ETHICS 2012; 18:757-774. [PMID: 21484320 DOI: 10.1007/s11948-011-9269-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2010] [Accepted: 03/28/2011] [Indexed: 05/30/2023]
Abstract
The phrase 'synthetic biology' is used to describe a set of different scientific and technological disciplines, which share the objective to design and produce new life forms. This essay addresses the following questions: What conception of life stands behind this ambitious objective? In what relation does this conception of life stand to that of traditional biology and biotechnology? And, could such a conception of life raise ethical concerns? Three different observations that provide useful indications for the conception of life in synthetic biology will be discussed in detail: 1. Synthetic biologists focus on different features of living organisms in order to design new life forms, 2. Synthetic biologists want to contribute to the understanding of life, and 3. Synthetic biologists want to modify life through a rational design, which implies the notions of utilising, minimising/optimising, varying and overcoming life. These observations indicate a tight connection between science and technology, a focus on selected aspects of life, a production-oriented approach to life, and a design-oriented understanding of life. It will be argued that through this conception of life synthetic biologists present life in a different light. This conception of life will be illustrated by the metaphor of a toolbox. According to the notion of life as a toolbox, the different features of living organisms are perceived as various rationally designed instruments that can be used for the production of the living organism itself or secondary products made by the organism. According to certain ethical positions this conception of life might raise ethical concerns related to the status of the organism, the motives of the scientists and the role of technology in our society.
Collapse
Affiliation(s)
- Anna Deplazes-Zemp
- University of Zurich, IBME, Pestalozzistr. 24, 8032, Zurich, Switzerland.
| |
Collapse
|
24
|
Rios AC, Tor Y. Refining the genetic alphabet: a late-period selection pressure? ASTROBIOLOGY 2012; 12:884-891. [PMID: 22984873 PMCID: PMC3444765 DOI: 10.1089/ast.2011.0789] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Accepted: 08/04/2012] [Indexed: 06/01/2023]
Abstract
The transition from genomic ribonucleic acid (RNA) to deoxyribonucleic acid (DNA) in primitive cells may have created a selection pressure that refined the genetic alphabet, resulting from the global weakening of the N-glycosyl bonds. Hydrolytic rupture of these bonds, termed deglycosylation, leaves an abasic site that is the single greatest threat to the stability and integrity of genomic DNA. The rates of deglycosylation are highly dependent on the identity of the nucleobases. Modifications made to the bases, such as deamination, oxidation, and alkylation, can further increase deglycosylation reaction rates, suggesting that the native bases provide optimum N-glycosyl bond stability. To protect their genomes, cells have evolved highly specific enzymes called glycosylases, associated with DNA repair, that detect and remove these damaged bases. In RNA, however, the occurrence of many of these modified bases is deliberate. The dichotomous behavior that cells exhibit toward base modifications may have originated in the RNA world. Modified bases would have been advantageous for the functional and structural repertoire of catalytic RNAs. Yet in an early DNA world, the utility of these heterocycles was greatly diminished, and their presence posed a distinct liability to the stability of cells' genomes. A natural selection for bases exhibiting the greatest resistance to deglycosylation would have ensured the viability of early DNA life, along with the recruitment of DNA repair.
Collapse
Affiliation(s)
- Andro C Rios
- Department of Chemistry and Biochemistry, University of California, San Diego, San Diego, California 92093, USA.
| | | |
Collapse
|
25
|
Valba OV, Tamm MV, Nechaev SK. New alphabet-dependent morphological transition in random RNA alignment. PHYSICAL REVIEW LETTERS 2012; 109:018102. [PMID: 23031133 DOI: 10.1103/physrevlett.109.018102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2011] [Indexed: 06/01/2023]
Abstract
We study the fraction f of nucleotides involved in the formation of a cactuslike secondary structure of random heteropolymer RNA-like molecules. In the low-temperature limit, we study this fraction as a function of the number c of different nucleotide species. We show, that with changing c, the secondary structures of random RNAs undergo a morphological transition: f(c)→1 for c≤c(cr) as the chain length n goes to infinity, signaling the formation of a virtually perfect gapless secondary structure; while f(c)<1 for c>c(cr), which means that a nonperfect structure with gaps is formed. The strict upper and lower bounds 2≤c(cr)≤4 are proven, and the numerical evidence for c(cr) is presented. The relevance of the transition from the evolutional point of view is discussed.
Collapse
Affiliation(s)
- O V Valba
- Moscow Institute of Physics and Technology, 141700, Dolgoprudny, Russia
| | | | | |
Collapse
|
26
|
Merkel L, Hoesl MG, Albrecht M, Schmidt A, Budisa N. Blue Fluorescent Amino Acids as In Vivo Building Blocks for Proteins. Chembiochem 2010; 11:305-14. [DOI: 10.1002/cbic.200900651] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
27
|
Koonin EV, Novozhilov AS. Origin and evolution of the genetic code: the universal enigma. IUBMB Life 2009; 61:99-111. [PMID: 19117371 DOI: 10.1002/iub.146] [Citation(s) in RCA: 199] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The genetic code is nearly universal, and the arrangement of the codons in the standard codon table is highly nonrandom. The three main concepts on the origin and evolution of the code are the stereochemical theory, according to which codon assignments are dictated by physicochemical affinity between amino acids and the cognate codons (anticodons); the coevolution theory, which posits that the code structure coevolved with amino acid biosynthesis pathways; and the error minimization theory under which selection to minimize the adverse effect of point mutations and translation errors was the principal factor of the code's evolution. These theories are not mutually exclusive and are also compatible with the frozen accident hypothesis, that is, the notion that the standard code might have no special properties but was fixed simply because all extant life forms share a common ancestor, with subsequent changes to the code, mostly, precluded by the deleterious effect of codon reassignment. Mathematical analysis of the structure and possible evolutionary trajectories of the code shows that it is highly robust to translational misreading but there are numerous more robust codes, so the standard code potentially could evolve from a random code via a short sequence of codon series reassignments. Thus, much of the evolution that led to the standard code could be a combination of frozen accident with selection for error minimization although contributions from coevolution of the code with metabolic pathways and weak affinities between amino acids and nucleotide triplets cannot be ruled out. However, such scenarios for the code evolution are based on formal schemes whose relevance to the actual primordial evolution is uncertain. A real understanding of the code origin and evolution is likely to be attainable only in conjunction with a credible scenario for the evolution of the coding principle itself and the translation system.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| | | |
Collapse
|
28
|
Abstract
Single-stranded DNA molecules have the capacity to adopt catalytically active structures known as DNAzymes, although the fundamental limits of this ability have not been determined. Starting with a parent DNAzyme composed of all four types of standard nucleotides, we conducted a search of the surrounding sequence space to identify functional derivatives with catalytic cores composed of only three, and subsequently only two types of nucleotides. We provide the first report of a DNAzyme that contains only guanosine and cytidine deoxyribonucleotides in its catalytic domain, which consists of just 13 nucleotides. This DNAzyme catalyzes the Mn(2+)-dependent cleavage of an RNA phosphodiester bond approximately 5300-fold faster than the corresponding uncatalyzed reaction, but approximately 10,000-fold slower than the parent. The demonstration of a catalytic DNA molecule made from a binary nucleotide alphabet broadens our understanding of the fundamental limits of nucleic-acid-mediated catalysis.
Collapse
Affiliation(s)
- Kenny Schlosser
- Department of Biochemistry and Biomedical Sciences and Department of Chemistry, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| | - Yingfu Li
- Department of Biochemistry and Biomedical Sciences and Department of Chemistry, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
- *To whom correspondence should be addressed. Tel: +1 905 525 9140. Ext. 22462; Fax: +1 905 522 9033;
| |
Collapse
|
29
|
Zhang HY. Exploring the evolution of standard amino-acid alphabet: When genomics meets thermodynamics. Biochem Biophys Res Commun 2007; 359:403-5. [PMID: 17543275 DOI: 10.1016/j.bbrc.2007.05.115] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2007] [Accepted: 05/18/2007] [Indexed: 11/27/2022]
Abstract
One of the most intriguing aspects of life is that despite the diversified apparent shapes, similar building blocks and infrastructures, such as standard amino acids and canonical genetic codes, are shared by most life on Earth. Thus, it is challenging to explore: why nature just selects these building blocks and strategies from numerous candidates to construct life? Was this deterministic or fortuitous? Thanks to the rapid progress in genomics, bioinformatics and synthetic biology, more and more basic principles underlying life design and construction were disclosed in the past decade. However, since the origin of early life is substantially a chemical process, to understand the enigma of life origin, chemists' efforts can not be neglected. In this paper, we focus on the evolution of standard amino-acid alphabet and indicate that chemistry, especially thermodynamics, is indeed critical to understanding the forming mechanisms of amino-acid alphabet. It is revealed that nature prefers low free energy and thus ubiquitous (cheap) small amino acids when beginning to build life, which is compatible with many recent findings from genomics and bioinformatics.
Collapse
Affiliation(s)
- Hong-Yu Zhang
- Shandong Provincial Research Center for Bioinformatic Engineering and Technique, Center for Advanced Study, Shandong University of Technology, Zibo 255049, PR China.
| |
Collapse
|
30
|
Lepthien S, Wiltschi B, Bolic B, Budisa N. In vivo engineering of proteins with nitrogen-containing tryptophan analogs. Appl Microbiol Biotechnol 2007; 73:740-54. [PMID: 17269153 DOI: 10.1007/s00253-006-0665-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Recently, it has become possible to reprogram the protein synthesis machinery such that numerous noncanonical amino acids can be translated into target sequences yielding tailor-made proteins. The canonical amino acid tryptophan (Trp) encoded by a single nucleotide triplet (UGG) is a particularly interesting target for protein engineering and design. Trp-residues can be substituted with a variety of analogs and surrogates generated biosynthetically or by organic chemistry. Among them, nitrogen-containing tryptophan analogs occupy a central position, as they have distinct chemical properties in comparison with aliphatic amines and imines. They resemble purine bases of DNA and share their capacity for pH-sensitive intramolecular charge transfer. These special properties of the analogs can be directly transmitted into related protein structures via in vivo ribosome-mediated translation. Proteins expressed in this way are further endowed with unique properties like new spectral, altered redox and titration features or might serve as useful biomaterials. We present and discuss current works and future developments in protein engineering with nitrogen-containing tryptophan analogs and related compounds as well as their relevance for academic and applicative research.
Collapse
Affiliation(s)
- Sandra Lepthien
- Molecular Biotechnology, Max-Planck-Institut für Biochemie, BioFuture Independent Research Group, Am Klopferspitz 18, 82152 Martinsreid, Germany
| | | | | | | |
Collapse
|
31
|
Wiltschi B, Budisa N. Natural history and experimental evolution of the genetic code. Appl Microbiol Biotechnol 2007; 74:739-53. [PMID: 17268784 DOI: 10.1007/s00253-006-0823-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2006] [Revised: 12/19/2006] [Accepted: 12/20/2006] [Indexed: 10/23/2022]
Abstract
The standard genetic code is a set of rules that relates the 20 canonical amino acids in proteins to groups of three bases in the mRNA. It evolved from a more primitive form and the attempts to reconstruct its natural history are based on its present-day features. Genetic code engineering as a new research field was developed independently in a few laboratories during the last 15 years. The main intention is to re-program protein synthesis by expanding the coding capacities of the genetic code via re-assignment of specific codons to un-natural amino acids. This article focuses on the question as to which extent hypothetical scenarios that led to codon re-assignments during the evolution of the genetic code are relevant for its further evolution in the laboratory. Current attempts to engineer the genetic code are reviewed with reference to theoretical works on its natural history. Integration of the theoretical considerations into experimental concepts will bring us closer to designer cells with target-engineered genetic codes that should open not only tremendous possibilities for the biotechnology of the twenty-first century but will also provide a basis for the design of novel life forms.
Collapse
Affiliation(s)
- Birgit Wiltschi
- Max-Planck-Institut für Biochemie, Am Klopferspitz 18, Martinsried, Germany
| | | |
Collapse
|
32
|
Abstract
Although one standard amino-acid 'alphabet' is used by most organisms on Earth, the evolutionary cause(s) and significance of this alphabet remain elusive. Fresh insights into the origin of the alphabet are now emerging from disciplines as diverse as astrobiology, biochemical engineering and bioinformatics.
Collapse
Affiliation(s)
- Yi Lu
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, MD 21250, USA
| | - Stephen Freeland
- Department of Biological Sciences, University of Maryland, Baltimore County, Baltimore, MD 21250, USA
| |
Collapse
|
33
|
Abstract
Since the early days of the discovery of the genetic code nonrandom patterns have been searched for in the code in the hope of providing information about its origin and early evolution. Here we present a new classification scheme of the genetic code that is based on a binary representation of the purines and pyrimidines. This scheme reveals known patterns more clearly than the common one, for instance, the classification of strong, mixed, and weak codons as well as the ordering of codon families. Furthermore, new patterns have been found that have not been described before: Nearly all quantitative amino acid properties, such as Woese's polarity and the specific volume, show a perfect correlation to Lagerkvist's codon-anticodon binding strength. Our new scheme leads to new ideas about the evolution of the genetic code. It is hypothesized that it started with a binary doublet code and developed via a quaternary doublet code into the contemporary triplet code. Furthermore, arguments are presented against suggestions that a "simpler" code, where only the midbase was informational, was at the origin of the genetic code.
Collapse
Affiliation(s)
- Thomas Wilhelm
- Institute of Molecular Biotechnology, Beutenbergstr. 11, 07745 Jena, Germany.
| | | |
Collapse
|
34
|
Andras P, Andras C. The origins of life – the ‘protein interaction world’ hypothesis: protein interactions were the first form of self-reproducing life and nucleic acids evolved later as memory molecules. Med Hypotheses 2005; 64:678-88. [PMID: 15694682 DOI: 10.1016/j.mehy.2004.11.029] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2004] [Accepted: 11/26/2004] [Indexed: 11/15/2022]
Abstract
The 'protein interaction world' (PIW) hypothesis of the origins of life assumes that life emerged as a self-reproducing and expanding system of protein interactions. In mainstream molecular biology, 'replication' refers to the material copying of molecules such as nucleic acids. However, PIW is conceptualized as an abstract communication system constituted by the interactions between proteins, in which 'replication' happens at the level of self-reproduction of these interactions between proteins. Densely concentrated peptide interaction systems may have reproduced and expanded as 'protocell' vesicles surrounded by lipid bi-layer membranes. Protocells led to the emergence of proto-RNA molecules of greater chemical stability which served as chemically differentiated 'memories' of peptide interaction states, thereby facilitating the reproduction and expansion of protocells. Simplification-driven expansion led to the selection of biotic amino acids and the reduction of the typical RNA alphabet to the four usual bases (A, C, G and U). Dense interactions between RNA molecules led to the emergence of the RNA interaction subsystem of the cell, and to the emergence of 'memories' of RNA interactions in the form of DNA molecules with greater chemical stability. The expansion of DNA molecule interactions led to the dense clustering and encapsulation of DNA molecules within the cell nucleus. RNA molecules therefore serve as memories of protein interactions and DNA molecules are memories of RNA interactions. We believe that the PIW hypothesis is more evolutionarily plausible than the mainstream RNA world hypothesis, and has greater explanatory power.
Collapse
Affiliation(s)
- Peter Andras
- School of Computing Science, University of Newcastle Upon Tyne, Newcastle Upon Tyne NE1 7RU, UK.
| | | |
Collapse
|
35
|
Affiliation(s)
- Jay S Siegel
- Organisch-chemisches Institut, Universitat Zurich, Winterthurerstr. 190, Zurich, CH-8057, Switzerland
| | | |
Collapse
|