1
|
Gupta MN, Uversky VN. Biological importance of arginine: A comprehensive review of the roles in structure, disorder, and functionality of peptides and proteins. Int J Biol Macromol 2024; 257:128646. [PMID: 38061507 DOI: 10.1016/j.ijbiomac.2023.128646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 12/02/2023] [Accepted: 12/04/2023] [Indexed: 01/26/2024]
Abstract
Arginine shows Jekyll and Hyde behavior in several respects. It participates in protein folding via ionic and H-bonds and cation-pi interactions; the charge and hydrophobicity of its side chain make it a disorder-promoting amino acid. Its methylation in histones; RNA binding proteins; chaperones regulates several cellular processes. The arginine-centric modifications are important in oncogenesis and as biomarkers in several cardiovascular diseases. The cross-links involving arginine in collagen and cornea are involved in pathogenesis of tissues but have also been useful in tissue engineering and wound-dressing materials. Arginine is a part of active site of several enzymes such as GTPases, peroxidases, and sulfotransferases. Its metabolic importance is obvious as it is involved in production of urea, NO, ornithine and citrulline. It can form unusual functional structures such as molecular tweezers in vitro and sprockets which engage DNA chains as part of histones in vivo. It has been used in design of cell-penetrating peptides as drugs. Arginine has been used as an excipient in both solid and injectable drug formulations; its role in suppressing opalescence due to liquid-liquid phase separation is particularly very promising. It has been known as a suppressor of protein aggregation during protein refolding. It has proved its usefulness in protein bioseparation processes like ion-exchange, hydrophobic and affinity chromatographies. Arginine is an amino acid, whose importance in biological sciences and biotechnology continues to grow in diverse ways.
Collapse
Affiliation(s)
- Munishwar Nath Gupta
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology, Hauz Khas, New Delhi 110016, India
| | - Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
| |
Collapse
|
2
|
Carter CW. Base Pairing Promoted the Self-Organization of Genetic Coding, Catalysis, and Free-Energy Transduction. Life (Basel) 2024; 14:199. [PMID: 38398709 PMCID: PMC10890426 DOI: 10.3390/life14020199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 01/21/2024] [Accepted: 01/25/2024] [Indexed: 02/25/2024] Open
Abstract
How Nature discovered genetic coding is a largely ignored question, yet the answer is key to explaining the transition from biochemical building blocks to life. Other, related puzzles also fall inside the aegis enclosing the codes themselves. The peptide bond is unstable with respect to hydrolysis. So, it requires some form of chemical free energy to drive it. Amino acid activation and acyl transfer are also slow and must be catalyzed. All living things must thus also convert free energy and synchronize cellular chemistry. Most importantly, functional proteins occupy only small, isolated regions of sequence space. Nature evolved heritable symbolic data processing to seek out and use those sequences. That system has three parts: a memory of how amino acids behave in solution and inside proteins, a set of code keys to access that memory, and a scoring function. The code keys themselves are the genes for cognate pairs of tRNA and aminoacyl-tRNA synthetases, AARSs. The scoring function is the enzymatic specificity constant, kcat/kM, which measures both catalysis and specificity. The work described here deepens the evidence for and understanding of an unexpected consequence of ancestral bidirectional coding. Secondary structures occur in approximately the same places within antiparallel alignments of their gene products. However, the polar amino acids that define the molecular surface of one are reflected into core-defining non-polar side chains on the other. Proteins translated from base-paired coding strands fold up inside out. Bidirectional genes thus project an inverted structural duality into the proteome. I review how experimental data root the scoring functions responsible for the origins of coding and catalyzed activation of unfavorable chemical reactions in that duality.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| |
Collapse
|
3
|
Patra SK, Douglas J, Wills PR, Bouckeart R, Betts L, Qing TG, Carter CW. Genomic database furnishes a spontaneous example of a functional Class II glycyl-tRNA synthetase urzyme. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.11.575260. [PMID: 38260702 PMCID: PMC10802616 DOI: 10.1101/2024.01.11.575260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
The chief barrier to studies of how genetic coding emerged is the lack of experimental models for ancestral aminoacyl-tRNA synthetases (AARS). We hypothesized that conserved core catalytic sites could represent such ancestors. That hypothesis enabled engineering functional "urzymes" from TrpRS, LeuRS, and HisRS. We describe here a fourth urzyme, GlyCA, detected in an open reading frame from the genomic record of the arctic fox, Vulpes lagopus. GlyCA is homologous to a bacterial heterotetrameric Class II GlyRS-B. Alphafold2 predicted that the N-terminal 81 amino acids would adopt a 3D structure nearly identical to the HisRS urzyme (HisCA1). We expressed and purified that N-terminal segment. Enzymatic characterization revealed a robust single-turnover burst size and a catalytic rate for ATP consumption well in excess of that previously published for HisCA1. Time-dependent aminoacylation of tRNAGly proceeds at a rate consistent with that observed for amino acid activation. In fact, GlyCA is actually 35 times more active in glycine activation by ATP than the full-length GlyRS-B α-subunit dimer. ATP-dependent activation of the 20 canonical amino acids favors Class II amino acids that complement those favored by HisCA and LeuAC. These properties reinforce the notion that urzymes represent the requisite ancestral catalytic activities to implement a reduced genetic coding alphabet.
Collapse
Affiliation(s)
- Sourav Kumar Patra
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260
| | - Jordan Douglas
- Department of Physics, The University of Auckland, New Zealand
- Centre for Computational Evolution, University of Auckland, New Zealand
| | - Peter R. Wills
- Department of Physics, The University of Auckland, New Zealand
| | - Remco Bouckeart
- Centre for Computational Evolution, University of Auckland, New Zealand
- Department of Computer Science, The University of Auckland, New Zealand
| | - Laurie Betts
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260
| | | | - Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260
| |
Collapse
|
4
|
Chen B, Mansour B, Zheng E, Liu Y, Gauld JW, Wang Q. Fundamentals behind the specificity of Cysteinyl-tRNA synthetase: MD and QM/MM joint investigations. Proteins 2023; 91:354-362. [PMID: 36196751 DOI: 10.1002/prot.26433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 09/04/2022] [Accepted: 09/28/2022] [Indexed: 11/05/2022]
Abstract
Cysteinyl-tRNA synthetase (CysRS) catalyzes the aminoacylation reaction of cysteine to its cognate tRNACys in the first step of protein translation. It is found that CysRS is different from other aaRSs as it transfers cysteine without the need for an editing reaction, which is not applicable in the case of serine despite the similarity in their structures. Surprisingly, the reasons why CysRS has high amino acid specificity are not clear yet. In this research, the binding configurations of Cys-AMP and its near-cognate amino acid Ser-AMP with CysRS are compared by Molecular Dynamics (MD). The results reveal that CysRS screens the substrate Cys-AMP to a certain extent in the process of combination and recognition, thus providing a guarantee for the high selectivity of the next reaction. While Ser-AMP is in a folded state in CysRS. In the meanwhile, the interaction between Cys-AMP and Zn963 in CysRS is much stronger than Ser-AMP. The substrate-assisted aminoacylation mechanism in CysRS is also explored by Quantum Mechanics/Molecular Mechanics (QM/MM) modeling. According to the QM/MM potential energies, the energy barrier of TSCys-AMP is 91.75 kJ/mol, while that of TSSer-AMP is close to 150 kJ/mol. Based on thermochemistry calculations, it is found that the product of Cys-AMP is more stable than the reactant. In contrast, Ser-AMP has a reactant that is more stable than its product. As a result, it reflects that the specificity of CysRS originates from both the kinetic and thermodynamical perspectives of the reaction. Our investigations demonstrate comprehensively on how CysRS recognizes and catalyzes the substrate Cys-AMP, hoping to provide some guidance for researchers in this area.
Collapse
Affiliation(s)
- Binbin Chen
- Department of Chemistry, Zhejiang University, Hangzhou, China.,ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Basel Mansour
- Department of Chemistry and Biochemistry, University of Windsor, Windsor, Canada
| | - En Zheng
- Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Yingchun Liu
- Department of Chemistry, Zhejiang University, Hangzhou, China
| | - James W Gauld
- Department of Chemistry and Biochemistry, University of Windsor, Windsor, Canada
| | - Qi Wang
- Department of Chemistry, Zhejiang University, Hangzhou, China
| |
Collapse
|
5
|
Furukawa R, Yokobori SI, Sato R, Kumagawa T, Nakagawa M, Katoh K, Yamagishi A. Amino Acid Specificity of Ancestral Aminoacyl-tRNA Synthetase Prior to the Last Universal Common Ancestor Commonote commonote. J Mol Evol 2022; 90:73-94. [PMID: 35084522 PMCID: PMC8821087 DOI: 10.1007/s00239-021-10043-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 12/16/2021] [Indexed: 11/24/2022]
Abstract
Extant organisms commonly use 20 amino acids in protein synthesis. In the translation system, aminoacyl-tRNA synthetase (ARS) selectively binds an amino acid and transfers it to the cognate tRNA. It is postulated that the amino acid repertoire of ARS expanded during the development of the translation system. In this study we generated composite phylogenetic trees for seven ARSs (SerRS, ProRS, ThrRS, GlyRS-1, HisRS, AspRS, and LysRS) which are thought to have diverged by gene duplication followed by mutation, before the evolution of the last universal common ancestor. The composite phylogenetic tree shows that the AspRS/LysRS branch diverged from the other five ARSs at the deepest node, with the GlyRS/HisRS branch and the other three ARSs (ThrRS, ProRS and SerRS) diverging at the second deepest node. ThrRS diverged next, and finally ProRS and SerRS diverged from each other. Based on the phylogenetic tree, sequences of the ancestral ARSs prior to the evolution of the last universal common ancestor were predicted. The amino acid specificity of each ancestral ARS was then postulated by comparison with amino acid recognition sites of ARSs of extant organisms. Our predictions demonstrate that ancestral ARSs had substantial specificity and that the number of amino acid types amino-acylated by proteinaceous ARSs was limited before the appearance of a fuller range of proteinaceous ARS species. From an assumption that 10 amino acid species are required for folding and function, proteinaceous ARS possibly evolved in a translation system composed of preexisting ribozyme ARSs, before the evolution of the last universal common ancestor.
Collapse
Affiliation(s)
- Ryutaro Furukawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.,Faculty of Human Science, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Shin-Ichi Yokobori
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Riku Sato
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Taimu Kumagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Mizuho Nakagawa
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan
| | - Kazutaka Katoh
- Department of Genome Informatics, Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, Japan.
| |
Collapse
|
6
|
Overview of tRNA Modifications in Chloroplasts. Microorganisms 2022; 10:microorganisms10020226. [PMID: 35208681 PMCID: PMC8877259 DOI: 10.3390/microorganisms10020226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 01/16/2022] [Accepted: 01/18/2022] [Indexed: 11/29/2022] Open
Abstract
The chloroplast is a promising platform for biotechnological innovation due to its compact translation machinery. Nucleotide modifications within a minimal set of tRNAs modulate codon–anticodon interactions that are crucial for translation efficiency. However, a comprehensive assessment of these modifications does not presently exist in chloroplasts. Here, we synthesize all available information concerning tRNA modifications in the chloroplast and assign translation efficiency for each modified anticodon–codon pair. In addition, we perform a bioinformatics analysis that links enzymes to tRNA modifications and aminoacylation in the chloroplast of Chlamydomonas reinhardtii. This work provides the first comprehensive analysis of codon and anticodon interactions of chloroplasts and its implication for translation efficiency.
Collapse
|
7
|
Abstract
Codon-dependent translation underlies genetics and phylogenetic inferences, but its origins pose two challenges. Prevailing narratives cannot account for the fact that aminoacyl-tRNA synthetases (aaRSs), which translate the genetic code, must collectively enforce the rules used to assemble themselves. Nor can they explain how specific assignments arose from rudimentary differentiation between ancestral aaRSs and corresponding transfer RNAs (tRNAs). Experimental deconstruction of the two aaRS superfamilies created new experimental tools with which to analyze the emergence of the code. Amino acid and tRNA substrate recognition are linked to phase transfer free energies of amino acids and arise largely from aaRS class-specific differences in secondary structure. Sensitivity to protein folding rules endowed ancestral aaRS-tRNA pairs with the feedback necessary to rapidly compare alternative genetic codes and coding sequences. These and other experimental data suggest that the aaRS bidirectional genetic ancestry stabilized the differentiation and interdependence required to initiate and elaborate the genetic coding table.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA;
| | - Peter R Wills
- Department of Physics, University of Auckland, Auckland 1142, New Zealand
| |
Collapse
|
8
|
Then A, Mácha K, Ibrahim B, Schuster S. A novel method for achieving an optimal classification of the proteinogenic amino acids. Sci Rep 2020; 10:15321. [PMID: 32948819 PMCID: PMC7501307 DOI: 10.1038/s41598-020-72174-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Accepted: 08/26/2020] [Indexed: 11/09/2022] Open
Abstract
The classification of proteinogenic amino acids is crucial for understanding their commonalities as well as their differences to provide a hint for why life settled on the usage of precisely those amino acids. It is also crucial for predicting electrostatic, hydrophobic, stacking and other interactions, for assessing conservation in multiple alignments and many other applications. While several methods have been proposed to find "the" optimal classification, they have several shortcomings, such as the lack of efficiency and interpretability or an unnecessarily high number of discriminating features. In this study, we propose a novel method involving a repeated binary separation via a minimum amount of five features (such as hydrophobicity or volume) expressed by numerical values for amino acid characteristics. The features are extracted from the AAindex database. By simple separation at the medians, we successfully derive the five properties volume, electron-ion-interaction potential, hydrophobicity, α-helix propensity, and π-helix propensity. We extend our analysis to separations other than by the median. We further score our combinations based on how natural the separations are.
Collapse
Affiliation(s)
- Andre Then
- Chair of Bioinformatics, Matthias Schleiden Institute, University of Jena, Ernst-Abbe-Platz 2, 07743, Jena, Germany
| | - Karel Mácha
- Chair of Bioinformatics, Matthias Schleiden Institute, University of Jena, Ernst-Abbe-Platz 2, 07743, Jena, Germany.,Westernacher Solutions, Columbiadamm 37, 10965, Berlin, Germany
| | - Bashar Ibrahim
- Chair of Bioinformatics, Matthias Schleiden Institute, University of Jena, Ernst-Abbe-Platz 2, 07743, Jena, Germany. .,Department of Mathematics and Natural Sciences, Centre for Applied Mathematics and Bioinformatics, Gulf University for Science and Technology, 32093, Hawally, Kuwait.
| | - Stefan Schuster
- Chair of Bioinformatics, Matthias Schleiden Institute, University of Jena, Ernst-Abbe-Platz 2, 07743, Jena, Germany.
| |
Collapse
|
9
|
Kaiser F, Krautwurst S, Salentin S, Haupt VJ, Leberecht C, Bittrich S, Labudde D, Schroeder M. The structural basis of the genetic code: amino acid recognition by aminoacyl-tRNA synthetases. Sci Rep 2020; 10:12647. [PMID: 32724042 PMCID: PMC7387524 DOI: 10.1038/s41598-020-69100-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 07/06/2020] [Indexed: 12/29/2022] Open
Abstract
Storage and directed transfer of information is the key requirement for the development of life. Yet any information stored on our genes is useless without its correct interpretation. The genetic code defines the rule set to decode this information. Aminoacyl-tRNA synthetases are at the heart of this process. We extensively characterize how these enzymes distinguish all natural amino acids based on the computational analysis of crystallographic structure data. The results of this meta-analysis show that the correct read-out of genetic information is a delicate interplay between the composition of the binding site, non-covalent interactions, error correction mechanisms, and steric effects.
Collapse
Affiliation(s)
- Florian Kaiser
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany. .,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany.
| | - Sarah Krautwurst
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | | - V Joachim Haupt
- Biotechnology Center (BIOTEC), TU Dresden, 01307, Dresden, Germany.,PharmAI GmbH, Tatzberg 47, 01307, Dresden, Germany
| | | | | | - Dirk Labudde
- University of Applied Sciences Mittweida, 09648, Mittweida, Germany
| | | |
Collapse
|
10
|
Gospodinov A, Kunnev D. Universal Codons with Enrichment from GC to AU Nucleotide Composition Reveal a Chronological Assignment from Early to Late Along with LUCA Formation. Life (Basel) 2020; 10:life10060081. [PMID: 32516985 PMCID: PMC7345086 DOI: 10.3390/life10060081] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 05/30/2020] [Accepted: 06/03/2020] [Indexed: 12/14/2022] Open
Abstract
The emergence of a primitive genetic code should be considered the most essential event during the origin of life. Almost a complete set of codons (as we know them) should have been established relatively early during the evolution of the last universal common ancestor (LUCA) from which all known organisms descended. Many hypotheses have been proposed to explain the driving forces and chronology of the evolution of the genetic code; however, none is commonly accepted. In the current paper, we explore the features of the genetic code that, in our view, reflect the mechanism and the chronological order of the origin of the genetic code. Our hypothesis postulates that the primordial RNA was mostly GC-rich, and this bias was reflected in the order of amino acid codon assignment. If we arrange the codons and their corresponding amino acids from GC-rich to AU-rich, we find that: 1. The amino acids encoded by GC-rich codons (Ala, Gly, Arg, and Pro) are those that contribute the most to the interactions with RNA (if incorporated into short peptides). 2. This order correlates with the addition of novel functions necessary for the evolution from simple to longer folded peptides. 3. The overlay of aminoacyl-tRNA synthetases (aaRS) to the amino acid order produces a distinctive zonal distribution for class I and class II suggesting an interdependent origin. These correlations could be explained by the active role of the bridge peptide (BP), which we proposed earlier in the evolution of the genetic code.
Collapse
Affiliation(s)
- Anastas Gospodinov
- Roumen Tsanev Institute of Molecular Biology, Bulgarian Academy of Sciences, Acad. G. Bonchev Str. 21, Sofia 1113, Bulgaria;
| | - Dimiter Kunnev
- Department of Molecular & Cellular Biology, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
- Correspondence:
| |
Collapse
|
11
|
Takénaka A, Moras D. Correlation between equi-partition of aminoacyl-tRNA synthetases and amino-acid biosynthesis pathways. Nucleic Acids Res 2020; 48:3277-3285. [PMID: 31965182 PMCID: PMC7102985 DOI: 10.1093/nar/gkaa013] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 12/31/2019] [Accepted: 01/07/2020] [Indexed: 12/11/2022] Open
Abstract
The partition of aminoacyl-tRNA synthetases (aaRSs) into two classes of equal size and the correlated amino acid distribution is a puzzling still unexplained observation. We propose that the time scale of the amino-acid synthesis, assumed to be proportional to the number of reaction steps (NE) involved in the biosynthesis pathway, is one of the parameters that controlled the timescale of aaRSs appearance. Because all pathways are branched at fructose-6-phosphate on the metabolic pathway, this product is defined as the common origin for the NE comparison. For each amino-acid, the NE value, counted from the origin to the final product, provides a timescale for the pathways to be established. An archeological approach based on NE reveals that aaRSs of the two classes are generated in pair along this timescale. The results support the coevolution theory for the origin of the genetic code with an earlier appearance of class II aaRSs.
Collapse
Affiliation(s)
- Akio Takénaka
- Research Institute, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino, Chiba 275-0016, Japan.,Faculty of Pharmacy, Shenyang Pharmaceutical University, Benxi, Liaoning 117004, China
| | - Dino Moras
- Department of Integrated Structural Biology, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC) 1 rue Laurent Fries, Illkirch 67404, France.,Centre National de Recherche Scientifique (CNRS) UMR 7104, France.,Institut National de Santé et de Recherche Médicale (INSERM) U1258, France.,Université de Strasbourg, Illkirch, France
| |
Collapse
|
12
|
The Ancient Operational Code is Embedded in the Amino Acid Substitution Matrix and aaRS Phylogenies. J Mol Evol 2019; 88:136-150. [PMID: 31781936 DOI: 10.1007/s00239-019-09918-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/14/2019] [Indexed: 10/25/2022]
Abstract
The underlying structure of the canonical amino acid substitution matrix (aaSM) is examined by considering stepwise improvements in the differential recognition of amino acids according to their chemical properties during the branching history of the two aminoacyl-tRNA synthetase (aaRS) superfamilies. The evolutionary expansion of the genetic code is described by a simple parameterization of the aaSM, in which (i) the number of distinguishable amino acid types, (ii) the matrix dimension and (iii) the number of parameters, each increases by one for each bifurcation in an aaRS phylogeny. Parameterized matrices corresponding to trees in which the size of an amino acid sidechain is the only discernible property behind its categorization as a substrate, exclusively for a Class I or II aaRS, provide a significantly better fit to empirically determined aaSM than trees with random bifurcation patterns. A second split between polar and nonpolar amino acids in each Class effects a vastly greater further improvement. The earliest Class-separated epochs in the phylogenies of the aaRS reflect these enzymes' capability to distinguish tRNAs through the recognition of acceptor stem identity elements via the minor (Class I) and major (Class II) helical grooves, which is how the ancient operational code functioned. The advent of tRNA recognition using the anticodon loop supports the evolution of the optimal map of amino acid chemistry found in the later genetic code, an essentially digital categorization, in which polarity is the major functional property, compensating for the unrefined, haphazard differentiation of amino acids achieved by the operational code.
Collapse
|
13
|
Carter CW, Wills PR. Experimental solutions to problems defining the origin of codon-directed protein synthesis. Biosystems 2019; 183:103979. [PMID: 31176803 PMCID: PMC6693952 DOI: 10.1016/j.biosystems.2019.103979] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 05/27/2019] [Accepted: 05/29/2019] [Indexed: 12/13/2022]
Abstract
How genetic coding differentiated biology from chemistry is a long-standing challenge in Biology, for which there have been few experimental approaches, despite a wide-ranging speculative literature. We summarize five coordinated areas-experimental characterization of functional approximations to the minimal peptides (protozymes and urzymes) necessary to activate amino acids and acylate tRNA; showing that specificities of these experimental models match those expected from the synthetase Class division; population of disjoint regions of amino acid sequence space via bidirectional coding ancestry of the two synthetase Classes; showing that the phase transfer equilibria of amino acid side chains that form a two-dimensional basis set for protein folding are embedded in patterns of bases in the tRNA acceptor stem and anticodon; and identification of molecular signatures of ancestral synthetases and tRNAs necessary to define the earliest cognate synthetase:tRNA pairs-that now compose an extensive experimentally testable paradigm for progress toward understanding the coordinated emergence of the codon table and viable mRNA coding sequences. We briefly discuss recent progress toward identifying the remaining outstanding questions-the nature of the earliest amino acid alphabets and the origin of binding discrimination via distinct amino acid sequence-independent protein secondary structures-and how these, too, might be addressed experimentally.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, United States
| | - Peter R Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand
| |
Collapse
|
14
|
Carter CW, Wills PR. Hierarchical groove discrimination by Class I and II aminoacyl-tRNA synthetases reveals a palimpsest of the operational RNA code in the tRNA acceptor-stem bases. Nucleic Acids Res 2019; 46:9667-9683. [PMID: 30016476 PMCID: PMC6182185 DOI: 10.1093/nar/gky600] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 07/12/2018] [Indexed: 01/01/2023] Open
Abstract
Class I and II aaRS recognition of opposite grooves was likely among the earliest determinants fixed in the tRNA acceptor stem bases. A new regression model identifies those determinants in bacterial tRNAs. Integral coefficients relate digital dependent to independent variables with perfect agreement between observed and calculated grooves for all twenty isoaccepting tRNAs. Recognition is mediated by the Discriminator base 73, the first base pair, and base 2 of the acceptor stem. Subsets of these coefficients also identically compute grooves recognized by smaller numbers of aaRS. Thus, the model is hierarchical, suggesting that new rules were added to pre-existing ones as new amino acids joined the coding alphabet. A thermodynamic rationale for the simplest model implies that Class-dependent aaRS secondary structures exploited differential tendencies of the acceptor stem to form the hairpin observed in Class I aaRS•tRNA complexes, enabling the earliest groove discrimination. Curiously, groove recognition also depends explicitly on the identity of base 2 in a manner consistent with the middle bases of the codon table, confirming a hidden ancestry of codon-anticodon pairing in the acceptor stem. That, and the lack of correlation with anticodon bases support prior productive coding interaction of tRNA minihelices with proto-mRNA.
Collapse
Affiliation(s)
- Charles W Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
| | - Peter R Wills
- Department of Physics, Centre for Computational Evolution, and Te Ao Marama Centre for Fundamental Enquiry, University of Auckland, PB 92109, Auckland 1142, New Zealand
| |
Collapse
|
15
|
Bittrich S, Schroeder M, Labudde D. Characterizing the relation of functional and Early Folding Residues in protein structures using the example of aminoacyl-tRNA synthetases. PLoS One 2018; 13:e0206369. [PMID: 30376559 PMCID: PMC6207335 DOI: 10.1371/journal.pone.0206369] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Accepted: 10/11/2018] [Indexed: 01/10/2023] Open
Abstract
Proteins are chains of amino acids which adopt a three-dimensional structure and are then able to catalyze chemical reactions or propagate signals in organisms. Without external influence, many proteins fold into their native structure, and a small number of Early Folding Residues (EFR) have previously been shown to initiate the formation of secondary structure elements and guide their respective assembly. Using the two diverse superfamilies of aminoacyl-tRNA synthetases (aaRS), it is shown that the position of EFR is preserved over the course of evolution even when the corresponding sequence conservation is small. Folding initiation sites are positioned in the center of secondary structure elements, independent of aaRS class. In class I, the predicted position of EFR resembles an ancient structural packing motif present in many seemingly unrelated proteins. Furthermore, it is shown that EFR and functionally relevant residues in aaRS are almost entirely disjoint sets of residues. The Start2Fold database is used to investigate whether this separation of EFR and functional residues can be observed for other proteins. EFR are found to constitute crucial connectors of protein regions which are distant at sequence level. Especially, these residues exhibit a high number of non-covalent residue-residue contacts such as hydrogen bonds and hydrophobic interactions. This tendency also manifests as energetically stable local regions, as substantiated by a knowledge-based potential. Despite profound differences regarding how EFR and functional residues are embedded in protein structures, a strict separation of structurally and functionally relevant residues cannot be observed for a more general collection of proteins.
Collapse
Affiliation(s)
- Sebastian Bittrich
- Applied Computer Sciences & Biosciences, University of Applied Sciences Mittweida, Mittweida, Saxony, Germany
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Saxony, Germany
| | - Michael Schroeder
- Biotechnology Center (BIOTEC), Technische Universität Dresden, Dresden, Saxony, Germany
| | - Dirk Labudde
- Applied Computer Sciences & Biosciences, University of Applied Sciences Mittweida, Mittweida, Saxony, Germany
| |
Collapse
|