51
|
Zhao VY, Rodrigues JV, Lozovsky ER, Hartl DL, Shakhnovich EI. Switching an active site helix in dihydrofolate reductase reveals limits to subdomain modularity. Biophys J 2021; 120:4738-4750. [PMID: 34571014 PMCID: PMC8595743 DOI: 10.1016/j.bpj.2021.09.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 09/14/2021] [Accepted: 09/22/2021] [Indexed: 11/23/2022] Open
Abstract
To what degree are individual structural elements within proteins modular such that similar structures from unrelated proteins can be interchanged? We study subdomain modularity by creating 20 chimeras of an enzyme, Escherichia coli dihydrofolate reductase (DHFR), in which a catalytically important, 10-residue α-helical sequence is replaced by α-helical sequences from a diverse set of proteins. The chimeras stably fold but have a range of diminished thermal stabilities and catalytic activities. Evolutionary coupling analysis indicates that the residues of this α-helix are under selection pressure to maintain catalytic activity in DHFR. Reversion to phenylalanine at key position 31 was found to partially restore catalytic activity, which could be explained by evolutionary coupling values. We performed molecular dynamics simulations using replica exchange with solute tempering. Chimeras with low catalytic activity exhibit nonhelical conformations that block the binding site and disrupt the positioning of the catalytically essential residue D27. Simulation observables and in vitro measurements of thermal stability and substrate-binding affinity are strongly correlated. Several E. coli strains with chromosomally integrated chimeric DHFRs can grow, with growth rates that follow predictions from a kinetic flux model that depends on the intracellular abundance and catalytic activity of DHFR. Our findings show that although α-helices are not universally substitutable, the molecular and fitness effects of modular segments can be predicted by the biophysical compatibility of the replacement segment.
Collapse
Affiliation(s)
- Victor Y Zhao
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - João V Rodrigues
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Elena R Lozovsky
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts
| | - Daniel L Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
52
|
TwinCons: Conservation score for uncovering deep sequence similarity and divergence. PLoS Comput Biol 2021; 17:e1009541. [PMID: 34714829 PMCID: PMC8580257 DOI: 10.1371/journal.pcbi.1009541] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 11/10/2021] [Accepted: 10/06/2021] [Indexed: 11/19/2022] Open
Abstract
We have developed the program TwinCons, to detect noisy signals of deep ancestry of proteins or nucleic acids. As input, the program uses a composite alignment containing pre-defined groups, and mathematically determines a 'cost' of transforming one group to the other at each position of the alignment. The output distinguishes conserved, variable and signature positions. A signature is conserved within groups but differs between groups. The method automatically detects continuous characteristic stretches (segments) within alignments. TwinCons provides a convenient representation of conserved, variable and signature positions as a single score, enabling the structural mapping and visualization of these characteristics. Structure is more conserved than sequence. TwinCons highlights alternative sequences of conserved structures. Using TwinCons, we detected highly similar segments between proteins from the translation and transcription systems. TwinCons detects conserved residues within regions of high functional importance for the ribosomal RNA (rRNA) and demonstrates that signatures are not confined to specific regions but are distributed across the rRNA structure. The ability to evaluate both nucleic acid and protein alignments allows TwinCons to be used in combined sequence and structural analysis of signatures and conservation in rRNA and in ribosomal proteins (rProteins). TwinCons detects a strong sequence conservation signal between bacterial and archaeal rProteins related by circular permutation. This conserved sequence is structurally colocalized with conserved rRNA, indicated by TwinCons scores of rRNA alignments of bacterial and archaeal groups. This combined analysis revealed deep co-evolution of rRNA and rProtein buried within the deepest branching points in the tree of life.
Collapse
|
53
|
Alvarez-Carreño C, Penev PI, Petrov AS, Williams LD. Fold Evolution before LUCA: Common Ancestry of SH3 Domains and OB Domains. Mol Biol Evol 2021; 38:5134-5143. [PMID: 34383917 PMCID: PMC8557408 DOI: 10.1093/molbev/msab240] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
SH3 and OB are the simplest, oldest, and most common protein domains within the translation system. SH3 and OB domains are β-barrels that are structurally similar but are topologically distinct. To transform an OB domain to a SH3 domain, β-strands must be permuted in a multistep and evolutionarily implausible mechanism. Here, we explored relationships between SH3 and OB domains of ribosomal proteins, initiation, and elongation factors using a combined sequence- and structure-based approach. We detect a common core of SH3 and OB domains, as a region of significant structure and sequence similarity. The common core contains four β-strands and a loop, but omits the fifth β-strand, which is variable and is absent from some OB and SH3 domain proteins. The structure of the common core immediately suggests a simple permutation mechanism for interconversion between SH3 and OB domains, which appear to share an ancestor. The OB domain was formed by duplication and adaptation of the SH3 domain core, or vice versa, in a simple and probable transformation. By employing the folding algorithm AlphaFold2, we demonstrated that an ancestral reconstruction of a permuted SH3 sequence folds into an OB structure, and an ancestral reconstruction of a permuted OB sequence folds into a SH3 structure. The tandem SH3 and OB domains in the universal ribosomal protein uL2 share a common ancestor, suggesting that the divergence of these two domains occurred before the last universal common ancestor.
Collapse
Affiliation(s)
- Claudia Alvarez-Carreño
- NASA Center for the Origin of Life, Georgia Institute of Technology, Atlanta, GA, USA
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA
| | - Petar I Penev
- NASA Center for the Origin of Life, Georgia Institute of Technology, Atlanta, GA, USA
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Anton S Petrov
- NASA Center for the Origin of Life, Georgia Institute of Technology, Atlanta, GA, USA
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA
| | - Loren Dean Williams
- NASA Center for the Origin of Life, Georgia Institute of Technology, Atlanta, GA, USA
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
54
|
Yagi S, Padhi AK, Vucinic J, Barbe S, Schiex T, Nakagawa R, Simoncini D, Zhang KYJ, Tagami S. Seven Amino Acid Types Suffice to Create the Core Fold of RNA Polymerase. J Am Chem Soc 2021; 143:15998-16006. [PMID: 34559526 DOI: 10.1021/jacs.1c05367] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The extant complex proteins must have evolved from ancient short and simple ancestors. The double-ψ β-barrel (DPBB) is one of the oldest protein folds and conserved in various fundamental enzymes, such as the core domain of RNA polymerase. Here, by reverse engineering a modern DPBB domain, we reconstructed its plausible evolutionary pathway started by "interlacing homodimerization" of a half-size peptide, followed by gene duplication and fusion. Furthermore, by simplifying the amino acid repertoire of the peptide, we successfully created the DPBB fold with only seven amino acid types (Ala, Asp, Glu, Gly, Lys, Arg, and Val), which can be coded by only GNN and ARR (R = A or G) codons in the modern translation system. Thus, the DPBB fold could have been materialized by the early translation system and genetic code.
Collapse
Affiliation(s)
- Sota Yagi
- RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Aditya K Padhi
- RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Jelena Vucinic
- Université Fédérale de Toulouse, ANITI, INRAE-UR 875, 31000 Toulouse, France.,TBI, Université Fédérale de Toulouse, CNRS, INRAE, INSA, ANITI, 31000 Toulouse, France.,Université Fédérale de Toulouse, ANITI, IRIT-UMR 5505, 31000 Toulouse, France
| | - Sophie Barbe
- TBI, Université Fédérale de Toulouse, CNRS, INRAE, INSA, ANITI, 31000 Toulouse, France
| | - Thomas Schiex
- Université Fédérale de Toulouse, ANITI, INRAE-UR 875, 31000 Toulouse, France
| | - Reiko Nakagawa
- RIKEN Center for Biosystems Dynamics Research, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan
| | - David Simoncini
- Université Fédérale de Toulouse, ANITI, IRIT-UMR 5505, 31000 Toulouse, France
| | - Kam Y J Zhang
- RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| | - Shunsuke Tagami
- RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
| |
Collapse
|
55
|
Le Vay K, Song EY, Ghosh B, Tang TYD, Mutschler H. Enhanced Ribozyme-Catalyzed Recombination and Oligonucleotide Assembly in Peptide-RNA Condensates. Angew Chem Int Ed Engl 2021; 60:26096-26104. [PMID: 34569680 PMCID: PMC9299051 DOI: 10.1002/anie.202109267] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Indexed: 11/17/2022]
Abstract
The ability of RNA to catalyze RNA ligation is critical to its central role in many prebiotic model scenarios, in particular the copying of information during self‐replication. Prebiotically plausible ribozymes formed from short oligonucleotides can catalyze reversible RNA cleavage and ligation reactions, but harsh conditions or unusual scenarios are often required to promote folding and drive the reaction equilibrium towards ligation. Here, we demonstrate that ribozyme activity is greatly enhanced by charge‐mediated phase separation with poly‐L‐lysine, which shifts the reaction equilibrium from cleavage in solution to ligation in peptide‐RNA coaggregates and coacervates. This compartmentalization enables robust isothermal RNA assembly over a broad range of conditions, which can be leveraged to assemble long and complex RNAs from short fragments under mild conditions in the absence of exogenous activation chemistry, bridging the gap between pools of short oligomers and functional RNAs.
Collapse
Affiliation(s)
- Kristian Le Vay
- Biomimetic Systems, Max Planck Institute of Biochemistry, Am Klopferspitz 18, 82152, Martinsried, Germany.,Department of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Str. 4a, 44227, Dortmund, Germany
| | - Emilie Yeonwha Song
- Biomimetic Systems, Max Planck Institute of Biochemistry, Am Klopferspitz 18, 82152, Martinsried, Germany.,Department of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Str. 4a, 44227, Dortmund, Germany
| | - Basusree Ghosh
- Max-Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307, Dresden, Germany
| | - T-Y Dora Tang
- Max-Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307, Dresden, Germany
| | - Hannes Mutschler
- Department of Chemistry and Chemical Biology, TU Dortmund University, Otto-Hahn-Str. 4a, 44227, Dortmund, Germany
| |
Collapse
|
56
|
Kamiński K, Ludwiczak J, Jasiński M, Bukala A, Madaj R, Szczepaniak K, Dunin-Horkawicz S. Rossmann-toolbox: a deep learning-based protocol for the prediction and design of cofactor specificity in Rossmann fold proteins. Brief Bioinform 2021; 23:6375059. [PMID: 34571541 PMCID: PMC8769691 DOI: 10.1093/bib/bbab371] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 08/04/2021] [Accepted: 08/22/2021] [Indexed: 11/15/2022] Open
Abstract
The Rossmann fold enzymes are involved in essential biochemical pathways such as nucleotide and amino acid metabolism. Their functioning relies on interaction with cofactors, small nucleoside-based compounds specifically recognized by a conserved βαβ motif shared by all Rossmann fold proteins. While Rossmann methyltransferases recognize only a single cofactor type, the S-adenosylmethionine, the oxidoreductases, depending on the family, bind nicotinamide (nicotinamide adenine dinucleotide, nicotinamide adenine dinucleotide phosphate) or flavin-based (flavin adenine dinucleotide) cofactors. In this study, we showed that despite its short length, the βαβ motif unambiguously defines the specificity towards the cofactor. Following this observation, we trained two complementary deep learning models for the prediction of the cofactor specificity based on the sequence and structural features of the βαβ motif. A benchmark on two independent test sets, one containing βαβ motifs bearing no resemblance to those of the training set, and the other comprising 38 experimentally confirmed cases of rational design of the cofactor specificity, revealed the nearly perfect performance of the two methods. The Rossmann-toolbox protocols can be accessed via the webserver at https://lbs.cent.uw.edu.pl/rossmann-toolbox and are available as a Python package at https://github.com/labstructbioinf/rossmann-toolbox.
Collapse
Affiliation(s)
- Kamil Kamiński
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Jan Ludwiczak
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland.,Laboratory of Bioinformatics, Nencki Institute of Experimental Biology, Pasteura 3, 02-093 Warsaw, Poland
| | - Maciej Jasiński
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Adriana Bukala
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Rafal Madaj
- Centre of Molecular and Macromolecular Studies, Polish Academy of Sciences, Sienkiewicza 112, 90-363, Lodz, Poland
| | - Krzysztof Szczepaniak
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Stanisław Dunin-Horkawicz
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| |
Collapse
|
57
|
Freire MÁ. Short non-coded peptides interacting with cofactors facilitated the integration of early chemical networks. Biosystems 2021; 211:104547. [PMID: 34547425 DOI: 10.1016/j.biosystems.2021.104547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 08/28/2021] [Accepted: 09/15/2021] [Indexed: 11/02/2022]
Abstract
Independently developed iron-sulphur/thioester- and phosphate-driven chemical reactions would have set up two distinct reaction networks prior to coupling in a proto-metabolic system supporting a minimal organisation closure. Each chemical system assisted initially by simple catalysts and then by more complex cofactors would have provided the precursors of the small metabolites and monomer units along with their respective polymers through dehydrating template-independent assemblies. For example, acylation reactions mediated by activated thioester groups produced peptides, fatty acids and polyhydroxyalkanoates, while phosphorylation reactions by phosphorylating agents allowed the synthesis of polysaccharides, polyribonucleotides and polyphosphates. Here, we address how these independent chemical systems might fit together and shaped a proto-metabolic system, focusing specifically on cofactors as molecular fossils of metabolism. As a result, the proposed overview suggests that non-coded peptides capable of binding a variety of ligands, but in particular with a redox active versatility and/or group transfer potential could have facilitated the chemical connections that led to a minimal closure with a proto-metabolism. Later developments would have made it possible to establish a cellular organisation with more complex and interdependent metabolic pathways.
Collapse
Affiliation(s)
- Miguel Ángel Freire
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), CONICET, Universidad Nacional de Córdoba (UNC). Facultad de Ciencias Exactas, Físicas y Naturales. Av. Vélez Sarsfield 299, CC 495, 5000, Córdoba, Argentina.
| |
Collapse
|
58
|
Pinto GP, Corbella M, Demkiv AO, Kamerlin SCL. Exploiting enzyme evolution for computational protein design. Trends Biochem Sci 2021; 47:375-389. [PMID: 34544655 DOI: 10.1016/j.tibs.2021.08.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/18/2021] [Accepted: 08/24/2021] [Indexed: 11/15/2022]
Abstract
Recent years have seen an explosion of interest in understanding the physicochemical parameters that shape enzyme evolution, as well as substantial advances in computational enzyme design. This review discusses three areas where evolutionary information can be used as part of the design process: (i) using ancestral sequence reconstruction (ASR) to generate new starting points for enzyme design efforts; (ii) learning from how nature uses conformational dynamics in enzyme evolution to mimic this process in silico; and (iii) modular design of enzymes from smaller fragments, again mimicking the process by which nature appears to create new protein folds. Using showcase examples, we highlight the importance of incorporating evolutionary information to continue to push forward the boundaries of enzyme design studies.
Collapse
Affiliation(s)
- Gaspar P Pinto
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Marina Corbella
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Andrey O Demkiv
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | | |
Collapse
|
59
|
Ferruz N, Michel F, Lobos F, Schmidt S, Höcker B. Fuzzle 2.0: Ligand Binding in Natural Protein Building Blocks. Front Mol Biosci 2021; 8:715972. [PMID: 34485385 PMCID: PMC8416435 DOI: 10.3389/fmolb.2021.715972] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/06/2021] [Indexed: 11/13/2022] Open
Abstract
Modern proteins have been shown to share evolutionary relationships via subdomain-sized fragments. The assembly of such fragments through duplication and recombination events led to the complex structures and functions we observe today. We previously implemented a pipeline that identified more than 1,000 of these fragments that are shared by different protein folds and developed a web interface to analyze and search for them. This resource named Fuzzle helps structural and evolutionary biologists to identify and analyze conserved parts of a protein but it also provides protein engineers with building blocks for example to design proteins by fragment combination. Here, we describe a new version of this web resource that was extended to include ligand information. This addition is a significant asset to the database since now protein fragments that bind specific ligands can be identified and analyzed. Often the mode of ligand binding is conserved in proteins thereby supporting a common evolutionary origin. The same can now be explored for subdomain-sized fragments within this database. This ligand binding information can also be used in protein engineering to graft binding pockets into other protein scaffolds or to transfer functional sites via recombination of a specific fragment. Fuzzle 2.0 is freely available at https://fuzzle.uni-bayreuth.de/2.0.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Francisco Lobos
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Steffen Schmidt
- Computational Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| |
Collapse
|
60
|
Nekrasov AN, Kozmin YP, Kozyrev SV, Ziganshin RH, de Brevern AG, Anashkina AA. Hierarchical Structure of Protein Sequence. Int J Mol Sci 2021; 22:8339. [PMID: 34361104 PMCID: PMC8348890 DOI: 10.3390/ijms22158339] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 07/22/2021] [Accepted: 07/27/2021] [Indexed: 11/28/2022] Open
Abstract
Most non-communicable diseases are associated with dysfunction of proteins or protein complexes. The relationship between sequence and structure has been analyzed for a long time, and the analysis of the sequences organization in domains and motifs remains an actual research area. Here, we propose a mathematical method for revealing the hierarchical organization of protein sequences. The method is based on the pentapeptide as a unit of protein sequences. Employing the frequency of occurrence of pentapeptides in sequences of natural proteins and a special mathematical approach, this method revealed a hierarchical structure in the protein sequence. The method was applied to 24,647 non-homologous protein sequences with sizes ranging from 50 to 400 residues from the NRDB90 database. Statistical analysis of the branching points of the graphs revealed 11 characteristic values of y (the width of the inscribed function), showing the relationship of these multiple fragments of the sequences. Several examples illustrate how fragments of the protein spatial structure correspond to the elements of the hierarchical structure of the protein sequence. This methodology provides a promising basis for a mathematically-based classification of the elements of the spatial organization of proteins. Elements of the hierarchical structure of different levels of the hierarchy can be used to solve biotechnological and medical problems.
Collapse
Affiliation(s)
- Alexei N. Nekrasov
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia; (A.N.N.); (Y.P.K.); (R.H.Z.)
| | - Yuri P. Kozmin
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia; (A.N.N.); (Y.P.K.); (R.H.Z.)
| | - Sergey V. Kozyrev
- Steklov Mathematical Institute and of Russian Academy of Sciences, 8 Gubkina St., 119991 Moscow, Russia;
| | - Rustam H. Ziganshin
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia; (A.N.N.); (Y.P.K.); (R.H.Z.)
| | - Alexandre G. de Brevern
- INSERM UMR S-1134, DSIMB, Univ. Paris, INTS, Lab. of Excellence GR-Ex 6, rue Alexandre Cabanel, CEDEX 15, 75739 Paris, France;
| | - Anastasia A. Anashkina
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov St. 32, 119991 Moscow, Russia
| |
Collapse
|
61
|
Romero-Romero S, Kordes S, Michel F, Höcker B. Evolution, folding, and design of TIM barrels and related proteins. Curr Opin Struct Biol 2021; 68:94-104. [PMID: 33453500 PMCID: PMC8250049 DOI: 10.1016/j.sbi.2020.12.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/13/2020] [Accepted: 12/14/2020] [Indexed: 12/16/2022]
Abstract
Proteins are chief actors in life that perform a myriad of exquisite functions. This diversity has been enabled through the evolution and diversification of protein folds. Analysis of sequences and structures strongly suggest that numerous protein pieces have been reused as building blocks and propagated to many modern folds. This information can be traced to understand how the protein world has diversified. In this review, we discuss the latest advances in the analysis of protein evolutionary units, and we use as a model system one of the most abundant and versatile topologies, the TIM-barrel fold, to highlight the existing common principles that interconnect protein evolution, structure, folding, function, and design.
Collapse
Affiliation(s)
| | - Sina Kordes
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
62
|
Kolodny R, Nepomnyachiy S, Tawfik DS, Ben-Tal N. Bridging Themes: Short Protein Segments Found in Different Architectures. Mol Biol Evol 2021; 38:2191-2208. [PMID: 33502503 PMCID: PMC8136508 DOI: 10.1093/molbev/msab017] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The vast majority of theoretically possible polypeptide chains do not fold, let alone confer function. Hence, protein evolution from preexisting building blocks has clear potential advantages over ab initio emergence from random sequences. In support of this view, sequence similarities between different proteins is generally indicative of common ancestry, and we collectively refer to such homologous sequences as "themes." At the domain level, sequence homology is routinely detected. However, short themes which are segments, or fragments of intact domains, are particularly interesting because they may provide hints about the emergence of domains, as opposed to divergence of preexisting domains, or their mixing-and-matching to form multi-domain proteins. Here we identified 525 representative short themes, comprising 20-80 residues that are unexpectedly shared between domains considered to have emerged independently. Among these "bridging themes" are ones shared between the most ancient domains, for example, Rossmann, P-loop NTPase, TIM-barrel, flavodoxin, and ferredoxin-like. We elaborate on several particularly interesting cases, where the bridging themes mediate ligand binding. Ligand binding may have contributed to the stability and the plasticity of these building blocks, and to their ability to invade preexisting domains or serve as starting points for completely new domains.
Collapse
Affiliation(s)
- Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | | | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Nir Ben-Tal
- George S. Wise Faculty of Life Sciences, Department of Biochemistry and Molecular Biology, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
63
|
Heizinger L, Merkl R. Evidence for the preferential reuse of sub-domain motifs in primordial protein folds. Proteins 2021; 89:1167-1179. [PMID: 33957009 DOI: 10.1002/prot.26089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 04/15/2021] [Accepted: 04/28/2021] [Indexed: 11/06/2022]
Abstract
A comparison of protein backbones makes clear that not more than approximately 1400 different folds exist, each specifying the three-dimensional topology of a protein domain. Large proteins are composed of specific domain combinations and many domains can accommodate different functions. These findings confirm that the reuse of domains is key for the evolution of multi-domain proteins. If reuse was also the driving force for domain evolution, ancestral fragments of sub-domain size exist that are shared between domains possessing significantly different topologies. For the fully automated detection of putatively ancestral motifs, we developed the algorithm Fragstatt that compares proteins pairwise to identify fragments, that is, instantiations of the same motif. To reach maximal sensitivity, Fragstatt compares sequences by means of cascaded alignments of profile Hidden Markov Models. If the fragment sequences are sufficiently similar, the program determines and scores the structural concordance of the fragments. By analyzing a comprehensive set of proteins from the CATH database, Fragstatt identified 12 532 partially overlapping and structurally similar motifs that clustered to 134 unique motifs. The dissemination of these motifs is limited: We found only two domain topologies that contain two different motifs and generally, these motifs occur in not more than 18% of the CATH topologies. Interestingly, motifs are enriched in topologies that are considered ancestral. Thus, our findings suggest that the reuse of sub-domain sized fragments was relevant in early phases of protein evolution and became less important later on.
Collapse
Affiliation(s)
- Leonhard Heizinger
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| |
Collapse
|
64
|
Makarov M, Meng J, Tretyachenko V, Srb P, Březinová A, Giacobelli VG, Bednárová L, Vondrášek J, Dunker AK, Hlouchová K. Enzyme catalysis prior to aromatic residues: Reverse engineering of a dephospho-CoA kinase. Protein Sci 2021; 30:1022-1034. [PMID: 33739538 PMCID: PMC8040869 DOI: 10.1002/pro.4068] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 03/12/2021] [Accepted: 03/13/2021] [Indexed: 11/07/2022]
Abstract
The wide variety of protein structures and functions results from the diverse properties of the 20 canonical amino acids. The generally accepted hypothesis is that early protein evolution was associated with enrichment of a primordial alphabet, thereby enabling increased protein catalytic efficiencies and functional diversification. Aromatic amino acids were likely among the last additions to genetic code. The main objective of this study was to test whether enzyme catalysis can occur without the aromatic residues (aromatics) by studying the structure and function of dephospho-CoA kinase (DPCK) following aromatic residue depletion. We designed two variants of a putative DPCK from Aquifex aeolicus by substituting (a) Tyr, Phe and Trp or (b) all aromatics (including His). Their structural characterization indicates that substituting the aromatics does not markedly alter their secondary structures but does significantly loosen their side chain packing and increase their sizes. Both variants still possess ATPase activity, although with 150-300 times lower efficiency in comparison with the wild-type phosphotransferase activity. The transfer of the phosphate group to the dephospho-CoA substrate becomes heavily uncoupled and only the His-containing variant is still able to perform the phosphotransferase reaction. These data support the hypothesis that proteins in the early stages of life could support catalytic activities, albeit with low efficiencies. An observed significant contraction upon ligand binding is likely important for appropriate organization of the active site. Formation of firm hydrophobic cores, which enable the assembly of stably structured active sites, is suggested to provide a selective advantage for adding the aromatic residues.
Collapse
Affiliation(s)
- Mikhail Makarov
- Department of Cell Biology, Faculty of ScienceCharles University, BIOCEVPragueCzech Republic
- Department of Biochemistry, Faculty of ScienceCharles UniversityPragueCzech Republic
| | - Jingwei Meng
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and BioinformaticsIndiana University School of MedicineIndianapolisIndianaUSA
| | - Vyacheslav Tretyachenko
- Department of Cell Biology, Faculty of ScienceCharles University, BIOCEVPragueCzech Republic
- Department of Biochemistry, Faculty of ScienceCharles UniversityPragueCzech Republic
| | - Pavel Srb
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| | - Anna Březinová
- Proteomics Core Facility, BIOCEV, Faculty of Science, Charles UniversityPragueCzech Republic
| | | | - Lucie Bednárová
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| | - Jiří Vondrášek
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| | - A. Keith Dunker
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and BioinformaticsIndiana University School of MedicineIndianapolisIndianaUSA
| | - Klára Hlouchová
- Department of Cell Biology, Faculty of ScienceCharles University, BIOCEVPragueCzech Republic
- Institute of Organic Chemistry and Biochemistry, IOCB Research Centre & Gilead Sciences, Academy of Sciences of the Czech RepublicPragueCzech Republic
| |
Collapse
|
65
|
Konagurthu AS, Subramanian R, Allison L, Abramson D, Stuckey PJ, Garcia de la Banda M, Lesk AM. Universal Architectural Concepts Underlying Protein Folding Patterns. Front Mol Biosci 2021; 7:612920. [PMID: 33996891 PMCID: PMC8120156 DOI: 10.3389/fmolb.2020.612920] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 12/16/2020] [Indexed: 11/17/2022] Open
Abstract
What is the architectural “basis set” of the observed universe of protein structures? Using information-theoretic inference, we answer this question with a dictionary of 1,493 substructures—called concepts—typically at a subdomain level, based on an unbiased subset of known protein structures. Each concept represents a topologically conserved assembly of helices and strands that make contact. Any protein structure can be dissected into instances of concepts from this dictionary. We dissected the Protein Data Bank and completely inventoried all the concept instances. This yields many insights, including correlations between concepts and catalytic activities or binding sites, useful for rational drug design; local amino-acid sequence–structure correlations, useful for ab initio structure prediction methods; and information supporting the recognition and exploration of evolutionary relationships, useful for structural studies. An interactive site, Proçodic, at http://lcb.infotech.monash.edu.au/prosodic (click), provides access to and navigation of the entire dictionary of concepts and their usages, and all associated information. This report is part of a continuing programme with the goal of elucidating fundamental principles of protein architecture, in the spirit of the work of Cyrus Chothia.
Collapse
Affiliation(s)
- Arun S Konagurthu
- Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Monash University, Clayton, VIC, Australia
| | - Ramanan Subramanian
- Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Monash University, Clayton, VIC, Australia
| | - Lloyd Allison
- Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Monash University, Clayton, VIC, Australia
| | - David Abramson
- Research Computing Center, University of Queensland, Brisbane, QLD, Australia
| | - Peter J Stuckey
- Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Monash University, Clayton, VIC, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, VIC, Australia
| | - Maria Garcia de la Banda
- Department of Data Science and Artificial Intelligence, Faculty of Information Technology, Monash University, Clayton, VIC, Australia
| | - Arthur M Lesk
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, United States.,MRC Laboratory of Molecular Biology, Cambridge, United Kingdom
| |
Collapse
|
66
|
Ferruz N, Noske J, Höcker B. Protlego: A Python package for the analysis and design of chimeric proteins. Bioinformatics 2021; 37:3182-3189. [PMID: 33901273 PMCID: PMC8504633 DOI: 10.1093/bioinformatics/btab253] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 03/05/2021] [Accepted: 04/19/2021] [Indexed: 01/03/2023] Open
Abstract
Motivation Duplication and recombination of protein fragments have led to the highly diverse protein space that we observe today. By mimicking this natural process, the design of protein chimeras via fragment recombination has proven experimentally successful and has opened a new era for the design of customizable proteins. The in silico building of structural models for these chimeric proteins, however, remains a manual task that requires a considerable degree of expertise and is not amenable for high-throughput studies. Energetic and structural analysis of the designed proteins often require the use of several tools, each with their unique technical difficulties and available in different programming languages or web servers. Results We implemented a Python package that enables automated, high-throughput design of chimeras and their structural analysis. First, it fetches evolutionarily conserved fragments from a built-in database (also available at fuzzle.uni-bayreuth.de). These relationships can then be represented via networks or further selected for chimera construction via recombination. Designed chimeras or natural proteins are then scored and minimized with the Charmm and Amber forcefields and their diverse structural features can be analyzed at ease. Here, we showcase Protlego’s pipeline by exploring the relationships between the P-loop and Rossmann superfolds, building and characterizing their offspring chimeras. We believe that Protlego provides a powerful new tool for the protein design community. Availability and implementation Protlego runs on the Linux platform and is freely available at (https://hoecker-lab.github.io/protlego/) with tutorials and documentation. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Jakob Noske
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| |
Collapse
|
67
|
Helicase-like functions in phosphate loop containing beta-alpha polypeptides. Proc Natl Acad Sci U S A 2021; 118:2016131118. [PMID: 33846247 DOI: 10.1073/pnas.2016131118] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The P-loop Walker A motif underlies hundreds of essential enzyme families that bind nucleotide triphosphates (NTPs) and mediate phosphoryl transfer (P-loop NTPases), including the earliest DNA/RNA helicases, translocases, and recombinases. What were the primordial precursors of these enzymes? Could these large and complex proteins emerge from simple polypeptides? Previously, we showed that P-loops embedded in simple βα repeat proteins bind NTPs but also, unexpectedly so, ssDNA and RNA. Here, we extend beyond the purely biophysical function of ligand binding to demonstrate rudimentary helicase-like activities. We further constructed simple 40-residue polypeptides comprising just one β-(P-loop)-α element. Despite their simplicity, these P-loop prototypes confer functions such as strand separation and exchange. Foremost, these polypeptides unwind dsDNA, and upon addition of NTPs, or inorganic polyphosphates, release the bound ssDNA strands to allow reformation of dsDNA. Binding kinetics and low-resolution structural analyses indicate that activity is mediated by oligomeric forms spanning from dimers to high-order assemblies. The latter are reminiscent of extant P-loop recombinases such as RecA. Overall, these P-loop prototypes compose a plausible description of the sequence, structure, and function of the earliest P-loop NTPases. They also indicate that multifunctionality and dynamic assembly were key in endowing short polypeptides with elaborate, evolutionarily relevant functions.
Collapse
|
68
|
Affiliation(s)
- Dragana Despotovic
- Department of Biomolecular Sciences Weizmann Institute of Science Rehovot 7610001 Israel
| | - Dan S. Tawfik
- Department of Biomolecular Sciences Weizmann Institute of Science Rehovot 7610001 Israel
| |
Collapse
|
69
|
Gullotto D. Fine tuned exploration of evolutionary relationships within the protein universe. Stat Appl Genet Mol Biol 2021; 20:17-36. [PMID: 33594839 DOI: 10.1515/sagmb-2019-0039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2019] [Accepted: 01/12/2021] [Indexed: 11/15/2022]
Abstract
In the regime of domain classifications, the protein universe unveils a discrete set of folds connected by hierarchical relationships. Instead, at sub-domain-size resolution and because of physical constraints not necessarily requiring evolution to shape polypeptide chains, networks of protein motifs depict a continuous view that lies beyond the extent of hierarchical classification schemes. A number of studies, however, suggest that universal sub-sequences could be the descendants of peptides emerged in an ancient pre-biotic world. Should this be the case, evolutionary signals retained by structurally conserved motifs, along with hierarchical features of ancient domains, could sew relationships among folds that diverged beyond the point where homology is discernable. In view of the aforementioned, this paper provides a rationale where a network with hierarchical and continuous levels of the protein space, together with sequence profiles that probe the extent of sequence similarity and contacting residues that capture the transition from pre-biotic to domain world, has been used to explore relationships between ancient folds. Statistics of detected signals have been reported. As a result, an example of an emergent sub-network that makes sense from an evolutionary perspective, where conserved signals retrieved from the assessed protein space have been co-opted, has been discussed.
Collapse
Affiliation(s)
- Danilo Gullotto
- Advanced Computational Biostructural Research Collaboratory, I-95019, Zafferana Etnea, Italy
| |
Collapse
|
70
|
Tong CL, Lee KH, Seelig B. De novo proteins from random sequences through in vitro evolution. Curr Opin Struct Biol 2021; 68:129-134. [PMID: 33517151 DOI: 10.1016/j.sbi.2020.12.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Accepted: 12/29/2020] [Indexed: 11/29/2022]
Abstract
Natural proteins are the result of billions of years of evolution. The earliest predecessors of today's proteins are believed to have emerged from random polypeptides. While we have no means to determine how this process exactly happened, there is great interest in understanding how it reasonably could have happened. We are reviewing how researchers have utilized in vitro selection and molecular evolution methods to investigate plausible scenarios for the emergence of early functional proteins. The studies range from analyzing general properties and structural features of unevolved random polypeptides to isolating de novo proteins with specific functions from synthetic randomized sequence libraries or generating novel proteins by combining evolution with rational design. While the results are exciting, more work is needed to fully unravel the mechanisms that seeded protein-dominated biology.
Collapse
Affiliation(s)
- Cher Ling Tong
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA; BioTechnology Institute, University of Minnesota, St. Paul, MN, USA
| | - Kun-Hwa Lee
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA; BioTechnology Institute, University of Minnesota, St. Paul, MN, USA
| | - Burckhard Seelig
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, USA; BioTechnology Institute, University of Minnesota, St. Paul, MN, USA.
| |
Collapse
|
71
|
Outer membrane protein evolution. Curr Opin Struct Biol 2021; 68:122-128. [PMID: 33493965 DOI: 10.1016/j.sbi.2021.01.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 12/16/2020] [Accepted: 01/02/2021] [Indexed: 01/31/2023]
Abstract
Outer membrane proteins have remarkably homogeneous structure. They are all up down β-barrels. Up down barrels themselves are composed of repeated sets of β-hairpins. The consistency of the usage of the β-hairpin throughout the outer membrane milieu allows for interrogation of the evolution of these repetitive structures. Here we describe recent investigations of outer membrane protein evolution and how evolutionary precepts have been used for novel outer membrane protein design.
Collapse
|
72
|
Bonfio C. The curious case of peptide-coordinated iron-sulfur clusters: prebiotic and biomimetic insights. Dalton Trans 2021; 50:801-807. [PMID: 33351009 DOI: 10.1039/d0dt03947k] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Iron-sulfur clusters are among the most ancient biological cofactors and are thought to have had an ancient role in mediating the chemical reactions that led to life. Two different, yet complementary approaches, based on bioinorganic chemistry and prebiotic chemistry, have already provided important clues for the formation and activity of biomimetic iron-sulfur analogues in aqueous solution. This frontier article discusses the efforts spent in the last 50 years in the context of peptide-coordinated iron-sulfur clusters, with a particular emphasis on insightful contributions from recent prebiotic chemistry research.
Collapse
Affiliation(s)
- Claudia Bonfio
- Medical Research Council Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Cambridge CB2 0QH, UK.
| |
Collapse
|
73
|
Searching protein space for ancient sub-domain segments. Curr Opin Struct Biol 2021; 68:105-112. [PMID: 33476896 DOI: 10.1016/j.sbi.2020.11.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Accepted: 11/29/2020] [Indexed: 01/08/2023]
Abstract
Evolutionary processes that formed the current protein universe left their traces, among them homologous segments that recur, or are 'reused,' in multiple proteins. These reused segments, called 'themes,' can be found at various scales, the best known of which is the domain. Yet, recent studies have begun to focus on the evolutionary insights that can be derived from sub-domain-scale themes, which are candidates for traces of more ancient events. Characterizing these may provide clues to the emergence of domains. Particularly interesting are themes that are reused across dissimilar contexts, that is, where the rest of the protein domain differs. We survey computational studies identifying reused themes within different contexts at the sub-domain level.
Collapse
|
74
|
Abstract
To perform an accurate protein synthesis, ribosomes accomplish complex tasks involving the long-range communication between its functional centres such as the peptidyl transfer centre, the tRNA bindings sites and the peptide exit tunnel. How information is transmitted between these sites remains one of the major challenges in current ribosome research. Many experimental studies have revealed that some r-proteins play essential roles in remote communication and the possible involvement of r-protein networks in these processes have been recently proposed. Our phylogenetic, structural and mathematical study reveals that of the three kingdom's r-protein networks converged towards non-random graphs where r-proteins collectively coevolved to optimize interconnection between functional centres. The massive acquisition of conserved aromatic residues at the interfaces and along the extensions of the newly connected eukaryotic r-proteins also highlights that a strong selective pressure acts on their sequences probably for the formation of new allosteric pathways in the network.
Collapse
|
75
|
Pereira J, Lupas AN. The VCBS superfamily forms a third supercluster of β-propellers that includes tachylectin and integrins. Bioinformatics 2021; 36:5618-5622. [PMID: 33416871 PMCID: PMC8023676 DOI: 10.1093/bioinformatics/btaa1085] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 12/14/2020] [Accepted: 12/21/2020] [Indexed: 01/15/2023] Open
Abstract
MOTIVATION β-Propellers are found in great variety across all kingdoms of life. They assume many cellular roles, primarily as scaffolds for macromolecular interactions and catalysis. Despite their diversity, most β-propeller families clearly originated by amplification from the same ancient peptide-the "blade". In cluster analyses, β-propellers of the WD40 superfamily always formed the largest group, to which some important families, such as the α-integrin, Asp-box, and glycoside hydrolase β-propellers connected weakly. Motivated by the dramatic growth of sequence databases we revisited these connections, with a special focus on VCBS-like β-propellers, which have not been analysed for their evolutionary relationships so far. RESULTS We found that VCBS-like form a supercluster with integrin-like β-propellers and tachylectins, clearly delimited from the superclusters formed by WD40 and Asp-Box β-propellers. Connections between the three superclusters are made mainly through PQQ-like β-propeller. Our results present a new, greatly expanded view of the β-propeller classification landscape. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Joana Pereira
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, Tübingen, 72076, Germany
| | - Andrei N Lupas
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, Tübingen, 72076, Germany
| |
Collapse
|
76
|
Modi T, Campitelli P, Kazan IC, Ozkan SB. Protein folding stability and binding interactions through the lens of evolution: a dynamical perspective. Curr Opin Struct Biol 2020; 66:207-215. [PMID: 33388636 DOI: 10.1016/j.sbi.2020.11.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 11/02/2020] [Accepted: 11/26/2020] [Indexed: 01/06/2023]
Abstract
While the function of a protein depends heavily on its ability to fold into a correct 3D structure, billions of years of evolution have tailored proteins from highly stable objects to flexible molecules as they adapted to environmental changes. Nature maintains the fine balance of protein folding and stability while still evolving towards new function through generations of fine-tuning necessary interactions with other proteins and small molecules. Here we focus on recent computational and experimental studies that shed light onto how evolution molds protein folding and the functional landscape from a conformational dynamics' perspective. Particularly, we explore the importance of dynamic allostery throughout protein evolution and discuss how the protein anisotropic network can give rise to allosteric and epistatic interactions.
Collapse
Affiliation(s)
- Tushar Modi
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, AZ 85287-1504, USA
| | - Paul Campitelli
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, AZ 85287-1504, USA
| | - Ismail Can Kazan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, AZ 85287-1504, USA
| | - Sefika Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, AZ 85287-1504, USA.
| |
Collapse
|
77
|
Mylemans B, Voet AR, Tame JR. The Taming of the Screw: the natural and artificial development of β-propeller proteins. Curr Opin Struct Biol 2020; 68:48-54. [PMID: 33373773 DOI: 10.1016/j.sbi.2020.11.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 11/09/2020] [Accepted: 11/27/2020] [Indexed: 12/17/2022]
Abstract
Many proteins are found to possess repeated structural elements, which hint at ancient evolutionary origins and ongoing evolutionary processes. β-propeller proteins are a large family of such proteins, and a popular focus of structural analysis. This review highlights recent work to understand how they arose, and how they have developed into one of the most successful of all protein folds.
Collapse
Affiliation(s)
- Bram Mylemans
- Laboraotry for biomolecular modelling and design, KU Leuven, Celestijnenlaan 200G, 3001 Leuven, Belgium
| | - Arnout Rd Voet
- Protein Design Laboratory, Graduate School of Medical Life Science, Yokohama City University, Suehiro 1-7-29, Tsurumi, Yokohama 230-0045, Japan
| | - Jeremy Rh Tame
- Protein Design Laboratory, Graduate School of Medical Life Science, Yokohama City University, Suehiro 1-7-29, Tsurumi, Yokohama 230-0045, Japan.
| |
Collapse
|
78
|
Longo LM, Jabłońska J, Vyas P, Kanade M, Kolodny R, Ben-Tal N, Tawfik DS. On the emergence of P-Loop NTPase and Rossmann enzymes from a Beta-Alpha-Beta ancestral fragment. eLife 2020; 9:e64415. [PMID: 33295875 PMCID: PMC7758060 DOI: 10.7554/elife.64415] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 12/04/2020] [Indexed: 12/14/2022] Open
Abstract
This article is dedicated to the memory of Michael G. Rossmann. Dating back to the last universal common ancestor, P-loop NTPases and Rossmanns comprise the most ubiquitous and diverse enzyme lineages. Despite similarities in their overall architecture and phosphate binding motif, a lack of sequence identity and some fundamental structural differences currently designates them as independent emergences. We systematically searched for structure and sequence elements shared by both lineages. We detected homologous segments that span the first βαβ motif of both lineages, including the phosphate binding loop and a conserved aspartate at the tip of β2. The latter ligates the catalytic metal in P-loop NTPases, while in Rossmanns it binds the nucleotide's ribose moiety. Tubulin, a Rossmann GTPase, demonstrates the potential of the β2-Asp to take either one of these two roles. While convergence cannot be completely ruled out, we show that both lineages likely emerged from a common βαβ segment that comprises the core of these enzyme families to this very day.
Collapse
Affiliation(s)
- Liam M Longo
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Jagoda Jabłońska
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Pratik Vyas
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Manil Kanade
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| | - Rachel Kolodny
- University of Haifa, Department of Computer ScienceHaifaIsrael
| | - Nir Ben-Tal
- Tel Aviv University, George S. Wise Faculty of Life Sciences, Department of Biochemistry and Molecular BiologyTel AvivIsrael
| | - Dan S Tawfik
- Weizmann Institute of Science, Department of Biomolecular SciencesRehovotIsrael
| |
Collapse
|
79
|
Despotović D, Longo LM, Aharon E, Kahana A, Scherf T, Gruic-Sovulj I, Tawfik DS. Polyamines Mediate Folding of Primordial Hyperacidic Helical Proteins. Biochemistry 2020; 59:4456-4462. [PMID: 33175508 PMCID: PMC7735664 DOI: 10.1021/acs.biochem.0c00800] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 10/29/2020] [Indexed: 12/11/2022]
Abstract
Polyamines are known to mediate diverse biological processes, and specifically to bind and stabilize compact conformations of nucleic acids, acting as chemical chaperones that promote folding by offsetting the repulsive negative charges of the phosphodiester backbone. However, whether and how polyamines modulate the structure and function of proteins remain unclear. In particular, early proteins are thought to have been highly acidic, like nucleic acids, due to a scarcity of basic amino acids in the prebiotic context. Perhaps polyamines, the abiotic synthesis of which is simple, could have served as chemical chaperones for such primordial proteins? We replaced all lysines of an ancestral 60-residue helix-bundle protein with glutamate, resulting in a disordered protein with 21 glutamates in total. Polyamines efficiently induce folding of this hyperacidic protein at submillimolar concentrations, and their potency scaled with the number of amine groups. Compared to cations, polyamines were several orders of magnitude more potent than Na+, while Mg2+ and Ca2+ had an effect similar to that of a diamine, inducing folding at approximately seawater concentrations. We propose that (i) polyamines and dications may have had a role in promoting folding of early proteins devoid of basic residues and (ii) coil-helix transitions could be the basis of polyamine regulation in contemporary proteins.
Collapse
Affiliation(s)
- Dragana Despotović
- Department
of Biomolecular Sciences, Weizmann Institute
of Science, 7610001 Rehovot, Israel
| | - Liam M. Longo
- Department
of Biomolecular Sciences, Weizmann Institute
of Science, 7610001 Rehovot, Israel
- Earth-Life
Science Institute, Tokyo Institute of Technology, 152-8550 Tokyo, Japan
- Blue
Marble Space Institute of Science, Seattle, Washington 98154, United States
| | - Einav Aharon
- Department
of Biomolecular Sciences, Weizmann Institute
of Science, 7610001 Rehovot, Israel
| | - Amit Kahana
- Department
of Biomolecular Sciences, Weizmann Institute
of Science, 7610001 Rehovot, Israel
- Department
of Molecular Genetics, Weizmann Institute
of Science, 7610001 Rehovot, Israel
| | - Tali Scherf
- Department
of Chemical Research Support, Weizmann Institute
of Science, 7610001 Rehovot, Israel
| | - Ita Gruic-Sovulj
- Department
of Chemistry, Faculty of Science, University
of Zagreb, 10000 Zagreb, Croatia
| | - Dan S. Tawfik
- Department
of Biomolecular Sciences, Weizmann Institute
of Science, 7610001 Rehovot, Israel
| |
Collapse
|
80
|
Krishnan A, Burroughs AM, Iyer LM, Aravind L. Comprehensive classification of ABC ATPases and their functional radiation in nucleoprotein dynamics and biological conflict systems. Nucleic Acids Res 2020; 48:10045-10075. [PMID: 32894288 DOI: 10.1093/nar/gkaa726] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 08/17/2020] [Accepted: 08/20/2020] [Indexed: 12/13/2022] Open
Abstract
ABC ATPases form one of the largest clades of P-loop NTPase fold enzymes that catalyze ATP-hydrolysis and utilize its free energy for a staggering range of functions from transport to nucleoprotein dynamics. Using sensitive sequence and structure analysis with comparative genomics, for the first time we provide a comprehensive classification of the ABC ATPase superfamily. ABC ATPases developed structural hallmarks that unambiguously distinguish them from other P-loop NTPases such as an alternative to arginine-finger-based catalysis. At least five and up to eight distinct clades of ABC ATPases are reconstructed as being present in the last universal common ancestor. They underwent distinct phases of structural innovation with the emergence of inserts constituting conserved binding interfaces for proteins or nucleic acids and the adoption of a unique dimeric toroidal configuration for DNA-threading. Specifically, several clades have also extensively radiated in counter-invader conflict systems where they serve as nodal nucleotide-dependent sensory and energetic components regulating a diversity of effectors (including some previously unrecognized) acting independently or together with restriction-modification systems. We present a unified mechanism for ABC ATPase function across disparate systems like RNA editing, translation, metabolism, DNA repair, and biological conflicts, and some unexpected recruitments, such as MutS ATPases in secondary metabolism.
Collapse
Affiliation(s)
- Arunkumar Krishnan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - A Maxwell Burroughs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
81
|
Lipsh-Sokolik R, Listov D, Fleishman SJ. The AbDesign computational pipeline for modular backbone assembly and design of binders and enzymes. Protein Sci 2020; 30:151-159. [PMID: 33040418 PMCID: PMC7737780 DOI: 10.1002/pro.3970] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 10/07/2020] [Accepted: 10/09/2020] [Indexed: 12/12/2022]
Abstract
The functional sites of many protein families are dominated by diverse backbone regions that lack secondary structure (loops) but fold stably into their functionally competent state. Nevertheless, the design of structured loop regions from scratch, especially in functional sites, has met with great difficulty. We therefore developed an approach, called AbDesign, to exploit the natural modularity of many protein families and computationally assemble a large number of new backbones by combining naturally occurring modular fragments. This strategy yielded large, atomically accurate, and highly efficient proteins, including antibodies and enzymes exhibiting dozens of mutations from any natural protein. The combinatorial backbone‐conformation space that can be accessed by AbDesign even for a modestly sized family of homologs may exceed the diversity in the entire PDB, providing the sub‐Ångstrom level of control over the positioning of active‐site groups that is necessary for obtaining highly active proteins. This manuscript describes how to implement the pipeline using code that is freely available at https://github.com/Fleishman‐Lab/AbDesign_for_enzymes.
Collapse
Affiliation(s)
| | - Dina Listov
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
82
|
Ros E, Torres AG, Ribas de Pouplana L. Learning from Nature to Expand the Genetic Code. Trends Biotechnol 2020; 39:460-473. [PMID: 32896440 DOI: 10.1016/j.tibtech.2020.08.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Revised: 07/29/2020] [Accepted: 08/04/2020] [Indexed: 01/14/2023]
Abstract
The genetic code is the manual that cells use to incorporate amino acids into proteins. It is possible to artificially expand this manual through cellular, molecular, and chemical manipulations to improve protein functionality. Strategies for in vivo genetic code expansion are under the same functional constraints as natural protein synthesis. Here, we review the approaches used to incorporate noncanonical amino acids (ncAAs) into designer proteins through the manipulation of the translation machinery and draw parallels between these methods and natural adaptations that improve translation in extant organisms. Following this logic, we propose new nature-inspired tactics to improve genetic code expansion (GCE) in synthetic organisms.
Collapse
Affiliation(s)
- Enric Ros
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, 08028, Spain
| | - Adrian Gabriel Torres
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, 08028, Spain
| | - Lluís Ribas de Pouplana
- Institute for Research in Biomedicine, The Barcelona Institute of Science and Technology, Barcelona, Catalonia, 08028, Spain; Catalan Institution for Research and Advanced Studies, Barcelona, Catalonia, 08010, Spain.
| |
Collapse
|
83
|
Chu XY, Zhang HY. Cofactors as Molecular Fossils To Trace the Origin and Evolution of Proteins. Chembiochem 2020; 21:3161-3168. [PMID: 32515532 DOI: 10.1002/cbic.202000027] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 06/03/2020] [Indexed: 12/16/2022]
Abstract
Due to their early origin and extreme conservation, cofactors are valuable molecular fossils for tracing the origin and evolution of proteins. First, as the order of protein folds binding with cofactors roughly coincides with protein-fold chronology, cofactors are considered to have facilitated the origin of primitive proteins by selecting them from pools of random amino acid sequences. Second, in the subsequent evolution of proteins, cofactors still played an important role. More interestingly, as metallic cofactors evolved with geochemical variations, some geochemical events left imprints in the chronology of protein architecture; this provides further evidence supporting the coevolution of biochemistry and geochemistry. In this paper, we attempt to review the molecular fossils used in tracing the origin and evolution of proteins, with a special focus on cofactors.
Collapse
Affiliation(s)
- Xin-Yi Chu
- Hubei Key Laboratory of Agricultural Bioinformatics College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Hong-Yu Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
84
|
Longo LM, Despotović D, Weil-Ktorza O, Walker MJ, Jabłońska J, Fridmann-Sirkis Y, Varani G, Metanis N, Tawfik DS. Primordial emergence of a nucleic acid-binding protein via phase separation and statistical ornithine-to-arginine conversion. Proc Natl Acad Sci U S A 2020; 117:15731-15739. [PMID: 32561643 PMCID: PMC7355028 DOI: 10.1073/pnas.2001989117] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
De novo emergence demands a transition from disordered polypeptides into structured proteins with well-defined functions. However, can polypeptides confer functions of evolutionary relevance, and how might such polypeptides evolve into modern proteins? The earliest proteins present an even greater challenge, as they were likely based on abiotic, spontaneously synthesized amino acids. Here we asked whether a primordial function, such as nucleic acid binding, could emerge with ornithine, a basic amino acid that forms abiotically yet is absent in modern-day proteins. We combined ancestral sequence reconstruction and empiric deconstruction to unravel a gradual evolutionary trajectory leading from a polypeptide to a ubiquitous nucleic acid-binding protein. Intermediates along this trajectory comprise sequence-duplicated functional proteins built from 10 amino acid types, with ornithine as the only basic amino acid. Ornithine side chains were further modified into arginine by an abiotic chemical reaction, improving both structure and function. Along this trajectory, function evolved from phase separation with RNA (coacervates) to avid and specific double-stranded DNA binding. Our results suggest that phase-separating polypeptides may have been an evolutionary resource for the emergence of early proteins, and that ornithine, together with its postsynthesis modification to arginine, could have been the earliest basic amino acids.
Collapse
Affiliation(s)
- Liam M Longo
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Dragana Despotović
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Orit Weil-Ktorza
- Institute of Chemistry, Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Matthew J Walker
- Department of Chemistry, University of Washington, Seattle, WA 98195
| | - Jagoda Jabłońska
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Yael Fridmann-Sirkis
- Life Sciences Core Facility, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Gabriele Varani
- Department of Chemistry, University of Washington, Seattle, WA 98195
| | - Norman Metanis
- Institute of Chemistry, Hebrew University of Jerusalem, Jerusalem 9190401, Israel;
| | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel;
| |
Collapse
|
85
|
Carpentier M, Chomilier J. Protein multiple alignments: sequence-based versus structure-based programs. Bioinformatics 2020; 35:3970-3980. [PMID: 30942864 DOI: 10.1093/bioinformatics/btz236] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 03/05/2019] [Accepted: 04/02/2019] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Multiple sequence alignment programs have proved to be very useful and have already been evaluated in the literature yet not alignment programs based on structure or both sequence and structure. In the present article we wish to evaluate the added value provided through considering structures. RESULTS We compared the multiple alignments resulting from 25 programs either based on sequence, structure or both, to reference alignments deposited in five databases (BALIBASE 2 and 3, HOMSTRAD, OXBENCH and SISYPHUS). On the whole, the structure-based methods compute more reliable alignments than the sequence-based ones, and even than the sequence+structure-based programs whatever the databases. Two programs lead, MAMMOTH and MATRAS, nevertheless the performances of MUSTANG, MATT, 3DCOMB, TCOFFEE+TM_ALIGN and TCOFFEE+SAP are better for some alignments. The advantage of structure-based methods increases at low levels of sequence identity, or for residues in regular secondary structures or buried ones. Concerning gap management, sequence-based programs set less gaps than structure-based programs. Concerning the databases, the alignments of the manually built databases are more challenging for the programs. AVAILABILITY AND IMPLEMENTATION All data and results presented in this study are available at: http://wwwabi.snv.jussieu.fr/people/mathilde/download/AliMulComp/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mathilde Carpentier
- Institut Systématique Evolution Biodiversité (ISYEB), Sorbonne Université, MNHN, CNRS, EPHE, Paris, France
| | - Jacques Chomilier
- Sorbonne Université, MNHN, CNRS, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC), BiBiP, Paris, France
| |
Collapse
|
86
|
Alva V, Lupas AN. Histones predate the split between bacteria and archaea. Bioinformatics 2020; 35:2349-2353. [PMID: 30520969 DOI: 10.1093/bioinformatics/bty1000] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 11/27/2018] [Accepted: 12/05/2018] [Indexed: 02/02/2023] Open
Abstract
MOTIVATION Histones form octameric complexes called nucleosomes, which organize the genomic DNA of eukaryotes into chromatin. Each nucleosome comprises two copies each of the histones H2A, H2B, H3 and H4, which share a common ancestry. Although histones were initially thought to be a eukaryotic innovation, the subsequent identification of archaeal homologs led to the notion that histones emerged before the divergence of archaea and eukaryotes. RESULTS Here, we report the detection and classification of two new groups of histone homologs, which are present in both archaea and bacteria. Proteins in one group consist of two histone subunits welded into single-chain pseudodimers, whereas in the other they resemble eukaryotic core histone subunits and show sequence patterns characteristic of DNA binding. The sequences come from a broad spectrum of deeply-branching lineages, excluding their genesis by horizontal gene transfer. Our results extend the origin of histones to the last universal common ancestor. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
87
|
Gospodinov A, Kunnev D. Universal Codons with Enrichment from GC to AU Nucleotide Composition Reveal a Chronological Assignment from Early to Late Along with LUCA Formation. Life (Basel) 2020; 10:life10060081. [PMID: 32516985 PMCID: PMC7345086 DOI: 10.3390/life10060081] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 05/30/2020] [Accepted: 06/03/2020] [Indexed: 12/14/2022] Open
Abstract
The emergence of a primitive genetic code should be considered the most essential event during the origin of life. Almost a complete set of codons (as we know them) should have been established relatively early during the evolution of the last universal common ancestor (LUCA) from which all known organisms descended. Many hypotheses have been proposed to explain the driving forces and chronology of the evolution of the genetic code; however, none is commonly accepted. In the current paper, we explore the features of the genetic code that, in our view, reflect the mechanism and the chronological order of the origin of the genetic code. Our hypothesis postulates that the primordial RNA was mostly GC-rich, and this bias was reflected in the order of amino acid codon assignment. If we arrange the codons and their corresponding amino acids from GC-rich to AU-rich, we find that: 1. The amino acids encoded by GC-rich codons (Ala, Gly, Arg, and Pro) are those that contribute the most to the interactions with RNA (if incorporated into short peptides). 2. This order correlates with the addition of novel functions necessary for the evolution from simple to longer folded peptides. 3. The overlay of aminoacyl-tRNA synthetases (aaRS) to the amino acid order produces a distinctive zonal distribution for class I and class II suggesting an interdependent origin. These correlations could be explained by the active role of the bridge peptide (BP), which we proposed earlier in the evolution of the genetic code.
Collapse
Affiliation(s)
- Anastas Gospodinov
- Roumen Tsanev Institute of Molecular Biology, Bulgarian Academy of Sciences, Acad. G. Bonchev Str. 21, Sofia 1113, Bulgaria;
| | - Dimiter Kunnev
- Department of Molecular & Cellular Biology, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
- Correspondence:
| |
Collapse
|
88
|
Lapenta F, Jerala R. Design of novel protein building modules and modular architectures. Curr Opin Struct Biol 2020; 63:90-96. [PMID: 32505942 DOI: 10.1016/j.sbi.2020.04.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 04/14/2020] [Accepted: 04/15/2020] [Indexed: 12/31/2022]
Abstract
Nature uses only a limited number of protein topologies and while several folds have evolved independently over time, there are clearly many possible topologies that have not been explored by evolution. With recent advances of protein design concepts, computational modeling tools, high resolution and high-throughput experimental methods it is now possible to design new protein architectures. The collection of building blocks and design principles widened both in size and complexity, offering an expanded toolset for building new modular folds and functional protein structures. Here we review and discuss recent achievements of protein design, focusing in particular on the use and prospects of modular approaches for assembling new protein folds.
Collapse
Affiliation(s)
- Fabio Lapenta
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Roman Jerala
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia; EN-FIST Centre of Excellence, Ljubljana, Slovenia.
| |
Collapse
|
89
|
Bowman JC, Petrov AS, Frenkel-Pinter M, Penev PI, Williams LD. Root of the Tree: The Significance, Evolution, and Origins of the Ribosome. Chem Rev 2020; 120:4848-4878. [PMID: 32374986 DOI: 10.1021/acs.chemrev.9b00742] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
The ribosome is an ancient molecular fossil that provides a telescope to the origins of life. Made from RNA and protein, the ribosome translates mRNA to coded protein in all living systems. Universality, economy, centrality and antiquity are ingrained in translation. The translation machinery dominates the set of genes that are shared as orthologues across the tree of life. The lineage of the translation system defines the universal tree of life. The function of a ribosome is to build ribosomes; to accomplish this task, ribosomes make ribosomal proteins, polymerases, enzymes, and signaling proteins. Every coded protein ever produced by life on Earth has passed through the exit tunnel, which is the birth canal of biology. During the root phase of the tree of life, before the last common ancestor of life (LUCA), exit tunnel evolution is dominant and unremitting. Protein folding coevolved with evolution of the exit tunnel. The ribosome shows that protein folding initiated with intrinsic disorder, supported through a short, primitive exit tunnel. Folding progressed to thermodynamically stable β-structures and then to kinetically trapped α-structures. The latter were enabled by a long, mature exit tunnel that partially offset the general thermodynamic tendency of all polypeptides to form β-sheets. RNA chaperoned the evolution of protein folding from the very beginning. The universal common core of the ribosome, with a mass of nearly 2 million Daltons, was finalized by LUCA. The ribosome entered stasis after LUCA and remained in that state for billions of years. Bacterial ribosomes never left stasis. Archaeal ribosomes have remained near stasis, except for the superphylum Asgard, which has accreted rRNA post LUCA. Eukaryotic ribosomes in some lineages appear to be logarithmically accreting rRNA over the last billion years. Ribosomal expansion in Asgard and Eukarya has been incremental and iterative, without substantial remodeling of pre-existing basal structures. The ribosome preserves information on its history.
Collapse
Affiliation(s)
- Jessica C Bowman
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Anton S Petrov
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Moran Frenkel-Pinter
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Petar I Penev
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Loren Dean Williams
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
90
|
Ferruz N, Lobos F, Lemm D, Toledo-Patino S, Farías-Rico JA, Schmidt S, Höcker B. Identification and Analysis of Natural Building Blocks for Evolution-Guided Fragment-Based Protein Design. J Mol Biol 2020; 432:3898-3914. [PMID: 32330481 PMCID: PMC7322520 DOI: 10.1016/j.jmb.2020.04.013] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 04/12/2020] [Accepted: 04/13/2020] [Indexed: 12/15/2022]
Abstract
Natural evolution has generated an impressively diverse protein universe via duplication and recombination from a set of protein fragments that served as building blocks. The application of these concepts to the design of new proteins using subdomain-sized fragments from different folds has proven to be experimentally successful. To better understand how evolution has shaped our protein universe, we performed an all-against-all comparison of protein domains representing all naturally existing folds and identified conserved homologous protein fragments. Overall, we found more than 1000 protein fragments of various lengths among different folds through similarity network analysis. These fragments are present in very different protein environments and represent versatile building blocks for protein design. These data are available in our web server called F(old P)uzzle (fuzzle.uni-bayreuth.de), which allows to individually filter the dataset and create customized networks for folds of interest. We believe that our results serve as an invaluable resource for structural and evolutionary biologists and as raw material for the design of custom-made proteins.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Francisco Lobos
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany; Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Dominik Lemm
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Saacnicteh Toledo-Patino
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany; Max Planck Institute for Developmental Biology, Tübingen, Germany
| | | | - Steffen Schmidt
- Max Planck Institute for Developmental Biology, Tübingen, Germany; Computational Biochemistry, University of Bayreuth, Bayreuth, Germany.
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany; Max Planck Institute for Developmental Biology, Tübingen, Germany.
| |
Collapse
|
91
|
Pseudo-Symmetric Assembly of Protodomains as a Common Denominator in the Evolution of Polytopic Helical Membrane Proteins. J Mol Evol 2020; 88:319-344. [PMID: 32189026 PMCID: PMC7162841 DOI: 10.1007/s00239-020-09934-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 02/16/2020] [Indexed: 11/05/2022]
Abstract
The polytopic helical membrane proteome is dominated by proteins containing seven transmembrane helices (7TMHs). They cannot be grouped under a monolithic fold or superfold. However, a parallel structural analysis of folds around that magic number of seven in distinct protein superfamilies (SWEET, PnuC, TRIC, FocA, Aquaporin, GPCRs) reveals a common homology, not in their structural fold, but in their systematic pseudo-symmetric construction during their evolution. Our analysis leads to guiding principles of intragenic duplication and pseudo-symmetric assembly of ancestral transmembrane helical protodomains, consisting of 3 (or 4) helices. A parallel deconstruction and reconstruction of these domains provides a structural and mechanistic framework for their evolutionary paths. It highlights the conformational plasticity inherent to fold formation itself, the role of structural as well as functional constraints in shaping that fold, and the usefulness of protodomains as a tool to probe convergent vs divergent evolution. In the case of FocA vs. Aquaporin, this protodomain analysis sheds new light on their potential divergent evolution at the protodomain level followed by duplication and parallel evolution of the two folds. GPCR domains, whose function does not seem to require symmetry, nevertheless exhibit structural pseudo-symmetry. Their construction follows the same protodomain assembly as any other pseudo-symmetric protein suggesting their potential evolutionary origins. Interestingly, all the 6/7/8TMH pseudo-symmetric folds in this study also assemble as oligomeric forms in the membrane, emphasizing the role of symmetry in evolution, revealing self-assembly and co-evolution not only at the protodomain level but also at the domain level.
Collapse
|
92
|
Abstract
Frameshifts in protein coding sequences are widely perceived as resulting in either nonfunctional or even deleterious protein products. Indeed, frameshifts typically lead to markedly altered protein sequences and premature stop codons. By analyzing complete proteomes from all three domains of life, we demonstrate that, in contrast, several key physicochemical properties of protein sequences exhibit significant robustness against +1 and -1 frameshifts. In particular, we show that hydrophobicity profiles of many protein sequences remain largely invariant upon frameshifting. For example, over 2,900 human proteins exhibit a Pearson's correlation coefficient R between the hydrophobicity profiles of the original and the +1-frameshifted variants greater than 0.7, despite an average sequence identity between the two of only 6.5% in this group. We observe a similar effect for protein sequence profiles of affinity for certain nucleobases as well as protein sequence profiles of intrinsic disorder. Finally, analysis of significance and optimality demonstrates that frameshift stability is embedded in the structure of the universal genetic code and may have contributed to shaping it. Our results suggest that frameshifting may be a powerful evolutionary mechanism for creating new proteins with vastly different sequences, yet similar physicochemical properties to the proteins from which they originate.
Collapse
|
93
|
Chojnowski G, Choudhury K, Heuser P, Sobolev E, Pereira J, Oezugurel U, Lamzin VS. The use of local structural similarity of distant homologues for crystallographic model building from a molecular-replacement solution. Acta Crystallogr D Struct Biol 2020; 76:248-260. [PMID: 32133989 PMCID: PMC7057216 DOI: 10.1107/s2059798320000455] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Accepted: 01/14/2020] [Indexed: 12/18/2022] Open
Abstract
The performance of automated protein model building usually decreases with resolution, mainly owing to the lower information content of the experimental data. This calls for a more elaborate use of the available structural information about macromolecules. Here, a new method is presented that uses structural homologues to improve the quality of protein models automatically constructed using ARP/wARP. The method uses local structural similarity between deposited models and the model being built, and results in longer main-chain fragments that in turn can be more reliably docked to the protein sequence. The application of the homology-based model extension method to the example of a CFA synthase at 2.7 Å resolution resulted in a more complete model with almost all of the residues correctly built and docked to the sequence. The method was also evaluated on 1493 molecular-replacement solutions at a resolution of 4.0 Å and better that were submitted to the ARP/wARP web service for model building. A significant improvement in the completeness and sequence coverage of the built models has been observed.
Collapse
Affiliation(s)
- Grzegorz Chojnowski
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Koushik Choudhury
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Philipp Heuser
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Egor Sobolev
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Joana Pereira
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Umut Oezugurel
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Victor S. Lamzin
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| |
Collapse
|
94
|
Frenkel-Pinter M, Samanta M, Ashkenasy G, Leman LJ. Prebiotic Peptides: Molecular Hubs in the Origin of Life. Chem Rev 2020; 120:4707-4765. [PMID: 32101414 DOI: 10.1021/acs.chemrev.9b00664] [Citation(s) in RCA: 148] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The fundamental roles that peptides and proteins play in today's biology makes it almost indisputable that peptides were key players in the origin of life. Insofar as it is appropriate to extrapolate back from extant biology to the prebiotic world, one must acknowledge the critical importance that interconnected molecular networks, likely with peptides as key components, would have played in life's origin. In this review, we summarize chemical processes involving peptides that could have contributed to early chemical evolution, with an emphasis on molecular interactions between peptides and other classes of organic molecules. We first summarize mechanisms by which amino acids and similar building blocks could have been produced and elaborated into proto-peptides. Next, non-covalent interactions of peptides with other peptides as well as with nucleic acids, lipids, carbohydrates, metal ions, and aromatic molecules are discussed in relation to the possible roles of such interactions in chemical evolution of structure and function. Finally, we describe research involving structural alternatives to peptides and covalent adducts between amino acids/peptides and other classes of molecules. We propose that ample future breakthroughs in origin-of-life chemistry will stem from investigations of interconnected chemical systems in which synergistic interactions between different classes of molecules emerge.
Collapse
Affiliation(s)
- Moran Frenkel-Pinter
- NSF/NASA Center for Chemical Evolution, https://centerforchemicalevolution.com/.,School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Mousumi Samanta
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Gonen Ashkenasy
- Department of Chemistry, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Luke J Leman
- NSF/NASA Center for Chemical Evolution, https://centerforchemicalevolution.com/.,Department of Chemistry, The Scripps Research Institute, La Jolla, California 92037, United States
| |
Collapse
|
95
|
Short and simple sequences favored the emergence of N-helix phospho-ligand binding sites in the first enzymes. Proc Natl Acad Sci U S A 2020; 117:5310-5318. [PMID: 32079722 DOI: 10.1073/pnas.1911742117] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The ubiquity of phospho-ligands suggests that phosphate binding emerged at the earliest stage of protein evolution. To evaluate this hypothesis and unravel its details, we identified all phosphate-binding protein lineages in the Evolutionary Classification of Protein Domains database. We found at least 250 independent evolutionary lineages that bind small molecule cofactors and metabolites with phosphate moieties. For many lineages, phosphate binding emerged later as a niche functionality, but for the oldest protein lineages, phosphate binding was the founding function. Across some 4 billion y of protein evolution, side-chain binding, in which the phosphate moiety does not interact with the backbone at all, emerged most frequently. However, in the oldest lineages, and most characteristically in αβα sandwich enzyme domains, N-helix binding sites dominate, where the phosphate moiety sits atop the N terminus of an α-helix. This discrepancy is explained by the observation that N-helix binding is uniquely realized by short, contiguous sequences with reduced amino acid diversity, foremost Gly, Ser, and Thr. The latter two amino acids preferentially interact with both the backbone amide and the side-chain hydroxyl (bidentate interaction) to promote binding by short sequences. We conclude that the first αβα sandwich domains emerged from shorter and simpler polypeptides that bound phospho-ligands via N-helix sites.
Collapse
|
96
|
Abstract
How proteins evolved to recognize and bind their ligands is a key mystery in protein function evolution. To explore this mystery, we study how proteins bind adenine, an ancient fragment. We characterize physicochemical patterns of protein–adenine interactions and link these to proteins’ evolutionary origins. In conflict with previous findings, we see that all of adenine’s hydrogen donors and acceptors have been used to bind proteins, and that adenine binding is likely to have emerged multiple times in evolution. To identify adenine-binding sites of shared origin, we use “themes”: short amino acid segments suggested to constitute evolutionary building blocks. We detect specific themes that are engaged in adenine binding; the detection of these in a protein’s sequence might reveal its function. Proteins’ interactions with ancient ligands may reveal how molecular recognition emerged and evolved. We explore how proteins recognize adenine: a planar rigid fragment found in the most common and ancient ligands. We have developed a computational pipeline that extracts protein–adenine complexes from the Protein Data Bank, structurally superimposes their adenine fragments, and detects the hydrogen bonds mediating the interaction. Our analysis extends the known motifs of protein–adenine interactions in the Watson–Crick edge of adenine and shows that all of adenine’s edges may contribute to molecular recognition. We further show that, on the proteins' side, binding is often mediated by specific amino acid segments (“themes”) that recur across different proteins, such that different proteins use the same themes when binding the same adenine-containing ligands. We identify numerous proteins that feature these themes and are thus likely to bind adenine-containing ligands. Our analysis suggests that adenine binding has emerged multiple times in evolution.
Collapse
|
97
|
Pereira J, Lupas AN. The ancestral KH peptide at the root of a domain family with three different folds. Bioinformatics 2019; 34:3961-3965. [PMID: 29912332 DOI: 10.1093/bioinformatics/bty480] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2018] [Accepted: 06/12/2018] [Indexed: 11/13/2022] Open
Abstract
Motivation The direct ancestor of the DNA-protein world of today is considered to have been an RNA-peptide world, in which peptides were co-factors of RNA-mediated catalysis and replication. Evidence for these ancestral peptides, from which folded proteins evolved, can be derived even today from regions of local sequence similarity within globally dissimilar folds. One of these is the 45-residue motif common to both folds of the hnRNP K homology (KH) domain. Results In a survey of KH domains, we found a third fold that contains the KH motif at its core. This corresponds to the Small Domain of bacterial Ribonucleases G/E and, like type I and type II KH domains, it cannot be related to the others by a single genetic event, providing further support for the KH motif as an ancestral peptide predating folded proteins. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Joana Pereira
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, Tübingen, Germany
| | - Andrei N Lupas
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|
98
|
Jin AH, Muttenthaler M, Dutertre S, Himaya SWA, Kaas Q, Craik DJ, Lewis RJ, Alewood PF. Conotoxins: Chemistry and Biology. Chem Rev 2019; 119:11510-11549. [PMID: 31633928 DOI: 10.1021/acs.chemrev.9b00207] [Citation(s) in RCA: 161] [Impact Index Per Article: 32.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The venom of the marine predatory cone snails (genus Conus) has evolved for prey capture and defense, providing the basis for survival and rapid diversification of the now estimated 750+ species. A typical Conus venom contains hundreds to thousands of bioactive peptides known as conotoxins. These mostly disulfide-rich and well-structured peptides act on a wide range of targets such as ion channels, G protein-coupled receptors, transporters, and enzymes. Conotoxins are of interest to neuroscientists as well as drug developers due to their exquisite potency and selectivity, not just against prey but also mammalian targets, thereby providing a rich source of molecular probes and therapeutic leads. The rise of integrated venomics has accelerated conotoxin discovery with now well over 10,000 conotoxin sequences published. However, their structural and pharmacological characterization lags considerably behind. In this review, we highlight the diversity of new conotoxins uncovered since 2014, their three-dimensional structures and folds, novel chemical approaches to their syntheses, and their value as pharmacological tools to unravel complex biology. Additionally, we discuss challenges and future directions for the field.
Collapse
Affiliation(s)
- Ai-Hua Jin
- Institute for Molecular Bioscience , The University of Queensland , Brisbane Queensland 4072 , Australia
| | - Markus Muttenthaler
- Institute for Molecular Bioscience , The University of Queensland , Brisbane Queensland 4072 , Australia.,Institute of Biological Chemistry, Faculty of Chemistry , University of Vienna , 1090 Vienna , Austria
| | - Sebastien Dutertre
- Département des Acides Amines, Peptides et Protéines, Unité Mixte de Recherche 5247, Université Montpellier 2-Centre Nationale de la Recherche Scientifique , Institut des Biomolécules Max Mousseron , Place Eugène Bataillon , 34095 Montpellier Cedex 5 , France
| | - S W A Himaya
- Institute for Molecular Bioscience , The University of Queensland , Brisbane Queensland 4072 , Australia
| | - Quentin Kaas
- Institute for Molecular Bioscience , The University of Queensland , Brisbane Queensland 4072 , Australia
| | - David J Craik
- Institute for Molecular Bioscience , The University of Queensland , Brisbane Queensland 4072 , Australia
| | - Richard J Lewis
- Institute for Molecular Bioscience , The University of Queensland , Brisbane Queensland 4072 , Australia
| | - Paul F Alewood
- Institute for Molecular Bioscience , The University of Queensland , Brisbane Queensland 4072 , Australia
| |
Collapse
|
99
|
Milner-White EJ. Protein three-dimensional structures at the origin of life. Interface Focus 2019; 9:20190057. [PMID: 31641431 DOI: 10.1098/rsfs.2019.0057] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/10/2019] [Indexed: 12/22/2022] Open
Abstract
Proteins are relatively easy to synthesize, compared to nucleic acids and it is likely that there existed a stage prior to the RNA world which can be called the protein world. Some of the three-dimensional (3D) peptide structures in these proteins have, we argue, been conserved since then and may constitute the oldest biological relics in existence. We focus on 3D peptide motifs consisting of up to eight or so amino acid residues. The best known of these is the 'nest', a three- to seven-residue protein motif, which has the function of binding anionic atoms or groups of atoms. Ten per cent of amino acids in typical proteins belong to a nest, so it is a common motif. A five-residue nest is found as part of the well-known P-loop that is a recurring feature of many ATP or GTP-binding proteins and it has the function of binding the phosphate part of these ligands. A synthetic hexapeptide, ser-gly-ala-gly-lys-thr, designed to resemble the P-loop, has been shown to bind inorganic phosphate. Another type of nest binds iron-sulfur centres. A range of other simple motifs occur with various intriguing 3D structures; others bind cations or form channels that transport potassium ions; other peptides form catalytically active haem-like or sheet structures with certain transition metals. Amyloid peptides are also discussed. It now seems that the earliest polypeptides were far from being functionless stretches, and had many of the properties, both binding and catalytic, that might be expected to encourage and stabilize simple life forms in the hydrothermal vents of ocean depths.
Collapse
Affiliation(s)
- E James Milner-White
- Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G128QQ, UK
| |
Collapse
|
100
|
Afanasieva E, Chaudhuri I, Martin J, Hertle E, Ursinus A, Alva V, Hartmann MD, Lupas AN. Structural diversity of oligomeric β-propellers with different numbers of identical blades. eLife 2019; 8:49853. [PMID: 31613220 PMCID: PMC6805158 DOI: 10.7554/elife.49853] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Accepted: 09/25/2019] [Indexed: 12/29/2022] Open
Abstract
β-Propellers arise through the amplification of a supersecondary structure element called a blade. This process produces toroids of between four and twelve repeats, which are almost always arranged sequentially in a single polypeptide chain. We found that new propellers evolve continuously by amplification from single blades. We therefore investigated whether such nascent propellers can fold as homo-oligomers before they have been fully amplified within a single chain. One- to six-bladed building blocks derived from two seven-bladed WD40 propellers yielded stable homo-oligomers with six to nine blades, depending on the size of the building block. High-resolution structures for tetramers of two blades, trimers of three blades, and dimers of four and five blades, respectively, show structurally diverse propellers and include a novel fold, highlighting the inherent flexibility of the WD40 blade. Our data support the hypothesis that subdomain-sized fragments can provide structural versatility in the evolution of new proteins.
Collapse
Affiliation(s)
- Evgenia Afanasieva
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Indronil Chaudhuri
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Jörg Martin
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Eva Hertle
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Astrid Ursinus
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Marcus D Hartmann
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Andrei N Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|