1
|
Toledo-Patiño S, Goetz SK, Shanmugaratnam S, Höcker B, Farías-Rico JA. Molecular handcraft of a well-folded protein chimera. FEBS Lett 2024; 598:1375-1386. [PMID: 38508768 DOI: 10.1002/1873-3468.14856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 02/11/2024] [Accepted: 02/12/2024] [Indexed: 03/22/2024]
Abstract
Modular assembly is a compelling pathway to create new proteins, a concept supported by protein engineering and millennia of evolution. Natural evolution provided a repository of building blocks, known as domains, which trace back to even shorter segments that underwent numerous 'copy-paste' processes culminating in the scaffolds we see today. Utilizing the subdomain-database Fuzzle, we constructed a fold-chimera by integrating a flavodoxin-like fragment into a periplasmic binding protein. This chimera is well-folded and a crystal structure reveals stable interfaces between the fragments. These findings demonstrate the adaptability of α/β-proteins and offer a stepping stone for optimization. By emphasizing the practicality of fragment databases, our work pioneers new pathways in protein engineering. Ultimately, the results substantiate the conjecture that periplasmic binding proteins originated from a flavodoxin-like ancestor.
Collapse
Affiliation(s)
- Saacnicteh Toledo-Patiño
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Okinawa Institute of Science and Technology Graduate University, Japan
| | | | - Sooruban Shanmugaratnam
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Department of Biochemistry, University of Bayreuth, Germany
| | - Birte Höcker
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Department of Biochemistry, University of Bayreuth, Germany
| | - José Arcadio Farías-Rico
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
| |
Collapse
|
2
|
Gaschignard G, Millet M, Bruley A, Benzerara K, Dezi M, Skouri-Panet F, Duprat E, Callebaut I. AlphaFold2-guided description of CoBaHMA, a novel family of bacterial domains within the heavy-metal-associated superfamily. Proteins 2024; 92:776-794. [PMID: 38258321 DOI: 10.1002/prot.26668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 12/22/2023] [Accepted: 01/01/2024] [Indexed: 01/24/2024]
Abstract
Three-dimensional (3D) structure information, now available at the proteome scale, may facilitate the detection of remote evolutionary relationships in protein superfamilies. Here, we illustrate this with the identification of a novel family of protein domains related to the ferredoxin-like superfold, by combining (i) transitive sequence similarity searches, (ii) clustering approaches, and (iii) the use of AlphaFold2 3D structure models. Domains of this family were initially identified in relation with the intracellular biomineralization of calcium carbonates by Cyanobacteria. They are part of the large heavy-metal-associated (HMA) superfamily, departing from the latter by specific sequence and structural features. In particular, most of them share conserved basic amino acids (hence their name CoBaHMA for Conserved Basic residues HMA), forming a positively charged surface, which is likely to interact with anionic partners. CoBaHMA domains are found in diverse modular organizations in bacteria, existing in the form of monodomain proteins or as part of larger proteins, some of which are membrane proteins involved in transport or lipid metabolism. This suggests that the CoBaHMA domains may exert a regulatory function, involving interactions with anionic lipids. This hypothesis might have a particular resonance in the context of the compartmentalization observed for cyanobacterial intracellular calcium carbonates.
Collapse
Affiliation(s)
- Geoffroy Gaschignard
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Maxime Millet
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Apolline Bruley
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Karim Benzerara
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Manuela Dezi
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Feriel Skouri-Panet
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Elodie Duprat
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| |
Collapse
|
3
|
Caetano-Anollés K, Aziz MF, Mughal F, Caetano-Anollés G. On Protein Loops, Prior Molecular States and Common Ancestors of Life. J Mol Evol 2024:10.1007/s00239-024-10167-y. [PMID: 38652291 DOI: 10.1007/s00239-024-10167-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 03/22/2024] [Indexed: 04/25/2024]
Abstract
The principle of continuity demands the existence of prior molecular states and common ancestors responsible for extant macromolecular structure. Here, we focus on the emergence and evolution of loop prototypes - the elemental architects of protein domain structure. Phylogenomic reconstruction spanning superkingdoms and viruses generated an evolutionary chronology of prototypes with six distinct evolutionary phases defining a most parsimonious evolutionary progression of cellular life. Each phase was marked by strategic prototype accumulation shaping the structures and functions of common ancestors. The last universal common ancestor (LUCA) of cells and viruses and the last universal cellular ancestor (LUCellA) defined stem lines that were structurally and functionally complex. The evolutionary saga highlighted transformative forces. LUCA lacked biosynthetic ribosomal machinery, while the pivotal LUCellA lacked essential DNA biosynthesis and modern transcription. Early proteins therefore relied on RNA for genetic information storage but appeared initially decoupled from it, hinting at transformative shifts of genetic processing. Urancestral loop types suggest advanced folding designs were present at an early evolutionary stage. An exploration of loop geometric properties revealed gradual replacement of prototypes with α-helix and β-strand bracing structures over time, paving the way for the dominance of other loop types. AlphFold2-generated atomic models of prototype accretion described patterns of fold emergence. Our findings favor a ‛processual' model of evolving stem lines aligned with Woese's vision of a communal world. This model prompts discussing the 'problem of ancestors' and the challenges that lie ahead for research in taxonomy, evolution and complexity.
Collapse
Affiliation(s)
- Kelsey Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
- Callout Biotech, Albuquerque, NM, 87112, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
| |
Collapse
|
4
|
Zheng Z, Goncearenco A, Berezovsky IN. Back in time to the Gly-rich prototype of the phosphate binding elementary function. Curr Res Struct Biol 2024; 7:100142. [PMID: 38655428 PMCID: PMC11035071 DOI: 10.1016/j.crstbi.2024.100142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 03/31/2024] [Accepted: 04/03/2024] [Indexed: 04/26/2024] Open
Abstract
Binding of nucleotides and their derivatives is one of the most ancient elementary functions dating back to the Origin of Life. We review here the works considering one of the key elements in binding of (di)nucleotide-containing ligands - phosphate binding. We start from a brief discussion of major participants, conditions, and events in prebiotic evolution that resulted in the Origin of Life. Tracing back to the basic functions, including metal and phosphate binding, and, potentially, formation of primitive protein-protein interactions, we focus here on the phosphate binding. Critically assessing works on the structural, functional, and evolutionary aspects of phosphate binding, we perform a simple computational experiment reconstructing its most ancient and generic sequence prototype. The profiles of the phosphate binding signatures have been derived in form of position-specific scoring matrices (PSSMs), their peculiarities depending on the type of the ligands have been analyzed, and evolutionary connections between them have been delineated. Then, the apparent prototype that gave rise to all relevant phosphate-binding signatures had also been reconstructed. We show that two major signatures of the phosphate binding that discriminate between the binding of dinucleotide- and nucleotide-containing ligands are GxGxxG and GxxGxG, respectively. It appears that the signature archetypal for dinucleotide-containing ligands is more generic, and it can frequently bind phosphate groups in nucleotide-containing ligands as well. The reconstructed prototype's key signature GxGGxG underlies the role of glycine residues in providing flexibility and interactions necessary for binding the phosphate groups. The prototype also contains other ancient amino acids, valine, and alanine, showing versatility towards evolutionary design and functional diversification.
Collapse
Affiliation(s)
- Zejun Zheng
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | | | - Igor N. Berezovsky
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore
| |
Collapse
|
5
|
Cuevas-Zuviría B, Garcia AK, Rivier AJ, Rucker HR, Carruthers BM, Kaçar B. Emergence of an Orphan Nitrogenase Protein Following Atmospheric Oxygenation. Mol Biol Evol 2024; 41:msae067. [PMID: 38526235 PMCID: PMC11018506 DOI: 10.1093/molbev/msae067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 03/06/2024] [Accepted: 03/19/2024] [Indexed: 03/26/2024] Open
Abstract
Molecular innovations within key metabolisms can have profound impacts on element cycling and ecological distribution. Yet, much of the molecular foundations of early evolved enzymes and metabolisms are unknown. Here, we bring one such mystery to relief by probing the birth and evolution of the G-subunit protein, an integral component of certain members of the nitrogenase family, the only enzymes capable of biological nitrogen fixation. The G-subunit is a Paleoproterozoic-age orphan protein that appears more than 1 billion years after the origin of nitrogenases. We show that the G-subunit arose with novel nitrogenase metal dependence and the ecological expansion of nitrogen-fixing microbes following the transition in environmental metal availabilities and atmospheric oxygenation that began ∼2.5 billion years ago. We identify molecular features that suggest early G-subunit proteins mediated cofactor or protein interactions required for novel metal dependency, priming ancient nitrogenases and their hosts to exploit these newly diversified geochemical environments. We further examined the degree of functional specialization in G-subunit evolution with extant and ancestral homologs using laboratory reconstruction experiments. Our results indicate that permanent recruitment of the orphan protein depended on the prior establishment of conserved molecular features and showcase how contingent evolutionary novelties might shape ecologically important microbial innovations.
Collapse
Affiliation(s)
| | - Amanda K Garcia
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Alex J Rivier
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Holly R Rucker
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Brooke M Carruthers
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| | - Betül Kaçar
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
6
|
Mustieles-Del-Ser P, Ruano-Gallego D, Parro V. Immunoanalytical Detection of Conserved Peptides: Refining the Universe of Biomarker Targets in Planetary Exploration. Anal Chem 2024; 96:4764-4773. [PMID: 38484023 DOI: 10.1021/acs.analchem.3c04165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
Ancient peptides are remnants of early biochemistry that continue to play pivotal roles in current proteins. They are simple molecules yet complex enough to exhibit independent functions, being products of an evolved biochemistry at the interface of life and nonlife. Their adsorption to minerals may contribute to their stabilization and preservation over time. To investigate the feasibility of conserved peptide sequences and structures as target biomarkers for the search for life on Mars or other planetary bodies, we conducted a bioinformatics selection of well-conserved ancient peptides and produced polyclonal antibodies for their detection using fluorescence microarray immunoassays. Additionally, we explored how adsorbing peptides to Mars-representative minerals to form organomineral complexes could affect their immunological detection. The results demonstrated that the selected peptides exhibited autonomous folding, with some of them regaining their structure, even after denaturation. Furthermore, their cognate antibodies detected their conformational features regardless of amino acid sequences, thereby broadening the spectrum of target peptide sequences. While certain antibodies displayed unspecific binding to bare minerals, we validated that peptide-mineral complexes can be detected using sandwich immunoassays, as confirmed through desorption and competitive assays. Consequently, we conclude that the diversity of peptide sequences and structures suitable for use as target biomarkers in astrobiology can be constrained to a few well conserved sets, and they can be detected even if they are adsorbed in organomineral complexes.
Collapse
Affiliation(s)
- Pedro Mustieles-Del-Ser
- Centro de Astrobiología (CAB) INTA-CSIC, Torrejón de Ardoz 28850, Spain
- Departments of Physics and Mathematics, and Automatics, Universidad de Alcalá (UAH), Alcalá de Henares 28805, Spain
| | | | - Víctor Parro
- Centro de Astrobiología (CAB) INTA-CSIC, Torrejón de Ardoz 28850, Spain
| |
Collapse
|
7
|
Ye W, Krishna Behra PR, Dyrhage K, Seeger C, Joiner JD, Karlsson E, Andersson E, Chi CN, Andersson SGE, Jemth P. Folded Alpha Helical Putative New Proteins from Apilactobacillus kunkeei. J Mol Biol 2024; 436:168490. [PMID: 38355092 DOI: 10.1016/j.jmb.2024.168490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 02/07/2024] [Accepted: 02/08/2024] [Indexed: 02/16/2024]
Abstract
The emergence of new proteins is a central question in biology. Most tertiary protein folds known to date appear to have an ancient origin, but it is clear from bioinformatic analyses that new proteins continuously emerge in all organismal groups. However, there is a paucity of experimental data on new proteins regarding their structure and biophysical properties. We performed a detailed phylogenetic analysis and identified 48 putative open reading frames in the honeybee-associated bacterium Apilactobacillus kunkeei for which no or few homologs could be identified in closely-related species, suggesting that they could be relatively new on an evolutionary time scale and represent recently evolved proteins. Using circular dichroism-, fluorescence- and nuclear magnetic resonance (NMR) spectroscopy we investigated six of these proteins and show that they are not intrinsically disordered, but populate alpha-helical dominated folded states with relatively low thermodynamic stability (0-3 kcal/mol). The NMR and biophysical data demonstrate that small new proteins readily adopt simple folded conformations suggesting that more complex tertiary structures can be continuously re-invented during evolution by fusion of such simple secondary structure elements. These findings have implications for the general view on protein evolution, where de novo emergence of folded proteins may be a common event.
Collapse
Affiliation(s)
- Weihua Ye
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Phani Rama Krishna Behra
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Karl Dyrhage
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Christian Seeger
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Joe D Joiner
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Elin Karlsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Celestine N Chi
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden.
| | - Siv G E Andersson
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden.
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden.
| |
Collapse
|
8
|
McGuinness KN, Fehon N, Feehan R, Miller M, Mutter AC, Rybak LA, Nam J, AbuSalim JE, Atkinson JT, Heidari H, Losada N, Kim JD, Koder RL, Lu Y, Silberg JJ, Slusky JSG, Falkowski PG, Nanda V. The energetics and evolution of oxidoreductases in deep time. Proteins 2024; 92:52-59. [PMID: 37596815 DOI: 10.1002/prot.26563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 07/06/2023] [Indexed: 08/20/2023]
Abstract
The core metabolic reactions of life drive electrons through a class of redox protein enzymes, the oxidoreductases. The energetics of electron flow is determined by the redox potentials of organic and inorganic cofactors as tuned by the protein environment. Understanding how protein structure affects oxidation-reduction energetics is crucial for studying metabolism, creating bioelectronic systems, and tracing the history of biological energy utilization on Earth. We constructed ProtReDox (https://protein-redox-potential.web.app), a manually curated database of experimentally determined redox potentials. With over 500 measurements, we can begin to identify how proteins modulate oxidation-reduction energetics across the tree of life. By mapping redox potentials onto networks of oxidoreductase fold evolution, we can infer the evolution of electron transfer energetics over deep time. ProtReDox is designed to include user-contributed submissions with the intention of making it a valuable resource for researchers in this field.
Collapse
Affiliation(s)
- Kenneth N McGuinness
- Department of Natural Sciences, Caldwell University, Caldwell, New Jersey, USA
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey, USA
| | - Nolan Fehon
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, New Jersey, USA
| | - Ryan Feehan
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
| | - Michelle Miller
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, New Jersey, USA
| | - Andrew C Mutter
- Department of Physics, The City College of New York, New York, New York, USA
| | - Laryssa A Rybak
- Department of Physics, The City College of New York, New York, New York, USA
| | - Justin Nam
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey, USA
| | - Jenna E AbuSalim
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey, USA
| | - Joshua T Atkinson
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, Texas, USA
| | - Hirbod Heidari
- Department of Chemistry, University of Texas at Austin, Austin, Texas, USA
| | - Natalie Losada
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey, USA
| | - J Dongun Kim
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, New Jersey, USA
| | - Ronald L Koder
- Department of Physics, The City College of New York, New York, New York, USA
| | - Yi Lu
- Department of Chemistry, University of Texas at Austin, Austin, Texas, USA
| | - Jonathan J Silberg
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, Texas, USA
| | - Joanna S G Slusky
- Computational Biology Program, The University of Kansas, Lawrence, Kansas, USA
- Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas, USA
| | - Paul G Falkowski
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, New Jersey, USA
- Department of Earth and Planetary Sciences, Rutgers University, New Brunswick, New Jersey, USA
| | - Vikas Nanda
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, New Jersey, USA
- Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey, USA
| |
Collapse
|
9
|
Wright Z, Seymour M, Paszczak K, Truttmann T, Senn K, Stilp S, Jansen N, Gosz M, Goeden L, Anantharaman V, Aravind L, Waters LS. The small protein MntS evolved from a signal peptide and acquired a novel function regulating manganese homeostasis in Escherichia coli. Mol Microbiol 2024; 121:152-166. [PMID: 38104967 PMCID: PMC10842292 DOI: 10.1111/mmi.15206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/17/2023] [Accepted: 11/24/2023] [Indexed: 12/19/2023]
Abstract
Small proteins (<50 amino acids) are emerging as ubiquitous and important regulators in organisms ranging from bacteria to humans, where they commonly bind to and regulate larger proteins during stress responses. However, fundamental aspects of small proteins, such as their molecular mechanism of action, downregulation after they are no longer needed, and their evolutionary provenance, are poorly understood. Here, we show that the MntS small protein involved in manganese (Mn) homeostasis binds and inhibits the MntP Mn transporter. Mn is crucial for bacterial survival in stressful environments but is toxic in excess. Thus, Mn transport is tightly controlled at multiple levels to maintain optimal Mn levels. The small protein MntS adds a new level of regulation for Mn transporters, beyond the known transcriptional and post-transcriptional control. We also found that MntS binds to itself in the presence of Mn, providing a possible mechanism of downregulating MntS activity to terminate its inhibition of MntP Mn export. MntS is homologous to the signal peptide of SitA, the periplasmic metal-binding subunit of a Mn importer. Remarkably, the homologous signal peptide regions can substitute for MntS, demonstrating a functional relationship between MntS and these signal peptides. Conserved gene neighborhoods support that MntS evolved from the signal peptide of an ancestral SitA protein, acquiring a life of its own with a distinct function in Mn homeostasis.
Collapse
Affiliation(s)
- Zachary Wright
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Mackenzie Seymour
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Kalista Paszczak
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Taylor Truttmann
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Katherine Senn
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Samuel Stilp
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Nickolas Jansen
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Magdalyn Gosz
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Lindsay Goeden
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| | - Vivek Anantharaman
- National Center for Biotechnology Information, National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - L. Aravind
- National Center for Biotechnology Information, National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Lauren S. Waters
- Department of Chemistry, 800 Algoma Blvd, University of Wisconsin, Oshkosh, WI 54901, USA
| |
Collapse
|
10
|
Subramanian AM, Thomson M. Unexplored regions of the protein sequence-structure map revealed at scale by a library of foldtuned language models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.22.573145. [PMID: 38187750 PMCID: PMC10769378 DOI: 10.1101/2023.12.22.573145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Nature has likely sampled only a fraction of all protein sequences and structures allowed by the laws of biophysics. However, the combinatorial scale of amino-acid sequence-space has traditionally precluded substantive study of the full protein sequence-structure map. In particular, it remains unknown how much of the vast uncharted landscape of far-from-natural sequences consists of alternate ways to encode the familiar ensemble of natural folds; proteins in this category also represent an opportunity to diversify candidates for downstream applications. Here, we characterize sequence-structure mapping in far-from-natural regions of sequence-space guided by the capacity of protein language models (pLMs) to explore sequences outside their natural training data through generation. We demonstrate that pretrained generative pLMs sample a limited structural snapshot of the natural protein universe, including >350 common (sub)domain elements. Incorporating pLM, structure prediction, and structure-based search techniques, we surpass this limitation by developing a novel "foldtuning" strategy that pushes a pretrained pLM into a generative regime that maintains structural similarity to a target protein fold (e.g. TIM barrel, thioredoxin, etc) while maximizing dissimilarity to natural amino-acid sequences. We apply "foldtuning" to build a library of pLMs for >700 naturally-abundant folds in the SCOP database, accessing swaths of proteins that take familiar structures yet lie far from known sequences, spanning targets that include enzymes, immune ligands, and signaling proteins. By revealing protein sequence-structure information at scale outside of the context of evolution, we anticipate that this work will enable future systematic searches for wholly novel folds and facilitate more immediate protein design goals in catalysis and medicine.
Collapse
Affiliation(s)
- Arjuna M Subramanian
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Matt Thomson
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
| |
Collapse
|
11
|
Dhar R, Bowman AM, Hatungimana B, Sg Slusky J. Evolutionary Engineering a Larger Porin Using a Loop-to-Hairpin Mechanism. J Mol Biol 2023; 435:168292. [PMID: 37769963 PMCID: PMC11215794 DOI: 10.1016/j.jmb.2023.168292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 09/20/2023] [Accepted: 09/21/2023] [Indexed: 10/03/2023]
Abstract
In protein evolution, diversification is generally driven by genetic duplication. The hallmarks of this mechanism are visible in the repeating topology of various proteins. In outer membrane β-barrels, duplication is visible with β-hairpins as the repeating unit of the barrel. In contrast to the overall use of duplication in diversification, a computational study hypothesized evolutionary mechanisms other than hairpin duplications leading to increases in the number of strands in outer membrane β-barrels. Specifically, the topology of some 16- and 18-stranded β-barrels appear to have evolved through a loop to β-hairpin transition. Here we test this novel evolutionary mechanism by creating a chimeric protein from an 18-stranded β-barrel and an evolutionarily related 16-stranded β-barrel. The chimeric combination of the two was created by replacing loop L3 of the 16-stranded barrel with the sequentially matched transmembrane β-hairpin region of the 18-stranded barrel. We find the resulting chimeric protein is stable and has characteristics of increased strand number. This study provides the first experimental evidence supporting the evolution through a loop to β-hairpin transition.
Collapse
Affiliation(s)
- Rik Dhar
- Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66045, USA. https://twitter.com/Rik_Skywalker
| | - Alexander M Bowman
- Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66045, USA
| | - Brunojoel Hatungimana
- Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66045, USA
| | - Joanna Sg Slusky
- Department of Molecular Biosciences, The University of Kansas, Lawrence, KS 66045, USA; Computational Biology Program, The University of Kansas, Lawrence, KS 66047, USA.
| |
Collapse
|
12
|
Michel F, Romero‐Romero S, Höcker B. Retracing the evolution of a modern periplasmic binding protein. Protein Sci 2023; 32:e4793. [PMID: 37788980 PMCID: PMC10601554 DOI: 10.1002/pro.4793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/20/2023] [Accepted: 09/22/2023] [Indexed: 10/05/2023]
Abstract
Investigating the evolution of structural features in modern multidomain proteins helps to understand their immense diversity and functional versatility. The class of periplasmic binding proteins (PBPs) offers an opportunity to interrogate one of the main processes driving diversification: the duplication and fusion of protein sequences to generate new architectures. The symmetry of their two-lobed topology, their mechanism of binding, and the organization of their operon structure led to the hypothesis that PBPs arose through a duplication and fusion event of a single common ancestor. To investigate this claim, we set out to reverse the evolutionary process and recreate the structural equivalent of a single-lobed progenitor using ribose-binding protein (RBP) as our model. We found that this modern PBP can be deconstructed into its lobes, producing two proteins that represent possible progenitor halves. The isolated halves of RBP are well folded and monomeric proteins, albeit with a lower thermostability, and do not retain the original binding function. However, the two entities readily form a heterodimer in vitro and in-cell. The x-ray structure of the heterodimer closely resembles the parental protein. Moreover, the binding function is fully regained upon formation of the heterodimer with a ligand affinity similar to that observed in the modern RBP. This highlights how a duplication event could have given rise to a stable and functional PBP-like fold and provides insights into how more complex functional structures can evolve from simpler molecular components.
Collapse
Affiliation(s)
- Florian Michel
- Department of BiochemistryUniversity of BayreuthBayreuthGermany
| | | | - Birte Höcker
- Department of BiochemistryUniversity of BayreuthBayreuthGermany
| |
Collapse
|
13
|
Kaminski K, Ludwiczak J, Pawlicki K, Alva V, Dunin-Horkawicz S. pLM-BLAST: distant homology detection based on direct comparison of sequence representations from protein language models. Bioinformatics 2023; 39:btad579. [PMID: 37725369 PMCID: PMC10576641 DOI: 10.1093/bioinformatics/btad579] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 07/09/2023] [Accepted: 09/15/2023] [Indexed: 09/21/2023] Open
Abstract
MOTIVATION The detection of homology through sequence comparison is a typical first step in the study of protein function and evolution. In this work, we explore the applicability of protein language models to this task. RESULTS We introduce pLM-BLAST, a tool inspired by BLAST, that detects distant homology by comparing single-sequence representations (embeddings) derived from a protein language model, ProtT5. Our benchmarks reveal that pLM-BLAST maintains a level of accuracy on par with HHsearch for both highly similar sequences (with >50% identity) and markedly divergent sequences (with <30% identity), while being significantly faster. Additionally, pLM-BLAST stands out among other embedding-based tools due to its ability to compute local alignments. We show that these local alignments, produced by pLM-BLAST, often connect highly divergent proteins, thereby highlighting its potential to uncover previously undiscovered homologous relationships and improve protein annotation. AVAILABILITY AND IMPLEMENTATION pLM-BLAST is accessible via the MPI Bioinformatics Toolkit as a web server for searching precomputed databases (https://toolkit.tuebingen.mpg.de/tools/plmblast). It is also available as a standalone tool for building custom databases and performing batch searches (https://github.com/labstructbioinf/pLM-BLAST).
Collapse
Affiliation(s)
- Kamil Kaminski
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw 02-089, Poland
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, Warsaw 02-097, Poland
| | - Jan Ludwiczak
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw 02-089, Poland
| | - Kamil Pawlicki
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw 02-089, Poland
| | - Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Biology Tübingen, Tübingen 72076, Germany
| | - Stanislaw Dunin-Horkawicz
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw 02-089, Poland
- Department of Protein Evolution, Max Planck Institute for Biology Tübingen, Tübingen 72076, Germany
| |
Collapse
|
14
|
Dhar R, Bowman AM, Hatungimana B, Slusky JS. Evolutionary engineering a larger porin using a loop-to-hairpin mechanism. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.14.544993. [PMID: 37398247 PMCID: PMC10312768 DOI: 10.1101/2023.06.14.544993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
In protein evolution, diversification is generally driven by genetic duplication. The hallmarks of this mechanism are visible in the repeating topology of various proteins. In outer membrane β-barrels, duplication is visible with β-hairpins as the repeating unit of the barrel. In contrast to the overall use of duplication in diversification, a computational study hypothesized evolutionary mechanisms other than hairpin duplications leading to increases in the number of strands in outer membrane β-barrels. Specifically, the topology of some 16- and 18-stranded β-barrels appear to have evolved through a loop to β-hairpin transition. Here we test this novel evolutionary mechanism by creating a chimeric protein from an 18-stranded β-barrel and an evolutionarily related 16-stranded β-barrel. The chimeric combination of the two was created by replacing loop L3 of the 16-stranded barrel with the sequentially matched transmembrane β-hairpin region of the 18-stranded barrel. We find the resulting chimeric protein is stable and has characteristics of increased strand number. This study provides the first experimental evidence supporting the evolution through a loop to β-hairpin transition. Highlights We find evidence supporting a novel diversification mechanism in membrane β-barrelsThe mechanism is the conversion of an extracellular loop to transmembrane β-hairpinA chimeric protein modeling this mechanism folds stably in the membraneThe chimera has more β-structure and a larger pore, consistent with a loop-to-hairpin transition.
Collapse
Affiliation(s)
- Rik Dhar
- Department of Molecular Biosciences, The University of Kansas, Lawrence KS 66045
| | - Alexander M Bowman
- Department of Molecular Biosciences, The University of Kansas, Lawrence KS 66045
| | | | - Joanna Sg Slusky
- Department of Molecular Biosciences, The University of Kansas, Lawrence KS 66045
- Computational Biology Program, The University of Kansas, Lawrence KS 66047
| |
Collapse
|
15
|
Aziz MF, Mughal F, Caetano-Anollés G. Tracing the birth of structural domains from loops during protein evolution. Sci Rep 2023; 13:14688. [PMID: 37673948 PMCID: PMC10482863 DOI: 10.1038/s41598-023-41556-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 08/28/2023] [Indexed: 09/08/2023] Open
Abstract
The structures and functions of proteins are embedded into the loop scaffolds of structural domains. Their origin and evolution remain mysterious. Here, we use a novel graph-theoretical approach to describe how modular and non-modular loop prototypes combine to form folded structures in protein domain evolution. Phylogenomic data-driven chronologies reoriented a bipartite network of loops and domains (and its projections) into 'waterfalls' depicting an evolving 'elementary functionome' (EF). Two primordial waves of functional innovation involving founder 'p-loop' and 'winged-helix' domains were accompanied by an ongoing emergence and reuse of structural and functional novelty. Metabolic pathways expanded before translation functionalities. A dual hourglass recruitment pattern transferred scale-free properties from loop to domain components of the EF network in generative cycles of hierarchical modularity. Modeling the evolutionary emergence of the oldest P-loop and winged-helix domains with AlphFold2 uncovered rapid convergence towards folded structure, suggesting that a folding vocabulary exists in loops for protein fold repurposing and design.
Collapse
Affiliation(s)
- M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA.
- C.R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL, 61801, USA.
| |
Collapse
|
16
|
Porter LL. Fluid protein fold space and its implications. Bioessays 2023; 45:e2300057. [PMID: 37431685 PMCID: PMC10529699 DOI: 10.1002/bies.202300057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 06/21/2023] [Accepted: 06/23/2023] [Indexed: 07/12/2023]
Abstract
Fold-switching proteins, which remodel their secondary and tertiary structures in response to cellular stimuli, suggest a new view of protein fold space. For decades, experimental evidence has indicated that protein fold space is discrete: dissimilar folds are encoded by dissimilar amino acid sequences. Challenging this assumption, fold-switching proteins interconnect discrete groups of dissimilar protein folds, making protein fold space fluid. Three recent observations support the concept of fluid fold space: (1) some amino acid sequences interconvert between folds with distinct secondary structures, (2) some naturally occurring sequences have switched folds by stepwise mutation, and (3) fold switching is evolutionarily selected and likely confers advantage. These observations indicate that minor amino acid sequence modifications can transform protein structure and function. Consequently, proteomic structural and functional diversity may be expanded by alternative splicing, small nucleotide polymorphisms, post-translational modifications, and modified translation rates.
Collapse
Affiliation(s)
- Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
| |
Collapse
|
17
|
Wright Z, Seymour M, Paszczak K, Truttmann T, Senn K, Stilp S, Jansen N, Gosz M, Goeden L, Anantharaman V, Aravind L, Waters LS. The small protein MntS evolved from a signal peptide and acquired a novel function regulating manganese homeostasis in Escherichia coli. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.02.543501. [PMID: 37398132 PMCID: PMC10312517 DOI: 10.1101/2023.06.02.543501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Small proteins (< 50 amino acids) are emerging as ubiquitous and important regulators in organisms ranging from bacteria to humans, where they commonly bind to and regulate larger proteins during stress responses. However, fundamental aspects of small proteins, such as their molecular mechanism of action, downregulation after they are no longer needed, and their evolutionary provenance are poorly understood. Here we show that the MntS small protein involved in manganese (Mn) homeostasis binds and inhibits the MntP Mn transporter. Mn is crucial for bacterial survival in stressful environments, but is toxic in excess. Thus, Mn transport is tightly controlled at multiple levels to maintain optimal Mn levels. The small protein MntS adds a new level of regulation for Mn transporters, beyond the known transcriptional and post-transcriptional control. We also found that MntS binds to itself in the presence of Mn, providing a possible mechanism of downregulating MntS activity to terminate its inhibition of MntP Mn export. MntS is homologous to the signal peptide of SitA, the periplasmic metal-binding subunit of a Mn importer. Remarkably, the homologous signal peptide regions can substitute for MntS, demonstrating a functional relationship between MntS and these signal peptides. Conserved gene-neighborhoods support that MntS evolved from an ancestral SitA, acquiring a life of its own with a distinct function in Mn homeostasis. Significance This study demonstrates that the MntS small protein binds and inhibits the MntP Mn exporter, adding another layer to the complex regulation of Mn homeostasis. MntS also interacts with itself in cells with Mn, which could prevent it from regulating MntP. We propose that MntS and other small proteins might sense environmental signals and shut off their own regulation via binding to ligands (e.g., metals) or other proteins. We also provide evidence that MntS evolved from the signal peptide region of the Mn importer, SitA. Homologous SitA signal peptides can recapitulate MntS activities, showing that they have a second function beyond protein secretion. Overall, we establish that small proteins can emerge and develop novel functionalities from gene remnants.
Collapse
|
18
|
Chakravarty D, Sreenivasan S, Swint-Kruse L, Porter LL. Identification of a covert evolutionary pathway between two protein folds. Nat Commun 2023; 14:3177. [PMID: 37264049 DOI: 10.1038/s41467-023-38519-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 05/03/2023] [Indexed: 06/03/2023] Open
Abstract
Although homologous protein sequences are expected to adopt similar structures, some amino acid substitutions can interconvert α-helices and β-sheets. Such fold switching may have occurred over evolutionary history, but supporting evidence has been limited by the: (1) abundance and diversity of sequenced genes, (2) quantity of experimentally determined protein structures, and (3) assumptions underlying the statistical methods used to infer homology. Here, we overcome these barriers by applying multiple statistical methods to a family of ~600,000 bacterial response regulator proteins. We find that their homologous DNA-binding subunits assume divergent structures: helix-turn-helix versus α-helix + β-sheet (winged helix). Phylogenetic analyses, ancestral sequence reconstruction, and AlphaFold2 models indicate that amino acid substitutions facilitated a switch from helix-turn-helix into winged helix. This structural transformation likely expanded DNA-binding specificity. Our approach uncovers an evolutionary pathway between two protein folds and provides a methodology to identify secondary structure switching in other protein families.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Shwetha Sreenivasan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Lauren L Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
19
|
Zagrovic B, Adlhart M, Kapral TH. Coding From Binding? Molecular Interactions at the Heart of Translation. Annu Rev Biophys 2023; 52:69-89. [PMID: 36626765 DOI: 10.1146/annurev-biophys-090622-102329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
The mechanism and the evolution of DNA replication and transcription, the key elements of the central dogma of biology, are fundamentally well explained by the physicochemical complementarity between strands of nucleic acids. However, the determinants that have shaped the third part of the dogma-the process of biological translation and the universal genetic code-remain unclear. We review and seek parallels between different proposals that view the evolution of translation through the prism of weak, noncovalent interactions between biological macromolecules. In particular, we focus on a recent proposal that there exists a hitherto unrecognized complementarity at the heart of biology, that between messenger RNA coding regions and the proteins that they encode, especially if the two are unstructured. Reflecting the idea that the genetic code evolved from intrinsic binding propensities between nucleotides and amino acids, this proposal promises to forge a link between the distant past and the present of biological systems.
Collapse
Affiliation(s)
- Bojan Zagrovic
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Marlene Adlhart
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
| | - Thomas H Kapral
- Department of Structural and Computational Biology, Max Perutz Labs & University of Vienna, Vienna, Austria;
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
| |
Collapse
|
20
|
Moreaud L, Viollet S, Urvoas A, Valerio-Lepiniec M, Mesneau A, Li de la Sierra-Gallay I, Miller J, Ouldali M, Marcelot C, Balor S, Soldan V, Meriadec C, Artzner F, Dujardin E, Minard P. Design, synthesis, and characterization of protein origami based on self-assembly of a brick and staple artificial protein pair. Proc Natl Acad Sci U S A 2023; 120:e2218428120. [PMID: 36893280 PMCID: PMC10089216 DOI: 10.1073/pnas.2218428120] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 02/03/2023] [Indexed: 03/11/2023] Open
Abstract
A versatile strategy to create an inducible protein assembly with predefined geometry is demonstrated. The assembly is triggered by a binding protein that staples two identical protein bricks together in a predictable spatial conformation. The brick and staple proteins are designed for mutual directional affinity and engineered by directed evolution from a synthetic modular repeat protein library. As a proof of concept, this article reports on the spontaneous, extremely fast and quantitative self-assembly of two designed alpha-repeat (αRep) brick and staple proteins into macroscopic tubular superhelices at room temperature. Small-angle X-ray scattering (SAXS) and transmission electron microscopy (TEM with staining agent and cryoTEM) elucidate the resulting superhelical arrangement that precisely matches the a priori intended 3D assembly. The highly ordered, macroscopic biomolecular construction sustains temperatures as high as 75 °C thanks to the robust αRep building blocks. Since the α-helices of the brick and staple proteins are highly programmable, their design allows encoding the geometry and chemical surfaces of the final supramolecular protein architecture. This work opens routes toward the design and fabrication of multiscale protein origami with arbitrarily programmed shapes and chemical functions.
Collapse
Affiliation(s)
- Laureen Moreaud
- Centre d’Elaboration des Matériaux et d’Etudes Structurales, CNRS UPR8011F-31055, Toulouse, France
| | - Sébastien Viollet
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
| | - Agathe Urvoas
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
| | - Marie Valerio-Lepiniec
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
| | - Agnès Mesneau
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
| | - Inès Li de la Sierra-Gallay
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
| | - Jessalyn Miller
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
- Department of Chemistry, Emory University, Atlanta, GA30322
| | - Malika Ouldali
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
| | - Cécile Marcelot
- Centre d’Elaboration des Matériaux et d’Etudes Structurales, CNRS UPR8011F-31055, Toulouse, France
| | - Stéphanie Balor
- Microscopie Electronique Intégrative Toulouse, Centre de Biologie Intégrative, Université de Toulouse, CNRS, 31062, Toulouse, France
| | - Vanessa Soldan
- Microscopie Electronique Intégrative Toulouse, Centre de Biologie Intégrative, Université de Toulouse, CNRS, 31062, Toulouse, France
| | - Cristelle Meriadec
- Institut de Physique de Rennes, CNRS, UMR6251, Université de Rennes 1F-35042, Rennes, France
| | - Franck Artzner
- Institut de Physique de Rennes, CNRS, UMR6251, Université de Rennes 1F-35042, Rennes, France
| | - Erik Dujardin
- Centre d’Elaboration des Matériaux et d’Etudes Structurales, CNRS UPR8011F-31055, Toulouse, France
- Laboratoire Interdisciplinaire Carnot de Bourgogne, CNRS, UMR6303, Université de Bourgogne Franche-Comté21000, Dijon, France
| | - Philippe Minard
- CEA, CNRS, Institute for Integrative Biology of the Cell, Université Paris-Saclay91198, Gif-sur-Yvette, France
| |
Collapse
|
21
|
Benton R, Himmel NJ. Structural screens identify candidate human homologs of insect chemoreceptors and cryptic Drosophila gustatory receptor-like proteins. eLife 2023; 12:85537. [PMID: 36803935 PMCID: PMC9998090 DOI: 10.7554/elife.85537] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 02/16/2023] [Indexed: 02/22/2023] Open
Abstract
Insect odorant receptors and gustatory receptors define a superfamily of seven transmembrane domain ion channels (referred to here as 7TMICs), with homologs identified across Animalia except Chordata. Previously, we used sequence-based screening methods to reveal conservation of this family in unicellular eukaryotes and plants (DUF3537 proteins) (Benton et al., 2020). Here, we combine three-dimensional structure-based screening, ab initio protein folding predictions, phylogenetics, and expression analyses to characterize additional candidate homologs with tertiary but little or no primary structural similarity to known 7TMICs, including proteins in disease-causing Trypanosoma. Unexpectedly, we identify structural similarity between 7TMICs and PHTF proteins, a deeply conserved family of unknown function, whose human orthologs display enriched expression in testis, cerebellum, and muscle. We also discover divergent groups of 7TMICs in insects, which we term the gustatory receptor-like (Grl) proteins. Several Drosophila melanogaster Grls display selective expression in subsets of taste neurons, suggesting that they are previously unrecognized insect chemoreceptors. Although we cannot exclude the possibility of remarkable structural convergence, our findings support the origin of 7TMICs in a eukaryotic common ancestor, counter previous assumptions of complete loss of 7TMICs in Chordata, and highlight the extreme evolvability of this protein fold, which likely underlies its functional diversification in different cellular contexts.
Collapse
Affiliation(s)
- Richard Benton
- Center for Integrative Genomics, Faculty of Biology and Medicine, University of LausanneLausanneSwitzerland
| | - Nathaniel J Himmel
- Center for Integrative Genomics, Faculty of Biology and Medicine, University of LausanneLausanneSwitzerland
| |
Collapse
|
22
|
Sykes J, Holland BR, Charleston MA. A review of visualisations of protein fold networks and their relationship with sequence and function. Biol Rev Camb Philos Soc 2023; 98:243-262. [PMID: 36210328 PMCID: PMC10092621 DOI: 10.1111/brv.12905] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 09/08/2022] [Accepted: 09/09/2022] [Indexed: 01/12/2023]
Abstract
Proteins form arguably the most significant link between genotype and phenotype. Understanding the relationship between protein sequence and structure, and applying this knowledge to predict function, is difficult. One way to investigate these relationships is by considering the space of protein folds and how one might move from fold to fold through similarity, or potential evolutionary relationships. The many individual characterisations of fold space presented in the literature can tell us a lot about how well the current Protein Data Bank represents protein fold space, how convergence and divergence may affect protein evolution, how proteins affect the whole of which they are part, and how proteins themselves function. A synthesis of these different approaches and viewpoints seems the most likely way to further our knowledge of protein structure evolution and thus, facilitate improved protein structure design and prediction.
Collapse
Affiliation(s)
- Janan Sykes
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| | - Barbara R Holland
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| | - Michael A Charleston
- School of Natural Sciences, University of Tasmania, Private Bag 37, Hobart, Tasmania, 7001, Australia
| |
Collapse
|
23
|
Evolutionary Conserved Short Linear Motifs Provide Insights into the Cellular Response to Stress. Antioxidants (Basel) 2022; 12:antiox12010096. [PMID: 36670957 PMCID: PMC9854524 DOI: 10.3390/antiox12010096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 11/22/2022] [Accepted: 12/22/2022] [Indexed: 01/03/2023] Open
Abstract
Short linear motifs (SLiMs) are evolutionarily conserved functional modules of proteins composed of 3 to 10 residues and involved in multiple cellular functions. Here, we performed a search for SLiMs that exert sequence similarity to two segments of alpha-fetoprotein (AFP), a major mammalian embryonic and cancer-associated protein. Biological activities of the peptides, LDSYQCT (AFP14-20) and EMTPVNPGV (GIP-9), have been previously confirmed under in vitro and in vivo conditions. In our study, we retrieved a vast array of proteins that contain SLiMs of interest from both prokaryotic and eukaryotic species, including viruses, bacteria, archaea, invertebrates, and vertebrates. Comprehensive Gene Ontology enrichment analysis showed that proteins from multiple functional classes, including enzymes, transcription factors, as well as those involved in signaling, cell cycle, and quality control, and ribosomal proteins were implicated in cellular adaptation to environmental stress conditions. These include response to oxidative and metabolic stress, hypoxia, DNA and RNA damage, protein degradation, as well as antimicrobial, antiviral, and immune response. Thus, our data enabled insights into the common functions of SLiMs evolutionary conserved across all taxonomic categories. These SLiMs can serve as important players in cellular adaptation to stress, which is crucial for cell functioning.
Collapse
|
24
|
Abstract
Mechanisms of emergence and divergence of protein folds pose central questions in biological sciences. Incremental mutation and stepwise adaptation explain relationships between topologically similar protein folds. However, the universe of folds is diverse and riotous, suggesting more potent and creative forces are at play. Sequence and structure similarity are observed between distinct folds, indicating that proteins with distinct folds may share common ancestry. We found evidence of common ancestry between three distinct β-barrel folds: Scr kinase family homology (SH3), oligonucleotide/oligosaccharide-binding (OB), and cradle loop barrel (CLB). The data suggest a mechanism of fold evolution that interconverts SH3, OB, and CLB. This mechanism, which we call creative destruction, can be generalized to explain many examples of fold evolution including circular permutation. In creative destruction, an open reading frame duplicates or otherwise merges with another to produce a fused polypeptide. A merger forces two ancestral domains into a new sequence and spatial context. The fused polypeptide can explore folding landscapes that are inaccessible to either of the independent ancestral domains. However, the folding landscapes of the fused polypeptide are not fully independent of those of the ancestral domains. Creative destruction is thus partially conservative; a daughter fold inherits some motifs from ancestral folds. After merger and refolding, adaptive processes such as mutation and loss of extraneous segments optimize the new daughter fold. This model has application in disease states characterized by genetic instability. Fused proteins observed in cancer cells are likely to experience remodeled folding landscapes and realize altered folds, conferring new or altered functions.
Collapse
|
25
|
Yu H, Kalutantirige FC, Yao L, Schroeder CM, Chen Q, Moore JS. Self-Assembly of Repetitive Segment and Random Segment Polymer Architectures. ACS Macro Lett 2022; 11:1366-1372. [PMID: 36413761 DOI: 10.1021/acsmacrolett.2c00495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Recent advances in chemical synthesis have created new methodologies for synthesizing sequence-controlled synthetic polymers, but rational design of monomer sequence for desired properties remains challenging. In this work, we synthesize periodic polymers with repetitive segments using a sequence-controlled ring-opening metathesis polymerization (ROMP) method, which draws inspiration from proteins containing repetitive sequence motifs. The repetitive segment architecture is shown to dramatically affect the self-assembly behavior of these materials. Our results show that polymers with identical repetitive sequences assemble into uniform spherical nanoparticles after thermal annealing, whereas copolymers with random placement of segments with different sequences exhibit disordered assemblies without a well-defined morphology. Overall, these results bring a new understanding to the role of periodic repetitive sequences in polymer assembly.
Collapse
Affiliation(s)
- Hao Yu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Falon C Kalutantirige
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Lehan Yao
- Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Charles M Schroeder
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Qian Chen
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Jeffrey S Moore
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
26
|
Verma A, Åberg-Zingmark E, Sparrman T, Mushtaq AU, Rogne P, Grundström C, Berntsson R, Sauer UH, Backman L, Nam K, Sauer-Eriksson E, Wolf-Watz M. Insights into the evolution of enzymatic specificity and catalysis: From Asgard archaea to human adenylate kinases. SCIENCE ADVANCES 2022; 8:eabm4089. [PMID: 36332013 PMCID: PMC9635829 DOI: 10.1126/sciadv.abm4089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
Enzymatic catalysis is critically dependent on selectivity, active site architecture, and dynamics. To contribute insights into the interplay of these properties, we established an approach with NMR, crystallography, and MD simulations focused on the ubiquitous phosphotransferase adenylate kinase (AK) isolated from Odinarchaeota (OdinAK). Odinarchaeota belongs to the Asgard archaeal phylum that is believed to be the closest known ancestor to eukaryotes. We show that OdinAK is a hyperthermophilic trimer that, contrary to other AK family members, can use all NTPs for its phosphorylation reaction. Crystallographic structures of OdinAK-NTP complexes revealed a universal NTP-binding motif, while 19F NMR experiments uncovered a conserved and rate-limiting dynamic signature. As a consequence of trimerization, the active site of OdinAK was found to be lacking a critical catalytic residue and is therefore considered to be "atypical." On the basis of discovered relationships with human monomeric homologs, our findings are discussed in terms of evolution of enzymatic substrate specificity and cold adaptation.
Collapse
Affiliation(s)
- Apoorv Verma
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | | | - Tobias Sparrman
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | | | - Per Rogne
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | | | - Ronnie Berntsson
- Department of Medical Biochemistry and Biophysics, Umeå University, 901 87 Umeå, Sweden
- Wallenberg Centre for Molecular Medicine, Umeå University, 901 87 Umeå, Sweden
| | - Uwe H. Sauer
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | - Lars Backman
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | - Kwangho Nam
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington, TX 76019, USA
| | | | | |
Collapse
|
27
|
Tong CL, Kanwar N, Morrone DJ, Seelig B. Nature-inspired engineering of an artificial ligase enzyme by domain fusion. Nucleic Acids Res 2022; 50:11175-11185. [PMID: 36243966 PMCID: PMC9638898 DOI: 10.1093/nar/gkac858] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 08/30/2022] [Accepted: 09/26/2022] [Indexed: 11/20/2022] Open
Abstract
The function of most proteins is accomplished through the interplay of two or more protein domains and fine-tuned by natural evolution. In contrast, artificial enzymes have often been engineered from a single domain scaffold and frequently have lower catalytic activity than natural enzymes. We previously generated an artificial enzyme that catalyzed an RNA ligation by >2 million-fold but was likely limited in its activity by low substrate affinity. Inspired by nature's concept of domain fusion, we fused the artificial enzyme to a series of protein domains known to bind nucleic acids with the goal of improving its catalytic activity. The effect of the fused domains on catalytic activity varied greatly, yielding severalfold increases but also reductions caused by domains that previously enhanced nucleic acid binding in other protein engineering projects. The combination of the two better performing binding domains improved the activity of the parental ligase by more than an order of magnitude. These results demonstrate for the first time that nature's successful evolutionary mechanism of domain fusion can also improve an unevolved primordial-like protein whose structure and function had just been created in the test tube. The generation of multi-domain proteins might therefore be an ancient evolutionary process.
Collapse
Affiliation(s)
- Cher Ling Tong
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
- BioTechnology Institute, University of Minnesota, St. Paul, MN 55108, USA
| | - Nisha Kanwar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
- BioTechnology Institute, University of Minnesota, St. Paul, MN 55108, USA
| | - Dana J Morrone
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
- BioTechnology Institute, University of Minnesota, St. Paul, MN 55108, USA
| | - Burckhard Seelig
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
- BioTechnology Institute, University of Minnesota, St. Paul, MN 55108, USA
| |
Collapse
|
28
|
A Short Tale of the Origin of Proteins and Ribosome Evolution. Microorganisms 2022; 10:microorganisms10112115. [DOI: 10.3390/microorganisms10112115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 09/30/2022] [Accepted: 10/19/2022] [Indexed: 11/16/2022] Open
Abstract
Proteins are the workhorses of the cell and have been key players throughout the evolution of all organisms, from the origin of life to the present era. How might life have originated from the prebiotic chemistry of early Earth? This is one of the most intriguing unsolved questions in biology. Currently, however, it is generally accepted that amino acids, the building blocks of proteins, were abiotically available on primitive Earth, which would have made the formation of early peptides in a similar fashion possible. Peptides are likely to have coevolved with ancestral forms of RNA. The ribosome is the most evident product of this coevolution process, a sophisticated nanomachine that performs the synthesis of proteins codified in genomes. In this general review, we explore the evolution of proteins from their peptide origins to their folding and regulation based on the example of superoxide dismutase (SOD1), a key enzyme in oxygen metabolism on modern Earth.
Collapse
|
29
|
Marquet C, Heinzinger M, Olenyi T, Dallago C, Erckert K, Bernhofer M, Nechaev D, Rost B. Embeddings from protein language models predict conservation and variant effects. Hum Genet 2022; 141:1629-1647. [PMID: 34967936 PMCID: PMC8716573 DOI: 10.1007/s00439-021-02411-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 12/06/2021] [Indexed: 12/13/2022]
Abstract
The emergence of SARS-CoV-2 variants stressed the demand for tools allowing to interpret the effect of single amino acid variants (SAVs) on protein function. While Deep Mutational Scanning (DMS) sets continue to expand our understanding of the mutational landscape of single proteins, the results continue to challenge analyses. Protein Language Models (pLMs) use the latest deep learning (DL) algorithms to leverage growing databases of protein sequences. These methods learn to predict missing or masked amino acids from the context of entire sequence regions. Here, we used pLM representations (embeddings) to predict sequence conservation and SAV effects without multiple sequence alignments (MSAs). Embeddings alone predicted residue conservation almost as accurately from single sequences as ConSeq using MSAs (two-state Matthews Correlation Coefficient-MCC-for ProtT5 embeddings of 0.596 ± 0.006 vs. 0.608 ± 0.006 for ConSeq). Inputting the conservation prediction along with BLOSUM62 substitution scores and pLM mask reconstruction probabilities into a simplistic logistic regression (LR) ensemble for Variant Effect Score Prediction without Alignments (VESPA) predicted SAV effect magnitude without any optimization on DMS data. Comparing predictions for a standard set of 39 DMS experiments to other methods (incl. ESM-1v, DeepSequence, and GEMME) revealed our approach as competitive with the state-of-the-art (SOTA) methods using MSA input. No method outperformed all others, neither consistently nor statistically significantly, independently of the performance measure applied (Spearman and Pearson correlation). Finally, we investigated binary effect predictions on DMS experiments for four human proteins. Overall, embedding-based methods have become competitive with methods relying on MSAs for SAV effect prediction at a fraction of the costs in computing/energy. Our method predicted SAV effects for the entire human proteome (~ 20 k proteins) within 40 min on one Nvidia Quadro RTX 8000. All methods and data sets are freely available for local and online execution through bioembeddings.com, https://github.com/Rostlab/VESPA , and PredictProtein.
Collapse
Affiliation(s)
- Céline Marquet
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany.
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany.
| | - Michael Heinzinger
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Tobias Olenyi
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Christian Dallago
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Kyra Erckert
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Michael Bernhofer
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Dmitrii Nechaev
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany
- TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany
| | - Burkhard Rost
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching, 85748, Munich, Germany
- Institute for Advanced Study (TUM-IAS), Lichtenbergstr. 2a, Garching, 85748, Munich, Germany
- TUM School of Life Sciences Weihenstephan (TUM-WZW), Alte Akademie 8, Freising, Germany
| |
Collapse
|
30
|
Kozlova MI, Shalaeva DN, Dibrova DV, Mulkidjanian AY. Common Patterns of Hydrolysis Initiation in P-loop Fold Nucleoside Triphosphatases. Biomolecules 2022; 12:biom12101345. [PMID: 36291554 PMCID: PMC9599529 DOI: 10.3390/biom12101345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Revised: 08/20/2022] [Accepted: 09/14/2022] [Indexed: 11/24/2022] Open
Abstract
The P-loop fold nucleoside triphosphate (NTP) hydrolases (also known as Walker NTPases) function as ATPases, GTPases, and ATP synthases, are often of medical importance, and represent one of the largest and evolutionarily oldest families of enzymes. There is still no consensus on their catalytic mechanism. To clarify this, we performed the first comparative structural analysis of more than 3100 structures of P-loop NTPases that contain bound substrate Mg-NTPs or their analogues. We proceeded on the assumption that structural features common to these P-loop NTPases may be essential for catalysis. Our results are presented in two articles. Here, in the first, we consider the structural elements that stimulate hydrolysis. Upon interaction of P-loop NTPases with their cognate activating partners (RNA/DNA/protein domains), specific stimulatory moieties, usually Arg or Lys residues, are inserted into the catalytic site and initiate the cleavage of gamma phosphate. By analyzing a plethora of structures, we found that the only shared feature was the mechanistic interaction of stimulators with the oxygen atoms of gamma-phosphate group, capable of causing its rotation. One of the oxygen atoms of gamma phosphate coordinates the cofactor Mg ion. The rotation must pull this oxygen atom away from the Mg ion. This rearrangement should affect the properties of the other Mg ligands and may initiate hydrolysis according to the mechanism elaborated in the second article.
Collapse
Affiliation(s)
- Maria I. Kozlova
- School of Physics, Osnabrueck University, D-49069 Osnabrueck, Germany
| | - Daria N. Shalaeva
- School of Physics, Osnabrueck University, D-49069 Osnabrueck, Germany
| | - Daria V. Dibrova
- School of Physics, Osnabrueck University, D-49069 Osnabrueck, Germany
| | - Armen Y. Mulkidjanian
- School of Physics, Osnabrueck University, D-49069 Osnabrueck, Germany
- Center of Cellular Nanoanalytics, Osnabrueck University, D-49069 Osnabrueck, Germany
- Correspondence: ; Tel.: +49-541-969-2698
| |
Collapse
|
31
|
Qiu K, Ben‐Tal N, Kolodny R. Similar protein segments shared between domains of different evolutionary lineages. Protein Sci 2022; 31:e4407. [PMID: 36040261 PMCID: PMC9387206 DOI: 10.1002/pro.4407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 07/01/2022] [Accepted: 07/25/2022] [Indexed: 11/21/2022]
Abstract
The emergence of novel proteins, beyond these that can be readily made by duplication and recombination of preexisting domains, is elusive. De novo emergence from random sequences is unlikely because the vast majority of random chains would not even fold, let alone function. An alternative explanation is that novel proteins emerge by duplication and fusion of pre‐existing polypeptide segments. In this case, traces of such ancient events may remain within contemporary proteins in the form of reused segments. Together with the late Dan Tawfik, we detected such similar segments, far shorter than intact protein domains, which are found in different environments. The detection of these, “bridging themes,” was based on a unique search strategy, where in addition to searching for similarity of shared fragments, so‐called “themes,” we also explicitly searched for cases in which the sequence segments before and after the theme are dissimilar (both in sequence and structure). Here, using a similar strategy, we further expanded the search and discovered almost 500 additional “bridging themes,” linking domains that are often from ancient folds. The themes, of 20 residues or more (average 53), do not retain their structure despite sharing 37% sequence identity on average. Indeed, conformation flexibility may confer an evolutionary advantage, in that it fits in multiple environments. We elaborate on two interesting themes, shared between Rossmann/Trefoil‐Plexin‐like domains and a β‐propeller‐like domain.
Collapse
Affiliation(s)
- Kaiyu Qiu
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences Tel Aviv University Tel Aviv Israel
| | - Nir Ben‐Tal
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences Tel Aviv University Tel Aviv Israel
| | - Rachel Kolodny
- Department of Computer Science University of Haifa Haifa Israel
| |
Collapse
|
32
|
Seal M, Weil-Ktorza O, Despotović D, Tawfik DS, Levy Y, Metanis N, Longo LM, Goldfarb D. Peptide-RNA Coacervates as a Cradle for the Evolution of Folded Domains. J Am Chem Soc 2022; 144:14150-14160. [PMID: 35904499 PMCID: PMC9376946 DOI: 10.1021/jacs.2c03819] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Peptide-RNA coacervates can result in the concentration and compartmentalization of simple biopolymers. Given their primordial relevance, peptide-RNA coacervates may have also been a key site of early protein evolution. However, the extent to which such coacervates might promote or suppress the exploration of novel peptide conformations is fundamentally unknown. To this end, we used electron paramagnetic resonance spectroscopy (EPR) to characterize the structure and dynamics of an ancient and ubiquitous nucleic acid binding element, the helix-hairpin-helix (HhH) motif, alone and in the presence of RNA, with which it forms coacervates. Double electron-electron resonance (DEER) spectroscopy applied to singly labeled peptides containing one HhH motif revealed the presence of dimers, even in the absence of RNA. Moreover, dimer formation is promoted upon RNA binding and was detectable within peptide-RNA coacervates. DEER measurements of spin-diluted, doubly labeled peptides in solution indicated transient α-helical character. The distance distributions between spin labels in the dimer and the signatures of α-helical folding are consistent with the symmetric (HhH)2-Fold, which is generated upon duplication and fusion of a single HhH motif and traditionally associated with dsDNA binding. These results support the hypothesis that coacervates are a unique testing ground for peptide oligomerization and that phase-separating peptides could have been a resource for the construction of complex protein structures via common evolutionary processes, such as duplication and fusion.
Collapse
Affiliation(s)
- Manas Seal
- Department of Chemical and Biological Physics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Orit Weil-Ktorza
- Institute of Chemistry, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Dragana Despotović
- Department of Biomolecular Science, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Dan S Tawfik
- Department of Biomolecular Science, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Yaakov Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Norman Metanis
- Institute of Chemistry, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel.,Casali Center for Applied Chemistry, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel.,The Center for Nanoscience and Nanotechnology, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Liam M Longo
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo 152-8550, Japan.,Blue Marble Space Institute of Science, Seattle, Washington 98104, United States
| | - Daniella Goldfarb
- Department of Chemical and Biological Physics, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
33
|
Jayaraman V, Toledo‐Patiño S, Noda‐García L, Laurino P. Mechanisms of protein evolution. Protein Sci 2022; 31:e4362. [PMID: 35762715 PMCID: PMC9214755 DOI: 10.1002/pro.4362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/11/2022] [Accepted: 05/14/2022] [Indexed: 11/06/2022]
Abstract
How do proteins evolve? How do changes in sequence mediate changes in protein structure, and in turn in function? This question has multiple angles, ranging from biochemistry and biophysics to evolutionary biology. This review provides a brief integrated view of some key mechanistic aspects of protein evolution. First, we explain how protein evolution is primarily driven by randomly acquired genetic mutations and selection for function, and how these mutations can even give rise to completely new folds. Then, we also comment on how phenotypic protein variability, including promiscuity, transcriptional and translational errors, may also accelerate this process, possibly via "plasticity-first" mechanisms. Finally, we highlight open questions in the field of protein evolution, with respect to the emergence of more sophisticated protein systems such as protein complexes, pathways, and the emergence of pre-LUCA enzymes.
Collapse
Affiliation(s)
- Vijay Jayaraman
- Department of Molecular Cell BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Saacnicteh Toledo‐Patiño
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| | - Lianet Noda‐García
- Department of Plant Pathology and Microbiology, Institute of Environmental Sciences, Robert H. Smith Faculty of Agriculture, Food and EnvironmentHebrew University of JerusalemRehovotIsrael
| | - Paola Laurino
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| |
Collapse
|
34
|
Valer L, Rossetto D, Parkkila T, Sebastianelli L, Guella G, Hendricks AL, Cowan JA, Sang L, Mansy SS. Histidine Ligated Iron-Sulfur Peptides. Chembiochem 2022; 23:e202200202. [PMID: 35674331 PMCID: PMC9400863 DOI: 10.1002/cbic.202200202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Revised: 06/08/2022] [Indexed: 11/17/2022]
Abstract
Iron‐sulfur clusters are thought to be ancient cofactors that could have played a role in early protometabolic systems. Thus far, redox active, prebiotically plausible iron‐sulfur clusters have always contained cysteine ligands to the cluster. However, extant iron‐sulfur proteins can be found to exploit other modes of binding, including ligation by histidine residues, as seen with [2Fe‐2S] Rieske and MitoNEET proteins. Here, we investigated the ability of cysteine‐ and histidine‐containing peptides to coordinate a mononuclear Fe2+ center and a [2Fe‐2S] cluster and compare their properties with purified iron‐sulfur proteins. The iron‐sulfur peptides were characterized by UV‐vis, circular dichroism, and paramagnetic NMR spectroscopies and cyclic voltammetry. Small (≤6 amino acids) peptides can coordinate [2Fe‐2S] clusters through a combination of cysteine and histidine residues with similar reduction potentials as their corresponding proteins. Such complexes may have been important for early cell‐like systems.
Collapse
Affiliation(s)
- Luca Valer
- D-CIBIO, University of Trento, via Sommarive 9, 38123, Trento 28123, Italy.,Department of Chemistry, University of Alberta, 11227 Saskatchewan Drive, Edmonton, T6G 2G2, Alberta, Canada
| | - Daniele Rossetto
- D-CIBIO, University of Trento, via Sommarive 9, 38123, Trento 28123, Italy.,Department of Chemistry, University of Alberta, 11227 Saskatchewan Drive, Edmonton, T6G 2G2, Alberta, Canada
| | - Taylor Parkkila
- Department of Chemistry, University of Alberta, 11227 Saskatchewan Drive, Edmonton, T6G 2G2, Alberta, Canada
| | - Lorenzo Sebastianelli
- Department of Chemistry, University of Alberta, 11227 Saskatchewan Drive, Edmonton, T6G 2G2, Alberta, Canada
| | - Graziano Guella
- Department of Physics, University of Trento, Via Sommarive 14, Trento, 38123, Italy
| | - Amber L Hendricks
- Department of Chemistry and Biochemistry, The Ohio State University, 100 West 18th Ave, Columbus, OH 43210, USA
| | - James A Cowan
- Department of Chemistry and Biochemistry, The Ohio State University, 100 West 18th Ave, Columbus, OH 43210, USA
| | - Lingzi Sang
- Department of Chemistry, University of Alberta, 11227 Saskatchewan Drive, Edmonton, T6G 2G2, Alberta, Canada
| | - Sheref S Mansy
- D-CIBIO, University of Trento, via Sommarive 9, 38123, Trento 28123, Italy.,Department of Chemistry, University of Alberta, 11227 Saskatchewan Drive, Edmonton, T6G 2G2, Alberta, Canada
| |
Collapse
|
35
|
Merski M, Macedo-Ribeiro S, Wieczorek RM, Górna MW. The Repeating, Modular Architecture of the HtrA Proteases. Biomolecules 2022; 12:biom12060793. [PMID: 35740918 PMCID: PMC9221053 DOI: 10.3390/biom12060793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 06/02/2022] [Accepted: 06/04/2022] [Indexed: 02/04/2023] Open
Abstract
A conserved, 26-residue sequence [AA(X2)[A/G][G/L](X2)GDV[I/L](X2)[V/L]NGE(X1)V(X6)] and corresponding structure repeating module were identified within the HtrA protease family using a non-redundant set (N = 20) of publicly available structures. While the repeats themselves were far from sequence perfect, they had notable conservation to a statistically significant level. Three or more repetitions were identified within each protein despite being statistically expected to randomly occur only once per 1031 residues. This sequence repeat was associated with a six stranded antiparallel β-barrel module, two of which are present in the core of the structures of the PA clan of serine proteases, while a modified version of this module could be identified in the PDZ-like domains. Automated structural alignment methods had difficulties in superimposing these β-barrels, but the use of a target human HtrA2 structure showed that these modules had an average RMSD across the set of structures of less than 2 Å (mean and median). Our findings support Dayhoff’s hypothesis that complex proteins arose through duplication of simpler peptide motifs and domains.
Collapse
Affiliation(s)
- Matthew Merski
- Structural Biology Group, Biological and Chemical Research Centre, Faculty of Chemistry, University of Warsaw, Żwirki i Wigury 101, 02-089 Warsaw, Poland
- Correspondence: (M.M.); (M.W.G.); Tel.: +48-225-526-642 (M.M.)
| | - Sandra Macedo-Ribeiro
- Instituto de Investigação e Inovação em Saúde and Instituto de Biologia Molecular e Celular (IBMC), Universidade do Porto, 4200-135 Porto, Portugal;
| | - Rafal M. Wieczorek
- Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland;
| | - Maria W. Górna
- Structural Biology Group, Biological and Chemical Research Centre, Faculty of Chemistry, University of Warsaw, Żwirki i Wigury 101, 02-089 Warsaw, Poland
- Correspondence: (M.M.); (M.W.G.); Tel.: +48-225-526-642 (M.M.)
| |
Collapse
|
36
|
León-González JA, Flatet P, Juárez-Ramírez MS, Farías-Rico JA. Folding and Evolution of a Repeat Protein on the Ribosome. Front Mol Biosci 2022; 9:851038. [PMID: 35707224 PMCID: PMC9189291 DOI: 10.3389/fmolb.2022.851038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Accepted: 04/27/2022] [Indexed: 12/04/2022] Open
Abstract
Life on earth is the result of the work of proteins, the cellular nanomachines that fold into elaborated 3D structures to perform their functions. The ribosome synthesizes all the proteins of the biosphere, and many of them begin to fold during translation in a process known as cotranslational folding. In this work we discuss current advances of this field and provide computational and experimental data that highlight the role of ribosome in the evolution of protein structures. First, we used the sequence of the Ankyrin domain from the Drosophila Notch receptor to launch a deep sequence-based search. With this strategy, we found a conserved 33-residue motif shared by different protein folds. Then, to see how the vectorial addition of the motif would generate a full structure we measured the folding on the ribosome of the Ankyrin repeat protein. Not only the on-ribosome folding data is in full agreement with classical in vitro biophysical measurements but also it provides experimental evidence on how folded proteins could have evolved by duplication and fusion of smaller fragments in the RNA world. Overall, we discuss how the ribosomal exit tunnel could be conceptualized as an active site that is under evolutionary pressure to influence protein folding.
Collapse
Affiliation(s)
- José Alberto León-González
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
| | - Perline Flatet
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - María Soledad Juárez-Ramírez
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
| | - José Arcadio Farías-Rico
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
- *Correspondence: José Arcadio Farías-Rico,
| |
Collapse
|
37
|
The Legend of ATP: From Origin of Life to Precision Medicine. Metabolites 2022; 12:metabo12050461. [PMID: 35629965 PMCID: PMC9148104 DOI: 10.3390/metabo12050461] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 02/05/2023] Open
Abstract
Adenosine triphosphate (ATP) may be the most important biological small molecule. Since it was discovered in 1929, ATP has been regarded as life’s energy reservoir. However, this compound means more to life. Its legend starts at the dawn of life and lasts to this day. ATP must be the basic component of ancient ribozymes and may facilitate the origin of structured proteins. In the existing organisms, ATP continues to construct ribonucleic acid (RNA) and work as a protein cofactor. ATP also functions as a biological hydrotrope, which may keep macromolecules soluble in the primitive environment and can regulate phase separation in modern cells. These functions are involved in the pathogenesis of aging-related diseases and breast cancer, providing clues to discovering anti-aging agents and precision medicine tactics for breast cancer.
Collapse
|
38
|
Tee WV, Wah Tan Z, Guarnera E, Berezovsky IN. Conservation and diversity in allosteric fingerprints of proteins for evolutionary-inspired engineering and design. J Mol Biol 2022; 434:167577. [PMID: 35395233 DOI: 10.1016/j.jmb.2022.167577] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 03/30/2022] [Accepted: 03/30/2022] [Indexed: 11/26/2022]
Abstract
Hand-in-hand work of physics and evolution delivered protein universe with diversity of forms, sizes, and functions. Pervasiveness and advantageous traits of allostery made it an important component of the protein function regulation, calling for thorough investigation of its structural determinants and evolution. Learning directly from nature, we explored here allosteric communication in several major folds and repeat proteins, including α/β and β-barrels, β-propellers, Ig-like fold, ankyrin and α/β leucine-rich repeat proteins, which provide structural platforms for many different enzymatic and signalling functions. We obtained a picture of conserved allosteric communication characteristic in different fold types, modifications of the structure-driven signalling patterns via sequence-determined divergence to specific functions, as well as emergence and potential diversification of allosteric regulation in multi-domain proteins and oligomeric assemblies. Our observations will be instrumental in facilitating the engineering and de novo design of proteins with allosterically regulated functions, including development of therapeutic biologics. In particular, results described here may guide the identification of the optimal structural platforms (e.g. fold type, size, and oligomerization states) and the types of diversifications/perturbations, such as mutations, effector binding, and order-disorder transition. The tunable allosteric linkage across distant regions can be used as a pivotal component in the design/engineering of modular biological systems beyond the traditional scaffolding function.
Collapse
Affiliation(s)
- Wei-Ven Tee
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Zhen Wah Tan
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, Singapore 117597.
| |
Collapse
|
39
|
Fried SD, Fujishima K, Makarov M, Cherepashuk I, Hlouchova K. Peptides before and during the nucleotide world: an origins story emphasizing cooperation between proteins and nucleic acids. J R Soc Interface 2022; 19:20210641. [PMID: 35135297 PMCID: PMC8833103 DOI: 10.1098/rsif.2021.0641] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Recent developments in Origins of Life research have focused on substantiating the narrative of an abiotic emergence of nucleic acids from organic molecules of low molecular weight, a paradigm that typically sidelines the roles of peptides. Nevertheless, the simple synthesis of amino acids, the facile nature of their activation and condensation, their ability to recognize metals and cofactors and their remarkable capacity to self-assemble make peptides (and their analogues) favourable candidates for one of the earliest functional polymers. In this mini-review, we explore the ramifications of this hypothesis. Diverse lines of research in molecular biology, bioinformatics, geochemistry, biophysics and astrobiology provide clues about the progression and early evolution of proteins, and lend credence to the idea that early peptides served many central prebiotic roles before they were encodable by a polynucleotide template, in a putative 'peptide-polynucleotide stage'. For example, early peptides and mini-proteins could have served as catalysts, compartments and structural hubs. In sum, we shed light on the role of early peptides and small proteins before and during the nucleotide world, in which nascent life fully grasped the potential of primordial proteins, and which has left an imprint on the idiosyncratic properties of extant proteins.
Collapse
Affiliation(s)
- Stephen D Fried
- Department of Chemistry, Johns Hopkins University, Baltimore, MD 21212, USA.,Department of Biophysics, Johns Hopkins University, Baltimore, MD 21212, USA
| | - Kosuke Fujishima
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo 1528550, Japan.,Graduate School of Media and Governance, Keio University, Fujisawa 2520882, Japan
| | - Mikhail Makarov
- Department of Cell Biology, Faculty of Science, Charles University, BIOCEV, Prague 12800, Czech Republic
| | - Ivan Cherepashuk
- Department of Cell Biology, Faculty of Science, Charles University, BIOCEV, Prague 12800, Czech Republic
| | - Klara Hlouchova
- Department of Cell Biology, Faculty of Science, Charles University, BIOCEV, Prague 12800, Czech Republic.,Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague 16610, Czech Republic
| |
Collapse
|
40
|
Longo LM, Kolodny R, McGlynn SE. Evidence for the emergence of β-trefoils by 'Peptide Budding' from an IgG-like β-sandwich. PLoS Comput Biol 2022; 18:e1009833. [PMID: 35157697 PMCID: PMC8880906 DOI: 10.1371/journal.pcbi.1009833] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 02/25/2022] [Accepted: 01/13/2022] [Indexed: 12/02/2022] Open
Abstract
As sequence and structure comparison algorithms gain sensitivity, the intrinsic interconnectedness of the protein universe has become increasingly apparent. Despite this general trend, β-trefoils have emerged as an uncommon counterexample: They are an isolated protein lineage for which few, if any, sequence or structure associations to other lineages have been identified. If β-trefoils are, in fact, remote islands in sequence-structure space, it implies that the oligomerizing peptide that founded the β-trefoil lineage itself arose de novo. To better understand β-trefoil evolution, and to probe the limits of fragment sharing across the protein universe, we identified both 'β-trefoil bridging themes' (evolutionarily-related sequence segments) and 'β-trefoil-like motifs' (structure motifs with a hallmark feature of the β-trefoil architecture) in multiple, ostensibly unrelated, protein lineages. The success of the present approach stems, in part, from considering β-trefoil sequence segments or structure motifs rather than the β-trefoil architecture as a whole, as has been done previously. The newly uncovered inter-lineage connections presented here suggest a novel hypothesis about the origins of the β-trefoil fold itself-namely, that it is a derived fold formed by 'budding' from an Immunoglobulin-like β-sandwich protein. These results demonstrate how the evolution of a folded domain from a peptide need not be a signature of antiquity and underpin an emerging truth: few protein lineages escape nature's sewing table.
Collapse
Affiliation(s)
- Liam M. Longo
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- Blue Marble Space Institute of Science, Seattle, Washington, United States of America
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | - Shawn E. McGlynn
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- Blue Marble Space Institute of Science, Seattle, Washington, United States of America
| |
Collapse
|
41
|
Carter CW, Popinga A, Bouckaert R, Wills PR. Multidimensional Phylogenetic Metrics Identify Class I Aminoacyl-tRNA Synthetase Evolutionary Mosaicity and Inter-Modular Coupling. Int J Mol Sci 2022; 23:ijms23031520. [PMID: 35163448 PMCID: PMC8835825 DOI: 10.3390/ijms23031520] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2021] [Revised: 01/17/2022] [Accepted: 01/17/2022] [Indexed: 02/01/2023] Open
Abstract
The role of aminoacyl-tRNA synthetases (aaRS) in the emergence and evolution of genetic coding poses challenging questions concerning their provenance. We seek evidence about their ancestry from curated structure-based multiple sequence alignments of a structurally invariant “scaffold” shared by all 10 canonical Class I aaRS. Three uncorrelated phylogenetic metrics—mutation frequency, its uniformity, and row-by-row cladistic congruence—imply that the Class I scaffold is a mosaic assembled from successive genetic sources. Metrics for different modules vary in accordance with their presumed functionality. Sequences derived from the ATP– and amino acid– binding sites exhibit specific two-way coupling to those derived from Connecting Peptide 1, a third module whose metrics suggest later acquisition. The data help validate: (i) experimental fragmentations of the canonical Class I structure into three partitions that retain catalytic activities in proportion to their length; and (ii) evidence that the ancestral Class I aaRS gene also encoded a Class II ancestor in frame on the opposite strand. A 46-residue Class I “protozyme” roots the Class I tree prior to the adaptive radiation of the Rossmann dinucleotide binding fold that refined substrate discrimination. Such rooting implies near simultaneous emergence of genetic coding and the origin of the proteome, resolving a conundrum posed by previous inferences that Class I aaRS evolved after the genetic code had been implemented in an RNA world. Further, pinpointing discontinuous enhancements of aaRS fidelity establishes a timeline for the growth of coding from a binary amino acid alphabet.
Collapse
Affiliation(s)
- Charles W. Carter
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599-7260, USA
- Correspondence: ; Tel.: +1-919-966-3263
| | - Alex Popinga
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Remco Bouckaert
- Centre for Computational Evolution, University of Auckland, PB 92019, Auckland 1142, New Zealand; (A.P.); (R.B.)
| | - Peter R. Wills
- Department of Physics and Te Ao Marama Centre for Fundamental Inquiry, University of Auckland, PB 92019, Auckland 1142, New Zealand;
| |
Collapse
|
42
|
Valer L, Rossetto D, Scintilla S, Hu YJ, Tomar A, Nader S, Betinol IO, Mansy S. Methods to identify and characterize iron-sulfur oligopeptides in water. CAN J CHEM 2022. [DOI: 10.1139/cjc-2021-0237] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Iron-sulfur clusters are ubiquitous cofactors that mediate central biological processes. However, despite their long history, these metallocofactors remain challenging to investigate when coordinated to small (≤ six amino acids) oligopeptides in aqueous solution. In addition to being often unstable in vitro, iron-sulfur clusters can be found in a wide variety of forms with varied characteristics, which makes it difficult to easily discern what is in solution. This difficulty is compounded by the dynamics of iron-sulfur peptides, which frequently coordinate multiple types of clusters simultaneously. To aid investigations of such complex samples, a summary of data from multiple techniques used to characterize both iron-sulfur proteins and peptides is provided. Although not all spectroscopic techniques are equally insightful, it is possible to use several, readily available methods to gain insight into the complex composition of aqueous solutions of iron-sulfur peptides.
Collapse
Affiliation(s)
- Luca Valer
- University of Trento, 19034, Trento, Trentino-Alto Adige, Italy
| | | | | | - Yin Juan Hu
- University of Alberta, 3158, Chemistry, Edmonton, Alberta, Canada
| | - Anju Tomar
- University of Trento, 19034, Trento, Trentino-Alto Adige, Italy
| | - Serge Nader
- University of Alberta, 3158, Chemistry, Edmonton, Alberta, Canada
| | | | - Sheref Mansy
- University of Alberta, 3158, Chemistry, Edmonton, Alberta, Canada
| |
Collapse
|
43
|
Dong Y, Zhang S, Zhao L. Unraveling the Structural Development of
Peptide‐Coordinated Iron‐Sulfur
Clusters: Prebiotic Evolution and Biosynthetic Strategies. CHINESE J CHEM 2022. [DOI: 10.1002/cjoc.202100892] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Yijun Dong
- School of Life Sciences, Tsinghua University Beijing 100084 China
| | - Siqi Zhang
- Key Laboratory of Bioorganic Phosphorus Chemistry & Chemical Biology, Department of Chemistry Tsinghua University Beijing 100084 China
| | - Liang Zhao
- Key Laboratory of Bioorganic Phosphorus Chemistry & Chemical Biology, Department of Chemistry Tsinghua University Beijing 100084 China
| |
Collapse
|
44
|
Chin AF, Wrabl JO, Hilser VJ. A thermodynamic atlas of proteomes reveals energetic innovation across the tree of life. Mol Biol Evol 2022; 39:6509521. [PMID: 35038744 PMCID: PMC8896757 DOI: 10.1093/molbev/msac010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein stability is a fundamental molecular property enabling organisms to adapt to their biological niches. How this is facilitated and whether there are kingdom specific or more general universal strategies is not known. A principal obstacle to addressing this issue is that the vast majority of proteins lack annotation, specifically thermodynamic annotation, beyond the amino acid and chromosome information derived from genome sequencing. To address this gap and facilitate future investigation into large-scale patterns of protein stability and dynamics within and between organisms, we applied a unique ensemble-based thermodynamic characterization of protein folds to a substantial portion of extant sequenced genomes. Using this approach, we compiled a database resource focused on the position-specific variation in protein stability. Interrogation of the database reveals; 1) domains of life exhibit distinguishing thermodynamic features, with eukaryotes particularly different from both archaea and bacteria, 2) the optimal growth temperature of an organism is proportional to the average apolar enthalpy of its proteome, 3) intrinsic disorder content is also proportional to the apolar enthalpy (but unexpectedly not the predicted stability at 25 °C), and 4) secondary structure and global stability information of individual proteins is extractable. We hypothesize that wider access to residue-specific thermodynamic information of proteomes will result in deeper understanding of mechanisms driving functional adaptation and protein evolution. Our database is free for download at https://afc-science.github.io/thermo-env-atlas/.
Collapse
Affiliation(s)
- Alexander F Chin
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - James O Wrabl
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - Vincent J Hilser
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA.,T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| |
Collapse
|
45
|
Bromberg Y, Aptekmann AA, Mahlich Y, Cook L, Senn S, Miller M, Nanda V, Ferreiro DU, Falkowski PG. Quantifying structural relationships of metal-binding sites suggests origins of biological electron transfer. SCIENCE ADVANCES 2022; 8:eabj3984. [PMID: 35030025 PMCID: PMC8759750 DOI: 10.1126/sciadv.abj3984] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 11/22/2021] [Indexed: 06/07/2023]
Abstract
Biological redox reactions drive planetary biogeochemical cycles. Using a novel, structure-guided sequence analysis of proteins, we explored the patterns of evolution of enzymes responsible for these reactions. Our analysis reveals that the folds that bind transition metal–containing ligands have similar structural geometry and amino acid sequences across the full diversity of proteins. Similarity across folds reflects the availability of key transition metals over geological time and strongly suggests that transition metal–ligand binding had a small number of common peptide origins. We observe that structures central to our similarity network come primarily from oxidoreductases, suggesting that ancestral peptides may have also facilitated electron transfer reactions. Last, our results reveal that the earliest biologically functional peptides were likely available before the assembly of fully functional protein domains over 3.8 billion years ago.Thus, life is a special, very complex form of motion of matter, but this form did not always exist, and it is not separated from inorganic nature by an impassable abyss; rather, it arose from inorganic nature as a new property in the process of evolution of the world. We must study the history of this evolution if we want to solve the problem of the origin of life. [A. I. Oparin (1)]
Collapse
Affiliation(s)
- Yana Bromberg
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Ariel A. Aptekmann
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Yannick Mahlich
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Linda Cook
- Program in Applied and Computational Math, Princeton University, Princeton, NJ 08540, USA
| | - Stefan Senn
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Maximillian Miller
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ 08873, USA
| | - Vikas Nanda
- Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, and Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ 08854, USA
| | - Diego U. Ferreiro
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICET), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Paul G. Falkowski
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ 08901, USA
| |
Collapse
|
46
|
Jackson C, Toth-Petroczy A, Kolodny R, Hollfelder F, Fuxreiter M, Caroline Lynn Kamerlin S, Tokuriki N. Adventures on the routes of protein evolution — in memoriam Dan Salah Tawfik (1955 - 2021). J Mol Biol 2022; 434:167462. [DOI: 10.1016/j.jmb.2022.167462] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 01/17/2022] [Indexed: 12/21/2022]
|
47
|
Coyote-Maestas W, Nedrud D, Suma A, He Y, Matreyek KA, Fowler DM, Carnevale V, Myers CL, Schmidt D. Probing ion channel functional architecture and domain recombination compatibility by massively parallel domain insertion profiling. Nat Commun 2021; 12:7114. [PMID: 34880224 PMCID: PMC8654947 DOI: 10.1038/s41467-021-27342-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 11/16/2021] [Indexed: 11/10/2022] Open
Abstract
Protein domains are the basic units of protein structure and function. Comparative analysis of genomes and proteomes showed that domain recombination is a main driver of multidomain protein functional diversification and some of the constraining genomic mechanisms are known. Much less is known about biophysical mechanisms that determine whether protein domains can be combined into viable protein folds. Here, we use massively parallel insertional mutagenesis to determine compatibility of over 300,000 domain recombination variants of the Inward Rectifier K+ channel Kir2.1 with channel surface expression. Our data suggest that genomic and biophysical mechanisms acted in concert to favor gain of large, structured domain at protein termini during ion channel evolution. We use machine learning to build a quantitative biophysical model of domain compatibility in Kir2.1 that allows us to derive rudimentary rules for designing domain insertion variants that fold and traffic to the cell surface. Positional Kir2.1 responses to motif insertion clusters into distinct groups that correspond to contiguous structural regions of the channel with distinct biophysical properties tuned towards providing either folding stability or gating transitions. This suggests that insertional profiling is a high-throughput method to annotate function of ion channel structural regions.
Collapse
Affiliation(s)
- Willow Coyote-Maestas
- grid.17635.360000000419368657Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455 USA
| | - David Nedrud
- grid.17635.360000000419368657Department of Biochemistry, Molecular Biology & Biophysics, University of Minnesota, Minneapolis, MN 55455 USA
| | - Antonio Suma
- grid.264727.20000 0001 2248 3398Department of Chemistry, Temple University, Philadelphia, PA 19122 USA
| | - Yungui He
- grid.17635.360000000419368657Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, MN 55455 USA
| | - Kenneth A. Matreyek
- grid.67105.350000 0001 2164 3847Department of Pathology, Case Western Reserve University School of Medicine, Cleveland, OH 44106 USA
| | - Douglas M. Fowler
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington, Seattle, WA 98115 USA ,grid.34477.330000000122986657Department of Bioengineering, University of Washington, Seattle, WA 98115 USA
| | - Vincenzo Carnevale
- grid.264727.20000 0001 2248 3398Department of Chemistry, Temple University, Philadelphia, PA 19122 USA
| | - Chad L. Myers
- grid.17635.360000000419368657Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455 USA
| | - Daniel Schmidt
- Department of Genetics, Cell Biology & Development, University of Minnesota, Minneapolis, MN, 55455, USA.
| |
Collapse
|
48
|
Le Vay K, Song EY, Ghosh B, Tang TD, Mutschler H. Enhanced Ribozyme‐Catalyzed Recombination and Oligonucleotide Assembly in Peptide‐RNA Condensates. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202109267] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Affiliation(s)
- Kristian Le Vay
- Biomimetic Systems Max Planck Institute of Biochemistry Am Klopferspitz 18 82152 Martinsried Germany
- Department of Chemistry and Chemical Biology TU Dortmund University Otto-Hahn-Str. 4a 44227 Dortmund Germany
| | - Emilie Yeonwha Song
- Biomimetic Systems Max Planck Institute of Biochemistry Am Klopferspitz 18 82152 Martinsried Germany
- Department of Chemistry and Chemical Biology TU Dortmund University Otto-Hahn-Str. 4a 44227 Dortmund Germany
| | - Basusree Ghosh
- Max-Planck Institute of Molecular Cell Biology and Genetics Pfotenhauerstraße 108 01307 Dresden Germany
| | - T.‐Y. Dora Tang
- Max-Planck Institute of Molecular Cell Biology and Genetics Pfotenhauerstraße 108 01307 Dresden Germany
| | - Hannes Mutschler
- Department of Chemistry and Chemical Biology TU Dortmund University Otto-Hahn-Str. 4a 44227 Dortmund Germany
| |
Collapse
|
49
|
Papadopoulos C, Callebaut I, Gelly JC, Hatin I, Namy O, Renard M, Lespinet O, Lopes A. Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution. Genome Res 2021; 31:2303-2315. [PMID: 34810219 PMCID: PMC8647833 DOI: 10.1101/gr.275638.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 09/23/2021] [Indexed: 01/08/2023]
Abstract
The noncoding genome plays an important role in de novo gene birth and in the emergence of genetic novelty. Nevertheless, how noncoding sequences' properties could promote the birth of novel genes and shape the evolution and the structural diversity of proteins remains unclear. Therefore, by combining different bioinformatic approaches, we characterized the fold potential diversity of the amino acid sequences encoded by all intergenic open reading frames (ORFs) of S. cerevisiae with the aim of (1) exploring whether the structural states' diversity of proteomes is already present in noncoding sequences, and (2) estimating the potential of the noncoding genome to produce novel protein bricks that could either give rise to novel genes or be integrated into pre-existing proteins, thus participating in protein structure diversity and evolution. We showed that amino acid sequences encoded by most yeast intergenic ORFs contain the elementary building blocks of protein structures. Moreover, they encompass the large structural state diversity of canonical proteins, with the majority predicted as foldable. Then, we investigated the early stages of de novo gene birth by reconstructing the ancestral sequences of 70 yeast de novo genes and characterized the sequence and structural properties of intergenic ORFs with a strong translation signal. This enabled us to highlight sequence and structural factors determining de novo gene emergence. Finally, we showed a strong correlation between the fold potential of de novo proteins and one of their ancestral amino acid sequences, reflecting the relationship between the noncoding genome and the protein structure universe.
Collapse
Affiliation(s)
- Chris Papadopoulos
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005 Paris, France
| | - Jean-Christophe Gelly
- Université de Paris, Biologie Intégrée du Globule Rouge, UMR_S1134, BIGR, INSERM, F-75015 Paris, France
- Laboratoire d'Excellence GR-Ex, 75015 Paris, France
- Institut National de la Transfusion Sanguine, F-75015 Paris, France
| | - Isabelle Hatin
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Olivier Namy
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Maxime Renard
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Olivier Lespinet
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Anne Lopes
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| |
Collapse
|
50
|
Caetano-Anollés G, Aziz MF, Mughal F, Caetano-Anollés D. Tracing protein and proteome history with chronologies and networks: folding recapitulates evolution. Expert Rev Proteomics 2021; 18:863-880. [PMID: 34628994 DOI: 10.1080/14789450.2021.1992277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
INTRODUCTION While the origin and evolution of proteins remain mysterious, advances in evolutionary genomics and systems biology are facilitating the historical exploration of the structure, function and organization of proteins and proteomes. Molecular chronologies are series of time events describing the history of biological systems and subsystems and the rise of biological innovations. Together with time-varying networks, these chronologies provide a window into the past. AREAS COVERED Here, we review molecular chronologies and networks built with modern methods of phylogeny reconstruction. We discuss how chronologies of structural domain families uncover the explosive emergence of metabolism, the late rise of translation, the co-evolution of ribosomal proteins and rRNA, and the late development of the ribosomal exit tunnel; events that coincided with a tendency to shorten folding time. Evolving networks described the early emergence of domains and a late 'big bang' of domain combinations. EXPERT OPINION Two processes, folding and recruitment appear central to the evolutionary progression. The former increases protein persistence. The later fosters diversity. Chronologically, protein evolution mirrors folding by combining supersecondary structures into domains, developing translation machinery to facilitate folding speed and stability, and enhancing structural complexity by establishing long-distance interactions in novel structural and architectural designs.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA.,C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Derek Caetano-Anollés
- Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| |
Collapse
|