1
|
Yagi S, Tagami S. An ancestral fold reveals the evolutionary link between RNA polymerase and ribosomal proteins. Nat Commun 2024; 15:5938. [PMID: 39025855 PMCID: PMC11258233 DOI: 10.1038/s41467-024-50013-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 06/25/2024] [Indexed: 07/20/2024] Open
Abstract
Numerous molecular machines are required to drive the central dogma of molecular biology. However, the means by which these numerous proteins emerged in the early evolutionary stage of life remains enigmatic. Many of them possess small β-barrel folds with different topologies, represented by double-psi β-barrels (DPBBs) conserved in DNA and RNA polymerases, and similar but topologically distinct six-stranded β-barrel RIFT or five-stranded β-barrel folds such as OB and SH3 in ribosomal proteins. Here, we discover that the previously reconstructed ancient DPBB sequence could also adopt a β-barrel fold named Double-Zeta β-barrel (DZBB), as a metamorphic protein. The DZBB fold is not found in any modern protein, although its structure shares similarities with RIFT and OB. Indeed, DZBB could be transformed into them through simple engineering experiments. Furthermore, the OB designs could be further converted into SH3 by circular-permutation as previously predicted. These results indicate that these β-barrels diversified quickly from a common ancestor at the beginning of the central dogma evolution.
Collapse
Affiliation(s)
- Sota Yagi
- RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
- Faculty of Human Sciences, Waseda University, 2-579-15, Mikajima, Tokorozawa, Saitama, 359-1192, Japan.
| | - Shunsuke Tagami
- RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan.
- Graduate School of Medicine, Science and Technology, Shinshu University, 3-1-1 Asahi, Matsumoto City, Nagano, 390-8621, Japan.
- International Institute for Sustainability with Knotted Chiral Meta Matter (WPI-SKCM²), Hiroshima University, 1-3-1 Kagamiyama, Higashi-Hiroshima, Hiroshima, 739-8526, Japan.
| |
Collapse
|
2
|
Toledo-Patiño S, Goetz SK, Shanmugaratnam S, Höcker B, Farías-Rico JA. Molecular handcraft of a well-folded protein chimera. FEBS Lett 2024; 598:1375-1386. [PMID: 38508768 DOI: 10.1002/1873-3468.14856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 02/11/2024] [Accepted: 02/12/2024] [Indexed: 03/22/2024]
Abstract
Modular assembly is a compelling pathway to create new proteins, a concept supported by protein engineering and millennia of evolution. Natural evolution provided a repository of building blocks, known as domains, which trace back to even shorter segments that underwent numerous 'copy-paste' processes culminating in the scaffolds we see today. Utilizing the subdomain-database Fuzzle, we constructed a fold-chimera by integrating a flavodoxin-like fragment into a periplasmic binding protein. This chimera is well-folded and a crystal structure reveals stable interfaces between the fragments. These findings demonstrate the adaptability of α/β-proteins and offer a stepping stone for optimization. By emphasizing the practicality of fragment databases, our work pioneers new pathways in protein engineering. Ultimately, the results substantiate the conjecture that periplasmic binding proteins originated from a flavodoxin-like ancestor.
Collapse
Affiliation(s)
- Saacnicteh Toledo-Patiño
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Okinawa Institute of Science and Technology Graduate University, Japan
| | | | - Sooruban Shanmugaratnam
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Department of Biochemistry, University of Bayreuth, Germany
| | - Birte Höcker
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Department of Biochemistry, University of Bayreuth, Germany
| | - José Arcadio Farías-Rico
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
| |
Collapse
|
3
|
Zheng Z, Goncearenco A, Berezovsky IN. Back in time to the Gly-rich prototype of the phosphate binding elementary function. Curr Res Struct Biol 2024; 7:100142. [PMID: 38655428 PMCID: PMC11035071 DOI: 10.1016/j.crstbi.2024.100142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 03/31/2024] [Accepted: 04/03/2024] [Indexed: 04/26/2024] Open
Abstract
Binding of nucleotides and their derivatives is one of the most ancient elementary functions dating back to the Origin of Life. We review here the works considering one of the key elements in binding of (di)nucleotide-containing ligands - phosphate binding. We start from a brief discussion of major participants, conditions, and events in prebiotic evolution that resulted in the Origin of Life. Tracing back to the basic functions, including metal and phosphate binding, and, potentially, formation of primitive protein-protein interactions, we focus here on the phosphate binding. Critically assessing works on the structural, functional, and evolutionary aspects of phosphate binding, we perform a simple computational experiment reconstructing its most ancient and generic sequence prototype. The profiles of the phosphate binding signatures have been derived in form of position-specific scoring matrices (PSSMs), their peculiarities depending on the type of the ligands have been analyzed, and evolutionary connections between them have been delineated. Then, the apparent prototype that gave rise to all relevant phosphate-binding signatures had also been reconstructed. We show that two major signatures of the phosphate binding that discriminate between the binding of dinucleotide- and nucleotide-containing ligands are GxGxxG and GxxGxG, respectively. It appears that the signature archetypal for dinucleotide-containing ligands is more generic, and it can frequently bind phosphate groups in nucleotide-containing ligands as well. The reconstructed prototype's key signature GxGGxG underlies the role of glycine residues in providing flexibility and interactions necessary for binding the phosphate groups. The prototype also contains other ancient amino acids, valine, and alanine, showing versatility towards evolutionary design and functional diversification.
Collapse
Affiliation(s)
- Zejun Zheng
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | | | - Igor N. Berezovsky
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore
| |
Collapse
|
4
|
Ye W, Krishna Behra PR, Dyrhage K, Seeger C, Joiner JD, Karlsson E, Andersson E, Chi CN, Andersson SGE, Jemth P. Folded Alpha Helical Putative New Proteins from Apilactobacillus kunkeei. J Mol Biol 2024; 436:168490. [PMID: 38355092 DOI: 10.1016/j.jmb.2024.168490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 02/07/2024] [Accepted: 02/08/2024] [Indexed: 02/16/2024]
Abstract
The emergence of new proteins is a central question in biology. Most tertiary protein folds known to date appear to have an ancient origin, but it is clear from bioinformatic analyses that new proteins continuously emerge in all organismal groups. However, there is a paucity of experimental data on new proteins regarding their structure and biophysical properties. We performed a detailed phylogenetic analysis and identified 48 putative open reading frames in the honeybee-associated bacterium Apilactobacillus kunkeei for which no or few homologs could be identified in closely-related species, suggesting that they could be relatively new on an evolutionary time scale and represent recently evolved proteins. Using circular dichroism-, fluorescence- and nuclear magnetic resonance (NMR) spectroscopy we investigated six of these proteins and show that they are not intrinsically disordered, but populate alpha-helical dominated folded states with relatively low thermodynamic stability (0-3 kcal/mol). The NMR and biophysical data demonstrate that small new proteins readily adopt simple folded conformations suggesting that more complex tertiary structures can be continuously re-invented during evolution by fusion of such simple secondary structure elements. These findings have implications for the general view on protein evolution, where de novo emergence of folded proteins may be a common event.
Collapse
Affiliation(s)
- Weihua Ye
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Phani Rama Krishna Behra
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Karl Dyrhage
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Christian Seeger
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Joe D Joiner
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Elin Karlsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Celestine N Chi
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden.
| | - Siv G E Andersson
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden.
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden.
| |
Collapse
|
5
|
Schierholz L, Brown CR, Helena-Bueno K, Uversky VN, Hirt RP, Barandun J, Melnikov SV. A Conserved Ribosomal Protein Has Entirely Dissimilar Structures in Different Organisms. Mol Biol Evol 2024; 41:msad254. [PMID: 37987564 PMCID: PMC10764239 DOI: 10.1093/molbev/msad254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 10/23/2023] [Accepted: 11/16/2023] [Indexed: 11/22/2023] Open
Abstract
Ribosomes from different species can markedly differ in their composition by including dozens of ribosomal proteins that are unique to specific lineages but absent in others. However, it remains unknown how ribosomes acquire new proteins throughout evolution. Here, to help answer this question, we describe the evolution of the ribosomal protein msL1/msL2 that was recently found in ribosomes from the parasitic microorganism clade, microsporidia. We show that this protein has a conserved location in the ribosome but entirely dissimilar structures in different organisms: in each of the analyzed species, msL1/msL2 exhibits an altered secondary structure, an inverted orientation of the N-termini and C-termini on the ribosomal binding surface, and a completely transformed 3D fold. We then show that this fold switching is likely caused by changes in the ribosomal msL1/msL2-binding site, specifically, by variations in rRNA. These observations allow us to infer an evolutionary scenario in which a small, positively charged, de novo-born unfolded protein was first captured by rRNA to become part of the ribosome and subsequently underwent complete fold switching to optimize its binding to its evolving ribosomal binding site. Overall, our work provides a striking example of how a protein can switch its fold in the context of a complex biological assembly, while retaining its specificity for its molecular partner. This finding will help us better understand the origin and evolution of new protein components of complex molecular assemblies-thereby enhancing our ability to engineer biological molecules, identify protein homologs, and peer into the history of life on Earth.
Collapse
Affiliation(s)
- Léon Schierholz
- Department of Molecular Biology, Laboratory for Molecular Infection Medicine Sweden, Umeå Centre for Microbial Research, Science for Life Laboratory, Umeå University, Umeå 901 87, Sweden
| | - Charlotte R Brown
- Biosciences Institute, Newcastle University School of Medicine, Newcastle upon Tyne NE2 4HH, UK
| | - Karla Helena-Bueno
- Biosciences Institute, Newcastle University School of Medicine, Newcastle upon Tyne NE2 4HH, UK
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Robert P Hirt
- Biosciences Institute, Newcastle University School of Medicine, Newcastle upon Tyne NE2 4HH, UK
| | - Jonas Barandun
- Department of Molecular Biology, Laboratory for Molecular Infection Medicine Sweden, Umeå Centre for Microbial Research, Science for Life Laboratory, Umeå University, Umeå 901 87, Sweden
| | - Sergey V Melnikov
- Biosciences Institute, Newcastle University School of Medicine, Newcastle upon Tyne NE2 4HH, UK
| |
Collapse
|
6
|
Benjdia A, Berteau O. B 12-dependent radical SAM enzymes: Ever expanding structural and mechanistic diversity. Curr Opin Struct Biol 2023; 83:102725. [PMID: 37931378 DOI: 10.1016/j.sbi.2023.102725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 09/26/2023] [Accepted: 10/03/2023] [Indexed: 11/08/2023]
Abstract
In the last decade, B12-dependent radical SAM enzymes have emerged as central biocatalysts in the biosynthesis of a myriad of natural products. Notably, these enzymes have been shown to catalyze carbon-carbon bond formation on unactivated carbon atoms leading to unusual methylations. Recently, structural studies have revealed unprecedented insights into the complex chemistry catalyzed by these enzymes. In this review, we cover recent advances in our understanding of B12-dependent radical SAM enzymes from a mechanistic and structural perspective. We discuss the unanticipated diversity of these enzymes which suggests evolutionary links between various biosynthetic and metabolic pathways from antibiotic to RiPP and methane biosynthesis.
Collapse
Affiliation(s)
- Alhosna Benjdia
- Université Paris-Saclay, INRAE, AgroParisTech, Micalis Institute, ChemSyBio, 78350, Jouy-en-Josas, France.
| | - Olivier Berteau
- Université Paris-Saclay, INRAE, AgroParisTech, Micalis Institute, ChemSyBio, 78350, Jouy-en-Josas, France.
| |
Collapse
|
7
|
Michel F, Romero‐Romero S, Höcker B. Retracing the evolution of a modern periplasmic binding protein. Protein Sci 2023; 32:e4793. [PMID: 37788980 PMCID: PMC10601554 DOI: 10.1002/pro.4793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/20/2023] [Accepted: 09/22/2023] [Indexed: 10/05/2023]
Abstract
Investigating the evolution of structural features in modern multidomain proteins helps to understand their immense diversity and functional versatility. The class of periplasmic binding proteins (PBPs) offers an opportunity to interrogate one of the main processes driving diversification: the duplication and fusion of protein sequences to generate new architectures. The symmetry of their two-lobed topology, their mechanism of binding, and the organization of their operon structure led to the hypothesis that PBPs arose through a duplication and fusion event of a single common ancestor. To investigate this claim, we set out to reverse the evolutionary process and recreate the structural equivalent of a single-lobed progenitor using ribose-binding protein (RBP) as our model. We found that this modern PBP can be deconstructed into its lobes, producing two proteins that represent possible progenitor halves. The isolated halves of RBP are well folded and monomeric proteins, albeit with a lower thermostability, and do not retain the original binding function. However, the two entities readily form a heterodimer in vitro and in-cell. The x-ray structure of the heterodimer closely resembles the parental protein. Moreover, the binding function is fully regained upon formation of the heterodimer with a ligand affinity similar to that observed in the modern RBP. This highlights how a duplication event could have given rise to a stable and functional PBP-like fold and provides insights into how more complex functional structures can evolve from simpler molecular components.
Collapse
Affiliation(s)
- Florian Michel
- Department of BiochemistryUniversity of BayreuthBayreuthGermany
| | | | - Birte Höcker
- Department of BiochemistryUniversity of BayreuthBayreuthGermany
| |
Collapse
|
8
|
Porter LL. Fluid protein fold space and its implications. Bioessays 2023; 45:e2300057. [PMID: 37431685 PMCID: PMC10529699 DOI: 10.1002/bies.202300057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 06/21/2023] [Accepted: 06/23/2023] [Indexed: 07/12/2023]
Abstract
Fold-switching proteins, which remodel their secondary and tertiary structures in response to cellular stimuli, suggest a new view of protein fold space. For decades, experimental evidence has indicated that protein fold space is discrete: dissimilar folds are encoded by dissimilar amino acid sequences. Challenging this assumption, fold-switching proteins interconnect discrete groups of dissimilar protein folds, making protein fold space fluid. Three recent observations support the concept of fluid fold space: (1) some amino acid sequences interconvert between folds with distinct secondary structures, (2) some naturally occurring sequences have switched folds by stepwise mutation, and (3) fold switching is evolutionarily selected and likely confers advantage. These observations indicate that minor amino acid sequence modifications can transform protein structure and function. Consequently, proteomic structural and functional diversity may be expanded by alternative splicing, small nucleotide polymorphisms, post-translational modifications, and modified translation rates.
Collapse
Affiliation(s)
- Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD
- National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD
| |
Collapse
|
9
|
Chakravarty D, Sreenivasan S, Swint-Kruse L, Porter LL. Identification of a covert evolutionary pathway between two protein folds. Nat Commun 2023; 14:3177. [PMID: 37264049 DOI: 10.1038/s41467-023-38519-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 05/03/2023] [Indexed: 06/03/2023] Open
Abstract
Although homologous protein sequences are expected to adopt similar structures, some amino acid substitutions can interconvert α-helices and β-sheets. Such fold switching may have occurred over evolutionary history, but supporting evidence has been limited by the: (1) abundance and diversity of sequenced genes, (2) quantity of experimentally determined protein structures, and (3) assumptions underlying the statistical methods used to infer homology. Here, we overcome these barriers by applying multiple statistical methods to a family of ~600,000 bacterial response regulator proteins. We find that their homologous DNA-binding subunits assume divergent structures: helix-turn-helix versus α-helix + β-sheet (winged helix). Phylogenetic analyses, ancestral sequence reconstruction, and AlphaFold2 models indicate that amino acid substitutions facilitated a switch from helix-turn-helix into winged helix. This structural transformation likely expanded DNA-binding specificity. Our approach uncovers an evolutionary pathway between two protein folds and provides a methodology to identify secondary structure switching in other protein families.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Shwetha Sreenivasan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS, 66160, USA
| | - Lauren L Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
10
|
Aina A, Hsueh SCC, Plotkin SS. PROTHON: A Local Order Parameter-Based Method for Efficient Comparison of Protein Ensembles. J Chem Inf Model 2023. [PMID: 37178169 DOI: 10.1021/acs.jcim.3c00145] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
The comparison of protein conformational ensembles is of central importance in structural biology. However, there are few computational methods for ensemble comparison, and those that are readily available, such as ENCORE, utilize methods that are sufficiently computationally expensive to be prohibitive for large ensembles. Here, a new method is presented for efficient representation and comparison of protein conformational ensembles. The method is based on the representation of a protein ensemble as a vector of probability distribution functions (pdfs), with each pdf representing the distribution of a local structural property such as the number of contacts between Cβ atoms. Dissimilarity between two conformational ensembles is quantified by the Jensen-Shannon distance between the corresponding set of probability distribution functions. The method is validated for conformational ensembles generated by molecular dynamics simulations of ubiquitin, as well as experimentally derived conformational ensembles of a 130 amino acid truncated form of human tau protein. In the ubiquitin ensemble data set, the method was up to 88 times faster than the existing ENCORE software, while simultaneously utilizing 48 times fewer computing cores. We make the method available as a Python package, called PROTHON, and provide a GitHub page with the Python source code at https://github.com/PlotkinLab/Prothon.
Collapse
Affiliation(s)
- Adekunle Aina
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, BC V6T 1Z1, Canada
| | - Shawn C C Hsueh
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, BC V6T 1Z1, Canada
| | - Steven S Plotkin
- Department of Physics and Astronomy, The University of British Columbia, Vancouver, BC V6T 1Z1, Canada
- Genome Science and Technology Program, The University of British Columbia, Vancouver, BC V6T 1Z1, Canada
| |
Collapse
|
11
|
Alrouji M, Majrashi TA, Alhumaydhi FA, Zari A, Zari TA, Al Abdulmonem W, Sharaf SE, Shahwan M, Anwar S, Shamsi A, Atiya A. Unveiling Phytoconstituents with Inhibitory Potential Against Tyrosine-Protein Kinase Fyn: A Comprehensive Virtual Screening Approach Targeting Alzheimer's Disease. J Alzheimers Dis 2023; 96:827-844. [PMID: 37899058 DOI: 10.3233/jad-230828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2023]
Abstract
BACKGROUND Tyrosine-protein kinase Fyn (Fyn) is a critical signaling molecule involved in various cellular processes, including neuronal development, synaptic plasticity, and disease pathogenesis. Dysregulation of Fyn kinase has been implicated in various complex diseases, including neurodegenerative disorders such as Alzheimer's and Parkinson's diseases, as well as different cancer types. Therefore, identifying small molecule inhibitors that can inhibit Fyn activity holds substantial significance in drug discovery. OBJECTIVE The aim of this study was to identify potential small-molecule inhibitors among bioactive phytoconstituents against tyrosine-protein kinase Fyn. METHODS Through a comprehensive approach involving molecular docking, drug likeliness filters, and molecular dynamics (MD) simulations, we performed a virtual screening of a natural compounds library. This methodology aimed to pinpoint compounds potentially interacting with Fyn kinase and inhibiting its activity. RESULTS This study finds two potential natural compounds: Dehydromillettone and Tanshinone B. These compoundsdemonstrated substantial affinity and specific interactions towards the Fyn binding pocket. Their conformations exhibitedcompatibility and stability, indicating the formation of robust protein-ligand complexes. A significant array of non-covalentinteractions supported the structural integrity of these complexes. CONCLUSION Dehydromillettone and Tanshinone B emerge as promising candidates, poised for further optimization as Fynkinase inhibitors with therapeutic applications. In a broader context, this study demonstrates the potential of computationaldrug discovery, underscoring its utility in identifying compounds with clinical significance. The identified inhibitors holdpromise in addressing a spectrum of cancer and neurodegenerative disorders. However, their efficacy and safety necessitatevalidation through subsequent experimental studies.
Collapse
Affiliation(s)
- Mohammed Alrouji
- Department of Medical Laboratories, College of Applied Medical Sciences, Shaqra University, Shaqra, Saudi Arabia
| | - Taghreed A Majrashi
- Department of Pharmacognosy, College of Pharmacy, King Khalid University (KKU), Guraiger, Abha, Saudi Arabia
| | - Fahad A Alhumaydhi
- Department of Medical Laboratories, College of Applied Medical Sciences, Qassim University, Buraydah, Saudi Arabia
| | - Ali Zari
- Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
- Princess Dr. Najla Bint Saud Al-Saud Center for Excellence Research in Biotechnology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Talal A Zari
- Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Waleed Al Abdulmonem
- Department of Pathology, College of Medicine, Qassim University, Buraydah, Saudi Arabia
| | - Sharaf E Sharaf
- Pharmaceutical Chemistry Department, College of Pharmacy Umm Al-Qura University Makkah, Saudi Arabia
| | - Moyad Shahwan
- Center for Medical and Bio-Allied Health Sciences, Ajman University, Ajman, UAE
| | - Saleha Anwar
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, Jamia Nagar, New Delhi, India
| | - Anas Shamsi
- Center for Medical and Bio-Allied Health Sciences, Ajman University, Ajman, UAE
| | - Akhtar Atiya
- Department of Pharmacognosy, College of Pharmacy, King Khalid University (KKU), Guraiger, Abha, Saudi Arabia
| |
Collapse
|
12
|
Pillai AS, Hochberg GK, Thornton JW. Simple mechanisms for the evolution of protein complexity. Protein Sci 2022; 31:e4449. [PMID: 36107026 PMCID: PMC9601886 DOI: 10.1002/pro.4449] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/01/2022] [Accepted: 09/10/2022] [Indexed: 01/26/2023]
Abstract
Proteins are tiny models of biological complexity: specific interactions among their many amino acids cause proteins to fold into elaborate structures, assemble with other proteins into higher-order complexes, and change their functions and structures upon binding other molecules. These complex features are classically thought to evolve via long and gradual trajectories driven by persistent natural selection. But a growing body of evidence from biochemistry, protein engineering, and molecular evolution shows that naturally occurring proteins often exist at or near the genetic edge of multimerization, allostery, and even new folds, so just one or a few mutations can trigger acquisition of these properties. These sudden transitions can occur because many of the physical properties that underlie these features are present in simpler proteins as fortuitous by-products of their architecture. Moreover, complex features of proteins can be encoded by huge arrays of sequences, so they are accessible from many different starting points via many possible paths. Because the bridges to these features are both short and numerous, random chance can join selection as a key factor in explaining the evolution of molecular complexity.
Collapse
Affiliation(s)
- Arvind S. Pillai
- Department of Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
- Institute for Protein DesignUniversity of WashingtonSeattleWAUSA
| | - Georg K.A. Hochberg
- Max Planck Institute for Terrestrial MicrobiologyMarburgGermany
- Department of Chemistry, Center for Synthetic MicrobiologyPhilipps University MarburgMarburgGermany
| | - Joseph W. Thornton
- Department of Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
- Departments of Human Genetics and Ecology and EvolutionUniversity of ChicagoChicagoIllinoisUSA
| |
Collapse
|
13
|
A Short Tale of the Origin of Proteins and Ribosome Evolution. Microorganisms 2022; 10:microorganisms10112115. [DOI: 10.3390/microorganisms10112115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 09/30/2022] [Accepted: 10/19/2022] [Indexed: 11/16/2022] Open
Abstract
Proteins are the workhorses of the cell and have been key players throughout the evolution of all organisms, from the origin of life to the present era. How might life have originated from the prebiotic chemistry of early Earth? This is one of the most intriguing unsolved questions in biology. Currently, however, it is generally accepted that amino acids, the building blocks of proteins, were abiotically available on primitive Earth, which would have made the formation of early peptides in a similar fashion possible. Peptides are likely to have coevolved with ancestral forms of RNA. The ribosome is the most evident product of this coevolution process, a sophisticated nanomachine that performs the synthesis of proteins codified in genomes. In this general review, we explore the evolution of proteins from their peptide origins to their folding and regulation based on the example of superoxide dismutase (SOD1), a key enzyme in oxygen metabolism on modern Earth.
Collapse
|
14
|
Jayaraman V, Toledo‐Patiño S, Noda‐García L, Laurino P. Mechanisms of protein evolution. Protein Sci 2022; 31:e4362. [PMID: 35762715 PMCID: PMC9214755 DOI: 10.1002/pro.4362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/11/2022] [Accepted: 05/14/2022] [Indexed: 11/06/2022]
Abstract
How do proteins evolve? How do changes in sequence mediate changes in protein structure, and in turn in function? This question has multiple angles, ranging from biochemistry and biophysics to evolutionary biology. This review provides a brief integrated view of some key mechanistic aspects of protein evolution. First, we explain how protein evolution is primarily driven by randomly acquired genetic mutations and selection for function, and how these mutations can even give rise to completely new folds. Then, we also comment on how phenotypic protein variability, including promiscuity, transcriptional and translational errors, may also accelerate this process, possibly via "plasticity-first" mechanisms. Finally, we highlight open questions in the field of protein evolution, with respect to the emergence of more sophisticated protein systems such as protein complexes, pathways, and the emergence of pre-LUCA enzymes.
Collapse
Affiliation(s)
- Vijay Jayaraman
- Department of Molecular Cell BiologyWeizmann Institute of ScienceRehovotIsrael
| | - Saacnicteh Toledo‐Patiño
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| | - Lianet Noda‐García
- Department of Plant Pathology and Microbiology, Institute of Environmental Sciences, Robert H. Smith Faculty of Agriculture, Food and EnvironmentHebrew University of JerusalemRehovotIsrael
| | - Paola Laurino
- Protein Engineering and Evolution UnitOkinawa Institute of Science and Technology Graduate UniversityOkinawaJapan
| |
Collapse
|
15
|
León-González JA, Flatet P, Juárez-Ramírez MS, Farías-Rico JA. Folding and Evolution of a Repeat Protein on the Ribosome. Front Mol Biosci 2022; 9:851038. [PMID: 35707224 PMCID: PMC9189291 DOI: 10.3389/fmolb.2022.851038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Accepted: 04/27/2022] [Indexed: 12/04/2022] Open
Abstract
Life on earth is the result of the work of proteins, the cellular nanomachines that fold into elaborated 3D structures to perform their functions. The ribosome synthesizes all the proteins of the biosphere, and many of them begin to fold during translation in a process known as cotranslational folding. In this work we discuss current advances of this field and provide computational and experimental data that highlight the role of ribosome in the evolution of protein structures. First, we used the sequence of the Ankyrin domain from the Drosophila Notch receptor to launch a deep sequence-based search. With this strategy, we found a conserved 33-residue motif shared by different protein folds. Then, to see how the vectorial addition of the motif would generate a full structure we measured the folding on the ribosome of the Ankyrin repeat protein. Not only the on-ribosome folding data is in full agreement with classical in vitro biophysical measurements but also it provides experimental evidence on how folded proteins could have evolved by duplication and fusion of smaller fragments in the RNA world. Overall, we discuss how the ribosomal exit tunnel could be conceptualized as an active site that is under evolutionary pressure to influence protein folding.
Collapse
Affiliation(s)
- José Alberto León-González
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
| | - Perline Flatet
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - María Soledad Juárez-Ramírez
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
| | - José Arcadio Farías-Rico
- Synthetic Biology Program, Center for Genome Sciences, National Autonomous University of Mexico, Cuernavaca, Mexico
- *Correspondence: José Arcadio Farías-Rico,
| |
Collapse
|
16
|
Longo LM, Kolodny R, McGlynn SE. Evidence for the emergence of β-trefoils by 'Peptide Budding' from an IgG-like β-sandwich. PLoS Comput Biol 2022; 18:e1009833. [PMID: 35157697 PMCID: PMC8880906 DOI: 10.1371/journal.pcbi.1009833] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 02/25/2022] [Accepted: 01/13/2022] [Indexed: 12/02/2022] Open
Abstract
As sequence and structure comparison algorithms gain sensitivity, the intrinsic interconnectedness of the protein universe has become increasingly apparent. Despite this general trend, β-trefoils have emerged as an uncommon counterexample: They are an isolated protein lineage for which few, if any, sequence or structure associations to other lineages have been identified. If β-trefoils are, in fact, remote islands in sequence-structure space, it implies that the oligomerizing peptide that founded the β-trefoil lineage itself arose de novo. To better understand β-trefoil evolution, and to probe the limits of fragment sharing across the protein universe, we identified both 'β-trefoil bridging themes' (evolutionarily-related sequence segments) and 'β-trefoil-like motifs' (structure motifs with a hallmark feature of the β-trefoil architecture) in multiple, ostensibly unrelated, protein lineages. The success of the present approach stems, in part, from considering β-trefoil sequence segments or structure motifs rather than the β-trefoil architecture as a whole, as has been done previously. The newly uncovered inter-lineage connections presented here suggest a novel hypothesis about the origins of the β-trefoil fold itself-namely, that it is a derived fold formed by 'budding' from an Immunoglobulin-like β-sandwich protein. These results demonstrate how the evolution of a folded domain from a peptide need not be a signature of antiquity and underpin an emerging truth: few protein lineages escape nature's sewing table.
Collapse
Affiliation(s)
- Liam M. Longo
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- Blue Marble Space Institute of Science, Seattle, Washington, United States of America
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | - Shawn E. McGlynn
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- Blue Marble Space Institute of Science, Seattle, Washington, United States of America
| |
Collapse
|
17
|
Battu A, Purushotham R, Kaur R. An Assay to Determine NAD(P)H: Quinone Oxidoreductase Activity in Cell Extracts from Candida glabrata. Bio Protoc 2021; 11:e4210. [PMID: 34859125 DOI: 10.21769/bioprotoc.4210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 08/18/2021] [Accepted: 08/20/2021] [Indexed: 11/02/2022] Open
Abstract
Flavodoxin-like proteins (Fld-LPs) are an important constituent of the oxidative stress defense system in several organisms and highly conserved from bacteria to humans. These proteins possess NAD(P)H:quinone oxidoreductase activity and convert quinones to hydroquinones through two-electron reduction, using NAD(P)H and quinone as electron donor and acceptor, respectively. Purified yeast and bacterial Fld-LPs exhibit NAD(P)H:quinone oxidoreductase activity in vitro. Here, we describe a protocol to measure oxidoreductase activity of Fld-LPs that are present in extracts of whole cells. We have recently shown that the assembly and activity of a Fld-LP, CgPst2, is regulated by an aspartyl protease-mediated cleavage of its C-terminus in the pathogenic yeast Candida glabrata. Mutant yeast where the CgPST2 gene was deleted lacked cellular NAD(P)H:quinone oxidoreductase activity and displayed elevated susceptibility to menadione stress. The protocol described herein is based on the measurement of NADH oxidation (conversion of NADH to NAD+) by endogenous Fld-LPs in the presence of quinone menadione. This assay can be performed with whole cell lysates prepared by the mechanical lysis of C. glabrata cells and does not require expression and purification of Fld-LPs from a heterogeneous system, thereby allowing researchers to study the effect of different posttranslational modifications and varied structural states of Fld-LPs on their enzymatic activities. Since many FLP-LPs are known to exist in dimeric and tetrameric states possessing differential activities, our efficient and easy-to-use assay can reliably detect and validate their quinone reductase activities. Although we have used menadione with CgPst2 enzyme in our study, the protocol can easily be modified to examine the presence of Fld-LPs with specificity for other quinones. As this assay does not require many expensive chemicals, it can readily be scaled up and adapted for other medically important fungi and potentially be a useful tool to characterize fungal oxidative stress response systems and screen inhibitors specific for fungal Fld-LPs, thereby contributing to our understanding of fungal pathogenesis mechanisms.
Collapse
Affiliation(s)
- Anamika Battu
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad-500039, India
| | - Rajaram Purushotham
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad-500039, India
| | - Rupinder Kaur
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad-500039, India
| |
Collapse
|
18
|
Freire MÁ. Short non-coded peptides interacting with cofactors facilitated the integration of early chemical networks. Biosystems 2021; 211:104547. [PMID: 34547425 DOI: 10.1016/j.biosystems.2021.104547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 08/28/2021] [Accepted: 09/15/2021] [Indexed: 11/02/2022]
Abstract
Independently developed iron-sulphur/thioester- and phosphate-driven chemical reactions would have set up two distinct reaction networks prior to coupling in a proto-metabolic system supporting a minimal organisation closure. Each chemical system assisted initially by simple catalysts and then by more complex cofactors would have provided the precursors of the small metabolites and monomer units along with their respective polymers through dehydrating template-independent assemblies. For example, acylation reactions mediated by activated thioester groups produced peptides, fatty acids and polyhydroxyalkanoates, while phosphorylation reactions by phosphorylating agents allowed the synthesis of polysaccharides, polyribonucleotides and polyphosphates. Here, we address how these independent chemical systems might fit together and shaped a proto-metabolic system, focusing specifically on cofactors as molecular fossils of metabolism. As a result, the proposed overview suggests that non-coded peptides capable of binding a variety of ligands, but in particular with a redox active versatility and/or group transfer potential could have facilitated the chemical connections that led to a minimal closure with a proto-metabolism. Later developments would have made it possible to establish a cellular organisation with more complex and interdependent metabolic pathways.
Collapse
Affiliation(s)
- Miguel Ángel Freire
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), CONICET, Universidad Nacional de Córdoba (UNC). Facultad de Ciencias Exactas, Físicas y Naturales. Av. Vélez Sarsfield 299, CC 495, 5000, Córdoba, Argentina.
| |
Collapse
|
19
|
Ferruz N, Michel F, Lobos F, Schmidt S, Höcker B. Fuzzle 2.0: Ligand Binding in Natural Protein Building Blocks. Front Mol Biosci 2021; 8:715972. [PMID: 34485385 PMCID: PMC8416435 DOI: 10.3389/fmolb.2021.715972] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/06/2021] [Indexed: 11/13/2022] Open
Abstract
Modern proteins have been shown to share evolutionary relationships via subdomain-sized fragments. The assembly of such fragments through duplication and recombination events led to the complex structures and functions we observe today. We previously implemented a pipeline that identified more than 1,000 of these fragments that are shared by different protein folds and developed a web interface to analyze and search for them. This resource named Fuzzle helps structural and evolutionary biologists to identify and analyze conserved parts of a protein but it also provides protein engineers with building blocks for example to design proteins by fragment combination. Here, we describe a new version of this web resource that was extended to include ligand information. This addition is a significant asset to the database since now protein fragments that bind specific ligands can be identified and analyzed. Often the mode of ligand binding is conserved in proteins thereby supporting a common evolutionary origin. The same can now be explored for subdomain-sized fragments within this database. This ligand binding information can also be used in protein engineering to graft binding pockets into other protein scaffolds or to transfer functional sites via recombination of a specific fragment. Fuzzle 2.0 is freely available at https://fuzzle.uni-bayreuth.de/2.0.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Francisco Lobos
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Steffen Schmidt
- Computational Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| |
Collapse
|
20
|
Romero-Romero S, Kordes S, Michel F, Höcker B. Evolution, folding, and design of TIM barrels and related proteins. Curr Opin Struct Biol 2021; 68:94-104. [PMID: 33453500 PMCID: PMC8250049 DOI: 10.1016/j.sbi.2020.12.007] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/13/2020] [Accepted: 12/14/2020] [Indexed: 12/16/2022]
Abstract
Proteins are chief actors in life that perform a myriad of exquisite functions. This diversity has been enabled through the evolution and diversification of protein folds. Analysis of sequences and structures strongly suggest that numerous protein pieces have been reused as building blocks and propagated to many modern folds. This information can be traced to understand how the protein world has diversified. In this review, we discuss the latest advances in the analysis of protein evolutionary units, and we use as a model system one of the most abundant and versatile topologies, the TIM-barrel fold, to highlight the existing common principles that interconnect protein evolution, structure, folding, function, and design.
Collapse
Affiliation(s)
| | - Sina Kordes
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
21
|
Kolodny R, Nepomnyachiy S, Tawfik DS, Ben-Tal N. Bridging Themes: Short Protein Segments Found in Different Architectures. Mol Biol Evol 2021; 38:2191-2208. [PMID: 33502503 PMCID: PMC8136508 DOI: 10.1093/molbev/msab017] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
The vast majority of theoretically possible polypeptide chains do not fold, let alone confer function. Hence, protein evolution from preexisting building blocks has clear potential advantages over ab initio emergence from random sequences. In support of this view, sequence similarities between different proteins is generally indicative of common ancestry, and we collectively refer to such homologous sequences as "themes." At the domain level, sequence homology is routinely detected. However, short themes which are segments, or fragments of intact domains, are particularly interesting because they may provide hints about the emergence of domains, as opposed to divergence of preexisting domains, or their mixing-and-matching to form multi-domain proteins. Here we identified 525 representative short themes, comprising 20-80 residues that are unexpectedly shared between domains considered to have emerged independently. Among these "bridging themes" are ones shared between the most ancient domains, for example, Rossmann, P-loop NTPase, TIM-barrel, flavodoxin, and ferredoxin-like. We elaborate on several particularly interesting cases, where the bridging themes mediate ligand binding. Ligand binding may have contributed to the stability and the plasticity of these building blocks, and to their ability to invade preexisting domains or serve as starting points for completely new domains.
Collapse
Affiliation(s)
- Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | | | - Dan S Tawfik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Nir Ben-Tal
- George S. Wise Faculty of Life Sciences, Department of Biochemistry and Molecular Biology, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
22
|
Heizinger L, Merkl R. Evidence for the preferential reuse of sub-domain motifs in primordial protein folds. Proteins 2021; 89:1167-1179. [PMID: 33957009 DOI: 10.1002/prot.26089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 04/15/2021] [Accepted: 04/28/2021] [Indexed: 11/06/2022]
Abstract
A comparison of protein backbones makes clear that not more than approximately 1400 different folds exist, each specifying the three-dimensional topology of a protein domain. Large proteins are composed of specific domain combinations and many domains can accommodate different functions. These findings confirm that the reuse of domains is key for the evolution of multi-domain proteins. If reuse was also the driving force for domain evolution, ancestral fragments of sub-domain size exist that are shared between domains possessing significantly different topologies. For the fully automated detection of putatively ancestral motifs, we developed the algorithm Fragstatt that compares proteins pairwise to identify fragments, that is, instantiations of the same motif. To reach maximal sensitivity, Fragstatt compares sequences by means of cascaded alignments of profile Hidden Markov Models. If the fragment sequences are sufficiently similar, the program determines and scores the structural concordance of the fragments. By analyzing a comprehensive set of proteins from the CATH database, Fragstatt identified 12 532 partially overlapping and structurally similar motifs that clustered to 134 unique motifs. The dissemination of these motifs is limited: We found only two domain topologies that contain two different motifs and generally, these motifs occur in not more than 18% of the CATH topologies. Interestingly, motifs are enriched in topologies that are considered ancestral. Thus, our findings suggest that the reuse of sub-domain sized fragments was relevant in early phases of protein evolution and became less important later on.
Collapse
Affiliation(s)
- Leonhard Heizinger
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| |
Collapse
|
23
|
Ferruz N, Noske J, Höcker B. Protlego: A Python package for the analysis and design of chimeric proteins. Bioinformatics 2021; 37:3182-3189. [PMID: 33901273 PMCID: PMC8504633 DOI: 10.1093/bioinformatics/btab253] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 03/05/2021] [Accepted: 04/19/2021] [Indexed: 01/03/2023] Open
Abstract
Motivation Duplication and recombination of protein fragments have led to the highly diverse protein space that we observe today. By mimicking this natural process, the design of protein chimeras via fragment recombination has proven experimentally successful and has opened a new era for the design of customizable proteins. The in silico building of structural models for these chimeric proteins, however, remains a manual task that requires a considerable degree of expertise and is not amenable for high-throughput studies. Energetic and structural analysis of the designed proteins often require the use of several tools, each with their unique technical difficulties and available in different programming languages or web servers. Results We implemented a Python package that enables automated, high-throughput design of chimeras and their structural analysis. First, it fetches evolutionarily conserved fragments from a built-in database (also available at fuzzle.uni-bayreuth.de). These relationships can then be represented via networks or further selected for chimera construction via recombination. Designed chimeras or natural proteins are then scored and minimized with the Charmm and Amber forcefields and their diverse structural features can be analyzed at ease. Here, we showcase Protlego’s pipeline by exploring the relationships between the P-loop and Rossmann superfolds, building and characterizing their offspring chimeras. We believe that Protlego provides a powerful new tool for the protein design community. Availability and implementation Protlego runs on the Linux platform and is freely available at (https://hoecker-lab.github.io/protlego/) with tutorials and documentation. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Jakob Noske
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| |
Collapse
|
24
|
Janaki C, Gowri VS, Srinivasan N. Master Blaster: an approach to sensitive identification of remotely related proteins. Sci Rep 2021; 11:8746. [PMID: 33888741 PMCID: PMC8062480 DOI: 10.1038/s41598-021-87833-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Accepted: 04/06/2021] [Indexed: 11/11/2022] Open
Abstract
Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our ability to detect distant relationships. In the past, an approach involving multiple Position Specific Scoring Matrices (PSSMs) was found to be more effective than traditional single PSSMs. Cascaded search is another successful approach where hits of a search are queried to detect more homologues. We propose a protocol, ‘Master Blaster’, which combines the principles adopted in these two approaches to enhance our ability to detect remote homologues even further. Assessment of the approach was performed using known relationships available in the SCOP70 database, and the results were compared against that of PSI-BLAST and HHblits, a hidden Markov model-based method. Compared to PSI-BLAST, Master Blaster resulted in 10% improvement with respect to detection of cross superfamily connections, nearly 35% improvement in cross family and more than 80% improvement in intra family connections. From the results it was observed that HHblits is more sensitive in detecting remote homologues compared to Master Blaster. However, there are true hits from 46-folds for which Master Blaster reported homologs that are not reported by HHblits even using the optimal parameters indicating that for detecting remote homologues, use of multiple methods employing a combination of different approaches can be more effective in detecting remote homologs. Master Blaster stand-alone code is available for download in the supplementary archive.
Collapse
Affiliation(s)
- Chintalapati Janaki
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India.,Centre for Development of Advanced Computing, Knowledge Park, Byappanahalli, Bangalore, 560038, India
| | - Venkatraman S Gowri
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India.,Department of Chemistry, Auxilium College, Gandhinagar, Vellore, 632006, India
| | | |
Collapse
|
25
|
Battu A, Purushotham R, Dey P, Vamshi SS, Kaur R. An aspartyl protease-mediated cleavage regulates structure and function of a flavodoxin-like protein and aids oxidative stress survival. PLoS Pathog 2021; 17:e1009355. [PMID: 33630938 PMCID: PMC7943015 DOI: 10.1371/journal.ppat.1009355] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 03/09/2021] [Accepted: 02/02/2021] [Indexed: 11/30/2022] Open
Abstract
A family of eleven glycosylphosphatidylinositol-anchored aspartyl proteases, commonly referred to as CgYapsins, regulate a myriad of cellular processes in the pathogenic yeast Candida glabrata, but their protein targets are largely unknown. Here, using the immunoprecipitation-mass spectrometry approach, we identify the flavodoxin-like protein (Fld-LP), CgPst2, to be an interactor of one of the aspartyl protease CgYps1. We also report the presence of four Fld-LPs in C. glabrata, which are required for survival in kidneys in the murine model of systemic candidiasis. We further demonstrated that of four Fld-LPs, CgPst2 was solely required for menadione detoxification. CgPst2 was found to form homo-oligomers, and contribute to cellular NADH:quinone oxidoreductase activity. CgYps1 cleaved CgPst2 at the C-terminus, and this cleavage was pivotal to oligomerization, activity and function of CgPst2. The arginine-174 residue in CgPst2 was essential for CgYps1-mediated cleavage, with alanine substitution of the arginine-174 residue also leading to elevated activity and oligomerization of CgPst2. Finally, we demonstrate that menadione treatment led to increased CgPst2 and CgYps1 protein levels, diminished CgYps1-CgPst2 interaction, and enhanced CgPst2 cleavage and activity, thereby implicating CgYps1 in activating CgPst2. Altogether, our findings of proteolytic cleavage as a key regulatory determinant of CgPst2, which belongs to the family of highly conserved, electron-carrier flavodoxin-fold-containing proteins, constituting cellular oxidative stress defense system in diverse organisms, unveil a hidden regulatory layer of environmental stress response mechanisms.
Collapse
Affiliation(s)
- Anamika Battu
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
- Graduate studies, Manipal Academy of Higher Education, Manipal, Karnataka, India
| | - Rajaram Purushotham
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| | - Partha Dey
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| | - S. Surya Vamshi
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| | - Rupinder Kaur
- Laboratory of Fungal Pathogenesis, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| |
Collapse
|
26
|
Searching protein space for ancient sub-domain segments. Curr Opin Struct Biol 2021; 68:105-112. [PMID: 33476896 DOI: 10.1016/j.sbi.2020.11.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Accepted: 11/29/2020] [Indexed: 01/08/2023]
Abstract
Evolutionary processes that formed the current protein universe left their traces, among them homologous segments that recur, or are 'reused,' in multiple proteins. These reused segments, called 'themes,' can be found at various scales, the best known of which is the domain. Yet, recent studies have begun to focus on the evolutionary insights that can be derived from sub-domain-scale themes, which are candidates for traces of more ancient events. Characterizing these may provide clues to the emergence of domains. Particularly interesting are themes that are reused across dissimilar contexts, that is, where the rest of the protein domain differs. We survey computational studies identifying reused themes within different contexts at the sub-domain level.
Collapse
|
27
|
Ferruz N, Lobos F, Lemm D, Toledo-Patino S, Farías-Rico JA, Schmidt S, Höcker B. Identification and Analysis of Natural Building Blocks for Evolution-Guided Fragment-Based Protein Design. J Mol Biol 2020; 432:3898-3914. [PMID: 32330481 PMCID: PMC7322520 DOI: 10.1016/j.jmb.2020.04.013] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 04/12/2020] [Accepted: 04/13/2020] [Indexed: 12/15/2022]
Abstract
Natural evolution has generated an impressively diverse protein universe via duplication and recombination from a set of protein fragments that served as building blocks. The application of these concepts to the design of new proteins using subdomain-sized fragments from different folds has proven to be experimentally successful. To better understand how evolution has shaped our protein universe, we performed an all-against-all comparison of protein domains representing all naturally existing folds and identified conserved homologous protein fragments. Overall, we found more than 1000 protein fragments of various lengths among different folds through similarity network analysis. These fragments are present in very different protein environments and represent versatile building blocks for protein design. These data are available in our web server called F(old P)uzzle (fuzzle.uni-bayreuth.de), which allows to individually filter the dataset and create customized networks for folds of interest. We believe that our results serve as an invaluable resource for structural and evolutionary biologists and as raw material for the design of custom-made proteins.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Francisco Lobos
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany; Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Dominik Lemm
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Saacnicteh Toledo-Patino
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany; Max Planck Institute for Developmental Biology, Tübingen, Germany
| | | | - Steffen Schmidt
- Max Planck Institute for Developmental Biology, Tübingen, Germany; Computational Biochemistry, University of Bayreuth, Bayreuth, Germany.
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany; Max Planck Institute for Developmental Biology, Tübingen, Germany.
| |
Collapse
|
28
|
Abstract
Life on Earth is driven by electron transfer reactions catalyzed by a suite of enzymes that comprise the superfamily of oxidoreductases (Enzyme Classification EC1). Most modern oxidoreductases are complex in their structure and chemistry and must have evolved from a small set of ancient folds. Ancient oxidoreductases from the Archean Eon between ca. 3.5 and 2.5 billion years ago have been long extinct, making it challenging to retrace evolution by sequence-based phylogeny or ancestral sequence reconstruction. However, three-dimensional topologies of proteins change more slowly than sequences. Using comparative structure and sequence profile-profile alignments, we quantify the similarity between proximal cofactor-binding folds and show that they are derived from a common ancestor. We discovered that two recurring folds were central to the origin of metabolism: ferredoxin and Rossmann-like folds. In turn, these two folds likely shared a common ancestor that, through duplication, recruitment, and diversification, evolved to facilitate electron transfer and catalysis at a very early stage in the origin of metabolism.
Collapse
|
29
|
Short and simple sequences favored the emergence of N-helix phospho-ligand binding sites in the first enzymes. Proc Natl Acad Sci U S A 2020; 117:5310-5318. [PMID: 32079722 DOI: 10.1073/pnas.1911742117] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The ubiquity of phospho-ligands suggests that phosphate binding emerged at the earliest stage of protein evolution. To evaluate this hypothesis and unravel its details, we identified all phosphate-binding protein lineages in the Evolutionary Classification of Protein Domains database. We found at least 250 independent evolutionary lineages that bind small molecule cofactors and metabolites with phosphate moieties. For many lineages, phosphate binding emerged later as a niche functionality, but for the oldest protein lineages, phosphate binding was the founding function. Across some 4 billion y of protein evolution, side-chain binding, in which the phosphate moiety does not interact with the backbone at all, emerged most frequently. However, in the oldest lineages, and most characteristically in αβα sandwich enzyme domains, N-helix binding sites dominate, where the phosphate moiety sits atop the N terminus of an α-helix. This discrepancy is explained by the observation that N-helix binding is uniquely realized by short, contiguous sequences with reduced amino acid diversity, foremost Gly, Ser, and Thr. The latter two amino acids preferentially interact with both the backbone amide and the side-chain hydroxyl (bidentate interaction) to promote binding by short sequences. We conclude that the first αβα sandwich domains emerged from shorter and simpler polypeptides that bound phospho-ligands via N-helix sites.
Collapse
|
30
|
Michalska K, Kowiel M, Bigelow L, Endres M, Gilski M, Jaskolski M, Joachimiak A. 3D domain swapping in the TIM barrel of the α subunit of Streptococcus pneumoniae tryptophan synthase. Acta Crystallogr D Struct Biol 2020; 76:166-175. [PMID: 32038047 PMCID: PMC7008512 DOI: 10.1107/s2059798320000212] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Accepted: 01/08/2020] [Indexed: 02/10/2023] Open
Abstract
Tryptophan synthase catalyzes the last two steps of tryptophan biosynthesis in plants, fungi and bacteria. It consists of two protein chains, designated α and β, encoded by trpA and trpB genes, that function as an αββα complex. Structural and functional features of tryptophan synthase have been extensively studied, explaining the roles of individual residues in the two active sites in catalysis and allosteric regulation. TrpA serves as a model for protein-folding studies. In 1969, Jackson and Yanofsky observed that the typically monomeric TrpA forms a small population of dimers. Dimerization was postulated to take place through an exchange of structural elements of the monomeric chains, a phenomenon later termed 3D domain swapping. The structural details of the TrpA dimer have remained unknown. Here, the crystal structure of the Streptococcus pneumoniae TrpA homodimer is reported, demonstrating 3D domain swapping in a TIM-barrel fold for the first time. The N-terminal domain comprising the H0-S1-H1-S2 elements is exchanged, while the hinge region corresponds to loop L2 linking strand S2 to helix H2'. The structural elements S2 and L2 carry the catalytic residues Glu52 and Asp63. As the S2 element is part of the swapped domain, the architecture of the catalytic apparatus in the dimer is recreated from two protein chains. The homodimer interface overlaps with the α-β interface of the tryptophan synthase αββα heterotetramer, suggesting that the 3D domain-swapped dimer cannot form a complex with the β subunit. In the crystal, the dimers assemble into a decamer comprising two pentameric rings.
Collapse
Affiliation(s)
- Karolina Michalska
- Midwest Center for Structural Genomics, X-ray Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
- Center for Structural Genomics of Infectious Diseases, Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL 60637, USA
- Structural Biology Center, X-ray Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Marcin Kowiel
- Center for Biocrystallographic Research, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Lance Bigelow
- Midwest Center for Structural Genomics, X-ray Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Michael Endres
- Midwest Center for Structural Genomics, X-ray Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Miroslaw Gilski
- Center for Biocrystallographic Research, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Department of Crystallography, Faculty of Chemistry, A. Mickiewicz University, Poznan, Poland
| | - Mariusz Jaskolski
- Center for Biocrystallographic Research, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
- Department of Crystallography, Faculty of Chemistry, A. Mickiewicz University, Poznan, Poland
| | - Andrzej Joachimiak
- Midwest Center for Structural Genomics, X-ray Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
- Center for Structural Genomics of Infectious Diseases, Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL 60637, USA
- Structural Biology Center, X-ray Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
31
|
Toledo-Patiño S, Chaubey M, Coles M, Höcker B. Reconstructing the Remote Origins of a Fold Singleton from a Flavodoxin-Like Ancestor. Biochemistry 2019; 58:4790-4793. [PMID: 31724394 PMCID: PMC6968885 DOI: 10.1021/acs.biochem.9b00900] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
![]()
Evolutionary processes that led to the emergence of structured
protein domains left footprints in the sequences of modern proteins.
We searched for such hints employing state-of-the-art sequence analysis
and found evidence that the HemD-like fold emerged from the flavodoxin-like
fold through segment swap and gene duplication. To verify this hypothesis,
we reverted these evolutionary steps experimentally, constructing
a HemD-half that resulted in a protein with the canonical flavodoxin-like
architecture. These results of fold reconstruction from the sequence
of a different fold strongly support our hypothesis of common ancestry.
It further illustrates the plasticity of modern proteins to form new
folded proteins.
Collapse
Affiliation(s)
- Saacnicteh Toledo-Patiño
- Department of Biochemistry , University of Bayreuth , 95447 Bayreuth , Germany.,Max Planck Institute for Developmental Biology , 72076 Tübingen , Germany
| | - Manish Chaubey
- Max Planck Institute for Developmental Biology , 72076 Tübingen , Germany
| | - Murray Coles
- Max Planck Institute for Developmental Biology , 72076 Tübingen , Germany
| | - Birte Höcker
- Department of Biochemistry , University of Bayreuth , 95447 Bayreuth , Germany.,Max Planck Institute for Developmental Biology , 72076 Tübingen , Germany
| |
Collapse
|
32
|
Berezovsky IN. Towards descriptor of elementary functions for protein design. Curr Opin Struct Biol 2019; 58:159-165. [PMID: 31352188 DOI: 10.1016/j.sbi.2019.06.010] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Accepted: 06/18/2019] [Indexed: 11/18/2022]
Abstract
We review studies of the protein evolution that help to formulate rules for protein design. Acknowledging the fundamental importance of Dayhoff's provision on the emergence of functional proteins from short peptides, we discuss multiple evidences of the omnipresent partitioning of protein globules into structural/functional units, using which greatly facilitates the engineering and design efforts. Closed loops and elementary functional loops, which are descendants of ancient ring-like peptides that formed fist protein domains in agreement with Dayhoff's hypothesis, can be considered as basic units of protein structure and function. We argue that future developments in protein design approaches should consider descriptors of the elementary functions, which will help to complement designed scaffolds with functional signatures and flexibility necessary for their functions.
Collapse
Affiliation(s)
- Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), 30 Biopolis Street, #07-01, Matrix 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore.
| |
Collapse
|
33
|
Krishnakumar P, Riemer S, Perera R, Lingner T, Goloborodko A, Khalifa H, Bontems F, Kaufholz F, El-Brolosy MA, Dosch R. Functional equivalence of germ plasm organizers. PLoS Genet 2018; 14:e1007696. [PMID: 30399145 PMCID: PMC6219760 DOI: 10.1371/journal.pgen.1007696] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Accepted: 09/16/2018] [Indexed: 11/18/2022] Open
Abstract
The proteins Oskar (Osk) in Drosophila and Bucky ball (Buc) in zebrafish act as germ plasm organizers. Both proteins recapitulate germ plasm activities but seem to be unique to their animal groups. Here, we discover that Osk and Buc show similar activities during germ cell specification. Drosophila Osk induces additional PGCs in zebrafish. Surprisingly, Osk and Buc do not show homologous protein motifs that would explain their related function. Nonetheless, we detect that both proteins contain stretches of intrinsically disordered regions (IDRs), which seem to be involved in protein aggregation. IDRs are known to rapidly change their sequence during evolution, which might obscure biochemical interaction motifs. Indeed, we show that Buc binds to the known Oskar interactors Vasa protein and nanos mRNA indicating conserved biochemical activities. These data provide a molecular framework for two proteins with unrelated sequence but with equivalent function to assemble a conserved core-complex nucleating germ plasm. Multicellular organisms use gametes for their propagation. Gametes are formed from germ cells, which are specified during embryogenesis in some animals by the inheritance of RNP granules known as germ plasm. Transplantation of germ plasm induces extra germ cells, whereas germ plasm ablation leads to the loss of gametes and sterility. Therefore, germ plasm is key for germ cell formation and reproduction. However, the molecular mechanisms of germ cell specification by germ plasm in the vertebrate embryo remain an unsolved question. Proteins, which assemble the germ plasm, are known as germ plasm organizers. Here, we show that the two germ plasm organizers Oskar from the fly and Bucky ball from the fish show similar functions by using a cross species approach. Both are intrinsically disordered proteins, which rapidly changed their sequence during evolution. Moreover, both proteins still interact with conserved components of the germ cell specification pathway. These data might provide a first example of two proteins with the same biological role, but distinct sequence.
Collapse
Affiliation(s)
- Pritesh Krishnakumar
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Stephan Riemer
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Roshan Perera
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Thomas Lingner
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Alexander Goloborodko
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Hazem Khalifa
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Franck Bontems
- Laboratory of Metabolism, Department of Internal Medicine Specialties, Faculty of Medicine, University of Geneva, Switzerland
| | - Felix Kaufholz
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Mohamed A. El-Brolosy
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
| | - Roland Dosch
- Institute for Developmental Biochemistry, University Medical Center, Göttingen, Germany
- Institute of Human Genetics, University Medical Center, Göttingen, Germany
- * E-mail:
| |
Collapse
|
34
|
Navigating Among Known Structures in Protein Space. Methods Mol Biol 2018. [PMID: 30298400 DOI: 10.1007/978-1-4939-8736-8_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Present-day protein space is the result of 3.7 billion years of evolution, constrained by the underlying physicochemical qualities of the proteins. It is difficult to differentiate between evolutionary traces and effects of physicochemical constraints. Nonetheless, as a rule of thumb, instances of structural reuse, or focusing on structural similarity, are likely attributable to physicochemical constraints, whereas sequence reuse, or focusing on sequence similarity, may be more indicative of evolutionary relationships. Both types of relationships have been studied and can provide meaningful insights to protein biophysics and evolution, which in turn can lead to better algorithms for protein search, annotation, and maybe even design.In broad strokes, studies of protein space vary in the entities they represent, the similarity measure comparing these entities, and the representation used. The entities can be, for example, protein chains, domains, supra-domains, or smaller protein sub-parts denoted themes. The measures of similarity between the entities can be based on sequence, structure, function, or any combination of these. The representation can be global, encompassing the whole space, or local, focusing on a particular region surrounding protein(s) of interest. Global representations include lists of grouped proteins, protein networks, and maps. Networks are the abstraction that is derived most directly from the similarity data: each node is the protein entity (e.g., a domain), and edges connect similar domains. Selecting the entities, the similarity measure, and the abstraction are three intertwined decisions: the similarity measures allow us to identify the entities, and the selection of entities influences what is a meaningful similarity measure. Similarly, we seek entities that are related to each other in a way, for which a simple representation describes their relationships succinctly and accurately. This chapter will cover studies that rely on different entities, similarity measures, and a range of representations to better understand protein structure space. Scholars may use publicly available navigators offering a global representation, and in particular the hierarchical classifications SCOP, CATH, and ECOD, or a local representation, which encompass structural alignment algorithms. Alternatively, scholars can configure their own navigator using existing tools. To demonstrate this DIY (do it yourself) approach for navigating in protein space, we investigate substrate-binding proteins. By presenting sequence similarities among this large and diverse protein family as a network, we can infer that one member (pdb ID 4ntl; of yet unknown function) may bind methionine and suggest a putative binding mechanism.
Collapse
|
35
|
Lechner H, Ferruz N, Höcker B. Strategies for designing non-natural enzymes and binders. Curr Opin Chem Biol 2018; 47:67-76. [PMID: 30248579 DOI: 10.1016/j.cbpa.2018.07.022] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2018] [Revised: 07/16/2018] [Accepted: 07/17/2018] [Indexed: 12/20/2022]
Abstract
The design of tailor-made enzymes is a major goal in biochemical research that can result in wide-range applications and will lead to a better understanding of how proteins fold and function. In this review we highlight recent advances in enzyme and small molecule binder design. A focus is placed on novel strategies for the design of scaffolds, developments in computational methods, and recent applications of these techniques on receptors, sensors, and enzymes. Further, the integration of computational and experimental methodologies is discussed. The outlined examples of designed enzymes and binders for various purposes highlight the importance of this topic and underline the need for tailor-made proteins.
Collapse
Affiliation(s)
- Horst Lechner
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
36
|
|
37
|
Erb TJ, Zarzycki J. A short history of RubisCO: the rise and fall (?) of Nature's predominant CO 2 fixing enzyme. Curr Opin Biotechnol 2017; 49:100-107. [PMID: 28843191 DOI: 10.1016/j.copbio.2017.07.017] [Citation(s) in RCA: 146] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Revised: 07/26/2017] [Accepted: 07/26/2017] [Indexed: 11/18/2022]
Abstract
Ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO) is arguably one of the most abundant proteins in the biosphere and a key enzyme in the global carbon cycle. Although RubisCO has been intensively studied, its evolutionary origins and rise as Nature's most dominant carbon dioxide (CO2)-fixing enzyme still remain in the dark. In this review we will bring together biochemical, structural, physiological, microbiological, as well as phylogenetic data to speculate on the evolutionary roots of the CO2-fixation reaction of RubisCO, the emergence of RubisCO-based autotrophic CO2-fixation in the context of the Calvin-Benson-Bassham cycle, and the further evolution of RubisCO into the 'RubisCOsome', a complex of various proteins assembling and interacting with the enzyme to improve its operational capacity (functionality) under different biological and environmental conditions.
Collapse
Affiliation(s)
- Tobias J Erb
- Max Planck Institute for Terrestrial Microbiology, Department of Biochemistry and Synthetic Metabolism, Karl-von-Frisch-Str. 10, 35043 Marburg, Germany.
| | - Jan Zarzycki
- Max Planck Institute for Terrestrial Microbiology, Department of Biochemistry and Synthetic Metabolism, Karl-von-Frisch-Str. 10, 35043 Marburg, Germany
| |
Collapse
|
38
|
Houwman JA, van Mierlo CPM. Folding of proteins with a flavodoxin-like architecture. FEBS J 2017; 284:3145-3167. [PMID: 28380286 DOI: 10.1111/febs.14077] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Revised: 03/13/2017] [Accepted: 04/03/2017] [Indexed: 12/21/2022]
Abstract
The flavodoxin-like fold is a protein architecture that can be traced back to the universal ancestor of the three kingdoms of life. Many proteins share this α-β parallel topology and hence it is highly relevant to illuminate how they fold. Here, we review experiments and simulations concerning the folding of flavodoxins and CheY-like proteins, which share the flavodoxin-like fold. These polypeptides tend to temporarily misfold during unassisted folding to their functionally active forms. This susceptibility to frustration is caused by the more rapid formation of an α-helix compared to a β-sheet, particularly when a parallel β-sheet is involved. As a result, flavodoxin-like proteins form intermediates that are off-pathway to native protein and several of these species are molten globules (MGs). Experiments suggest that the off-pathway species are of helical nature and that flavodoxin-like proteins have a nonconserved transition state that determines the rate of productive folding. Folding of flavodoxin from Azotobacter vinelandii has been investigated extensively, enabling a schematic construction of its folding energy landscape. It is the only flavodoxin-like protein of which cotranslational folding has been probed. New insights that emphasize differences between in vivo and in vitro folding energy landscapes are emerging: the ribosome modulates MG formation in nascent apoflavodoxin and forces this polypeptide toward the native state.
Collapse
Affiliation(s)
- Joseline A Houwman
- Laboratory of Biochemistry, Wageningen University and Research, The Netherlands
| | | |
Collapse
|
39
|
Dybas JM, Fiser A. Development of a motif-based topology-independent structure comparison method to identify evolutionarily related folds. Proteins 2016; 84:1859-1874. [PMID: 27671894 PMCID: PMC5118133 DOI: 10.1002/prot.25169] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Revised: 08/17/2016] [Accepted: 08/25/2016] [Indexed: 11/09/2022]
Abstract
Structure conservation, functional similarities, and homologous relationships that exist across diverse protein topologies suggest that some regions of the protein fold universe are continuous. However, the current structure classification systems are based on hierarchical organizations, which cannot accommodate structural relationships that span fold definitions. Here, we describe a novel, super-secondary-structure motif-based, topology-independent structure comparison method (SmotifCOMP) that is able to quantitatively identify structural relationships between disparate topologies. The basis of SmotifCOMP is a systematically defined super-secondary-structure motif library whose representative geometries are shown to be saturated in the Protein Data Bank and exhibit a unique distribution within the known folds. SmotifCOMP offers a robust and quantitative technique to compare domains that adopt different topologies since the method does not rely on a global superposition. SmotifCOMP is used to perform an exhaustive comparison of the known folds and the identified relationships are used to produce a nonhierarchical representation of the fold space that reflects the notion of a continuous and connected fold universe. The current work offers insight into previously hypothesized evolutionary relationships between disparate folds and provides a resource for exploring novel ones. Proteins 2016; 84:1859-1874. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Joseph M. Dybas
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
| | - Andras Fiser
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue Bronx, NY 10461, USA
| |
Collapse
|
40
|
Baier F, Copp JN, Tokuriki N. Evolution of Enzyme Superfamilies: Comprehensive Exploration of Sequence–Function Relationships. Biochemistry 2016; 55:6375-6388. [DOI: 10.1021/acs.biochem.6b00723] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- F. Baier
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - J. N. Copp
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - N. Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| |
Collapse
|
41
|
Recurring sequence-structure motifs in (βα) 8-barrel proteins and experimental optimization of a chimeric protein designed based on such motifs. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2016; 1865:165-175. [PMID: 27836620 DOI: 10.1016/j.bbapap.2016.11.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 11/04/2016] [Accepted: 11/06/2016] [Indexed: 11/22/2022]
Abstract
An interesting way of generating novel artificial proteins is to combine sequence motifs from natural proteins, mimicking the evolutionary path suggested by natural proteins comprising recurring motifs. We analyzed the βα and αβ modules of TIM barrel proteins by structure alignment-based sequence clustering. A number of preferred motifs were identified. A chimeric TIM was designed by using recurring elements as mutually compatible interfaces. The foldability of the designed TIM protein was then significantly improved by six rounds of directed evolution. The melting temperature has been improved by more than 20°C. A variety of characteristics suggested that the resulting protein is well-folded. Our analysis provided a library of peptide motifs that is potentially useful for different protein engineering studies. The protein engineering strategy of using recurring motifs as interfaces to connect partial natural proteins may be applied to other protein folds.
Collapse
|
42
|
Berezovsky IN, Guarnera E, Zheng Z. Basic units of protein structure, folding, and function. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2016; 128:85-99. [PMID: 27697476 DOI: 10.1016/j.pbiomolbio.2016.09.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 09/05/2016] [Accepted: 09/26/2016] [Indexed: 10/20/2022]
Abstract
Study of the hierarchy of domain structure with alternative sets of domains and analysis of discontinuous domains, consisting of remote segments of the polypeptide chain, raised a question about the minimal structural unit of the protein domain. The hypothesis on the decisive role of the polypeptide backbone in determining the elementary units of globular proteins have led to the discovery of closed loops. It is reviewed here how closed loops form the loop-n-lock structure of proteins, providing the foundation for stability and designability of protein folds/domain and underlying their co-translational folding. Simplified protein sequences are considered here with the aim to explore the basic principles that presumably dominated the folding and stability of proteins in the early stages of structural evolution. Elementary functional loops (EFLs), closed loops with one or few catalytic residues, are, in turn, units of the protein function. They are apparent descendants of the prebiotic ring-like peptides, which gave rise to the first functional folds/domains being fused in the beginning of the evolution of protein structure. It is also shown how evolutionary relations between protein functional superfamilies and folds delineated with the help of EFLs can contribute to establishing the rules for design of desired enzymatic functions. Generalized descriptors of the elementary functions are proposed to be used as basic units in the future computational design.
Collapse
Affiliation(s)
- Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore.
| | - Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Zejun Zheng
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| |
Collapse
|
43
|
Abstract
Proteins are the workhorses of the cell and, over billions of years, they have evolved an amazing plethora of extremely diverse and versatile structures with equally diverse functions. Evolutionary emergence of new proteins and transitions between existing ones are believed to be rare or even impossible. However, recent advances in comparative genomics have repeatedly called some 10%-30% of all genes without any detectable similarity to existing proteins. Even after careful scrutiny, some of those orphan genes contain protein coding reading frames with detectable transcription and translation. Thus some proteins seem to have emerged from previously non-coding 'dark genomic matter'. These 'de novo' proteins tend to be disordered, fast evolving, weakly expressed but also rapidly assuming novel and physiologically important functions. Here we review mechanisms by which 'de novo' proteins might be created, under which circumstances they may become fixed and why they are elusive. We propose a 'grow slow and moult' model in which first a reading frame is extended, coding for an initially disordered and non-globular appendage which, over time, becomes more structured and may also become associated with other proteins.
Collapse
|
44
|
Figueroa M, Sleutel M, Vandevenne M, Parvizi G, Attout S, Jacquin O, Vandenameele J, Fischer AW, Damblon C, Goormaghtigh E, Valerio-Lepiniec M, Urvoas A, Durand D, Pardon E, Steyaert J, Minard P, Maes D, Meiler J, Matagne A, Martial JA, Van de Weerdt C. The unexpected structure of the designed protein Octarellin V.1 forms a challenge for protein structure prediction tools. J Struct Biol 2016; 195:19-30. [PMID: 27181418 DOI: 10.1016/j.jsb.2016.05.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Revised: 04/19/2016] [Accepted: 05/12/2016] [Indexed: 12/26/2022]
Abstract
Despite impressive successes in protein design, designing a well-folded protein of more 100 amino acids de novo remains a formidable challenge. Exploiting the promising biophysical features of the artificial protein Octarellin V, we improved this protein by directed evolution, thus creating a more stable and soluble protein: Octarellin V.1. Next, we obtained crystals of Octarellin V.1 in complex with crystallization chaperons and determined the tertiary structure. The experimental structure of Octarellin V.1 differs from its in silico design: the (αβα) sandwich architecture bears some resemblance to a Rossman-like fold instead of the intended TIM-barrel fold. This surprising result gave us a unique and attractive opportunity to test the state of the art in protein structure prediction, using this artificial protein free of any natural selection. We tested 13 automated webservers for protein structure prediction and found none of them to predict the actual structure. More than 50% of them predicted a TIM-barrel fold, i.e. the structure we set out to design more than 10years ago. In addition, local software runs that are human operated can sample a structure similar to the experimental one but fail in selecting it, suggesting that the scoring and ranking functions should be improved. We propose that artificial proteins could be used as tools to test the accuracy of protein structure prediction algorithms, because their lack of evolutionary pressure and unique sequences features.
Collapse
Affiliation(s)
- Maximiliano Figueroa
- GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium.
| | - Mike Sleutel
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Marylene Vandevenne
- GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
| | - Gregory Parvizi
- GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
| | - Sophie Attout
- GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
| | - Olivier Jacquin
- GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
| | - Julie Vandenameele
- Laboratoire d'Enzymologie et Repliement des Protéines, Centre for Protein Engineering, University of Liège, Liège, Belgium
| | - Axel W Fischer
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, TN, United States
| | | | - Erik Goormaghtigh
- Laboratory for the Structure and Function of Biological Membranes, Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles, Brussels, Belgium
| | - Marie Valerio-Lepiniec
- Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
| | - Agathe Urvoas
- Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
| | - Dominique Durand
- Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
| | - Els Pardon
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Research Center, VIB, Pleinlaan 2, 1050 Brussels, Belgium
| | - Jan Steyaert
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Research Center, VIB, Pleinlaan 2, 1050 Brussels, Belgium
| | - Philippe Minard
- Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
| | - Dominique Maes
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Jens Meiler
- Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, TN, United States
| | - André Matagne
- Laboratoire d'Enzymologie et Repliement des Protéines, Centre for Protein Engineering, University of Liège, Liège, Belgium
| | - Joseph A Martial
- GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
| | - Cécile Van de Weerdt
- GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium.
| |
Collapse
|
45
|
Undheim EAB, Mobli M, King GF. Toxin structures as evolutionary tools: Using conserved 3D folds to study the evolution of rapidly evolving peptides. Bioessays 2016; 38:539-48. [DOI: 10.1002/bies.201500165] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Eivind A. B. Undheim
- Institute for Molecular BioscienceUniversity of QueenslandSt LuciaQueenslandAustralia
| | - Mehdi Mobli
- Centre for Advanced ImagingUniversity of QueenslandSt LuciaQueenslandAustralia
| | - Glenn F. King
- Institute for Molecular BioscienceUniversity of QueenslandSt LuciaQueenslandAustralia
| |
Collapse
|
46
|
Laurino P, Tóth-Petróczy Á, Meana-Pañeda R, Lin W, Truhlar DG, Tawfik DS. An Ancient Fingerprint Indicates the Common Ancestry of Rossmann-Fold Enzymes Utilizing Different Ribose-Based Cofactors. PLoS Biol 2016; 14:e1002396. [PMID: 26938925 PMCID: PMC4777477 DOI: 10.1371/journal.pbio.1002396] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2015] [Accepted: 01/29/2016] [Indexed: 01/30/2023] Open
Abstract
Nucleoside-based cofactors are presumed to have preceded proteins. The Rossmann fold is one of the most ancient and functionally diverse protein folds, and most Rossmann enzymes utilize nucleoside-based cofactors. We analyzed an omnipresent Rossmann ribose-binding interaction: a carboxylate side chain at the tip of the second β-strand (β2-Asp/Glu). We identified a canonical motif, defined by the β2-topology and unique geometry. The latter relates to the interaction being bidentate (both ribose hydroxyls interacting with the carboxylate oxygens), to the angle between the carboxylate and the ribose, and to the ribose's ring configuration. We found that this canonical motif exhibits hallmarks of divergence rather than convergence. It is uniquely found in Rossmann enzymes that use different cofactors, primarily SAM (S-adenosyl methionine), NAD (nicotinamide adenine dinucleotide), and FAD (flavin adenine dinucleotide). Ribose-carboxylate bidentate interactions in other folds are not only rare but also have a different topology and geometry. We further show that the canonical geometry is not dictated by a physical constraint--geometries found in noncanonical interactions have similar calculated bond energies. Overall, these data indicate the divergence of several major Rossmann-fold enzyme classes, with different cofactors and catalytic chemistries, from a common pre-LUCA (last universal common ancestor) ancestor that possessed the β2-Asp/Glu motif.
Collapse
Affiliation(s)
- Paola Laurino
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Ágnes Tóth-Petróczy
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Rubén Meana-Pañeda
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Wei Lin
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Donald G. Truhlar
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Dan S. Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
47
|
Khersonsky O, Fleishman SJ. Why reinvent the wheel? Building new proteins based on ready-made parts. Protein Sci 2016; 25:1179-87. [PMID: 26821641 DOI: 10.1002/pro.2892] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2015] [Revised: 01/20/2016] [Accepted: 01/27/2016] [Indexed: 12/12/2022]
Abstract
We protein engineers are ambivalent about evolution: on the one hand, evolution inspires us with myriad examples of biomolecular binders, sensors, and catalysts; on the other hand, these examples are seldom well-adapted to the engineering tasks we have in mind. Protein engineers have therefore modified natural proteins by point substitutions and fragment exchanges in an effort to generate new functions. A counterpoint to such design efforts, which is being pursued now with greater success, is to completely eschew the starting materials provided by nature and to design new protein functions from scratch by using de novo molecular modeling and design. While important progress has been made in both directions, some areas of protein design are still beyond reach. To this end, we advocate a synthesis of these two strategies: by using design calculations to both recombine and optimize fragments from natural proteins, we can build stable and as of yet un-sampled structures, thereby granting access to an expanded repertoire of conformations and desired functions. We propose that future methods that combine phylogenetic analysis, structure and sequence bioinformatics, and atomistic modeling may well succeed where any one of these approaches has failed on its own.
Collapse
Affiliation(s)
- Olga Khersonsky
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, 76100, Israel
| |
Collapse
|
48
|
Kries H, Niquille DL, Hilvert D. A subdomain swap strategy for reengineering nonribosomal peptides. ACTA ACUST UNITED AC 2016; 22:640-8. [PMID: 26000750 DOI: 10.1016/j.chembiol.2015.04.015] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Revised: 03/31/2015] [Accepted: 04/15/2015] [Indexed: 11/24/2022]
Abstract
Nonribosomal peptide synthetases (NRPSs) protect microorganisms from environmental threats by producing diverse siderophores, antibiotics, and other peptide natural products. Their modular molecular structure is also attractive from the standpoint of biosynthetic engineering. Here we evaluate a methodology for swapping module specificities of these mega-enzymes that takes advantage of flavodoxin-like subdomains involved in substrate recognition. Nine subdomains encoding diverse specificities were transplanted into the Phe-specific GrsA initiation module of gramicidin S synthetase. All chimeras could be purified as soluble protein. One construct based on a Val-specific subdomain showed sizable adenylation activity and functioned as a Val-Pro diketopiperazine synthetase upon addition of the proline-specific GrsB1 module. These results suggest that subdomain swapping could be a viable alternative to previous NRPS design approaches targeting binding pockets, domains, or entire modules. The short length of the swapped sequence stretch may facilitate straightforward exploitation of the wealth of existing NRPS modules for combinatorial biosynthesis.
Collapse
Affiliation(s)
- Hajo Kries
- Laboratory of Organic Chemistry, ETH Zurich, 8093 Zürich, Switzerland
| | - David L Niquille
- Laboratory of Organic Chemistry, ETH Zurich, 8093 Zürich, Switzerland
| | - Donald Hilvert
- Laboratory of Organic Chemistry, ETH Zurich, 8093 Zürich, Switzerland.
| |
Collapse
|
49
|
|
50
|
Alva V, Söding J, Lupas AN. A vocabulary of ancient peptides at the origin of folded proteins. eLife 2015; 4:e09410. [PMID: 26653858 PMCID: PMC4739770 DOI: 10.7554/elife.09410] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2015] [Accepted: 12/13/2015] [Indexed: 01/01/2023] Open
Abstract
The seemingly limitless diversity of proteins in nature arose from only a few thousand domain prototypes, but the origin of these themselves has remained unclear. We are pursuing the hypothesis that they arose by fusion and accretion from an ancestral set of peptides active as co-factors in RNA-dependent replication and catalysis. Should this be true, contemporary domains may still contain vestiges of such peptides, which could be reconstructed by a comparative approach in the same way in which ancient vocabularies have been reconstructed by the comparative study of modern languages. To test this, we compared domains representative of known folds and identified 40 fragments whose similarity is indicative of common descent, yet which occur in domains currently not thought to be homologous. These fragments are widespread in the most ancient folds and enriched for iron-sulfur- and nucleic acid-binding. We propose that they represent the observable remnants of a primordial RNA-peptide world.
Collapse
Affiliation(s)
- Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Johannes Söding
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Andrei N Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|