1
|
Zhang Q, Chen Y, Duan L, Dong L, Wang S. Design Glutamate Dehydrogenase for Nonaqueous System by Motifs Reassembly and Interaction Network Analysis. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:19931-19939. [PMID: 39222309 DOI: 10.1021/acs.jafc.4c02995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
Glutamate dehydrogenases (GDH) serve as the key regulated enzyme that links protein and carbohydrate metabolism. Combined with motif reassembly and mutation, novel GDHs were designed. Motif reassembly of thermophilic GDH and malate dehydrogenase aims to overcome stability and activity tradeoff in nonaqueous systems. Structural compatibility and dynamic cooperation of the designed AaDHs were studied by molecular dynamics simulation. Furthermore, multipoint mutations improved its catalytic activity for unnatural substrates. Amino acid interaction network analysis indicated that the high density of hydrogen-bonded salt bridges is beneficial to the stability. Finally, the experimental verification determines the kinetics of AaDHs in a nonaqueous system. The activity of Aa05 was increased by 1.78-fold with ionic liquid [EMIM]BF4. This study presents the strategy of a combination of rigid motif assembly and mutations of active sites for robust dehydrogenases with high activity in the nonaqueous system, which overcomes the activity-stability tradeoff effect.
Collapse
Affiliation(s)
- Qian Zhang
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Yuxin Chen
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Lingxuan Duan
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Lingling Dong
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Shizhen Wang
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
- Xiamen Key Laboratory of Synthetic Biotechnology, Xiamen University, Xiamen, Fujian 361005, P. R. China
| |
Collapse
|
2
|
Zheng Z, Goncearenco A, Berezovsky IN. Back in time to the Gly-rich prototype of the phosphate binding elementary function. Curr Res Struct Biol 2024; 7:100142. [PMID: 38655428 PMCID: PMC11035071 DOI: 10.1016/j.crstbi.2024.100142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 03/31/2024] [Accepted: 04/03/2024] [Indexed: 04/26/2024] Open
Abstract
Binding of nucleotides and their derivatives is one of the most ancient elementary functions dating back to the Origin of Life. We review here the works considering one of the key elements in binding of (di)nucleotide-containing ligands - phosphate binding. We start from a brief discussion of major participants, conditions, and events in prebiotic evolution that resulted in the Origin of Life. Tracing back to the basic functions, including metal and phosphate binding, and, potentially, formation of primitive protein-protein interactions, we focus here on the phosphate binding. Critically assessing works on the structural, functional, and evolutionary aspects of phosphate binding, we perform a simple computational experiment reconstructing its most ancient and generic sequence prototype. The profiles of the phosphate binding signatures have been derived in form of position-specific scoring matrices (PSSMs), their peculiarities depending on the type of the ligands have been analyzed, and evolutionary connections between them have been delineated. Then, the apparent prototype that gave rise to all relevant phosphate-binding signatures had also been reconstructed. We show that two major signatures of the phosphate binding that discriminate between the binding of dinucleotide- and nucleotide-containing ligands are GxGxxG and GxxGxG, respectively. It appears that the signature archetypal for dinucleotide-containing ligands is more generic, and it can frequently bind phosphate groups in nucleotide-containing ligands as well. The reconstructed prototype's key signature GxGGxG underlies the role of glycine residues in providing flexibility and interactions necessary for binding the phosphate groups. The prototype also contains other ancient amino acids, valine, and alanine, showing versatility towards evolutionary design and functional diversification.
Collapse
Affiliation(s)
- Zejun Zheng
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | | | - Igor N. Berezovsky
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore
| |
Collapse
|
3
|
Ye W, Krishna Behra PR, Dyrhage K, Seeger C, Joiner JD, Karlsson E, Andersson E, Chi CN, Andersson SGE, Jemth P. Folded Alpha Helical Putative New Proteins from Apilactobacillus kunkeei. J Mol Biol 2024; 436:168490. [PMID: 38355092 DOI: 10.1016/j.jmb.2024.168490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 02/07/2024] [Accepted: 02/08/2024] [Indexed: 02/16/2024]
Abstract
The emergence of new proteins is a central question in biology. Most tertiary protein folds known to date appear to have an ancient origin, but it is clear from bioinformatic analyses that new proteins continuously emerge in all organismal groups. However, there is a paucity of experimental data on new proteins regarding their structure and biophysical properties. We performed a detailed phylogenetic analysis and identified 48 putative open reading frames in the honeybee-associated bacterium Apilactobacillus kunkeei for which no or few homologs could be identified in closely-related species, suggesting that they could be relatively new on an evolutionary time scale and represent recently evolved proteins. Using circular dichroism-, fluorescence- and nuclear magnetic resonance (NMR) spectroscopy we investigated six of these proteins and show that they are not intrinsically disordered, but populate alpha-helical dominated folded states with relatively low thermodynamic stability (0-3 kcal/mol). The NMR and biophysical data demonstrate that small new proteins readily adopt simple folded conformations suggesting that more complex tertiary structures can be continuously re-invented during evolution by fusion of such simple secondary structure elements. These findings have implications for the general view on protein evolution, where de novo emergence of folded proteins may be a common event.
Collapse
Affiliation(s)
- Weihua Ye
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Phani Rama Krishna Behra
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Karl Dyrhage
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Christian Seeger
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden
| | - Joe D Joiner
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Elin Karlsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden
| | - Celestine N Chi
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden.
| | - Siv G E Andersson
- Department of Molecular Evolution, Cell and Molecular Biology, Biomedical Centre, Science for Life Laboratory, Uppsala University, 75236 Uppsala, Sweden.
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, 75123 Uppsala, Sweden.
| |
Collapse
|
4
|
Amangeldina A, Tan ZW, Berezovsky IN. Living in trinity of extremes: Genomic and proteomic signatures of halophilic, thermophilic, and pH adaptation. Curr Res Struct Biol 2024; 7:100129. [PMID: 38327713 PMCID: PMC10847869 DOI: 10.1016/j.crstbi.2024.100129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/16/2024] [Accepted: 01/16/2024] [Indexed: 02/09/2024] Open
Abstract
Since nucleic acids and proteins of unicellular prokaryotes are directly exposed to extreme environmental conditions, it is possible to explore the genomic-proteomic compositional determinants of molecular mechanisms of adaptation developed by them in response to harsh environmental conditions. Using a wealth of currently available complete genomes/proteomes we were able to explore signatures of adaptation to three environmental factors, pH, salinity, and temperature, observing major trends in compositions of their nucleic acids and proteins. We derived predictors of thermostability, halophilic, and pH adaptations and complemented them by the principal components analysis. We observed a clear difference between thermophilic and salinity/pH adaptations, whereas latter invoke seemingly overlapping mechanisms. The genome-proteome compositional trade-off reveals an intricate balance between the work of base paring and base stacking in stabilization of coding DNA and r/tRNAs, and, at the same time, universal requirements for the stability and foldability of proteins regardless of the nucleotide biases. Nevertheless, we still found hidden fingerprints of ancient evolutionary connections between the nucleotide and amino acid compositions indicating their emergence, mutual evolution, and adjustment. The evolutionary perspective on the adaptation mechanisms is further studied here by means of the comparative analysis of genomic/proteomic traits of archaeal and bacterial species. The overall picture of genomic/proteomic signals of adaptation obtained here provides a foundation for future engineering and design of functional biomolecules resistant to harsh environments.
Collapse
Affiliation(s)
- Aidana Amangeldina
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore
| | - Zhen Wah Tan
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Igor N. Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore
| |
Collapse
|
5
|
Mughal F, Caetano-Anollés G. Evolution of Intrinsic Disorder in Protein Loops. Life (Basel) 2023; 13:2055. [PMID: 37895436 PMCID: PMC10608553 DOI: 10.3390/life13102055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/08/2023] [Accepted: 10/10/2023] [Indexed: 10/29/2023] Open
Abstract
Intrinsic disorder accounts for the flexibility of protein loops, molecular building blocks that are largely responsible for the processes and molecular functions of the living world. While loops likely represent early structural forms that served as intermediates in the emergence of protein structural domains, their origin and evolution remain poorly understood. Here, we conduct a phylogenomic survey of disorder in loop prototypes sourced from the ArchDB classification. Tracing prototypes associated with protein fold families along an evolutionary chronology revealed that ancient prototypes tended to be more disordered than their derived counterparts, with ordered prototypes developing later in evolution. This highlights the central evolutionary role of disorder and flexibility. While mean disorder increased with time, a minority of ordered prototypes exist that emerged early in evolutionary history, possibly driven by the need to preserve specific molecular functions. We also revealed the percolation of evolutionary constraints from higher to lower levels of organization. Percolation resulted in trade-offs between flexibility and rigidity that impacted prototype structure and geometry. Our findings provide a deep evolutionary view of the link between structure, disorder, flexibility, and function, as well as insights into the evolutionary role of intrinsic disorder in loops and their contribution to protein structure and function.
Collapse
Affiliation(s)
- Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL 61801, USA
- C.R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL 61801, USA
| |
Collapse
|
6
|
Tan ZW, Tee WV, Guarnera E, Berezovsky IN. AlloMAPS 2: allosteric fingerprints of the AlphaFold and Pfam-trRosetta predicted structures for engineering and design. Nucleic Acids Res 2022; 51:D345-D351. [PMID: 36169226 PMCID: PMC9825619 DOI: 10.1093/nar/gkac828] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 08/26/2022] [Accepted: 09/15/2022] [Indexed: 01/29/2023] Open
Abstract
AlloMAPS 2 is an update of the Allosteric Mutation Analysis and Polymorphism of Signalling database, which contains data on allosteric communication obtained for predicted structures in the AlphaFold database (AFDB) and trRosetta-predicted Pfam domains. The data update contains Allosteric Signalling Maps (ASMs) and Allosteric Probing Maps (APMs) quantifying allosteric effects of mutations and of small probe binding, respectively. To ensure quality of the ASMs and APMs, we performed careful and accurate selection of protein sets containing high-quality predicted structures in both databases for each organism/structure, and the data is available for browsing and download. The data for remaining structures are available for download and should be used at user's discretion and responsibility. We believe these massive data can facilitate both diagnostics and drug design within the precision medicine paradigm. Specifically, it can be instrumental in the analysis of allosteric effects of pathological and rescue mutations, providing starting points for fragment-based design of allosteric effectors. The exhaustive character of allosteric signalling and probing fingerprints will be also useful in future developments of corresponding machine learning applications. The database is freely available at: http://allomaps.bii.a-star.edu.sg.
Collapse
Affiliation(s)
- Zhen Wah Tan
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Wei-Ven Tee
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Igor N Berezovsky
- To whom correspondence should be addressed. Tel: +65 6478 8269; Fax: +65 6478 9047;
| |
Collapse
|
7
|
Berezovsky IN, Nussinov R. Multiscale Allostery: Basic Mechanisms and Versatility in Diagnostics and Drug Design. J Mol Biol 2022; 434:167751. [PMID: 35863488 DOI: 10.1016/j.jmb.2022.167751] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore.
| | - Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Cancer Innovation Laboraory, National Cancer Institute, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| |
Collapse
|
8
|
Wah Tan Z, Tee WV, Berezovsky IN. Learning about allosteric drugs and ways to design them. J Mol Biol 2022; 434:167692. [PMID: 35738428 DOI: 10.1016/j.jmb.2022.167692] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 05/23/2022] [Accepted: 06/15/2022] [Indexed: 11/16/2022]
Abstract
While the accelerating quest for precision medicine requires new individually targeting and selective drugs, and the ability to work with so-called undruggable targets, the realm of allosteric drugs meeting this need remains largely uncharted. Generalizing the observations on two major drug targets with widely observed inherent allostery, GPCRs and kinases, we describe and discuss basic allosteric modes of action that are universally applicable in all types of structures and functions. Using examples of Class A GPCRs and CMGC protein kinases, we show how Allosteric Signalling and Probing Fingerprints can be used to identify potential allosteric sites and reveal effector-leads that may serve as a starting point for the development of allosteric drugs targeting these regulatory sites. A set of distinct characteristics of allosteric ligands was established, which highlights the versatility of their design and make them advantageous before their orthosteric counterparts in personalized medicine. We argue that rational design of allosteric drugs should begin with the search for latent sites or design of non-natural binding sites followed by fragment-based design of allosteric ligands and by the mutual adjustment of the site-ligand pair in order to achieve required effects. On the basis of the perturbative nature and reversibility of allosteric communication, we propose a generic protocol for computational design of allosteric effectors, enabling also the allosteric tuning of biologics, in obtaining allosteric control over protein functions.
Collapse
Affiliation(s)
- Zhen Wah Tan
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Wei-Ven Tee
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore.
| |
Collapse
|
9
|
Tee WV, Wah Tan Z, Guarnera E, Berezovsky IN. Conservation and diversity in allosteric fingerprints of proteins for evolutionary-inspired engineering and design. J Mol Biol 2022; 434:167577. [PMID: 35395233 DOI: 10.1016/j.jmb.2022.167577] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 03/30/2022] [Accepted: 03/30/2022] [Indexed: 11/26/2022]
Abstract
Hand-in-hand work of physics and evolution delivered protein universe with diversity of forms, sizes, and functions. Pervasiveness and advantageous traits of allostery made it an important component of the protein function regulation, calling for thorough investigation of its structural determinants and evolution. Learning directly from nature, we explored here allosteric communication in several major folds and repeat proteins, including α/β and β-barrels, β-propellers, Ig-like fold, ankyrin and α/β leucine-rich repeat proteins, which provide structural platforms for many different enzymatic and signalling functions. We obtained a picture of conserved allosteric communication characteristic in different fold types, modifications of the structure-driven signalling patterns via sequence-determined divergence to specific functions, as well as emergence and potential diversification of allosteric regulation in multi-domain proteins and oligomeric assemblies. Our observations will be instrumental in facilitating the engineering and de novo design of proteins with allosterically regulated functions, including development of therapeutic biologics. In particular, results described here may guide the identification of the optimal structural platforms (e.g. fold type, size, and oligomerization states) and the types of diversifications/perturbations, such as mutations, effector binding, and order-disorder transition. The tunable allosteric linkage across distant regions can be used as a pivotal component in the design/engineering of modular biological systems beyond the traditional scaffolding function.
Collapse
Affiliation(s)
- Wei-Ven Tee
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Zhen Wah Tan
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671
| | - Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore 138671; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, Singapore 117597.
| |
Collapse
|
10
|
Longo LM, Kolodny R, McGlynn SE. Evidence for the emergence of β-trefoils by 'Peptide Budding' from an IgG-like β-sandwich. PLoS Comput Biol 2022; 18:e1009833. [PMID: 35157697 PMCID: PMC8880906 DOI: 10.1371/journal.pcbi.1009833] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 02/25/2022] [Accepted: 01/13/2022] [Indexed: 12/02/2022] Open
Abstract
As sequence and structure comparison algorithms gain sensitivity, the intrinsic interconnectedness of the protein universe has become increasingly apparent. Despite this general trend, β-trefoils have emerged as an uncommon counterexample: They are an isolated protein lineage for which few, if any, sequence or structure associations to other lineages have been identified. If β-trefoils are, in fact, remote islands in sequence-structure space, it implies that the oligomerizing peptide that founded the β-trefoil lineage itself arose de novo. To better understand β-trefoil evolution, and to probe the limits of fragment sharing across the protein universe, we identified both 'β-trefoil bridging themes' (evolutionarily-related sequence segments) and 'β-trefoil-like motifs' (structure motifs with a hallmark feature of the β-trefoil architecture) in multiple, ostensibly unrelated, protein lineages. The success of the present approach stems, in part, from considering β-trefoil sequence segments or structure motifs rather than the β-trefoil architecture as a whole, as has been done previously. The newly uncovered inter-lineage connections presented here suggest a novel hypothesis about the origins of the β-trefoil fold itself-namely, that it is a derived fold formed by 'budding' from an Immunoglobulin-like β-sandwich protein. These results demonstrate how the evolution of a folded domain from a peptide need not be a signature of antiquity and underpin an emerging truth: few protein lineages escape nature's sewing table.
Collapse
Affiliation(s)
- Liam M. Longo
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- Blue Marble Space Institute of Science, Seattle, Washington, United States of America
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | - Shawn E. McGlynn
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- Blue Marble Space Institute of Science, Seattle, Washington, United States of America
| |
Collapse
|
11
|
Papadopoulos C, Callebaut I, Gelly JC, Hatin I, Namy O, Renard M, Lespinet O, Lopes A. Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution. Genome Res 2021; 31:2303-2315. [PMID: 34810219 PMCID: PMC8647833 DOI: 10.1101/gr.275638.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 09/23/2021] [Indexed: 01/08/2023]
Abstract
The noncoding genome plays an important role in de novo gene birth and in the emergence of genetic novelty. Nevertheless, how noncoding sequences' properties could promote the birth of novel genes and shape the evolution and the structural diversity of proteins remains unclear. Therefore, by combining different bioinformatic approaches, we characterized the fold potential diversity of the amino acid sequences encoded by all intergenic open reading frames (ORFs) of S. cerevisiae with the aim of (1) exploring whether the structural states' diversity of proteomes is already present in noncoding sequences, and (2) estimating the potential of the noncoding genome to produce novel protein bricks that could either give rise to novel genes or be integrated into pre-existing proteins, thus participating in protein structure diversity and evolution. We showed that amino acid sequences encoded by most yeast intergenic ORFs contain the elementary building blocks of protein structures. Moreover, they encompass the large structural state diversity of canonical proteins, with the majority predicted as foldable. Then, we investigated the early stages of de novo gene birth by reconstructing the ancestral sequences of 70 yeast de novo genes and characterized the sequence and structural properties of intergenic ORFs with a strong translation signal. This enabled us to highlight sequence and structural factors determining de novo gene emergence. Finally, we showed a strong correlation between the fold potential of de novo proteins and one of their ancestral amino acid sequences, reflecting the relationship between the noncoding genome and the protein structure universe.
Collapse
Affiliation(s)
- Chris Papadopoulos
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005 Paris, France
| | - Jean-Christophe Gelly
- Université de Paris, Biologie Intégrée du Globule Rouge, UMR_S1134, BIGR, INSERM, F-75015 Paris, France
- Laboratoire d'Excellence GR-Ex, 75015 Paris, France
- Institut National de la Transfusion Sanguine, F-75015 Paris, France
| | - Isabelle Hatin
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Olivier Namy
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Maxime Renard
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Olivier Lespinet
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Anne Lopes
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| |
Collapse
|
12
|
Romero-Romero S, Kordes S, Michel F, Höcker B. Evolution, folding, and design of TIM barrels and related proteins. Curr Opin Struct Biol 2021; 68:94-104. [PMID: 33453500 PMCID: PMC8250049 DOI: 10.1016/j.sbi.2020.12.007] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/13/2020] [Accepted: 12/14/2020] [Indexed: 12/16/2022]
Abstract
Proteins are chief actors in life that perform a myriad of exquisite functions. This diversity has been enabled through the evolution and diversification of protein folds. Analysis of sequences and structures strongly suggest that numerous protein pieces have been reused as building blocks and propagated to many modern folds. This information can be traced to understand how the protein world has diversified. In this review, we discuss the latest advances in the analysis of protein evolutionary units, and we use as a model system one of the most abundant and versatile topologies, the TIM-barrel fold, to highlight the existing common principles that interconnect protein evolution, structure, folding, function, and design.
Collapse
Affiliation(s)
| | - Sina Kordes
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
13
|
Yin M, Goncearenco A, Berezovsky IN. Deriving and Using Descriptors of Elementary Functions in Rational Protein Design. FRONTIERS IN BIOINFORMATICS 2021; 1:657529. [PMID: 36303771 PMCID: PMC9581014 DOI: 10.3389/fbinf.2021.657529] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Accepted: 03/15/2021] [Indexed: 06/26/2024] Open
Abstract
The rational design of proteins with desired functions requires a comprehensive description of the functional building blocks. The evolutionary conserved functional units constitute nature's toolbox; however, they are not readily available to protein designers. This study focuses on protein units of subdomain size that possess structural properties and amino acid residues sufficient to carry out elementary reactions in the catalytic mechanisms. The interactions within such elementary functional loops (ELFs) and the interactions with the surrounding protein scaffolds constitute the descriptor of elementary function. The computational approach to deriving descriptors directly from protein sequences and structures and applying them in rational design was implemented in a proof-of-concept DEFINED-PROTEINS software package. Once the descriptor is obtained, the ELF can be fitted into existing or novel scaffolds to obtain the desired function. For instance, the descriptor may be used to determine the necessary spatial restraints in a fragment-based grafting protocol. We illustrated the approach by applying it to well-known cases of ELFs, including phosphate-binding P-loop, diphosphate-binding glycine-rich motif, and calcium-binding EF-hand motif, which could be used to jumpstart templates for user applications. The DEFINED-PROTEINS package is available for free at https://github.com/MelvinYin/Defined_Proteins.
Collapse
Affiliation(s)
- Melvin Yin
- Bioinformatics Institute, Agency for Science, Technology, and Research (ASTAR), Singapore, Singapore
| | - Alexander Goncearenco
- National Center for Biotechnology Information, National Institute of Health (NIH), Bethesda, MD, United States
| | - Igor N. Berezovsky
- Bioinformatics Institute, Agency for Science, Technology, and Research (ASTAR), Singapore, Singapore
- Department of Biological Sciences (DBS), National University of Singapore (NUS), Singapore, Singapore
| |
Collapse
|
14
|
Helicase-like functions in phosphate loop containing beta-alpha polypeptides. Proc Natl Acad Sci U S A 2021; 118:2016131118. [PMID: 33846247 DOI: 10.1073/pnas.2016131118] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The P-loop Walker A motif underlies hundreds of essential enzyme families that bind nucleotide triphosphates (NTPs) and mediate phosphoryl transfer (P-loop NTPases), including the earliest DNA/RNA helicases, translocases, and recombinases. What were the primordial precursors of these enzymes? Could these large and complex proteins emerge from simple polypeptides? Previously, we showed that P-loops embedded in simple βα repeat proteins bind NTPs but also, unexpectedly so, ssDNA and RNA. Here, we extend beyond the purely biophysical function of ligand binding to demonstrate rudimentary helicase-like activities. We further constructed simple 40-residue polypeptides comprising just one β-(P-loop)-α element. Despite their simplicity, these P-loop prototypes confer functions such as strand separation and exchange. Foremost, these polypeptides unwind dsDNA, and upon addition of NTPs, or inorganic polyphosphates, release the bound ssDNA strands to allow reformation of dsDNA. Binding kinetics and low-resolution structural analyses indicate that activity is mediated by oligomeric forms spanning from dimers to high-order assemblies. The latter are reminiscent of extant P-loop recombinases such as RecA. Overall, these P-loop prototypes compose a plausible description of the sequence, structure, and function of the earliest P-loop NTPases. They also indicate that multifunctionality and dynamic assembly were key in endowing short polypeptides with elaborate, evolutionarily relevant functions.
Collapse
|
15
|
Tee WV, Tan ZW, Lee K, Guarnera E, Berezovsky IN. Exploring the Allosteric Territory of Protein Function. J Phys Chem B 2021; 125:3763-3780. [PMID: 33844527 DOI: 10.1021/acs.jpcb.1c00540] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
While the pervasiveness of allostery in proteins is commonly accepted, we further show the generic nature of allosteric mechanisms by analyzing here transmembrane ion-channel viroporin 3a and RNA-dependent RNA polymerase (RdRp) from SARS-CoV-2 along with metabolic enzymes isocitrate dehydrogenase 1 (IDH1) and fumarate hydratase (FH) implicated in cancers. Using the previously developed structure-based statistical mechanical model of allostery (SBSMMA), we share our experience in analyzing the allosteric signaling, predicting latent allosteric sites, inducing and tuning targeted allosteric response, and exploring the allosteric effects of mutations. This, yet incomplete list of phenomenology, forms a complex and unique allosteric territory of protein function, which should be thoroughly explored. We propose a generic computational framework, which not only allows one to obtain a comprehensive allosteric control over proteins but also provides an opportunity to approach the fragment-based design of allosteric effectors and drug candidates. The advantages of allosteric drugs over traditional orthosteric compounds, complemented by the emerging role of the allosteric effects of mutations in the expansion of the cancer mutational landscape and in the increased mutability of viral proteins, leave no choice besides further extensive studies of allosteric mechanisms and their biomedical implications.
Collapse
Affiliation(s)
- Wei-Ven Tee
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore.,Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117597, Singapore
| | - Zhen Wah Tan
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Keene Lee
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore.,Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117597, Singapore
| |
Collapse
|
16
|
Liu M, Yu J, Lv B, Hou Y, Liu X, Feng X, Li C. Improving the activity and thermostability of GH2 β-glucuronidases via domain reassembly. Biotechnol Bioeng 2021; 118:1962-1972. [PMID: 33559890 DOI: 10.1002/bit.27710] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 01/31/2021] [Indexed: 11/07/2022]
Abstract
Glycoside hydrolase family 2 (GH2) enzymes are generally composed of three domains: TIM-barrel domain (TIM), immunoglobulin-like β-sandwich domain (ISD), and sugar-binding domain (SBD). The combination of these three domains yields multiple structural combinations with different properties. Theoretically, the drawbacks of a given GH2 fold may be circumvented by efficiently reassembling the three domains. However, very few successful cases have been reported. In this study, we used six GH2 β-glucuronidases (GUSs) from bacteria, fungi, or humans as model enzymes and constructed a series of mutants by reassembling the domains from different GUSs. The mutants PGUS-At, GUS-PAA, and GUS-PAP, with reassembled domains from fungal GUSs, showed improved expression levels, activity, and thermostability, respectively. Specifically, compared to the parental enzyme, the mutant PGUS-At displayed 3.8 times higher expression, the mutant GUS-PAA displayed 1.0 time higher catalytic efficiency (kcat /Km ), and the mutant GUS-PAP displayed 7.5 times higher thermostability at 65°C. Furthermore, two-hybrid mutants, GUS-AEA and GUS-PEP, were constructed with the ISD from a bacterial GUS and SBD and TIM domain from fungal GUSs. GUS-AEA and GUS-PEP showed 30.4% and 23.0% higher thermostability than GUS-PAP, respectively. Finally, molecular dynamics simulations were conducted to uncover the molecular reasons for the increased thermostability of the mutant.
Collapse
Affiliation(s)
- Mingzhu Liu
- Key Laboratory of Medical Molecule Science and Pharmaceutics Engineering, Ministry of Industry and Information Technology, Institute of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, PR China
| | - Jing Yu
- Key Laboratory of Medical Molecule Science and Pharmaceutics Engineering, Ministry of Industry and Information Technology, Institute of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, PR China
| | - Bo Lv
- Key Laboratory of Medical Molecule Science and Pharmaceutics Engineering, Ministry of Industry and Information Technology, Institute of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, PR China
| | - Yuhui Hou
- Key Laboratory of Medical Molecule Science and Pharmaceutics Engineering, Ministry of Industry and Information Technology, Institute of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, PR China
| | - Xinhe Liu
- Key Laboratory of Medical Molecule Science and Pharmaceutics Engineering, Ministry of Industry and Information Technology, Institute of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, PR China
| | - Xudong Feng
- Key Laboratory of Medical Molecule Science and Pharmaceutics Engineering, Ministry of Industry and Information Technology, Institute of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, PR China
| | - Chun Li
- Key Laboratory of Medical Molecule Science and Pharmaceutics Engineering, Ministry of Industry and Information Technology, Institute of Biochemical Engineering, School of Chemistry and Chemical Engineering, Beijing Institute of Technology, Beijing, PR China.,Key Laboratory for Industrial Biocatalysis, Ministry of Education, Department of Chemical Engineering, Tsinghua University, Beijing, PR China.,Center for Synthetic & Systems Biology, Tsinghua University, Beijing, PR China
| |
Collapse
|
17
|
Tian P, Best RB. Exploring the sequence fitness landscape of a bridge between protein folds. PLoS Comput Biol 2020; 16:e1008285. [PMID: 33048928 PMCID: PMC7553338 DOI: 10.1371/journal.pcbi.1008285] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 08/24/2020] [Indexed: 12/15/2022] Open
Abstract
Most foldable protein sequences adopt only a single native fold. Recent protein design studies have, however, created protein sequences which fold into different structures apon changes of environment, or single point mutation, the best characterized example being the switch between the folds of the GA and GB binding domains of streptococcal protein G. To obtain further insight into the design of sequences which can switch folds, we have used a computational model for the fitness landscape of a single fold, built from the observed sequence variation of protein homologues. We have recently shown that such coevolutionary models can be used to design novel foldable sequences. By appropriately combining two of these models to describe the joint fitness landscape of GA and GB, we are able to describe the propensity of a given sequence for each of the two folds. We have successfully tested the combined model against the known series of designed GA/GB hybrids. Using Monte Carlo simulations on this landscape, we are able to identify pathways of mutations connecting the two folds. In the absence of a requirement for domain stability, the most frequent paths go via sequences in which neither domain is stably folded, reminiscent of the propensity for certain intrinsically disordered proteins to fold into different structures according to context. Even if the folded state is required to be stable, we find that there is nonetheless still a wide range of sequences which are close to the transition region and therefore likely fold switches, consistent with recent estimates that fold switching may be more widespread than had been thought.
Collapse
Affiliation(s)
- Pengfei Tian
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, U.S.A
| | - Robert B. Best
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, U.S.A
| |
Collapse
|
18
|
Allosteric drugs and mutations: chances, challenges, and necessity. Curr Opin Struct Biol 2020; 62:149-157. [DOI: 10.1016/j.sbi.2020.01.010] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 01/16/2020] [Indexed: 12/22/2022]
|