1
|
Anderson NT, Xie JS, Chacko AN, Liu VL, Fan KC, Mukherjee A. Rational Design of a Circularly Permuted Flavin-Based Fluorescent Protein. Chembiochem 2024; 25:e202300814. [PMID: 38356332 PMCID: PMC11065581 DOI: 10.1002/cbic.202300814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 02/12/2024] [Accepted: 02/14/2024] [Indexed: 02/16/2024]
Abstract
Flavin-based fluorescent proteins are oxygen-independent reporters that hold great promise for imaging anaerobic and hypoxic biological systems. In this study, we explored the feasibility of applying circular permutation, a valuable method for the creation of fluorescent sensors, to flavin-based fluorescent proteins. We used rational design and structural data to identify a suitable location for circular permutation in iLOV, a flavin-based reporter derived from A. thaliana. However, relocating the N- and C-termini to this position resulted in a significant reduction in fluorescence. This loss of fluorescence was reversible, however, by fusing dimerizing coiled coils at the new N- and C-termini to compensate for the increase in local chain entropy. Additionally, by inserting protease cleavage sites in circularly permuted iLOV, we developed two protease sensors and demonstrated their application in mammalian cells. In summary, our work establishes the first approach to engineer circularly permuted FbFPs optimized for high fluorescence and further showcases the utility of circularly permuted FbFPs to serve as a scaffold for sensor engineering.
Collapse
Affiliation(s)
| | - Jason S. Xie
- Department of Molecular, Cellular, and Developmental Biology
| | | | - Vannie L. Liu
- Department of Molecular, Cellular, and Developmental Biology
| | | | | |
Collapse
|
2
|
SeqCP: A sequence-based algorithm for searching circularly permuted proteins. Comput Struct Biotechnol J 2022; 21:185-201. [PMID: 36582435 PMCID: PMC9763678 DOI: 10.1016/j.csbj.2022.11.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 11/10/2022] [Accepted: 11/10/2022] [Indexed: 11/16/2022] Open
Abstract
Circular permutation (CP) is a protein sequence rearrangement in which the amino- and carboxyl-termini of a protein can be created in different positions along the imaginary circularized sequence. Circularly permutated proteins usually exhibit conserved three-dimensional structures and functions. By comparing the structures of circular permutants (CPMs), protein research and bioengineering applications can be approached in ways that are difficult to achieve by traditional mutagenesis. Most current CP detection algorithms depend on structural information. Because there is a vast number of proteins with unknown structures, many CP pairs may remain unidentified. An efficient sequence-based CP detector will help identify more CP pairs and advance many protein studies. For instance, some hypothetical proteins may have CPMs with known functions and structures that are informative for functional annotation, but existing structure-based CP search methods cannot be applied when those hypothetical proteins lack structural information. Despite the considerable potential for applications, sequence-based CP search methods have not been well developed. We present a sequence-based method, SeqCP, which analyzes normal and duplicated sequence alignments to identify CPMs and determine candidate CP sites for proteins. SeqCP was trained by data obtained from the Circular Permutation Database and tested with nonredundant datasets from the Protein Data Bank. It shows high reliability in CP identification and achieves an AUC of 0.9. SeqCP has been implemented into a web server available at: http://pcnas.life.nthu.edu.tw/SeqCP/.
Collapse
Key Words
- AUC, area under the ROC curve
- CE, combinatorial extension
- CE-CP, CE with Circular Permutations
- CP, circular permutation
- CPDB, Circular Permutation Database
- CPMs, circular permutants
- CPSARST, Circular Permutation Search Aided by Ramachandran Sequential Transformation
- Circular permutants
- Circular permutation
- MCC, Matthews correlation coefficient
- Protein sequence analysis
- Protein structure modeling
- RMSD, root-mean-square distance
- ROC, receiver operating characteristic
Collapse
|
3
|
Blaber M. Variable and Conserved Regions of Secondary Structure in the β-Trefoil Fold: Structure Versus Function. Front Mol Biosci 2022; 9:889943. [PMID: 35517858 PMCID: PMC9062101 DOI: 10.3389/fmolb.2022.889943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 04/01/2022] [Indexed: 11/13/2022] Open
Abstract
β-trefoil proteins exhibit an approximate C3 rotational symmetry. An analysis of the secondary structure for members of this diverse superfamily of proteins indicates that it is comprised of remarkably conserved β-strands and highly-divergent turn regions. A fundamental “minimal” architecture can be identified that is devoid of heterogenous and extended turn regions, and is conserved among all family members. Conversely, the different functional families of β-trefoils can potentially be identified by their unique turn patterns (or turn “signature”). Such analyses provide clues as to the evolution of the β-trefoil family, suggesting a folding/stability role for the β-strands and a functional role for turn regions. This viewpoint can also guide de novo protein design of β-trefoil proteins having novel functionality.
Collapse
Affiliation(s)
- Michael Blaber
- Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, FL, United States
| |
Collapse
|
4
|
Khersonsky O, Fleishman SJ. What Have We Learned from Design of Function in Large Proteins? BIODESIGN RESEARCH 2022; 2022:9787581. [PMID: 37850148 PMCID: PMC10521758 DOI: 10.34133/2022/9787581] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 02/21/2022] [Indexed: 10/19/2023] Open
Abstract
The overarching goal of computational protein design is to gain complete control over protein structure and function. The majority of sophisticated binders and enzymes, however, are large and exhibit diverse and complex folds that defy atomistic design calculations. Encouragingly, recent strategies that combine evolutionary constraints from natural homologs with atomistic calculations have significantly improved design accuracy. In these approaches, evolutionary constraints mitigate the risk from misfolding and aggregation, focusing atomistic design calculations on a small but highly enriched sequence subspace. Such methods have dramatically optimized diverse proteins, including vaccine immunogens, enzymes for sustainable chemistry, and proteins with therapeutic potential. The new generation of deep learning-based ab initio structure predictors can be combined with these methods to extend the scope of protein design, in principle, to any natural protein of known sequence. We envision that protein engineering will come to rely on completely computational methods to efficiently discover and optimize biomolecular activities.
Collapse
Affiliation(s)
- Olga Khersonsky
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Sarel J. Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
5
|
Structural dynamics in the evolution of a bilobed protein scaffold. Proc Natl Acad Sci U S A 2021; 118:2026165118. [PMID: 34845009 PMCID: PMC8694067 DOI: 10.1073/pnas.2026165118] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/20/2021] [Indexed: 11/18/2022] Open
Abstract
Proteins conduct numerous complex biological functions by use of tailored structural dynamics. The molecular details of how these emerged from ancestral peptides remains mysterious. How does nature utilize the same repertoire of folds to diversify function? To shed light on this, we analyzed bilobed proteins with a common structural core, which is spread throughout the tree of life and is involved in diverse biological functions such as transcription, enzymatic catalysis, membrane transport, and signaling. We show here that the structural dynamics of the structural core differentiate predominantly via terminal additions during a long-period evolution. This diversifies substrate specificity and, ultimately, biological function. Novel biophysical tools allow the structural dynamics of proteins and the regulation of such dynamics by binding partners to be explored in unprecedented detail. Although this has provided critical insights into protein function, the means by which structural dynamics direct protein evolution remain poorly understood. Here, we investigated how proteins with a bilobed structure, composed of two related domains from the periplasmic-binding protein–like II domain family, have undergone divergent evolution, leading to adaptation of their structural dynamics. We performed a structural analysis on ∼600 bilobed proteins with a common primordial structural core, which we complemented with biophysical studies to explore the structural dynamics of selected examples by single-molecule Förster resonance energy transfer and Hydrogen–Deuterium exchange mass spectrometry. We show that evolutionary modifications of the structural core, largely at its termini, enable distinct structural dynamics, allowing the diversification of these proteins into transcription factors, enzymes, and extracytoplasmic transport-related proteins. Structural embellishments of the core created interdomain interactions that stabilized structural states, reshaping the active site geometry, and ultimately altered substrate specificity. Our findings reveal an as-yet-unrecognized mechanism for the emergence of functional promiscuity during long periods of evolution and are applicable to a large number of domain architectures.
Collapse
|
6
|
Zhao VY, Rodrigues JV, Lozovsky ER, Hartl DL, Shakhnovich EI. Switching an active site helix in dihydrofolate reductase reveals limits to subdomain modularity. Biophys J 2021; 120:4738-4750. [PMID: 34571014 PMCID: PMC8595743 DOI: 10.1016/j.bpj.2021.09.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 09/14/2021] [Accepted: 09/22/2021] [Indexed: 11/23/2022] Open
Abstract
To what degree are individual structural elements within proteins modular such that similar structures from unrelated proteins can be interchanged? We study subdomain modularity by creating 20 chimeras of an enzyme, Escherichia coli dihydrofolate reductase (DHFR), in which a catalytically important, 10-residue α-helical sequence is replaced by α-helical sequences from a diverse set of proteins. The chimeras stably fold but have a range of diminished thermal stabilities and catalytic activities. Evolutionary coupling analysis indicates that the residues of this α-helix are under selection pressure to maintain catalytic activity in DHFR. Reversion to phenylalanine at key position 31 was found to partially restore catalytic activity, which could be explained by evolutionary coupling values. We performed molecular dynamics simulations using replica exchange with solute tempering. Chimeras with low catalytic activity exhibit nonhelical conformations that block the binding site and disrupt the positioning of the catalytically essential residue D27. Simulation observables and in vitro measurements of thermal stability and substrate-binding affinity are strongly correlated. Several E. coli strains with chromosomally integrated chimeric DHFRs can grow, with growth rates that follow predictions from a kinetic flux model that depends on the intracellular abundance and catalytic activity of DHFR. Our findings show that although α-helices are not universally substitutable, the molecular and fitness effects of modular segments can be predicted by the biophysical compatibility of the replacement segment.
Collapse
Affiliation(s)
- Victor Y Zhao
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - João V Rodrigues
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Elena R Lozovsky
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts
| | - Daniel L Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
7
|
Dishman AF, Tyler RC, Fox JC, Kleist AB, Prehoda KE, Babu MM, Peterson FC, Volkman BF. Evolution of fold switching in a metamorphic protein. Science 2021; 371:86-90. [PMID: 33384377 PMCID: PMC8017559 DOI: 10.1126/science.abd8700] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 11/11/2020] [Indexed: 12/14/2022]
Abstract
Metamorphic proteins switch between different folds, defying the protein folding paradigm. It is unclear how fold switching arises during evolution. With ancestral reconstruction and nuclear magnetic resonance, we studied the evolution of the metamorphic human protein XCL1, which has two distinct folds with different functions, making it an unusual member of the chemokine family, whose members generally adopt one conserved fold. XCL1 evolved from an ancestor with the chemokine fold. Evolution of a dimer interface, changes in structural constraints and molecular strain, and alteration of intramolecular protein contacts drove the evolution of metamorphosis. Then, XCL1 likely evolved to preferentially populate the noncanonical fold before reaching its modern-day near-equal population of folds. These discoveries illuminate how one sequence has evolved to encode multiple structures, revealing principles for protein design and engineering.
Collapse
Affiliation(s)
- Acacia F Dishman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
- Medical Scientist Training Program, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Robert C Tyler
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jamie C Fox
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Andrew B Kleist
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
- Medical Scientist Training Program, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Kenneth E Prehoda
- Institute of Molecular Biology, Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
| | - M Madan Babu
- MRC Laboratory of Molecular Biology, Cambridge, UK
- Department of Structural Biology and Center for Data Driven Discovery, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Francis C Peterson
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Brian F Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA.
| |
Collapse
|
8
|
Nagata Y, Kato H, Ohtsubo Y, Tsuda M. Lessons from the genomes of lindane-degrading sphingomonads. ENVIRONMENTAL MICROBIOLOGY REPORTS 2019; 11:630-644. [PMID: 31063253 DOI: 10.1111/1758-2229.12762] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 04/29/2019] [Accepted: 05/02/2019] [Indexed: 05/27/2023]
Abstract
Bacterial strains capable of degrading man-made xenobiotic compounds are good materials to study bacterial evolution towards new metabolic functions. Lindane (γ-hexachlorocyclohexane, γ-HCH, or γ-BHC) is an especially good target compound for the purpose, because it is relatively recalcitrant but can be degraded by a limited range of bacterial strains. A comparison of the complete genome sequences of lindane-degrading sphingomonad strains clearly demonstrated that (i) lindane-degrading strains emerged from a number of different ancestral hosts that have recruited lin genes encoding enzymes that are able to channel lindane to central metabolites, (ii) in sphingomonads lin genes have been acquired by horizontal gene transfer mediated by different plasmids and in which IS6100 plays a role in recruitment and distribution of genes, and (iii) IS6100 plays a role in dynamic genome rearrangements providing genetic diversity to different strains and ability to evolve to other states. Lindane-degrading bacteria whose genomes change so easily and quickly are also fascinating starting materials for tracing the bacterial evolution process experimentally in a relatively short time period. As the origin of the specific lin genes remains a mystery, such genes will be useful probes for exploring the cryptic 'gene pool' available to bacteria.
Collapse
Affiliation(s)
- Yuji Nagata
- Graduate School of Life Sciences, Tohoku University, 2-1-1 Katahira, Sendai, 980-8577, Japan
| | - Hiromi Kato
- Graduate School of Life Sciences, Tohoku University, 2-1-1 Katahira, Sendai, 980-8577, Japan
| | - Yoshiyuki Ohtsubo
- Graduate School of Life Sciences, Tohoku University, 2-1-1 Katahira, Sendai, 980-8577, Japan
| | - Masataka Tsuda
- Graduate School of Life Sciences, Tohoku University, 2-1-1 Katahira, Sendai, 980-8577, Japan
| |
Collapse
|
9
|
Atkinson JT, Jones AM, Zhou Q, Silberg JJ. Circular permutation profiling by deep sequencing libraries created using transposon mutagenesis. Nucleic Acids Res 2019; 46:e76. [PMID: 29912470 PMCID: PMC6061844 DOI: 10.1093/nar/gky255] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2017] [Accepted: 03/28/2018] [Indexed: 12/17/2022] Open
Abstract
Deep mutational scanning has been used to create high-resolution DNA sequence maps that illustrate the functional consequences of large numbers of point mutations. However, this approach has not yet been applied to libraries of genes created by random circular permutation, an engineering strategy that is used to create open reading frames that express proteins with altered contact order. We describe a new method, termed circular permutation profiling with DNA sequencing (CPP-seq), which combines a one-step transposon mutagenesis protocol for creating libraries with a functional selection, deep sequencing and computational analysis to obtain unbiased insight into a protein's tolerance to circular permutation. Application of this method to an adenylate kinase revealed that CPP-seq creates two types of vectors encoding each circularly permuted gene, which differ in their ability to express proteins. Functional selection of this library revealed that >65% of the sampled vectors that express proteins are enriched relative to those that cannot translate proteins. Mapping enriched sequences onto structure revealed that the mobile AMP binding and rigid core domains display greater tolerance to backbone fragmentation than the mobile lid domain, illustrating how CPP-seq can be used to relate a protein's biophysical characteristics to the retention of activity upon permutation.
Collapse
Affiliation(s)
- Joshua T Atkinson
- Systems, Synthetic, and Physical Biology Graduate Program, Rice University, 6100 Main MS-180, Houston, TX 77005, USA
| | - Alicia M Jones
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, TX 77005, USA
| | - Quan Zhou
- Department of Statistics, Rice University, 6100 Main Street, Houston, TX 77005, USA
| | - Jonathan J Silberg
- Department of BioSciences, Rice University, MS-140, 6100 Main Street, Houston, TX 77005, USA.,Department of Bioengineering, Rice University, 6100 Main Street, Houston, TX 77005, USA
| |
Collapse
|
10
|
Abstract
The combination of modern biotechnologies such as DNA synthesis, λ red recombineering, CRISPR-based editing and next-generation high-throughput sequencing increasingly enables precise manipulation of genes and genomes. Beyond rational design, these technologies also enable the targeted, and potentially continuous, introduction of multiple mutations. While this might seem to be merely a return to natural selection, the ability to target evolution greatly reduces fitness burdens and focuses mutation and selection on those genes and traits that best contribute to a desired phenotype, ultimately throwing evolution into fast forward.
Collapse
|
11
|
Zhong Z, Liu CC. Probing pathways of adaptation with continuous evolution. CURRENT OPINION IN SYSTEMS BIOLOGY 2019; 14:18-24. [PMID: 31608311 PMCID: PMC6788780 DOI: 10.1016/j.coisb.2019.02.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Affiliation(s)
- Ziwei Zhong
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA 92697, USA
| | - Chang C. Liu
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA 92697, USA
- Department of Chemistry, University of California, Irvine, Irvine, CA 92697, USA
- Department of Molecular Biology and Biochemistry, University of California, Irvine, Irvine, CA 92697, USA
- Lead Contact
| |
Collapse
|
12
|
Bandyopadhyay B, Peleg Y. Facilitating circular permutation using Restriction Free (RF) cloning. Protein Eng Des Sel 2019; 31:65-68. [PMID: 29319799 DOI: 10.1093/protein/gzx061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2017] [Accepted: 11/14/2017] [Indexed: 02/02/2023] Open
Abstract
Circular permutation is a powerful tool to test the role of topology in protein folding and function. Previous methods for generating circular permutants were based on rearranging gene elements using restriction enzymes-based cloning. Here, we present a Restriction Free (RF) approach to achieve circular permutation which is faster and more cost-effective.
Collapse
Affiliation(s)
| | - Yoav Peleg
- The Israel Structural Proteomics Center (ISPC), Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
13
|
Albert P, Varga B, Zsibrita N, Kiss A. Circularly permuted variants of two CG-specific prokaryotic DNA methyltransferases. PLoS One 2018; 13:e0197232. [PMID: 29746549 PMCID: PMC5944983 DOI: 10.1371/journal.pone.0197232] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Accepted: 04/27/2018] [Indexed: 01/06/2023] Open
Abstract
The highly similar prokaryotic DNA (cytosine-5) methyltransferases (C5-MTases) M.MpeI and M.SssI share the specificity of eukaryotic C5-MTases (5'-CG), and can be useful research tools in the study of eukaryotic DNA methylation and epigenetic regulation. In an effort to improve the stability and solubility of complementing fragments of the two MTases, genes encoding circularly permuted (CP) variants of M.MpeI and M.SssI were created, and cloned in a plasmid vector downstream of an arabinose-inducible promoter. MTase activity of the CP variants was tested by digestion of the plasmids with methylation-sensitive restriction enzymes. Eleven of the fourteen M.MpeI permutants and six of the seven M.SssI permutants had detectable MTase activity as indicated by the full or partial protection of the plasmid carrying the cpMTase gene. Permutants cp62M.MpeI and cp58M.SssI, in which the new N-termini are located between conserved motifs II and III, had by far the highest activity. The activity of cp62M.MpeI was comparable to the activity of wild-type M.MpeI. Based on the location of the split sites, the permutants possessing MTase activity can be classified in ten types. Although most permutation sites were designed to fall outside of conserved motifs, and the MTase activity of the permutants measured in cell extracts was in most cases substantially lower than that of the wild-type enzyme, the high proportion of circular permutation topologies compatible with MTase activity is remarkable, and is a new evidence for the structural plasticity of C5-MTases. A computer search of the REBASE database identified putative C5-MTases with CP arrangement. Interestingly, all natural circularly permuted C5-MTases appear to represent only one of the ten types of permutation topology created in this work.
Collapse
Affiliation(s)
- Pál Albert
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
- Doctoral School in Biology, Faculty of Science and Informatics, University of Szeged, Szeged, Hungary
| | - Bence Varga
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Nikolett Zsibrita
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
- Doctoral School in Biology, Faculty of Science and Informatics, University of Szeged, Szeged, Hungary
| | - Antal Kiss
- Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| |
Collapse
|
14
|
Molecular and Functional Study of a Branching Sucrase-Like Glucansucrase Reveals an Evolutionary Intermediate between Two Subfamilies of the GH70 Enzymes. Appl Environ Microbiol 2018; 84:AEM.02810-17. [PMID: 29453261 DOI: 10.1128/aem.02810-17] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Accepted: 01/21/2018] [Indexed: 11/20/2022] Open
Abstract
Glucansucrases (GSs) in glycoside hydrolase family 70 (GH70) catalyze the synthesis of α-glucans from sucrose, a reaction that is widely seen in lactic acid bacteria (LAB). These enzymes have been implicated in many aspects of microbial life. Products of GSs have great commercial value as food supplements and medical materials; therefore, these enzymes have attracted much attention from both science and industry. Certain issues concerning the origin and evolution of GSs are still to be addressed, although an increasing number of GH70 enzymes have been characterized. This study describes a GS enzyme with the appearance of a branching sucrase (BrS). Structural analysis indicated that this GS enzyme produced a type of glucan composed of an α-(1→6) glucosidic backbone and α-(1→4) branches, as well as a considerable amount of α-(1→3) branches, distinguishing it from the GSs identified so far. Moreover, sequence-based analysis of the catalytic core of this enzyme suggested that it might be an evolutionary intermediate between the BrS and GS subgroups. These results provide an evolutionary link between these subgroups of GH70 enzymes and shed new light on the origination of GSs.IMPORTANCE GH70 GSs catalyze the synthesis of α-glucans from sucrose, a reaction that is widely seen in LAB. Products of these enzymes have great commercial value as food supplements and medical materials. Moreover, these enzymes have attracted much attention from scientists because they have potential in tailored synthesis of α-glucans with desired structures and properties. Although more and more GSs have been characterized, the origin and evolution of these enzymes have not been well addressed. This study describes a GS with the appearance of a BrS (i.e., high levels of similarity to BrSs in sequence analysis). Further analysis indicated that this enzyme synthesized a type of insoluble glucan composed of an α-(1→6) glucosidic backbone and many α-(1→4)- and α-(1→3)-linked branches, the linkage composition of which has rarely been reported in the literature. This BrS-like GS enzyme might be an evolutionary intermediate between BrS and GS enzymes.
Collapse
|
15
|
Natan E, Endoh T, Haim-Vilmovsky L, Flock T, Chalancon G, Hopper JTS, Kintses B, Horvath P, Daruka L, Fekete G, Pál C, Papp B, Oszi E, Magyar Z, Marsh JA, Elcock AH, Babu MM, Robinson CV, Sugimoto N, Teichmann SA. Cotranslational protein assembly imposes evolutionary constraints on homomeric proteins. Nat Struct Mol Biol 2018; 25:279-288. [PMID: 29434345 PMCID: PMC5995306 DOI: 10.1038/s41594-018-0029-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Accepted: 01/10/2018] [Indexed: 01/11/2023]
Abstract
Cotranslational protein folding can facilitate rapid formation of functional structures. However, it can also cause premature assembly of protein complexes, if two interacting nascent chains are in close proximity. By analyzing known protein structures, we show that homomeric protein contacts are enriched toward the C termini of polypeptide chains across diverse proteomes. We hypothesize that this is the result of evolutionary constraints for folding to occur before assembly. Using high-throughput imaging of protein homomers in Escherichia coli and engineered protein constructs with N- and C-terminal oligomerization domains, we show that, indeed, proteins with C-terminal homomeric interface residues consistently assemble more efficiently than those with N-terminal interface residues. Using in vivo, in vitro and in silico experiments, we identify features that govern successful assembly of homomers, which have implications for protein design and expression optimization.
Collapse
Affiliation(s)
| | - Tamaki Endoh
- Frontier Institute for Biomolecular Engineering Research (FIBER), Konan University, Kobe, Japan
| | - Liora Haim-Vilmovsky
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Tilman Flock
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | | | | | - Bálint Kintses
- Synthetic and System Biology Unit, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
| | - Peter Horvath
- Synthetic and System Biology Unit, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Lejla Daruka
- Synthetic and System Biology Unit, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
| | - Gergely Fekete
- Synthetic and System Biology Unit, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
| | - Csaba Pál
- Synthetic and System Biology Unit, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
| | - Balázs Papp
- Synthetic and System Biology Unit, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
| | - Erika Oszi
- Institute of Plant Biology, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
| | - Zoltán Magyar
- Institute of Plant Biology, Biological Research Center of the Hungarian Academia of Sciences, Szeged, Hungary
| | - Joseph A Marsh
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Adrian H Elcock
- Department of Biochemistry, University of Iowa, Iowa City, IA, USA
| | - M Madan Babu
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | | | - Naoki Sugimoto
- Frontier Institute for Biomolecular Engineering Research (FIBER), Konan University, Kobe, Japan
- Graduate School of Frontiers of Innovative Research in Science and Technology (FIRST), Konan University, Kobe, Japan
| | - Sarah A Teichmann
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Cambridge, UK.
- Cavendish Laboratory, University of Cambridge, Cambridge, UK.
| |
Collapse
|
16
|
The Role of Evolutionary Selection in the Dynamics of Protein Structure Evolution. Biophys J 2017; 112:1350-1365. [PMID: 28402878 DOI: 10.1016/j.bpj.2017.02.029] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Revised: 02/16/2017] [Accepted: 02/22/2017] [Indexed: 02/05/2023] Open
Abstract
Homology modeling is a powerful tool for predicting a protein's structure. This approach is successful because proteins whose sequences are only 30% identical still adopt the same structure, while structure similarity rapidly deteriorates beyond the 30% threshold. By studying the divergence of protein structure as sequence evolves in real proteins and in evolutionary simulations, we show that this nonlinear sequence-structure relationship emerges as a result of selection for protein folding stability in divergent evolution. Fitness constraints prevent the emergence of unstable protein evolutionary intermediates, thereby enforcing evolutionary paths that preserve protein structure despite broad sequence divergence. However, on longer timescales, evolution is punctuated by rare events where the fitness barriers obstructing structure evolution are overcome and discovery of new structures occurs. We outline biophysical and evolutionary rationale for broad variation in protein family sizes, prevalence of compact structures among ancient proteins, and more rapid structure evolution of proteins with lower packing density.
Collapse
|
17
|
Yachnin BJ, Khare SD. Engineering carboxypeptidase G2 circular permutations for the design of an autoinhibited enzyme. Protein Eng Des Sel 2017; 30:321-331. [PMID: 28160000 PMCID: PMC6283397 DOI: 10.1093/protein/gzx005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Revised: 01/11/2017] [Accepted: 01/18/2017] [Indexed: 11/14/2022] Open
Abstract
Carboxypeptidase G2 (CPG2) is an Food and Drug Administration (FDA)-approved enzyme drug used to treat methotrexate (MTX) toxicity in cancer patients receiving MTX treatment. It has also been used in directed enzyme-prodrug chemotherapy, but this strategy has been hampered by off-site activation of the prodrug by the circulating enzyme. The development of a tumor protease activatable CPG2, which could be achieved using a circular permutation of CPG2 fused to an inactivating 'prodomain', would aid in these applications. We report the development of a protease accessibility-based screen to identify candidate sites for circular permutation in proximity of the CPG2 active site. The resulting six circular permutants showed similar expression, structure, thermal stability, and, in four cases, activity levels compared to the wild-type enzyme. We rationalize these results based on structural models of the permutants obtained using the Rosetta software. We developed a cell growth-based selection system, and demonstrated that when fused to periplasm-directing signal peptides, one of our circular permutants confers MTX resistance in Escherichia coli with equal efficiency as the wild-type enzyme. As the permutants have similar properties to wild-type CPG2, these enzymes are promising starting points for the development of autoinhibited, protease-activatable zymogen forms of CPG2 for use in therapeutic contexts.
Collapse
Affiliation(s)
- Brahm J. Yachnin
- Department of Chemistry & Chemical Biology and the Center for Integrative Proteomics, Rutgers University, Piscataway, NJ 08854, USA
| | - Sagar D. Khare
- Department of Chemistry & Chemical Biology and the Center for Integrative Proteomics, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
18
|
Meng X, Gangoiti J, Bai Y, Pijning T, Van Leeuwen SS, Dijkhuizen L. Structure-function relationships of family GH70 glucansucrase and 4,6-α-glucanotransferase enzymes, and their evolutionary relationships with family GH13 enzymes. Cell Mol Life Sci 2016; 73:2681-706. [PMID: 27155661 PMCID: PMC4919382 DOI: 10.1007/s00018-016-2245-7] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Accepted: 04/22/2016] [Indexed: 12/13/2022]
Abstract
Lactic acid bacteria (LAB) are known to produce large amounts of α-glucan exopolysaccharides. Family GH70 glucansucrase (GS) enzymes catalyze the synthesis of these α-glucans from sucrose. The elucidation of the crystal structures of representative GS enzymes has advanced our understanding of their reaction mechanism, especially structural features determining their linkage specificity. In addition, with the increase of genome sequencing, more and more GS enzymes are identified and characterized. Together, such knowledge may promote the synthesis of α-glucans with desired structures and properties from sucrose. In the meantime, two new GH70 subfamilies (GTFB- and GTFC-like) have been identified as 4,6-α-glucanotransferases (4,6-α-GTs) that represent novel evolutionary intermediates between the family GH13 and "classical GH70 enzymes". These enzymes are not active on sucrose; instead, they use (α1 → 4) glucans (i.e. malto-oligosaccharides and starch) as substrates to synthesize novel α-glucans by introducing linear chains of (α1 → 6) linkages. All these GH70 enzymes are very interesting biocatalysts and hold strong potential for applications in the food, medicine and cosmetic industries. In this review, we summarize the microbiological distribution and the structure-function relationships of family GH70 enzymes, introduce the two newly identified GH70 subfamilies, and discuss evolutionary relationships between family GH70 and GH13 enzymes.
Collapse
Affiliation(s)
- Xiangfeng Meng
- Microbial Physiology, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747, AG, Groningen, The Netherlands
| | - Joana Gangoiti
- Microbial Physiology, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747, AG, Groningen, The Netherlands
| | - Yuxiang Bai
- Microbial Physiology, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747, AG, Groningen, The Netherlands
| | - Tjaard Pijning
- Biophysical Chemistry, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747, AG, Groningen, The Netherlands
| | - Sander S Van Leeuwen
- Microbial Physiology, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747, AG, Groningen, The Netherlands
| | - Lubbert Dijkhuizen
- Microbial Physiology, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747, AG, Groningen, The Netherlands.
| |
Collapse
|
19
|
Pandey N, Kuypers BE, Nassif B, Thomas EE, Alnahhas RN, Segatori L, Silberg JJ. Tolerance of a Knotted Near-Infrared Fluorescent Protein to Random Circular Permutation. Biochemistry 2016; 55:3763-73. [PMID: 27304983 DOI: 10.1021/acs.biochem.6b00258] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Bacteriophytochrome photoreceptors (BphP) are knotted proteins that have been developed as near-infrared fluorescent protein (iRFP) reporters of gene expression. To explore how rearrangements in the peptides that interlace into the knot within the BphP photosensory core affect folding, we subjected iRFPs to random circular permutation using an improved transposase mutagenesis strategy and screened for variants that fluoresce. We identified 27 circularly permuted iRFPs that display biliverdin-dependent fluorescence in Escherichia coli. The variants with the brightest whole cell fluorescence initiated translation at residues near the domain linker and knot tails, although fluorescent variants that initiated translation within the PAS and GAF domains were discovered. Circularly permuted iRFPs retained sufficient cofactor affinity to fluoresce in tissue culture without the addition of biliverdin, and one variant displayed enhanced fluorescence when expressed in bacteria and tissue culture. This variant displayed a quantum yield similar to that of iRFPs but exhibited increased resistance to chemical denaturation, suggesting that the observed increase in the magnitude of the signal arose from more efficient protein maturation. These results show how the contact order of a knotted BphP can be altered without disrupting chromophore binding and fluorescence, an important step toward the creation of near-infrared biosensors with expanded chemical sensing functions for in vivo imaging.
Collapse
Affiliation(s)
- Naresh Pandey
- Department of Biosciences, Rice University , Houston, Texas 77005, United States.,Biochemistry and Cell Biology Graduate Program, Rice University , Houston, Texas 77005, United States
| | - Brianna E Kuypers
- Systems, Synthetic, and Physical Biology Graduate Program, Rice University , Houston, Texas 77005, United States.,Department of Chemical and Biomolecular Engineering, Rice University , Houston, Texas 77005, United States
| | - Barbara Nassif
- Department of Biosciences, Rice University , Houston, Texas 77005, United States
| | - Emily E Thomas
- Department of Biosciences, Rice University , Houston, Texas 77005, United States.,Biochemistry and Cell Biology Graduate Program, Rice University , Houston, Texas 77005, United States
| | - Razan N Alnahhas
- Department of Biosciences, Rice University , Houston, Texas 77005, United States.,Biochemistry and Cell Biology Graduate Program, Rice University , Houston, Texas 77005, United States
| | - Laura Segatori
- Department of Biosciences, Rice University , Houston, Texas 77005, United States.,Department of Chemical and Biomolecular Engineering, Rice University , Houston, Texas 77005, United States.,Department of Bioengineering, Rice University , Houston, Texas 77005, United States
| | - Jonathan J Silberg
- Department of Biosciences, Rice University , Houston, Texas 77005, United States.,Department of Bioengineering, Rice University , Houston, Texas 77005, United States
| |
Collapse
|
20
|
Weigele P, Raleigh EA. Biosynthesis and Function of Modified Bases in Bacteria and Their Viruses. Chem Rev 2016; 116:12655-12687. [PMID: 27319741 DOI: 10.1021/acs.chemrev.6b00114] [Citation(s) in RCA: 120] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Naturally occurring modification of the canonical A, G, C, and T bases can be found in the DNA of cellular organisms and viruses from all domains of life. Bacterial viruses (bacteriophages) are a particularly rich but still underexploited source of such modified variant nucleotides. The modifications conserve the coding and base-pairing functions of DNA, but add regulatory and protective functions. In prokaryotes, modified bases appear primarily to be part of an arms race between bacteriophages (and other genomic parasites) and their hosts, although, as in eukaryotes, some modifications have been adapted to convey epigenetic information. The first half of this review catalogs the identification and diversity of DNA modifications found in bacteria and bacteriophages. What is known about the biogenesis, context, and function of these modifications are also described. The second part of the review places these DNA modifications in the context of the arms race between bacteria and bacteriophages. It focuses particularly on the defense and counter-defense strategies that turn on direct recognition of the presence of a modified base. Where modification has been shown to affect other DNA transactions, such as expression and chromosome segregation, that is summarized, with reference to recent reviews.
Collapse
Affiliation(s)
- Peter Weigele
- Chemical Biology, New England Biolabs , Ipswich, Massachusetts 01938, United States
| | | |
Collapse
|
21
|
Jones AM, Mehta MM, Thomas EE, Atkinson JT, Segall-Shapiro TH, Liu S, Silberg JJ. The Structure of a Thermophilic Kinase Shapes Fitness upon Random Circular Permutation. ACS Synth Biol 2016; 5:415-25. [PMID: 26976658 DOI: 10.1021/acssynbio.5b00305] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Proteins can be engineered for synthetic biology through circular permutation, a sequence rearrangement in which native protein termini become linked and new termini are created elsewhere through backbone fission. However, it remains challenging to anticipate a protein's functional tolerance to circular permutation. Here, we describe new transposons for creating libraries of randomly circularly permuted proteins that minimize peptide additions at their termini, and we use transposase mutagenesis to study the tolerance of a thermophilic adenylate kinase (AK) to circular permutation. We find that libraries expressing permuted AKs with either short or long peptides amended to their N-terminus yield distinct sets of active variants and present evidence that this trend arises because permuted protein expression varies across libraries. Mapping all sites that tolerate backbone cleavage onto AK structure reveals that the largest contiguous regions of sequence that lack cleavage sites are proximal to the phosphotransfer site. A comparison of our results with a range of structure-derived parameters further showed that retention of function correlates to the strongest extent with the distance to the phosphotransfer site, amino acid variability in an AK family sequence alignment, and residue-level deviations in superimposed AK structures. Our work illustrates how permuted protein libraries can be created with minimal peptide additions using transposase mutagenesis, and it reveals a challenge of maintaining consistent expression across permuted variants in a library that minimizes peptide additions. Furthermore, these findings provide a basis for interpreting responses of thermophilic phosphotransferases to circular permutation by calibrating how different structure-derived parameters relate to retention of function in a cellular selection.
Collapse
Affiliation(s)
- Alicia M. Jones
- Department
of Biosciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - Manan M. Mehta
- Medical
Scientist Training Program, Northwestern University, 303 East
Chicago Avenue, Morton 1-670, Chicago, Illinois 60611, United States
| | - Emily E. Thomas
- Department
of Biosciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - Joshua T. Atkinson
- Systems,
Synthetic, and Physical Biology Graduate Program, Rice University, 6100
Main MS-180, Houston, Texas 77005, United States
| | - Thomas H. Segall-Shapiro
- Department
of Biological Engineering, Synthetic Biology Center, Massachusetts Institute of Technology, 500 Technology Square, NE47-257, Cambridge, Massachusetts 02139, United States
| | - Shirley Liu
- Department
of Biosciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| | - Jonathan J. Silberg
- Department
of Biosciences, Rice University, MS-140, 6100 Main Street, Houston, Texas 77005, United States
| |
Collapse
|
22
|
Smock RG, Yadid I, Dym O, Clarke J, Tawfik DS. De Novo Evolutionary Emergence of a Symmetrical Protein Is Shaped by Folding Constraints. Cell 2016; 164:476-86. [PMID: 26806127 PMCID: PMC4735018 DOI: 10.1016/j.cell.2015.12.024] [Citation(s) in RCA: 74] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2015] [Revised: 10/05/2015] [Accepted: 12/07/2015] [Indexed: 01/02/2023]
Abstract
Molecular evolution has focused on the divergence of molecular functions, yet we know little about how structurally distinct protein folds emerge de novo. We characterized the evolutionary trajectories and selection forces underlying emergence of β-propeller proteins, a globular and symmetric fold group with diverse functions. The identification of short propeller-like motifs (<50 amino acids) in natural genomes indicated that they expanded via tandem duplications to form extant propellers. We phylogenetically reconstructed 47-residue ancestral motifs that form five-bladed lectin propellers via oligomeric assembly. We demonstrate a functional trajectory of tandem duplications of these motifs leading to monomeric lectins. Foldability, i.e., higher efficiency of folding, was the main parameter leading to improved functionality along the entire evolutionary trajectory. However, folding constraints changed along the trajectory: initially, conflicts between monomer folding and oligomer assembly dominated, whereas subsequently, upon tandem duplication, tradeoffs between monomer stability and foldability took precedence. Inferred 47-aminoacid ancestral motifs fold into functional β-propeller assemblies Motif duplication, fusion, and diversification yield functional monomeric propellers Folding efficiency was the key parameter optimized throughout propeller emergence Single-motif precursors in extant genomes support the reconstructed emergence pathway
Collapse
Affiliation(s)
- Robert G Smock
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Itamar Yadid
- Metabolic Pathways and Enzyme Evolution Laboratory, Migal Galilee Research Institute, Kiryat Shmona 11016, Israel
| | - Orly Dym
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Jane Clarke
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK
| | - Dan S Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|
23
|
The Exiguobacterium sibiricum 255-15 GtfC Enzyme Represents a Novel Glycoside Hydrolase 70 Subfamily of 4,6-α-Glucanotransferase Enzymes. Appl Environ Microbiol 2015; 82:756-66. [PMID: 26590275 DOI: 10.1128/aem.03420-15] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Accepted: 11/13/2015] [Indexed: 11/20/2022] Open
Abstract
The glycoside hydrolase 70 (GH70) family originally was established for glucansucrase enzymes found solely in lactic acid bacteria synthesizing α-glucan polysaccharides from sucrose (e.g., GtfA). In recent years, we have characterized GtfB and related Lactobacillus enzymes as 4,6-α-glucanotransferase enzymes. These GtfB-type enzymes constitute the first GH70 subfamily of enzymes that are unable to act on sucrose as a substrate but are active with maltodextrins and starch, cleave α1→4 linkages, and synthesize linear α1→6-glucan chains. The GtfB disproportionating type of activity results in the conversion of malto-oligosaccharides into isomalto/malto-polysaccharides with a relatively high percentage of α1→6 linkages. This paper reports the identification of the members of a second GH70 subfamily (designated GtfC enzymes) and the characterization of the Exiguobacterium sibiricum 255-15 GtfC enzyme, which is also inactive with sucrose and displays 4,6-α-glucanotransferase activity with malto-oligosaccharides. GtfC differs from GtfB in synthesizing isomalto/malto-oligosaccharides. Biochemically, the GtfB- and GtfC-type enzymes are related, but phylogenetically, they clearly constitute different GH70 subfamilies, displaying only 30% sequence identity. Whereas the GtfB-type enzyme largely has the same domain order as glucansucrases (with α-amylase domains A, B, and C plus domains IV and V), this GtfC-type enzyme differs in the order of these domains and completely lacks domain V. In GtfC, the sequence of conserved regions I to IV of clan GH-H is identical to that in GH13 (I-II-III-IV) but different from that in GH70 (II-III-IV-I because of a circular permutation of the (β/α)8 barrel. The GtfC 4,6-α-glucanotransferase enzymes thus represent structurally and functionally very interesting evolutionary intermediates between α-amylase and glucansucrase enzymes.
Collapse
|
24
|
Abstract
First discovered in bacteriophage HK97, biological chainmail is a highly stable system formed by concatenated protein rings. Each subunit of the ring contains the HK97-like fold, which is characterized by its submarine-like shape with a 5-stranded β sheet in the axial (A) domain, spine helix in the peripheral (P) domain, and an extended (E) loop. HK97 capsid consists of covalently-linked copies of just one HK97-like fold protein and represents the most effective strategy to form highly stable chainmail needed for dsDNA genome encapsidation. Recently, near-atomic resolution structures enabled by cryo electron microscopy (cryoEM) have revealed a range of other, more complex variants of this strategy for constructing dsDNA viruses. The first strategy, exemplified by P22-like phages, is the attachment of an insertional (I) domain to the core 5-stranded β sheet of the HK97-like fold. The atomic models of the Bordetella phage BPP-1 showcases an alternative topology of the classic HK97 topology of the HK97-like fold, as well as the second strategy for constructing stable capsids, where an auxiliary jellyroll protein dimer serves to cement the non-covalent chainmail formed by capsid protein subunits. The third strategy, found in lambda-like phages, uses auxiliary protein trimers to stabilize the underlying non-covalent chainmail near the 3-fold axis. Herpesviruses represent highly complex viruses that use a combination of these strategies, resulting in four-level hierarchical organization including a non-covalent chainmail formed by the HK97-like fold domain found in the floor region. A thorough understanding of these structures should help unlock the enigma of the emergence and evolution of dsDNA viruses and inform bioengineering efforts based on these viruses.
Collapse
Affiliation(s)
- Z Hong Zhou
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, California 90095, USA.,California NanoSystems Institute (CNSI), University of California, Los Angeles (UCLA), Los Angeles, CA, 90095, USA
| | - Joshua Chiou
- Department of Microbiology, Immunology and Molecular Genetics, University of California, Los Angeles, California 90095, USA
| |
Collapse
|
25
|
Currin A, Swainston N, Day PJ, Kell DB. Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently. Chem Soc Rev 2015; 44:1172-239. [PMID: 25503938 PMCID: PMC4349129 DOI: 10.1039/c4cs00351a] [Citation(s) in RCA: 256] [Impact Index Per Article: 28.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Indexed: 12/21/2022]
Abstract
The amino acid sequence of a protein affects both its structure and its function. Thus, the ability to modify the sequence, and hence the structure and activity, of individual proteins in a systematic way, opens up many opportunities, both scientifically and (as we focus on here) for exploitation in biocatalysis. Modern methods of synthetic biology, whereby increasingly large sequences of DNA can be synthesised de novo, allow an unprecedented ability to engineer proteins with novel functions. However, the number of possible proteins is far too large to test individually, so we need means for navigating the 'search space' of possible protein sequences efficiently and reliably in order to find desirable activities and other properties. Enzymologists distinguish binding (Kd) and catalytic (kcat) steps. In a similar way, judicious strategies have blended design (for binding, specificity and active site modelling) with the more empirical methods of classical directed evolution (DE) for improving kcat (where natural evolution rarely seeks the highest values), especially with regard to residues distant from the active site and where the functional linkages underpinning enzyme dynamics are both unknown and hard to predict. Epistasis (where the 'best' amino acid at one site depends on that or those at others) is a notable feature of directed evolution. The aim of this review is to highlight some of the approaches that are being developed to allow us to use directed evolution to improve enzyme properties, often dramatically. We note that directed evolution differs in a number of ways from natural evolution, including in particular the available mechanisms and the likely selection pressures. Thus, we stress the opportunities afforded by techniques that enable one to map sequence to (structure and) activity in silico, as an effective means of modelling and exploring protein landscapes. Because known landscapes may be assessed and reasoned about as a whole, simultaneously, this offers opportunities for protein improvement not readily available to natural evolution on rapid timescales. Intelligent landscape navigation, informed by sequence-activity relationships and coupled to the emerging methods of synthetic biology, offers scope for the development of novel biocatalysts that are both highly active and robust.
Collapse
Affiliation(s)
- Andrew Currin
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| | - Neil Swainston
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- School of Computer Science , The University of Manchester , Manchester M13 9PL , UK
| | - Philip J. Day
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- Faculty of Medical and Human Sciences , The University of Manchester , Manchester M13 9PT , UK
| | - Douglas B. Kell
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| |
Collapse
|
26
|
Zhang D, Iyer LM, Burroughs AM, Aravind L. Resilience of biochemical activity in protein domains in the face of structural divergence. Curr Opin Struct Biol 2014; 26:92-103. [PMID: 24952217 DOI: 10.1016/j.sbi.2014.05.008] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Accepted: 05/20/2014] [Indexed: 01/07/2023]
Abstract
Recent studies point to the prevalence of the evolutionary phenomenon of drastic structural transformation of protein domains while continuing to preserve their basic biochemical function. These transformations span a wide spectrum, including simple domains incorporated into larger structural scaffolds, changes in the structural core, major active site shifts, topological rewiring and extensive structural transmogrifications. Proteins from biological conflict systems, such as toxin-antitoxin, restriction-modification, CRISPR/Cas, polymorphic toxin and secondary metabolism systems commonly display such transformations. These include endoDNases, metal-independent RNases, deaminases, ADP ribosyltransferases, immunity proteins, kinases and E1-like enzymes. In eukaryotes such transformations are seen in domains involved in chromatin-related peptide recognition and protein/DNA-modification. Intense selective pressures from 'arms-race'-like situations in conflict and macromolecular modification systems could favor drastic structural divergence while preserving function.
Collapse
Affiliation(s)
- Dapeng Zhang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - A Maxwell Burroughs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|
27
|
Skorupka K, Han SK, Nam HJ, Kim S, Faham S. Protein design by fusion: implications for protein structure prediction and evolution. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:2451-60. [PMID: 24311586 DOI: 10.1107/s0907444913022701] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2013] [Accepted: 08/12/2013] [Indexed: 01/21/2023]
Abstract
Domain fusion is a useful tool in protein design. Here, the structure of a fusion of the heterodimeric flagella-assembly proteins FliS and FliC is reported. Although the ability of the fusion protein to maintain the structure of the heterodimer may be apparent, threading-based structural predictions do not properly fuse the heterodimer. Additional examples of naturally occurring heterodimers that are homologous to full-length proteins were identified. These examples highlight that the designed protein was engineered by the same tools as used in the natural evolution of proteins and that heterodimeric structures contain a wealth of information, currently unused, that can improve structural predictions.
Collapse
Affiliation(s)
- Katarzyna Skorupka
- Department of Molecular Physiology and Biological Physics, University of Virginia School of Medicine, Charlottesville, VA 22093, USA
| | | | | | | | | |
Collapse
|
28
|
Longo L, Lee J, Tenorio C, Blaber M. Alternative Folding Nuclei Definitions Facilitate the Evolution of a Symmetric Protein Fold from a Smaller Peptide Motif. Structure 2013; 21:2042-50. [DOI: 10.1016/j.str.2013.09.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2013] [Revised: 09/09/2013] [Accepted: 09/11/2013] [Indexed: 11/25/2022]
|
29
|
Establishing catalytic activity on an artificial (βα)8-barrel protein designed from identical half-barrels. FEBS Lett 2013; 587:2798-805. [PMID: 23806364 DOI: 10.1016/j.febslet.2013.06.022] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2013] [Revised: 05/27/2013] [Accepted: 06/16/2013] [Indexed: 01/28/2023]
Abstract
It has been postulated that the ubiquitous (βα)8-barrel enzyme fold has evolved by duplication and fusion of an ancestral (βα)4-half-barrel. We have previously reconstructed this process in the laboratory by fusing two copies of the C-terminal half-barrel HisF-C of imidazole glycerol phosphate synthase (HisF). The resulting construct HisF-CC was stepwise stabilized to Sym1 and Sym2, which are extremely robust but catalytically inert proteins. Here, we report on the generation of a circular permutant of Sym2 and the establishment of a sugar isomerization reaction on its scaffold. Our results demonstrate that duplication and mutagenesis of (βα)4-half-barrels can readily lead to a stable and catalytically active (βα)8-barrel enzyme.
Collapse
|
30
|
Yanagawa H. Exploration of the Origin and Evolution of Globular Proteins by mRNA Display. Biochemistry 2013; 52:3841-51. [DOI: 10.1021/bi301704x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Affiliation(s)
- Hiroshi Yanagawa
- Department of Biosciences and Informatics,
Faculty
of Sciences and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, Japan
| |
Collapse
|
31
|
Burak E, Yogev O, Sheffer S, Schueler-Furman O, Pines O. Evolving dual targeting of a prokaryotic protein in yeast. Mol Biol Evol 2013; 30:1563-73. [PMID: 23462316 DOI: 10.1093/molbev/mst039] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Dual targeting is an important and abundant phenomenon. Indeed, we estimate that more than a third of the yeast mitochondrial proteome is dual localized. The enzyme fumarase is a highly conserved protein in all organisms with respect to its sequence, structure, and enzymatic activity. In eukaryotes, it is dual localized to the cytosol and mitochondria. In Saccharomyces cerevisiae, the dual localization of fumarase is achieved by the reverse translocation mechanism; all fumarase molecules harbor a mitochondrial targeting sequence (MTS), are targeted to mitochondria, begin their translocation, and are processed by mitochondrial processing peptidase in the matrix. A subset of these processed fumarase molecules in transit is then fully imported into the matrix, whereas the majority moves back into the cytosol by reverse translocation. The proposed driving force for fumarase distribution is protein folding during import. Here, we asked how reverse translocation could have evolved on a prokaryotic protein that had already acquired expression from the nuclear genome and a targeting sequence. To address this question, we used, as a model, the Escherichia coli FumC Class II fumarase, which is homologous to eukaryotic fumarases (∼58% identity and ∼74% similarity to the yeast Fum1). Starting with an exclusively mitochondrial targeted FumC (attached to a strong MTS), we show that two randomly acquired mutations within the prokaryotic FumC sequence are sufficient to cause substantial dual targeting by reverse translocation. In fact, the unmutated MTS-FumC also has some ability to be dual targeted but only at low temperatures. Our results suggest that in this case, evolution of dual targeting by reverse translocation is based on naturally occurring and fortuitously conserved features of fumarase folding.
Collapse
Affiliation(s)
- Efrat Burak
- Department of Microbiology Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | | | | | | | | |
Collapse
|
32
|
Mazor Y, Nataf D, Toporik H, Nelson N. Crystal structures of virus-like photosystem I complexes from the mesophilic cyanobacterium Synechocystis PCC 6803. eLife 2013; 3:e01496. [PMID: 24473073 PMCID: PMC3903132 DOI: 10.7554/elife.01496] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Oxygenic photosynthesis supports virtually all life forms on earth. Light energy is converted by two photosystems—photosystem I (PSI) and photosystem II (PSII). Globally, nearly 50% of photosynthesis takes place in the Ocean, where single cell cyanobacteria and algae reside together with their viruses. An operon encoding PSI was identified in cyanobacterial marine viruses. We generated a PSI that mimics the salient features of the viral complex, named PSIPsaJF. PSIPsaJF is promiscuous for its electron donors and can accept electrons from respiratory cytochromes. We solved the structure of PSIPsaJF and a monomeric PSI, with subunit composition similar to the viral PSI, providing for the first time a detailed description of the reaction center and antenna system from mesophilic cyanobacteria, including red chlorophylls and cofactors of the electron transport chain. Our finding extends the understanding of PSI structure, function and evolution and suggests a unique function for the viral PSI. DOI:http://dx.doi.org/10.7554/eLife.01496.001 Photosynthesis—the process by which plants and other organisms harness the energy in sunlight—is the source of almost all oxygen, food and fuel on earth. Oxygenic photosynthesis in living cells involves a series of reactions catalyzed by large protein complexes, various other soluble chemicals, and the transfer of electrons from so-called donors to acceptors. The energy in the sunlight is captured by two membrane-embedded protein complexes—photosystem I, which is the most powerful electron donor in nature, and photosystem II—and converted into chemical energy. Almost half of the world’s photosynthesis occurs in the oceans, and is performed by single-celled cyanobacteria and algae. Interestingly, some of the genes that encode photosynthetic enzymes in cyanobacteria are also found in the genomes of viruses that infect these bacteria. It is thought that these viruses can alter photosynthetic pathways in their hosts, but the interactions between these viruses and their hosts are not fully understood. Now, Mazor et al. have created a photosystem I complex that mimics the viral version of this complex, and have gone on to solve its three-dimensional structure. This mimetic virus-encoded complex was shown to be a ‘promiscuous’ electron acceptor: this means that, unlike most electron acceptors, it can accept electrons from more than one electron donor. Further, Mazor et al. solved the structure of photosystem I from Synechocystis, a cyanobacterium that lives in fresh water; and found some surprising differences between it and the only other published structure for photosystem I from a cyanobacterium (which was from a species that lives in hot water springs). These included differences in components involved in the electron transfer chain—a series of chemical reactions in which electrons are passed from donor to acceptor molecules—that were thought to be highly conserved. Other differences in the structures allowed Mazor et al. to identify the location of a unique chlorophyll pigment group in the Synechocystis photosystem I. Since Synechocystis is commonly used as a model to study photosynthesis, an improved understanding of its photosystem I should lead to further improvements in our knowledge of photosynthesis. DOI:http://dx.doi.org/10.7554/eLife.01496.002
Collapse
Affiliation(s)
- Yuval Mazor
- Department of Biochemistry and Molecular Biology, The George S Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | | | | | | |
Collapse
|
33
|
Blaber M, Lee J, Longo L. Emergence of symmetric protein architecture from a simple peptide motif: evolutionary models. Cell Mol Life Sci 2012; 69:3999-4006. [PMID: 22790181 PMCID: PMC11115074 DOI: 10.1007/s00018-012-1077-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2012] [Revised: 06/22/2012] [Accepted: 06/26/2012] [Indexed: 10/28/2022]
Abstract
Structural symmetry is observed in the majority of fundamental protein folds and gene duplication and fusion evolutionary processes are postulated to be responsible. However, convergent evolution leading to structural symmetry has also been proposed; additionally, there is debate regarding the extent to which exact primary structure symmetry is compatible with efficient protein folding. Issues of symmetry in protein evolution directly impact strategies for de novo protein design as symmetry can substantially simplify the design process. Additionally, when considering gene duplication and fusion in protein evolution, there are two competing models: "emergent architecture" and "conserved architecture". Recent experimental work has shed light on both the evolutionary process leading to symmetric protein folds as well as the ability of symmetric primary structure to efficiently fold. Such studies largely support a "conserved architecture" evolutionary model, suggesting that complex protein architecture was an early evolutionary achievement involving oligomerization of smaller polypeptides.
Collapse
Affiliation(s)
- Michael Blaber
- Department of Biomedical Sciences, College of Medicine, Florida State University, 1115 West Call St., Tallahassee, FL, 32306-4300, USA,
| | | | | |
Collapse
|
34
|
Wang Z, Zarlenga D, Martin J, Abubucker S, Mitreva M. Exploring metazoan evolution through dynamic and holistic changes in protein families and domains. BMC Evol Biol 2012; 12:138. [PMID: 22862991 PMCID: PMC3483195 DOI: 10.1186/1471-2148-12-138] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Accepted: 07/19/2012] [Indexed: 11/18/2022] Open
Abstract
Background Proteins convey the majority of biochemical and cellular activities in organisms. Over the course of evolution, proteins undergo normal sequence mutations as well as large scale mutations involving domain duplication and/or domain shuffling. These events result in the generation of new proteins and protein families. Processes that affect proteome evolution drive species diversity and adaptation. Herein, change over the course of metazoan evolution, as defined by birth/death and duplication/deletion events within protein families and domains, was examined using the proteomes of 9 metazoan and two outgroup species. Results In studying members of the three major metazoan groups, the vertebrates, arthropods, and nematodes, we found that the number of protein families increased at the majority of lineages over the course of metazoan evolution where the magnitude of these increases was greatest at the lineages leading to mammals. In contrast, the number of protein domains decreased at most lineages and at all terminal lineages. This resulted in a weak correlation between protein family birth and domain birth; however, the correlation between domain birth and domain member duplication was quite strong. These data suggest that domain birth and protein family birth occur via different mechanisms, and that domain shuffling plays a role in the formation of protein families. The ratio of protein family birth to protein domain birth (domain shuffling index) suggests that shuffling had a more demonstrable effect on protein families in nematodes and arthropods than in vertebrates. Through the contrast of high and low domain shuffling indices at the lineages of Trichinella spiralis and Gallus gallus, we propose a link between protein redundancy and evolutionary changes controlled by domain shuffling; however, the speed of adaptation among the different lineages was relatively invariant. Evaluating the functions of protein families that appeared or disappeared at the last common ancestors (LCAs) of the three metazoan clades supports a correlation with organism adaptation. Furthermore, bursts of new protein families and domains in the LCAs of metazoans and vertebrates are consistent with whole genome duplications. Conclusion Metazoan speciation and adaptation were explored by birth/death and duplication/deletion events among protein families and domains. Our results provide insights into protein evolution and its bearing on metazoan evolution.
Collapse
Affiliation(s)
- Zhengyuan Wang
- The Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | | | | | | | | |
Collapse
|
35
|
Wagner GP. Next Gen Devo-Evo. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2012; 318:519-20. [PMID: 22791647 DOI: 10.1002/jez.b.22463] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2012] [Accepted: 06/12/2012] [Indexed: 11/10/2022]
Affiliation(s)
- Gunter P Wagner
- Systems Biology Institute, Department of Ecology and Evolutionary Biology, Yale University, West Haven, Connecticut 06516, USA
| |
Collapse
|
36
|
Abstract
Signaling networks process vast amounts of environmental information to generate specific cellular responses. As cellular environments change, signaling networks adapt accordingly. Here, I will discuss how the integration of synthetic biology and directed evolution approaches is shedding light on the molecular mechanisms that guide the evolution of signaling networks. In particular, I will review studies that demonstrate how different types of mutations, from the replacement of individual amino acids to the shuffling of modular domains, lead to markedly different evolutionary trajectories and consequently to diverse network rewiring. Moreover, I will argue that intrinsic evolutionary properties of signaling proteins, such as the robustness of wild type functions, the promiscuous nature of evolutionary intermediates, and the modular decoupling between binding and catalysis, play important roles in the evolution of signaling networks. Finally, I will argue that rapid advances in our ability to synthesize DNA will radically alter how we study signaling network evolution at the genome-wide level.
Collapse
Affiliation(s)
- Sergio G. Peisajovich
- Department
of Cell and Systems Biology, University of Toronto, Toronto, M5S 3G5 Canada
| |
Collapse
|
37
|
Kipnis Y, Dellus-Gur E, Tawfik DS. TRINS: a method for gene modification by randomized tandem repeat insertions. Protein Eng Des Sel 2012; 25:437-44. [DOI: 10.1093/protein/gzs023] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
|
38
|
Guo B, Zou M, Wagner A. Pervasive indels and their evolutionary dynamics after the fish-specific genome duplication. Mol Biol Evol 2012; 29:3005-22. [PMID: 22490820 DOI: 10.1093/molbev/mss108] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Insertions and deletions (indels) in protein-coding genes are important sources of genetic variation. Their role in creating new proteins may be especially important after gene duplication. However, little is known about how indels affect the divergence of duplicate genes. We here study thousands of duplicate genes in five fish (teleost) species with completely sequenced genomes. The ancestor of these species has been subject to a fish-specific genome duplication (FSGD) event that occurred approximately 350 Ma. We find that duplicate genes contain at least 25% more indels than single-copy genes. These indels accumulated preferentially in the first 40 my after the FSGD. A lack of widespread asymmetric indel accumulation indicates that both members of a duplicate gene pair typically experience relaxed selection. Strikingly, we observe a 30-80% excess of deletions over insertions that is consistent for indels of various lengths and across the five genomes. We also find that indels preferentially accumulate inside loop regions of protein secondary structure and in regions where amino acids are exposed to solvent. We show that duplicate genes with high indel density also show high DNA sequence divergence. Indel density, but not amino acid divergence, can explain a large proportion of the tertiary structure divergence between proteins encoded by duplicate genes. Our observations are consistent across all five fish species. Taken together, they suggest a general pattern of duplicate gene evolution in which indels are important driving forces of evolutionary change.
Collapse
Affiliation(s)
- Baocheng Guo
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | | | | |
Collapse
|
39
|
Gupta R, Capalash N, Sharma P. Restriction endonucleases: natural and directed evolution. Appl Microbiol Biotechnol 2012; 94:583-99. [PMID: 22398859 DOI: 10.1007/s00253-012-3961-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2011] [Revised: 02/08/2012] [Accepted: 02/09/2012] [Indexed: 10/28/2022]
Abstract
Type II restriction endonucleases (REs) are highly sequence-specific compared with other classes of nucleases. PD-(D/E)XK nucleases, initially represented by only type II REs, now comprise a large and extremely diverse superfamily of proteins and, although sharing a structurally conserved core, typically display little or no detectable sequence similarity except for the active site motifs. Sequence similarity can only be observed in methylases and few isoschizomers. As a consequence, REs are classified according to combinations of functional properties rather than on the basis of genetic relatedness. New alignment matrices and classification systems based on structural core connectivity and cleavage mechanisms have been developed to characterize new REs and related proteins. REs recognizing more than 300 distinct specificities have been identified in RE database (REBASE: http://rebase.neb.com/cgi-bin/statlist ) but still the need for newer specificities is increasing due to the advancement in molecular biology and applications. The enzymes have undergone constant evolution through structural changes in protein scaffolds which include random mutations, homologous recombinations, insertions, and deletions of coding DNA sequences but rational mutagenesis or directed evolution delivers protein variants with new functions in accordance with defined biochemical or environmental pressures. Redesigning through random mutation, addition or deletion of amino acids, methylation-based selection, synthetic molecules, combining recognition and cleavage domains from different enzymes, or combination with domains of additional functions change the cleavage specificity or substrate preference and stability. There is a growing number of patents awarded for the creation of engineered REs with new and enhanced properties.
Collapse
Affiliation(s)
- Richa Gupta
- Department of Biotechnology, Panjab University, Chandigarh, India 160014
| | | | | |
Collapse
|
40
|
Deciphering the preference and predicting the viability of circular permutations in proteins. PLoS One 2012; 7:e31791. [PMID: 22359629 PMCID: PMC3281007 DOI: 10.1371/journal.pone.0031791] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2011] [Accepted: 01/19/2012] [Indexed: 01/21/2023] Open
Abstract
Circular permutation (CP) refers to situations in which the termini of a protein are relocated to other positions in the structure. CP occurs naturally and has been artificially created to study protein function, stability and folding. Recently CP is increasingly applied to engineer enzyme structure and function, and to create bifunctional fusion proteins unachievable by tandem fusion. CP is a complicated and expensive technique. An intrinsic difficulty in its application lies in the fact that not every position in a protein is amenable for creating a viable permutant. To examine the preferences of CP and develop CP viability prediction methods, we carried out comprehensive analyses of the sequence, structural, and dynamical properties of known CP sites using a variety of statistics and simulation methods, such as the bootstrap aggregating, permutation test and molecular dynamics simulations. CP particularly favors Gly, Pro, Asp and Asn. Positions preferred by CP lie within coils, loops, turns, and at residues that are exposed to solvent, weakly hydrogen-bonded, environmentally unpacked, or flexible. Disfavored positions include Cys, bulky hydrophobic residues, and residues located within helices or near the protein's core. These results fostered the development of an effective viable CP site prediction system, which combined four machine learning methods, e.g., artificial neural networks, the support vector machine, a random forest, and a hierarchical feature integration procedure developed in this work. As assessed by using the hydrofolate reductase dataset as the independent evaluation dataset, this prediction system achieved an AUC of 0.9. Large-scale predictions have been performed for nine thousand representative protein structures; several new potential applications of CP were thus identified. Many unreported preferences of CP are revealed in this study. The developed system is the best CP viability prediction method currently available. This work will facilitate the application of CP in research and biotechnology.
Collapse
|
41
|
Sharma PK, Kumar R, Kumar R, Mohammad O, Singh R, Kaur J. Engineering of a metagenome derived lipase toward thermal tolerance: effect of asparagine to lysine mutation on the protein surface. Gene 2011; 491:264-71. [PMID: 22001407 DOI: 10.1016/j.gene.2011.09.028] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2011] [Revised: 09/15/2011] [Accepted: 09/24/2011] [Indexed: 11/16/2022]
Abstract
A highly thermostable mutant lipase was generated and characterized. Mutant enzyme demonstrated 144 fold enhanced thermostability over the wild type enzyme at 60°C. Interestingly, the overall catalytic efficiency (k(cat/)K(m)) of mutant was also enhanced (~20 folds). Circular dichroism spectroscopy, studied as function of temperature, demonstrated that the mutant lipase retained its secondary structure up to 70-80°C, whereas wild type protein structure was completely distorted above 35°C. Additionally, the intrinsic tryptophan fluorescence (a probe for the tertiary structure) also displayed difference in the conformation of two enzymes during temperature dependent unfolding. Furthermore, mutation N355K resulted in extensive H-bonding (Lys355 HZ1OE2 Glu284) with a distance 2.44 Å. In contrast to this, Wt enzyme has not shown such H-bonding interaction.
Collapse
|
42
|
Hollup SM, Sadowski MI, Jonassen I, Taylor WR. Exploring the limits of fold discrimination by structural alignment: a large scale benchmark using decoys of known fold. Comput Biol Chem 2011; 35:174-88. [PMID: 21704264 PMCID: PMC3145973 DOI: 10.1016/j.compbiolchem.2011.04.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Accepted: 04/23/2011] [Indexed: 11/10/2022]
Abstract
Protein structure comparison by pairwise alignment is commonly used to identify highly similar substructures in pairs of proteins and provide a measure of structural similarity based on the size and geometric similarity of the match. These scores are routinely applied in analyses of protein fold space under the assumption that high statistical significance is equivalent to a meaningful relationship, however the truth of this assumption has previously been difficult to test since there is a lack of automated methods which do not rely on the same underlying principles. As a resolution to this we present a method based on the use of topological descriptions of global protein structure, providing an independent means to assess the ability of structural alignment to maintain meaningful structural correspondances on a large scale. Using a large set of decoys of specified global fold we benchmark three widely used methods for structure comparison, SAP, TM-align and DALI, and test the degree to which this assumption is justified for these methods. Application of a topological edit distance measure to provide a scale of the degree of fold change shows that while there is a broad correlation between high structural alignment scores and low edit distances there remain many pairs of highly significant score which differ by core strand swaps and therefore are structurally different on a global level. Possible causes of this problem and its meaning for present assessments of protein fold space are discussed.
Collapse
|
43
|
Yu Y, Lutz S. Circular permutation: a different way to engineer enzyme structure and function. Trends Biotechnol 2011; 29:18-25. [DOI: 10.1016/j.tibtech.2010.10.004] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2010] [Revised: 10/11/2010] [Accepted: 10/18/2010] [Indexed: 12/15/2022]
|
44
|
Crystal structure of a 117 kDa glucansucrase fragment provides insight into evolution and product specificity of GH70 enzymes. Proc Natl Acad Sci U S A 2010; 107:21406-11. [PMID: 21118988 DOI: 10.1073/pnas.1007531107] [Citation(s) in RCA: 127] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Glucansucrases are large enzymes belonging to glycoside hydrolase family 70, which catalyze the cleavage of sucrose into fructose and glucose, with the concomitant transfer of the glucose residue to a growing α-glucan polymer. Among others, plaque-forming oral bacteria secrete these enzymes to produce α-glucans, which facilitate the adhesion of the bacteria to the tooth enamel. We determined the crystal structure of a fully active, 1,031-residue fragment encompassing the catalytic and C-terminal domains of GTF180 from Lactobacillus reuteri 180, both in the native state, and in complexes with sucrose and maltose. These structures show that the enzyme has an α-amylase-like (β/α)(8)-barrel catalytic domain that is circularly permuted compared to the catalytic domains of members of glycoside hydrolase families 13 and 77, which belong to the same GH-H superfamily. In contrast to previous suggestions, the enzyme has only one active site and one nucleophilic residue. Surprisingly, in GTF180 the peptide chain follows a "U"-path, such that four of the five domains are made up from discontiguous N- and C-terminal stretches of the peptide chain. Finally, the structures give insight into the factors that determine the different linkage types in the polymeric product.
Collapse
|
45
|
Eisenbeis S, Höcker B. Evolutionary mechanism as a template for protein engineering. J Pept Sci 2010; 16:538-44. [DOI: 10.1002/psc.1233] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
46
|
Yadid I, Tawfik DS. Functional β-propeller lectins by tandem duplications of repetitive units. Protein Eng Des Sel 2010; 24:185-95. [DOI: 10.1093/protein/gzq053] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
47
|
How do new proteins arise? Curr Opin Struct Biol 2010; 20:390-6. [PMID: 20347587 DOI: 10.1016/j.sbi.2010.02.005] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2010] [Revised: 02/24/2010] [Accepted: 02/25/2010] [Indexed: 11/23/2022]
|
48
|
Metamorphic proteins mediate evolutionary transitions of structure. Proc Natl Acad Sci U S A 2010; 107:7287-92. [PMID: 20368465 DOI: 10.1073/pnas.0912616107] [Citation(s) in RCA: 81] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The primary sequence of proteins usually dictates a single tertiary and quaternary structure. However, certain proteins undergo reversible backbone rearrangements. Such metamorphic proteins provide a means of facilitating the evolution of new folds and architectures. However, because natural folds emerged at the early stages of evolution, the potential role of metamorphic intermediates in mediating evolutionary transitions of structure remains largely unexplored. We evolved a set of new proteins based on approximately 100 amino acid fragments derived from tachylectin-2--a monomeric, 236 amino acids, five-bladed beta-propeller. Their structures reveal a unique pentameric assembly and novel beta-propeller structures. Although identical in sequence, the oligomeric subunits adopt two, or even three, different structures that together enable the pentameric assembly of two propellers connected via a small linker. Most of the subunits adopt a wild-type-like structure within individual five-bladed propellers. However, the bridging subunits exhibit domain swaps and asymmetric strand exchanges that allow them to complete the two propellers and connect them. Thus, the modular and metamorphic nature of these subunits enabled dramatic changes in tertiary and quaternary structure, while maintaining the lectin function. These oligomers therefore comprise putative intermediates via which beta-propellers can evolve from smaller elements. Our data also suggest that the ability of one sequence to equilibrate between different structures can be evolutionary optimized, thus facilitating the emergence of new structures.
Collapse
|
49
|
Tsuji T, Onimaru M, Doi N, Miyamoto-Sato E, Takashima H, Yanagawa H. In vitro selection of GTP-binding proteins by block shuffling of estrogen-receptor fragments. Biochem Biophys Res Commun 2009; 390:689-93. [DOI: 10.1016/j.bbrc.2009.10.029] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2009] [Accepted: 10/07/2009] [Indexed: 11/26/2022]
|
50
|
Hegyi H, Buday L, Tompa P. Intrinsic structural disorder confers cellular viability on oncogenic fusion proteins. PLoS Comput Biol 2009; 5:e1000552. [PMID: 19888473 PMCID: PMC2768585 DOI: 10.1371/journal.pcbi.1000552] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2009] [Accepted: 09/30/2009] [Indexed: 12/22/2022] Open
Abstract
Chromosomal translocations, which often generate chimeric proteins by fusing segments of two distinct genes, represent the single major genetic aberration leading to cancer. We suggest that the unifying theme of these events is a high level of intrinsic structural disorder, enabling fusion proteins to evade cellular surveillance mechanisms that eliminate misfolded proteins. Predictions in 406 translocation-related human proteins show that they are significantly enriched in disorder (43.3% vs. 20.7% in all human proteins), they have fewer Pfam domains, and their translocation breakpoints tend to avoid domain splitting. The vicinity of the breakpoint is significantly more disordered than the rest of these already highly disordered fusion proteins. In the unlikely event of domain splitting in fusion it usually spares much of the domain or splits at locations where the newly exposed hydrophobic surface area approximates that of an intact domain. The mechanisms of action of fusion proteins suggest that in most cases their structural disorder is also essential to the acquired oncogenic function, enabling the long-range structural communication of remote binding and/or catalytic elements. In this respect, there are three major mechanisms that contribute to generating an oncogenic signal: (i) a phosphorylation site and a tyrosine-kinase domain are fused, and structural disorder of the intervening region enables intramolecular phosphorylation (e.g., BCR-ABL); (ii) a dimerisation domain fuses with a tyrosine kinase domain and disorder enables the two subunits within the homodimer to engage in permanent intermolecular phosphorylations (e.g., TFG-ALK); (iii) the fusion of a DNA-binding element to a transactivator domain results in an aberrant transcription factor that causes severe misregulation of transcription (e.g. EWS-ATF). Our findings also suggest novel strategies of intervention against the ensuing neoplastic transformations. Chromosomal translocations generate chimeric proteins by fusing segments of two distinct genes and are frequently associated with cancer. The proteins involved are large and fairly heterogeneous in sequence and typically have only a few dispersed structural domains connected by long uncharacterized regions. It has never been studied from a structural perspective how these chimeras survive losing significant portions of the original proteins and acquire new oncogenic functions. By analyzing a collection of 406 human translocation proteins we show here that the answer to both questions lies to a large extent in the high level of structural disorder in the fusion partner proteins (on average, they are twice as disordered as all human proteins). The translocation breakpoints usually avoid globular domains. In rare cases when a globular domain is truncated by the fusion, it happens at a location in the domain where the hydrophobicity exposed by the split is favorable (i.e., not too high). Disorder on average is significantly higher in the vicinity of the breakpoint than in the rest of the fusion proteins. Disorder also plays a pivotal role in the acquired oncogenic function by bringing distant/disparate fusion segments together that enables novel intra- and/or intermolecular interactions.
Collapse
Affiliation(s)
- Hedi Hegyi
- Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences, Budapest, Hungary
| | - László Buday
- Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences, Budapest, Hungary
- Department of Medical Chemistry, Semmelweis University Medical School, Budapest, Hungary
| | - Peter Tompa
- Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences, Budapest, Hungary
- * E-mail:
| |
Collapse
|