1
|
Iqbal Z, Sadaf S. Forty Years of Directed Evolution and its Continuously Evolving Technology Toolbox - A Review of the Patent Landscape. Biotechnol Bioeng 2021; 119:693-724. [PMID: 34923625 DOI: 10.1002/bit.28009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 11/28/2021] [Accepted: 11/29/2021] [Indexed: 11/10/2022]
Abstract
Generating functional protein variants with novel or improved characteristics has been a goal of the biotechnology industry and life sciences, for decades. Rational design and directed evolution are two major pathways to achieve the desired ends. Whilst rational protein design approach has made substantial progress, the idea of using a method based on cycles of mutagenesis and natural selection to develop novel binding proteins, enzymes and structures has attracted great attention. Laboratory evolution of proteins/enzymes requires new tools and analytical approaches to create genetic diversity and identifying variants with desired traits. In this pursuit, construction of sufficiently large libraries of target molecules to search for improved variants and the need for new protocols to alter the properties of target molecules has been a continuing challenge in the directed evolution experiments. This review will discuss the in vivo and in vitro gene diversification tools, library screening or selection approaches, and artificial intelligence/machine-learning-based strategies to mutagenesis developed in the last forty years to accelerate the natural process of evolution in creating new functional protein variants, optimization of microbial strains and transformation of enzymes into industrial machines. Analyzing patent position over these techniques and mechanisms also constitutes an integral and distinctive part of this review. The aim is to provide an up-to-date resource/technology toolbox for research-based and pharmaceutical companies to discover the boundaries of competitor's intellectual property (IP) portfolio, their freedom-to-operate in the relevant IP landscape, and the need for patent due diligence analysis to rule out whether use of a particular patented mutagenesis method, library screening/selection technique falls outside the safe harbor of experimental use exemption. While so doing, we have referred to some recent cases that emphasize the significance of selecting a suitable gene diversification strategy in directed evolution experiments. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Zarina Iqbal
- PakPat World Intellectual Property Protection Services, Lahore, 54000, Pakistan
| | - Saima Sadaf
- School of Biochemistry and Biotechnology, University of the Punjab, Lahore, 54590, Pakistan
| |
Collapse
|
2
|
Grisewood MJ, Hernández-Lozada NJ, Thoden JB, Gifford NP, Mendez-Perez D, Schoenberger HA, Allan MF, Floy ME, Lai RY, Holden HM, Pfleger BF, Maranas CD. Computational Redesign of Acyl-ACP Thioesterase with Improved Selectivity toward Medium-Chain-Length Fatty Acids. ACS Catal 2017; 7:3837-3849. [PMID: 29375928 DOI: 10.1021/acscatal.7b00408] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Enzyme and metabolic engineering offer the potential to develop biocatalysts for converting natural resources into a wide range of chemicals. To broaden the scope of potential products beyond natural metabolites, methods of engineering enzymes to accept alternative substrates and/or perform novel chemistries must be developed. DNA synthesis can create large libraries of enzyme-coding sequences, but most biochemistries lack a simple assay to screen for promising enzyme variants. Our solution to this challenge is structure-guided mutagenesis in which optimization algorithms select the best sequences from libraries based on specified criteria (i.e. binding selectivity). Here, we demonstrate this approach by identifying medium-chain (C6-C12) acyl-ACP thioesterases through structure-guided mutagenesis. Medium-chain fatty acids, products of thioesterase-catalyzed hydrolysis, are limited in natural abundance compared to long-chain fatty acids; the limited supply leads to high costs of C6-C10 oleochemicals such as fatty alcohols, amines, and esters. Here, we applied computational tools to tune substrate binding to the highly-active 'TesA thioesterase in Escherichia coli. We used the IPRO algorithm to design thioesterase variants with enhanced C12- or C8-specificity while maintaining high activity. After four rounds of structure-guided mutagenesis, we identified three thioesterases with enhanced production of dodecanoic acid (C12) and twenty-seven thioesterases with enhanced production of octanoic acid (C8). The top variants reached up to 49% C12 and 50% C8 while exceeding native levels of total free fatty acids. A comparably sized library created by random mutagenesis failed to identify promising mutants. The chain length-preference of 'TesA and the best mutant were confirmed in vitro using acyl-CoA substrates. Molecular dynamics simulations, confirmed by resolved crystal structures, of 'TesA variants suggest that hydrophobic forces govern 'TesA substrate specificity. We expect that the design rules we uncovered and the thioesterase variants identified will be useful to metabolic engineering projects aimed at sustainable production of medium-chain oleochemicals.
Collapse
Affiliation(s)
- Matthew J. Grisewood
- Department
of Chemical Engineering, Pennsylvania State University, 158 Fenske Laboratory, University Park, Pennsylvania 16802, United States
| | - Néstor J. Hernández-Lozada
- Department
of Chemical and Biological Engineering, University of Wisconsin−Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, United States
| | - James B. Thoden
- Department
of Biochemistry, University of Wisconsin−Madison, 440 Henry Mall, Madison, Wisconsin 53706, United States
| | - Nathanael P. Gifford
- Department
of Chemical Engineering, Pennsylvania State University, 158 Fenske Laboratory, University Park, Pennsylvania 16802, United States
| | - Daniel Mendez-Perez
- Department
of Chemical and Biological Engineering, University of Wisconsin−Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, United States
| | - Haley A. Schoenberger
- Department
of Chemical and Biological Engineering, University of Wisconsin−Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, United States
| | - Matthew F. Allan
- Department
of Chemical Engineering, Pennsylvania State University, 158 Fenske Laboratory, University Park, Pennsylvania 16802, United States
| | - Martha E. Floy
- Department
of Chemical and Biological Engineering, University of Wisconsin−Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, United States
| | - Rung-Yi Lai
- Department
of Chemical and Biological Engineering, University of Wisconsin−Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, United States
| | - Hazel M. Holden
- Department
of Biochemistry, University of Wisconsin−Madison, 440 Henry Mall, Madison, Wisconsin 53706, United States
| | - Brian F. Pfleger
- Department
of Chemical and Biological Engineering, University of Wisconsin−Madison, 1415 Engineering Drive, Madison, Wisconsin 53706, United States
| | - Costas D. Maranas
- Department
of Chemical Engineering, Pennsylvania State University, 158 Fenske Laboratory, University Park, Pennsylvania 16802, United States
| |
Collapse
|
3
|
Abstract
Faced with a protein engineering challenge, a contemporary researcher can choose from myriad design strategies. Library-scale computational protein design (LCPD) is a hybrid method suitable for the engineering of improved protein variants with diverse sequences. This chapter discusses the background and merits of several practical LCPD techniques. First, LCPD methods suitable for delocalized protein design are presented in the context of example design calculations for cellobiohydrolase II. Second, localized design methods are discussed in the context of an example design calculation intended to shift the substrate specificity of a ketol-acid reductoisomerase Rossmann domain from NADPH to NADH.
Collapse
|
4
|
Pantazes RJ, Grisewood MJ, Li T, Gifford NP, Maranas CD. The Iterative Protein Redesign and Optimization (IPRO) suite of programs. J Comput Chem 2014; 36:251-63. [PMID: 25448866 DOI: 10.1002/jcc.23796] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Revised: 10/30/2014] [Accepted: 11/08/2014] [Indexed: 11/10/2022]
Abstract
Proteins are an important class of biomolecules with applications spanning across biotechnology and medicine. In many cases, native proteins must be redesigned to improve various performance metrics by changing their amino acid sequences. Algorithms can help sharpen protein library design by focusing the library on sequences that optimize computationally accessible proxies. The Iterative Protein Redesign and Optimization (IPRO) suite of programs offers an integrated environment for (1) altering protein binding affinity and specificity, (2) grafting a binding pocket into an existing protein scaffold, (3) predicting an antibody's tertiary structure based on its sequence, (4) enhancing enzymatic activity, and (5) assessing the structure and binding energetics for a specific mutant. This manuscript provides an overview of the methods involved in IPRO, input language terminology, algorithmic details, software implementation specifics and application highlights. IPRO can be downloaded at http://maranas.che.psu.edu.
Collapse
Affiliation(s)
- Robert J Pantazes
- Chemical Engineering Department, University of California, Santa Barbara, 3357 Engineering II, Santa Barbara, California, 93106
| | | | | | | | | |
Collapse
|
5
|
Grisewood MJ, Gifford NP, Pantazes RJ, Li Y, Cirino PC, Janik MJ, Maranas CD. OptZyme: computational enzyme redesign using transition state analogues. PLoS One 2013; 8:e75358. [PMID: 24116038 PMCID: PMC3792102 DOI: 10.1371/journal.pone.0075358] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Accepted: 08/11/2013] [Indexed: 11/18/2022] Open
Abstract
OptZyme is a new computational procedure for designing improved enzymatic activity (i.e., kcat or kcat/KM) with a novel substrate. The key concept is to use transition state analogue compounds, which are known for many reactions, as proxies for the typically unknown transition state structures. Mutations that minimize the interaction energy of the enzyme with its transition state analogue, rather than with its substrate, are identified that lower the transition state formation energy barrier. Using Escherichia coli β-glucuronidase as a benchmark system, we confirm that KM correlates (R(2) = 0.960) with the computed interaction energy between the enzyme and the para-nitrophenyl- β, D-glucuronide substrate, kcat/KM correlates (R(2) = 0.864) with the interaction energy of the transition state analogue, 1,5-glucarolactone, and kcat correlates (R(2) = 0.854) with a weighted combination of interaction energies with the substrate and transition state analogue. OptZyme is subsequently used to identify mutants with improved KM, kcat, and kcat/KM for a new substrate, para-nitrophenyl- β, D-galactoside. Differences between the three libraries reveal structural differences that underpin improving KM, kcat, or kcat/KM. Mutants predicted to enhance the activity for para-nitrophenyl- β, D-galactoside directly or indirectly create hydrogen bonds with the altered sugar ring conformation or its substituents, namely H162S, L361G, W549R, and N550S.
Collapse
Affiliation(s)
- Matthew J. Grisewood
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Nathanael P. Gifford
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Robert J. Pantazes
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Ye Li
- Department of Chemical and Biomolecular Engineering, The University of Houston, Houston, Texas, United States of America
| | - Patrick C. Cirino
- Department of Chemical and Biomolecular Engineering, The University of Houston, Houston, Texas, United States of America
| | - Michael J. Janik
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Costas D. Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
6
|
Barrozo A, Borstnar R, Marloie G, Kamerlin SCL. Computational protein engineering: bridging the gap between rational design and laboratory evolution. Int J Mol Sci 2012. [PMID: 23202907 PMCID: PMC3497281 DOI: 10.3390/ijms131012428] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Enzymes are tremendously proficient catalysts, which can be used as extracellular catalysts for a whole host of processes, from chemical synthesis to the generation of novel biofuels. For them to be more amenable to the needs of biotechnology, however, it is often necessary to be able to manipulate their physico-chemical properties in an efficient and streamlined manner, and, ideally, to be able to train them to catalyze completely new reactions. Recent years have seen an explosion of interest in different approaches to achieve this, both in the laboratory, and in silico. There remains, however, a gap between current approaches to computational enzyme design, which have primarily focused on the early stages of the design process, and laboratory evolution, which is an extremely powerful tool for enzyme redesign, but will always be limited by the vastness of sequence space combined with the low frequency for desirable mutations. This review discusses different approaches towards computational enzyme design and demonstrates how combining newly developed screening approaches that can rapidly predict potential mutation “hotspots” with approaches that can quantitatively and reliably dissect the catalytic step can bridge the gap that currently exists between computational enzyme design and laboratory evolution studies.
Collapse
Affiliation(s)
- Alexandre Barrozo
- Department of Cell and Molecular Biology, Uppsala Biomedical Center (BMC), Uppsala University, Box 596, S-751 24 Uppsala, Sweden; E-Mails: (A.B.); (R.B.); (G.M.)
| | - Rok Borstnar
- Department of Cell and Molecular Biology, Uppsala Biomedical Center (BMC), Uppsala University, Box 596, S-751 24 Uppsala, Sweden; E-Mails: (A.B.); (R.B.); (G.M.)
- Laboratory for Biocomputing and Bioinformatics, National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia
| | - Gaël Marloie
- Department of Cell and Molecular Biology, Uppsala Biomedical Center (BMC), Uppsala University, Box 596, S-751 24 Uppsala, Sweden; E-Mails: (A.B.); (R.B.); (G.M.)
| | - Shina Caroline Lynn Kamerlin
- Department of Cell and Molecular Biology, Uppsala Biomedical Center (BMC), Uppsala University, Box 596, S-751 24 Uppsala, Sweden; E-Mails: (A.B.); (R.B.); (G.M.)
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +46-18-471-4423; Fax: +46-18-530-396
| |
Collapse
|
7
|
Feng X, Sanchis J, Reetz MT, Rabitz H. Enhancing the efficiency of directed evolution in focused enzyme libraries by the adaptive substituent reordering algorithm. Chemistry 2012; 18:5646-54. [PMID: 22434591 DOI: 10.1002/chem.201103811] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2011] [Indexed: 11/11/2022]
Abstract
Directed evolution is a broadly successful strategy for protein engineering in the quest to enhance the stereoselectivity, activity, and thermostability of enzymes. To increase the efficiency of directed evolution based on iterative saturation mutagenesis, the adaptive substituent reordering algorithm (ASRA) is introduced here as an alternative to traditional quantitative structure-activity relationship (QSAR) methods for identifying potential protein mutants with desired properties from minimal sampling of focused libraries. The operation of ASRA depends on identifying the underlying regularity of the protein property landscape, allowing it to make predictions without explicit knowledge of the structure-property relationships. In a proof-of-principle study, ASRA identified all or most of the best enantioselective mutants among the synthesized epoxide hydrolase from Aspergillus niger, in the absence of peptide seeds with high E-values. ASRA even revealed a laboratory error from irregularities of the reordered E-value landscape alone.
Collapse
Affiliation(s)
- Xiaojiang Feng
- Department of Chemistry, Princeton University, New Jersey 08544, USA
| | | | | | | |
Collapse
|
8
|
He L, Friedman AM, Bailey-Kellogg C. A divide-and-conquer approach to determine the Pareto frontier for optimization of protein engineering experiments. Proteins 2012; 80:790-806. [PMID: 22180081 PMCID: PMC4939273 DOI: 10.1002/prot.23237] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2011] [Revised: 10/06/2011] [Accepted: 10/21/2011] [Indexed: 01/07/2023]
Abstract
In developing improved protein variants by site-directed mutagenesis or recombination, there are often competing objectives that must be considered in designing an experiment (selecting mutations or breakpoints): stability versus novelty, affinity versus specificity, activity versus immunogenicity, and so forth. Pareto optimal experimental designs make the best trade-offs between competing objectives. Such designs are not "dominated"; that is, no other design is better than a Pareto optimal design for one objective without being worse for another objective. Our goal is to produce all the Pareto optimal designs (the Pareto frontier), to characterize the trade-offs and suggest designs most worth considering, but to avoid explicitly considering the large number of dominated designs. To do so, we develop a divide-and-conquer algorithm, Protein Engineering Pareto FRontier (PEPFR), that hierarchically subdivides the objective space, using appropriate dynamic programming or integer programming methods to optimize designs in different regions. This divide-and-conquer approach is efficient in that the number of divisions (and thus calls to the optimizer) is directly proportional to the number of Pareto optimal designs. We demonstrate PEPFR with three protein engineering case studies: site-directed recombination for stability and diversity via dynamic programming, site-directed mutagenesis of interacting proteins for affinity and specificity via integer programming, and site-directed mutagenesis of a therapeutic protein for activity and immunogenicity via integer programming. We show that PEPFR is able to effectively produce all the Pareto optimal designs, discovering many more designs than previous methods. The characterization of the Pareto frontier provides additional insights into the local stability of design choices as well as global trends leading to trade-offs between competing criteria.
Collapse
Affiliation(s)
- Lu He
- Department of Computer Science, Dartmouth College, Hanover NH 03755
| | - Alan M. Friedman
- Department of Biological Sciences, Markey Center for Structural Biology, Purdue Cancer Center, and Bindley Bioscience Center, Purdue University
| | | |
Collapse
|
9
|
Parker AS, Griswold KE, Bailey-Kellogg C. Optimization of combinatorial mutagenesis. J Comput Biol 2011; 18:1743-56. [PMID: 21923411 DOI: 10.1089/cmb.2011.0152] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Protein engineering by combinatorial site-directed mutagenesis evaluates a portion of the sequence space near a target protein, seeking variants with improved properties (e.g., stability, activity, immunogenicity). In order to improve the hit-rate of beneficial variants in such mutagenesis libraries, we develop methods to select optimal positions and corresponding sets of the mutations that will be used, in all combinations, in constructing a library for experimental evaluation. Our approach, OCoM (Optimization of Combinatorial Mutagenesis), encompasses both degenerate oligonucleotides and specified point mutations, and can be directed accordingly by requirements of experimental cost and library size. It evaluates the quality of the resulting library by one- and two-body sequence potentials, averaged over the variants. To ensure that it is not simply recapitulating extant sequences, it balances the quality of a library with an explicit evaluation of the novelty of its members. We show that, despite dealing with a combinatorial set of variants, in our approach the resulting library optimization problem is actually isomorphic to single-variant optimization. By the same token, this means that the two-body sequence potential results in an NP-hard optimization problem. We present an efficient dynamic programming algorithm for the one-body case and a practically-efficient integer programming approach for the general two-body case. We demonstrate the effectiveness of our approach in designing libraries for three different case study proteins targeted by previous combinatorial libraries--a green fluorescent protein, a cytochrome P450, and a beta lactamase. We found that OCoM worked quite efficiently in practice, requiring only 1 hour even for the massive design problem of selecting 18 mutations to generate 10⁷ variants of a 443-residue P450. We demonstrate the general ability of OCoM in enabling the protein engineer to explore and evaluate trade-offs between quality and novelty as well as library construction technique, and identify optimal libraries for experimental evaluation.
Collapse
Affiliation(s)
- Andrew S Parker
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire, USA
| | | | | |
Collapse
|
10
|
Zheng W, Griswold KE, Bailey-Kellogg C. Protein fragment swapping: a method for asymmetric, selective site-directed recombination. J Comput Biol 2010; 17:459-75. [PMID: 20377457 DOI: 10.1089/cmb.2009.0189] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
This article presents a new approach to site-directed recombination, swapping combinations of selected discontiguous fragments from a source protein in place of corresponding fragments of a target protein. By being both asymmetric (differentiating source and target) and selective (swapping discontiguous fragments), our method focuses experimental effort on a more restricted portion of sequence space, constructing hybrids that are more likely to have the properties that are the objective of the experiment. Furthermore, since the source and target need to be structurally homologous only locally (rather than overall), our method supports swapping fragments from functionally important regions of a source into a target "scaffold" (for example, to humanize an exogenous therapeutic protein). A protein fragment swapping plan is defined by the residue position boundaries of the fragments to be swapped; it is assessed by an average potential score over the resulting hybrid library, with singleton and pairwise terms evaluating the importance and fit of the swapped residues. While we prove that it is NP-hard to choose an optimal set of fragments under such a potential score, we develop an integer programming approach, which we call Swagmer, that works very well in practice. We demonstrate the effectiveness of our method in three swapping problems: selective recombination between beta-lactamases, activity swapping between glutathione transferases, and activity swapping between carboxylases and mutases in the purE family. We show that the selective recombination approach generates better plan (in terms of resulting potential score) than traditional site-directed recombination approaches. We also show that in all cases the optimized experiments are significantly better than ones that would result from stochastic methods.
Collapse
Affiliation(s)
- Wei Zheng
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire 03755, USA
| | | | | |
Collapse
|
11
|
Schmidt M, Böttcher D, Bornscheuer UT. Directed Evolution of Industrial Biocatalysts. Ind Biotechnol (New Rochelle N Y) 2010. [DOI: 10.1002/9783527630233.ch4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
12
|
Fromer M, Yanover C, Linial M. Design of multispecific protein sequences using probabilistic graphical modeling. Proteins 2010; 78:530-47. [PMID: 19842166 DOI: 10.1002/prot.22575] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
In nature, proteins partake in numerous protein- protein interactions that mediate their functions. Moreover, proteins have been shown to be physically stable in multiple structures, induced by cellular conditions, small ligands, or covalent modifications. Understanding how protein sequences achieve this structural promiscuity at the atomic level is a fundamental step in the drug design pipeline and a critical question in protein physics. One way to investigate this subject is to computationally predict protein sequences that are compatible with multiple states, i.e., multiple target structures or binding to distinct partners. The goal of engineering such proteins has been termed multispecific protein design. We develop a novel computational framework to efficiently and accurately perform multispecific protein design. This framework utilizes recent advances in probabilistic graphical modeling to predict sequences with low energies in multiple target states. Furthermore, it is also geared to specifically yield positional amino acid probability profiles compatible with these target states. Such profiles can be used as input to randomly bias high-throughput experimental sequence screening techniques, such as phage display, thus providing an alternative avenue for elucidating the multispecificity of natural proteins and the synthesis of novel proteins with specific functionalities. We prove the utility of such multispecific design techniques in better recovering amino acid sequence diversities similar to those resulting from millions of years of evolution. We then compare the approaches of prediction of low energy ensembles and of amino acid profiles and demonstrate their complementarity in providing more robust predictions for protein design.
Collapse
Affiliation(s)
- Menachem Fromer
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel.
| | | | | |
Collapse
|
13
|
Chakrabarti S, Panchenko AR. Structural and functional roles of coevolved sites in proteins. PLoS One 2010; 5:e8591. [PMID: 20066038 PMCID: PMC2797611 DOI: 10.1371/journal.pone.0008591] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2009] [Accepted: 10/19/2009] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Understanding the residue covariations between multiple positions in protein families is very crucial and can be helpful for designing protein engineering experiments. These simultaneous changes or residue coevolution allow protein to maintain its overall structural-functional integrity while enabling it to acquire specific functional modifications. Despite the significant efforts in the field there is still controversy in terms of the preferable locations of coevolved residues on different regions of protein molecules, the strength of coevolutionary signal and role of coevolution in functional diversification. METHODOLOGY In this paper we study the scale and nature of residue coevolution in maintaining the overall functionality and structural integrity of proteins. We employed a large scale study to investigate the structural and functional aspects of coevolved residues. We found that the networks representing the coevolutionary residue connections within our dataset are in general of 'small-world' type as they have clustering coefficient values higher than random networks and also show smaller mean shortest path lengths similar and/or lower than random and regular networks. We also found that altogether 11% of functionally important sites are coevolved with any other sites. Active sites are found more frequently to coevolve with any other sites (15%) compared to protein (11%) and ligand (9%) binding sites. Metal binding and active sites are also found to be more frequently coevolved with other metal binding and active sites, respectively. Analysis of the coupling between coevolutionary processes and the spatial distribution of coevolved sites reveals that a high fraction of coevolved sites are located close to each other. Moreover, approximately 80% of charge compensatory substitutions within coevolved sites are found at very close spatial proximity (<or= 5A), pointing to the possible preservation of salt bridges in evolution. CONCLUSION Our findings show that a noticeable fraction of functionally important sites undergo coevolution and also point towards compensatory substitutions as a probable coevolutionary mechanism within spatially proximal coevolved functional sites.
Collapse
Affiliation(s)
- Saikat Chakrabarti
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (SC); (ARP)
| | - Anna R. Panchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- * E-mail: (SC); (ARP)
| |
Collapse
|
14
|
Zheng W, Friedman AM, Bailey-Kellogg C. Algorithms for joint optimization of stability and diversity in planning combinatorial libraries of chimeric proteins. J Comput Biol 2009; 16:1151-68. [PMID: 19645597 DOI: 10.1089/cmb.2009.0090] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In engineering protein variants by constructing and screening combinatorial libraries of chimeric proteins, two complementary and competing goals are desired: the new proteins must be similar enough to the evolutionarily-selected wild-type proteins to be stably folded, and they must be different enough to display functional variation. We present here the first method, Staversity, to simultaneously optimize stability and diversity in selecting sets of breakpoint locations for site-directed recombination. Our goal is to uncover all "undominated" breakpoint sets, for which no other breakpoint set is better in both factors. Our first algorithm finds the undominated sets serving as the vertices of the lower envelope of the two-dimensional (stability and diversity) convex hull containing all possible breakpoint sets. Our second algorithm identifies additional breakpoint sets in the concavities that are either undominated or dominated only by undiscovered breakpoint sets within a distance bound computed by the algorithm. Both algorithms are efficient, requiring only time polynomial in the numbers of residues and breakpoints, while characterizing a space defined by an exponential number of possible breakpoint sets. We applied Staversity to identify 2-10 breakpoint plans for different sets of parent proteins taken from the purE family, as well as for parent proteins TEM-1 and PSE-4 from the beta-lactamase family. The average normalized distance between our plans and the lower bound for optimal plans is around 2%. Our plans dominate most (60-90% on average for each parent set) of the plans found by other possible approaches, random sampling or explicit optimization for stability with implicit optimization for diversity. The identified breakpoint sets provide a compact representation of good plans, enabling a protein engineer to understand and account for the trade-offs between two key considerations in combinatorial chimeragenesis.
Collapse
Affiliation(s)
- Wei Zheng
- Department of Computer Science, Dartmouth College , Hanover, New Hampshire, USA
| | | | | |
Collapse
|
15
|
Bhattacherjee A, Biswas P. Combinatorial design of protein sequences with applications to lattice and real proteins. J Chem Phys 2009; 131:125101. [DOI: 10.1063/1.3236519] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
16
|
Bhattacherjee A, Biswas P. Statistical Theory of Protein Sequence Design by Random Mutation. J Phys Chem B 2009; 113:5520-7. [DOI: 10.1021/jp810515s] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
| | - Parbati Biswas
- Department of Chemistry, University of Delhi, Delhi-110007
| |
Collapse
|
17
|
Sciretti D, Bruscolini P, Pelizzola A, Pretti M, Jaramillo A. Computational protein design with side-chain conformational entropy. Proteins 2009; 74:176-91. [PMID: 18618711 DOI: 10.1002/prot.22145] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Recent advances in modeling protein structures at the atomic level have made it possible to tackle "de novo" computational protein design. Most procedures are based on combinatorial optimization using a scoring function that estimates the folding free energy of a protein sequence on a given main-chain structure. However, the computation of the conformational entropy in the folded state is generally an intractable problem, and its contribution to the free energy is not properly evaluated. In this article, we propose a new automated protein design methodology that incorporates such conformational entropy based on statistical mechanics principles. We define the free energy of a protein sequence by the corresponding partition function over rotamer states. The free energy is written in variational form in a pairwise approximation and minimized using the Belief Propagation algorithm. In this way, a free energy is associated to each amino acid sequence: we use this insight to rescore the results obtained with a standard minimization method, with the energy as the cost function. Then, we set up a design method that directly uses the free energy as a cost function in combination with a stochastic search in the sequence space. We validate the methods on the design of three superficial sites of a small SH3 domain, and then apply them to the complete redesign of 27 proteins. Our results indicate that accounting for entropic contribution in the score function affects the outcome in a highly nontrivial way, and might improve current computational design techniques based on protein stability.
Collapse
Affiliation(s)
- Daniele Sciretti
- Departamento de Física Teórica, Universidad de Zaragoza, c. Pedro Cerbuna 12, Zaragoza 50009, Spain
| | | | | | | | | |
Collapse
|
18
|
Abstract
MOTIVATION The task of engineering a protein to perform a target biological function is known as protein design. A commonly used paradigm casts this functional design problem as a structural one, assuming a fixed backbone. In probabilistic protein design, positional amino acid probabilities are used to create a random library of sequences to be simultaneously screened for biological activity. Clearly, certain choices of probability distributions will be more successful in yielding functional sequences. However, since the number of sequences is exponential in protein length, computational optimization of the distribution is difficult. RESULTS In this paper, we develop a computational framework for probabilistic protein design following the structural paradigm. We formulate the distribution of sequences for a structure using the Boltzmann distribution over their free energies. The corresponding probabilistic graphical model is constructed, and we apply belief propagation (BP) to calculate marginal amino acid probabilities. We test this method on a large structural dataset and demonstrate the superiority of BP over previous methods. Nevertheless, since the results obtained by BP are far from optimal, we thoroughly assess the paradigm using high-quality experimental data. We demonstrate that, for small scale sub-problems, BP attains identical results to those produced by exact inference on the paradigmatic model. However, quantitative analysis shows that the distributions predicted significantly differ from the experimental data. These findings, along with the excellent performance we observed using BP on the smaller problems, suggest potential shortcomings of the paradigm. We conclude with a discussion of how it may be improved in the future.
Collapse
Affiliation(s)
- Menachem Fromer
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Israel.
| | | |
Collapse
|
19
|
Wong TS, Roccatano D, Schwaneberg U. Steering directed protein evolution: strategies to manage combinatorial complexity of mutant libraries. Environ Microbiol 2007; 9:2645-59. [DOI: 10.1111/j.1462-2920.2007.01411.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
20
|
Biswas P, Zou J, Saven JG. Statistical theory for protein ensembles with designed energy landscapes. J Chem Phys 2007; 123:154908. [PMID: 16252973 DOI: 10.1063/1.2062047] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Combinatorial protein libraries provide a promising route to investigate the determinants and features of protein folding and to identify novel folding amino acid sequences. A library of sequences based on a pool of different monomer types are screened for folding molecules, consistent with a particular foldability criterion. The number of sequences grows exponentially with the length of the polymer, making both experimental and computational tabulations of sequences infeasible. Herein a statistical theory is extended to specify the properties of sequences having particular values of global energetic quantities that specify their energy landscape. The theory yields the site-specific monomer probabilities. A foldability criterion is derived that characterizes the properties of sequences by quantifying the energetic separation of the target state from low-energy states in the unfolded ensemble and the fluctuations of the energies in the unfolded state ensemble. For a simple lattice model of proteins, excellent agreement is observed between the theory and the results of exact enumeration. The theory may be used to provide a quantitative framework for the design and interpretation of combinatorial experiments.
Collapse
Affiliation(s)
- Parbati Biswas
- Department of Chemistry, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
| | | | | |
Collapse
|
21
|
Treynor TP, Vizcarra CL, Nedelcu D, Mayo SL. Computationally designed libraries of fluorescent proteins evaluated by preservation and diversity of function. Proc Natl Acad Sci U S A 2006; 104:48-53. [PMID: 17179210 PMCID: PMC1765474 DOI: 10.1073/pnas.0609647103] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To determine which of seven library design algorithms best introduces new protein function without destroying it altogether, seven combinatorial libraries of green fluorescent protein variants were designed and synthesized. Each was evaluated by distributions of emission intensity and color compiled from measurements made in vivo. Additional comparisons were made with a library constructed by error-prone PCR. Among the designed libraries, fluorescent function was preserved for the greatest fraction of samples in a library designed by using a structure-based computational method developed and described here. A trend was observed toward greater diversity of color in designed libraries that better preserved fluorescence. Contrary to trends observed among libraries constructed by error-prone PCR, preservation of function was observed to increase with a library's average mutation level among the four libraries designed with structure-based computational methods.
Collapse
Affiliation(s)
- Thomas P. Treynor
- Divisions of *Biology and Chemistry and
- Howard Hughes Medical Institute, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125
| | | | | | - Stephen L. Mayo
- Divisions of *Biology and Chemistry and
- Howard Hughes Medical Institute, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
22
|
Meyer MM, Hochrein L, Arnold FH. Structure-guided SCHEMA recombination of distantly related β-lactamases. Protein Eng Des Sel 2006; 19:563-70. [PMID: 17090554 DOI: 10.1093/protein/gzl045] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We constructed a library of beta-lactamases by recombining three naturally occurring homologs (TEM-1, PSE-4, SED-1) that share 34-42% sequence identity. Most chimeras created by recombining such distantly related proteins are unfolded due to unfavorable side-chain interactions that destabilize the folded structure. To enhance the fraction of properly folded chimeras, we designed the library using SCHEMA, a structure-guided approach to choosing the least disruptive crossover locations. Recombination at seven selected crossover positions generated 6561 chimeric sequences that differ from their closest parent at an average of 66 positions. Of 553 unique characterized chimeras, 111 (20%) retained beta-lactamase activity; the library contains hundreds more novel beta-lactamases. The functional chimeras share as little as 70% sequence identity with any known sequence and are characterized by low SCHEMA disruption (E) compared to the average nonfunctional chimera. Furthermore, many nonfunctional chimeras with low E are readily rescued by low error-rate random mutagenesis or by the introduction of a known stabilizing mutation (TEM-1 M182T). These results show that structure-guided recombination effectively generates a family of diverse, folded proteins even when the parents exhibit only 34% sequence identity. Furthermore, the fraction of sequences that encode folded and functional proteins can be enhanced by utilizing previously stabilized parental sequences.
Collapse
Affiliation(s)
- Michelle M Meyer
- Biochemistry and Molecular Biophysics, California Institute of Technology Mail Code 210-21, California Institute of Technology Mail Code 210-41, Pasadena, CA 91125, USA
| | | | | |
Collapse
|
23
|
Saraf MC, Moore GL, Goodey NM, Cao VY, Benkovic SJ, Maranas CD. IPRO: an iterative computational protein library redesign and optimization procedure. Biophys J 2006; 90:4167-80. [PMID: 16513775 PMCID: PMC1459523 DOI: 10.1529/biophysj.105.079277] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A number of computational approaches have been developed to reengineer promising chimeric proteins one at a time through targeted point mutations. In this article, we introduce the computational procedure IPRO (iterative protein redesign and optimization procedure) for the redesign of an entire combinatorial protein library in one step using energy-based scoring functions. IPRO relies on identifying mutations in the parental sequences, which when propagated downstream in the combinatorial library, improve the average quality of the library (e.g., stability, binding affinity, specific activity, etc.). Residue and rotamer design choices are driven by a globally convergent mixed-integer linear programming formulation. Unlike many of the available computational approaches, the procedure allows for backbone movement as well as redocking of the associated ligands after a prespecified number of design iterations. IPRO can also be used, as a limiting case, for the redesign of a single or handful of individual sequences. The application of IPRO is highlighted through the redesign of a 16-member library of Escherichia coli/Bacillus subtilis dihydrofolate reductase hybrids, both individually and through upstream parental sequence redesign, for improving the average binding energy. Computational results demonstrate that it is indeed feasible to improve the overall library quality as exemplified by binding energy scores through targeted mutations in the parental sequences.
Collapse
Affiliation(s)
- Manish C Saraf
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | | | | | |
Collapse
|
24
|
Patrick WM, Firth AE. Strategies and computational tools for improving randomized protein libraries. ACTA ACUST UNITED AC 2005; 22:105-12. [PMID: 16095966 DOI: 10.1016/j.bioeng.2005.06.001] [Citation(s) in RCA: 97] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2005] [Revised: 06/20/2005] [Accepted: 06/21/2005] [Indexed: 11/15/2022]
Abstract
In the last decade, directed evolution has become a routine approach for engineering proteins with novel or altered properties. Concurrently, a trend away from purely 'blind' randomization strategies and towards more 'semi-rational' approaches has also become apparent. In this review, we discuss ways in which structural information and predictive computational tools are playing an increasingly important role in guiding the design of randomized libraries: web servers such as ConSurf-HSSP and SCHEMA allow the prediction of sites to target for producing functional variants, while algorithms such as GLUE, PEDEL and DRIVeR are useful for estimating library completeness and diversity. In addition, we review recent methodological developments that facilitate the construction of unbiased libraries, which are inherently more diverse than biased libraries and therefore more likely to yield improved variants.
Collapse
Affiliation(s)
- Wayne M Patrick
- Center for Fundamental and Applied Molecular Evolution, Emory University, 1510 Clifton Road, Atlanta GA 30322, USA.
| | | |
Collapse
|
25
|
Wittrup KD. Directed evolution in chemical engineering. AIChE J 2005. [DOI: 10.1002/aic.10706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
26
|
Kruus E, Thumfort P, Tang C, Wingreen NS. Gibbs sampling and helix-cap motifs. Nucleic Acids Res 2005; 33:5343-53. [PMID: 16174845 PMCID: PMC1234247 DOI: 10.1093/nar/gki842] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2005] [Revised: 08/08/2005] [Accepted: 08/30/2005] [Indexed: 11/25/2022] Open
Abstract
Protein backbones have characteristic secondary structures, including alpha-helices and beta-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure prediction and protein design. For the case of alpha-helix caps, we test whether the information content of the sequence-structure mapping can be self-consistently improved by using a relaxed definition of the structure. We derive helix-cap sequence motifs using database helix assignments for proteins of known structure. These motifs are refined using Gibbs sampling in competition with a null motif. Then Gibbs sampling is repeated, allowing for frameshifts of +/-1 amino acid residue, in order to find sequence motifs of higher total information content. All helix-cap motifs were found to have good generalization capability, as judged by training on a small set of non-redundant proteins and testing on a larger set. For overall prediction purposes, frameshift motifs using all training examples yielded the best results. Frameshift motifs using a fraction of all training examples performed best in terms of true positives among top predictions. However, motifs without frameshifts also performed well, despite a roughly one-third lower total information content.
Collapse
Affiliation(s)
- Erik Kruus
- NEC Laboratories America, Inc. 4 Independence Way, Princeton, NJ 08544, USA.
| | | | | | | |
Collapse
|
27
|
Hernández G, LeMaster DM. Hybrid native partitioning of interactions among nonconserved residues in chimeric proteins. Proteins 2005; 60:723-31. [PMID: 16021631 DOI: 10.1002/prot.20534] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Given any operational criterion for pairwise interatomic interactions, for a pair of structurally homologous proteins there exists for both proteins a unique equivalent partitioning of the nonconserved residue positions into mutually non-interacting clusters. In the formation of a chimeric protein derived from these two parental sequences, if nonnative-like interactions are to be avoided in its tertiary structure, then all of the nonconserved residues of each cluster must necessarily be either maintained or interchanged simultaneously. This hybrid native partitioning criterion is applied to known gene shuffling results. When the degree of estimated disruption is modest, the HybNat algorithm provides an efficient predictor of structural integrity. This supports the expectation that a substantial fraction of sequences that conform to the hybrid native partitioning criterion will yield tertiary structures that largely preserve the native-like interactions of the parental proteins.
Collapse
Affiliation(s)
- Griselda Hernández
- Wadsworth Center, New York State Department of Health and Department of Biomedical Sciences, University at Albany-SUNY, Empire State Plaza, Albany, New York 12201-0509, USA
| | | |
Collapse
|
28
|
Abstract
In this article we introduce a computational procedure, OPTCOMB (Optimal Pattern of Tiling for COMBinatorial library design), for designing protein hybrid libraries that optimally balance library size with quality. The proposed procedure is directly applicable to oligonucleotide ligation-based protocols such as GeneReassembly, DHR, SISDC, and many more. Given a set of parental sequences and the size ranges of the parental sequence fragments, OPTCOMB determines the optimal junction points (i.e., crossover positions) and the fragment contributing parental sequences at each one of the junction points. By rationally selecting the junction points and the contributing parental sequences, the number of clashes (i.e., unfavorable interactions) in the library is systematically minimized with the aim of improving the overall library quality. Using OPTCOMB, hybrid libraries containing fragments from three different dihydrofolate reductase sequences (Escherichia coli, Bacillus subtilis, and Lactobacillus casei) are computationally designed. Notably, we find that there exists an optimal library size when both the number of clashes between the fragments composing the library and the average number of clashes per hybrid in the library are minimized. Results reveal that the best library designs typically involve complex tiling patterns of parental segments of unequal size hard to infer without relying on computational means.
Collapse
Affiliation(s)
- Manish C Saraf
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16082, USA
| | | | | |
Collapse
|
29
|
Saraf MC, Horswill AR, Benkovic SJ, Maranas CD. FamClash: a method for ranking the activity of engineered enzymes. Proc Natl Acad Sci U S A 2004; 101:4142-7. [PMID: 14981242 PMCID: PMC384708 DOI: 10.1073/pnas.0400065101] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
This article introduces the computational procedure FamClash for analyzing incompatibilities in engineered protein hybrids by using protein family sequence data. All pairs of residue positions in the sequence alignment that conserve the property triplet of charge, volume, and hydrophobicity are first identified, and significant deviations are denoted as residue-residue clashes. This approach moves beyond earlier efforts aimed at solely classifying hybrids as functional or nonfunctional by correlating the rank ordering of these hybrids based on their activity levels. Experimental testing of this approach was performed in parallel to assess the predictive ability of FamClash. As a model system, single-crossover ITCHY (incremental truncation for the creation of hybrid enzymes) libraries were prepared from the Escherichia coli and Bacillus subtilis dihydrofolate reductases, and the activities of functional hybrids were determined. Comparisons of the predicted clash map as a function of crossover position revealed good agreement with activity data, reproducing the observed V shape and matching the location of a local peak in activity.
Collapse
Affiliation(s)
- Manish C Saraf
- Department of Chemistry, 414 Wartik Laboratory, Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | |
Collapse
|
30
|
Moore GL, Maranas CD. Computational challenges in combinatorial library design for protein engineering. AIChE J 2004. [DOI: 10.1002/aic.10025] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
31
|
Abstract
The seminal hypotheses proposed over the years for enzymatic catalysis are scrutinized. The historical record is explored from both biochemical and theoretical perspectives. Particular attention is given to the impact of molecular motions within the protein on the enzyme's catalytic properties. A case study for the enzyme dihydrofolate reductase provides evidence for coupled networks of predominantly conserved residues that influence the protein structure and motion. Such coupled networks have important implications for the origin and evolution of enzymes, as well as for protein engineering.
Collapse
Affiliation(s)
- Stephen J Benkovic
- Department of Chemistry, 152 Davey Laboratory, Pennsylvania State University, University Park, PA 16802, USA.
| | | |
Collapse
|