51
|
Kumar A, Grover S, Sharma J, Batish VK. Chymosin and other milk coagulants: sources and biotechnological interventions. Crit Rev Biotechnol 2010; 30:243-58. [DOI: 10.3109/07388551.2010.483459] [Citation(s) in RCA: 92] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
52
|
Piovan A, Cozza G, Caniato R, Moro S, Filippini R. A novel glucosyltransferase from Catharanthus roseus cell suspensions. Process Biochem 2010. [DOI: 10.1016/j.procbio.2009.12.014] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
53
|
Tufvesson P, Fu W, Jensen JS, Woodley JM. Process considerations for the scale-up and implementation of biocatalysis. FOOD AND BIOPRODUCTS PROCESSING 2010. [DOI: 10.1016/j.fbp.2010.01.003] [Citation(s) in RCA: 110] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
54
|
Lee SC. Implications of available design space for identification of non-immunogenic protein therapeutics. Biomed Microdevices 2009; 12:283-6. [PMID: 20012559 DOI: 10.1007/s10544-009-9383-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Immunogenicity/antibody responses are major issues for parenteral proteins and nanotherapeutics (nanovectors, diagnostics, theranostics, etc.), and robust antibody responses require T-helper epitopes. T-helper epitopes consist of specific amino acids at specific positions (anchor positions) in immunogens which contact the major histocompatibility complex (MHC), provide most of the energy for MHC binding and constitute the binding motif for the corresponding MHC alleles. We developed an algorithm that considers motifs to design vaccines lacking unwanted T-cell epitopes, and found numbers of such vaccines can be astronomical (Lee et al. 2009). The algorithm can be used to design reduced immunogenicity proteins, and numbers of predicted proteins are also immense. Reducing T-helper epitope content reduces protein immunogenicity, but the depth of mutagensis needed to eliminate immunogenicity is commonly assumed to be too great for retention of protein bioactivity. However, very deep, but successful substitution, insertion and deletion mutagenesis have been reported. These reports and design space the algorithm reveals suggest development of non-immunogenic therapeutics might be more feasible than commonly assumed.
Collapse
Affiliation(s)
- Stephen Craig Lee
- Department of Biomedical Engineering, The Ohio State University, Columbus, OH 43210, USA.
| |
Collapse
|
55
|
Liang J, Lalonde J, Borup B, Mitchell V, Mundorff E, Trinh N, Kochrekar DA, Nair Cherat R, Pai GG. Development of a Biocatalytic Process as an Alternative to the (−)-DIP-Cl-Mediated Asymmetric Reduction of a Key Intermediate of Montelukast. Org Process Res Dev 2009. [DOI: 10.1021/op900272d] [Citation(s) in RCA: 172] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jack Liang
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - James Lalonde
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - Birthe Borup
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - Vesna Mitchell
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - Emily Mundorff
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - Na Trinh
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - D. A. Kochrekar
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - Ramachandran Nair Cherat
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| | - G. Ganesh Pai
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Arch PharmLabs Limited, “H” Wing, Fourth Floor, Tex Centre, Off Saki Vihar Road, Chandivali Andheri (East), Mumbai - 400 072, India
| |
Collapse
|
56
|
Bougioukou D, Kille S, Taglieber A, Reetz M. Directed Evolution of an Enantioselective Enoate-Reductase: Testing the Utility of Iterative Saturation Mutagenesis. Adv Synth Catal 2009. [DOI: 10.1002/adsc.200900644] [Citation(s) in RCA: 150] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
57
|
Liang J, Mundorff E, Voladri R, Jenne S, Gilson L, Conway A, Krebber A, Wong J, Huisman G, Truesdell S, Lalonde J. Highly Enantioselective Reduction of a Small Heterocyclic Ketone: Biocatalytic Reduction of Tetrahydrothiophene-3-one to the Corresponding (R)-Alcohol. Org Process Res Dev 2009. [DOI: 10.1021/op9002714] [Citation(s) in RCA: 110] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Jack Liang
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Emily Mundorff
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Rama Voladri
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Stephan Jenne
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Lynne Gilson
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Aaron Conway
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Anke Krebber
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - John Wong
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Gjalt Huisman
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - Susan Truesdell
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| | - James Lalonde
- Codexis, Inc., 200 Penobscot Drive, Redwood City, California 94063, U.S.A., and Pfizer Global R&D, Easten Point Road, Groton, Connecticut 06340, U.S.A
| |
Collapse
|
58
|
Enzyme engineering for enantioselectivity: from trial-and-error to rational design? Trends Biotechnol 2009; 28:46-54. [PMID: 19913316 DOI: 10.1016/j.tibtech.2009.10.001] [Citation(s) in RCA: 105] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2009] [Revised: 10/02/2009] [Accepted: 10/05/2009] [Indexed: 11/23/2022]
Abstract
The availability of tailored enzymes is crucial for the implementation of biocatalysis in organic chemistry. Enantioselectivity is one key parameter defining the usefulness of an enzyme and, therefore, the competitiveness of the corresponding industrial process. Hence, identification of enzymes with high enantioselectivity in the desired transformation is important. Currently, this is achieved by screening collections and libraries comprising natural or man-made diversity for the desired trait. Recently, a variety of improved methods have been developed to generate and screen this diversity more efficiently. Here, we present and discuss the most important advances in both library generation and screening. We also evaluate future trends, such as moving from random evolution to more rational.
Collapse
|
59
|
Ozen A, Gönen M, Alpaydan E, Haliloğlu T. Machine learning integration for predicting the effect of single amino acid substitutions on protein stability. BMC STRUCTURAL BIOLOGY 2009; 9:66. [PMID: 19840377 PMCID: PMC2777163 DOI: 10.1186/1472-6807-9-66] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2009] [Accepted: 10/19/2009] [Indexed: 11/10/2022]
Abstract
BACKGROUND Computational prediction of protein stability change due to single-site amino acid substitutions is of interest in protein design and analysis. We consider the following four ways to improve the performance of the currently available predictors: (1) We include additional sequence- and structure-based features, namely, the amino acid substitution likelihoods, the equilibrium fluctuations of the alpha- and beta-carbon atoms, and the packing density. (2) By implementing different machine learning integration approaches, we combine information from different features or representations. (3) We compare classification vs. regression methods to predict the sign vs. the output of stability change. (4) We allow a reject option for doubtful cases where the risk of misclassification is high. RESULTS We investigate three different approaches: early, intermediate and late integration, which respectively combine features, kernels over feature subsets, and decisions. We perform simulations on two data sets: (1) S1615 is used in previous studies, (2) S2783 is the updated version (as of July 2, 2009) extracted also from ProTherm. For S1615 data set, our highest accuracy using both sequence and structure information is 0.842 on cross-validation and 0.904 on testing using early integration. Newly added features, namely, local compositional packing and the mobility extent of the mutated residues, improve accuracy significantly with intermediate integration. For S2783 data set, we also train regression methods to estimate not only the sign but also the amount of stability change and apply risk-based classification to reject when the learner has low confidence and the loss of misclassification is high. The highest accuracy is 0.835 on cross-validation and 0.832 on testing using only sequence information. The percentage of false positives can be decreased to less than 0.005 by rejecting 10 per cent using late integration. CONCLUSION We find that in both early and late integration, combining inputs or decisions is useful in increasing accuracy. Intermediate integration allows assessing the contributions of individual features by looking at the assigned weights. Overall accuracy of regression is not better than that of classification but it has less false positives, especially when combined with the reject option. The server for stability prediction for three integration approaches and the data sets are available at http://www.prc.boun.edu.tr/appserv/prc/mlsta.
Collapse
Affiliation(s)
- Ayşegül Ozen
- Department of Chemical Engineering, Polymer Research Center, Boğaziçi University, Istanbul, Turkey.
| | | | | | | |
Collapse
|
60
|
How to overcome limitations in biotechnological processes - examples from hydroxynitrile lyase applications. Trends Biotechnol 2009; 27:599-607. [DOI: 10.1016/j.tibtech.2009.07.005] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2009] [Revised: 07/01/2009] [Accepted: 07/13/2009] [Indexed: 11/17/2022]
|
61
|
Reetz M, Soni P, Acevedo J, Sanchis J. Creation of an Amino Acid Network of Structurally Coupled Residues in the Directed Evolution of a Thermostable Enzyme. Angew Chem Int Ed Engl 2009; 48:8268-72. [DOI: 10.1002/anie.200904209] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
62
|
Reetz M, Soni P, Acevedo J, Sanchis J. Creation of an Amino Acid Network of Structurally Coupled Residues in the Directed Evolution of a Thermostable Enzyme. Angew Chem Int Ed Engl 2009. [DOI: 10.1002/ange.200904209] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
63
|
Carothers JM, Goler JA, Keasling JD. Chemical synthesis using synthetic biology. Curr Opin Biotechnol 2009; 20:498-503. [PMID: 19720519 DOI: 10.1016/j.copbio.2009.08.001] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2009] [Revised: 08/04/2009] [Accepted: 08/04/2009] [Indexed: 12/22/2022]
Abstract
An immense array of naturally occurring biological systems have evolved that convert simple substrates into the products that cells need for growth and persistence. Through the careful application of metabolic engineering and synthetic biology, this biotransformation potential can be harnessed to produce chemicals that address unmet clinical and industrial needs. Developing the capacity to utilize biology to perform chemistry is a matter of increasing control over both the function of synthetic biological systems and the engineering of those systems. Recent efforts have improved general techniques and yielded successes in the use of synthetic biology for the production of drugs, bulk chemicals, and fuels in microbial platform hosts. Synthetic promoter systems and novel RNA-based, or riboregulator, mechanisms give more control over gene expression. Improved methods for isolating, engineering, and evolving enzymes give more control over substrate and product specificity and better catalysis inside the cell. New computational tools and methods for high-throughput system assembly and analysis may lead to more rapid forward engineering. We highlight research that reduces reliance upon natural biological components and point to future work that may enable more rational design and assembly of synthetic biological systems for synthetic chemistry.
Collapse
Affiliation(s)
- James M Carothers
- California Institute for Quantitative Biosciences and Berkeley Center for Synthetic Biology, University of California, Berkeley, CA 94720, USA.
| | | | | |
Collapse
|
64
|
Affiliation(s)
- Ian W. Davies
- Merck, 126 East Lincoln Avenue, Rahway, NJ 07065–0900, USA
| | | |
Collapse
|
65
|
Buchholz F. Engineering DNA processing enzymes for the postgenomic era. Curr Opin Biotechnol 2009; 20:383-9. [DOI: 10.1016/j.copbio.2009.07.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Revised: 07/07/2009] [Accepted: 07/25/2009] [Indexed: 12/15/2022]
|
66
|
Reetz MT, Bocola M, Wang LW, Sanchis J, Cronin A, Arand M, Zou J, Archelas A, Bottalla AL, Naworyta A, Mowbray SL. Directed evolution of an enantioselective epoxide hydrolase: uncovering the source of enantioselectivity at each evolutionary stage. J Am Chem Soc 2009; 131:7334-43. [PMID: 19469578 DOI: 10.1021/ja809673d] [Citation(s) in RCA: 119] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Directed evolution of enzymes as enantioselective catalysts in organic chemistry is an alternative to traditional asymmetric catalysis using chiral transition-metal complexes or organocatalysts, the different approaches often being complementary. Moreover, directed evolution studies allow us to learn more about how enzymes perform mechanistically. The present study concerns a previously evolved highly enantioselective mutant of the epoxide hydrolase from Aspergillus niger in the hydrolytic kinetic resolution of racemic glycidyl phenyl ether. Kinetic data, molecular dynamics calculations, molecular modeling, inhibition experiments, and X-ray structural work for the wild-type (WT) enzyme and the best mutant reveal the basis of the large increase in enantioselectivity (E = 4.6 versus E = 115). The overall structures of the WT and the mutant are essentially identical, but dramatic differences are observed in the active site as revealed by the X-ray structures. All of the experimental and computational results support a model in which productive positioning of the preferred (S)-glycidyl phenyl ether, but not the (R)-enantiomer, forms the basis of enhanced enantioselectivity. Predictions regarding substrate scope and enantioselectivity of the best mutant are shown to be possible.
Collapse
Affiliation(s)
- Manfred T Reetz
- Max-Planck-Institut für Kohlenforschung, Kaiser-Wilhelm-Platz 1, D-45470 Mulheim/Ruhr, Germany.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
67
|
|
68
|
Damborsky J, Brezovsky J. Computational tools for designing and engineering biocatalysts. Curr Opin Chem Biol 2009; 13:26-34. [PMID: 19297237 DOI: 10.1016/j.cbpa.2009.02.021] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2008] [Revised: 02/15/2009] [Accepted: 02/17/2009] [Indexed: 11/28/2022]
Abstract
Current computational tools to assist experimentalists for the design and engineering of proteins with desired catalytic properties are reviewed. The applications of these tools for de novo design of protein active sites, optimization of substrate access and product exit pathways, redesign of protein-protein interfaces, identification of neutral/advantageous/deleterious mutations in the libraries from directed evolution and stabilization of protein structures are described. Remarkable progress is seen in de novo design of enzymes catalyzing a chemical reaction for which a natural biocatalyst does not exist. Yet, constructed biocatalysts do not match natural enzymes in their efficiency, suggesting that more research is needed to capture all the important features of natural biocatalysts in theoretical designs.
Collapse
Affiliation(s)
- Jiri Damborsky
- Institute of Experimental Biology and National Centre for Biomolecular Research, Masaryk University, Brno, Czech Republic.
| | | |
Collapse
|
69
|
Shivange AV, Marienhagen J, Mundhada H, Schenk A, Schwaneberg U. Advances in generating functional diversity for directed protein evolution. Curr Opin Chem Biol 2009; 13:19-25. [DOI: 10.1016/j.cbpa.2009.01.019] [Citation(s) in RCA: 138] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2008] [Revised: 01/26/2009] [Accepted: 01/28/2009] [Indexed: 11/16/2022]
|
70
|
Reetz MT, Kahakeaw D, Sanchis J. Shedding light on the efficacy of laboratory evolution based on iterative saturation mutagenesis. ACTA ACUST UNITED AC 2009; 5:115-22. [DOI: 10.1039/b814862g] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
|
71
|
Sanchis J, Fernández L, Carballeira JD, Drone J, Gumulya Y, Höbenreich H, Kahakeaw D, Kille S, Lohmer R, Peyralans JJP, Podtetenieff J, Prasad S, Soni P, Taglieber A, Wu S, Zilly FE, Reetz MT. Improved PCR method for the creation of saturation mutagenesis libraries in directed evolution: application to difficult-to-amplify templates. Appl Microbiol Biotechnol 2008; 81:387-97. [PMID: 18820909 PMCID: PMC7419347 DOI: 10.1007/s00253-008-1678-9] [Citation(s) in RCA: 112] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2008] [Revised: 08/20/2008] [Accepted: 08/21/2008] [Indexed: 11/24/2022]
Abstract
Saturation mutagenesis constitutes a powerful method in the directed evolution of enzymes. Traditional protocols of whole plasmid amplification such as Stratagene’s QuikChange™ sometimes fail when the templates are difficult to amplify. In order to overcome such restrictions, we have devised a simple two-primer, two-stage polymerase chain reaction (PCR) method which constitutes an improvement over existing protocols. In the first stage of the PCR, both the mutagenic primer and the antiprimer that are not complementary anneal to the template. In the second stage, the amplified sequence is used as a megaprimer. Sites composed of one or more residues can be randomized in a single PCR reaction, irrespective of their location in the gene sequence.The method has been applied to several enzymes successfully, including P450-BM3 from Bacillus megaterium, the lipases from Pseudomonas aeruginosa and Candida antarctica and the epoxide hydrolase from Aspergillus niger. Here, we show that megaprimer size as well as the direction and design of the antiprimer are determining factors in the amplification of the plasmid. Comparison of the results with the performances of previous protocols reveals the efficiency of the improved method.
Collapse
Affiliation(s)
- Joaquin Sanchis
- Max-Planck-Institut für Kohlenforschung, Kaiser-Wilhelm-Platz 1, 45470, Mülheim/Ruhr, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
72
|
Reetz MT, Sanchis J. Constructing and Analyzing the Fitness Landscape of an Experimental Evolutionary Process. Chembiochem 2008; 9:2260-7. [DOI: 10.1002/cbic.200800371] [Citation(s) in RCA: 78] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
73
|
Predicting positive p53 cancer rescue regions using Most Informative Positive (MIP) active learning. PLoS Comput Biol 2008; 5:e1000498. [PMID: 19756158 PMCID: PMC2742196 DOI: 10.1371/journal.pcbi.1000498] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2009] [Accepted: 08/04/2009] [Indexed: 11/19/2022] Open
Abstract
Many protein engineering problems involve finding mutations that produce proteins
with a particular function. Computational active learning is an attractive
approach to discover desired biological activities. Traditional active learning
techniques have been optimized to iteratively improve classifier accuracy, not
to quickly discover biologically significant results. We report here a novel
active learning technique, Most Informative Positive (MIP), which is tailored to
biological problems because it seeks novel and informative positive results. MIP
active learning differs from traditional active learning methods in two ways:
(1) it preferentially seeks Positive (functionally active) examples; and (2) it
may be effectively extended to select gene regions suitable for high throughput
combinatorial mutagenesis. We applied MIP to discover mutations in the tumor
suppressor protein p53 that reactivate mutated p53 found in human cancers. This
is an important biomedical goal because p53 mutants have been
implicated in half of all human cancers, and restoring active p53 in tumors
leads to tumor regression. MIP found Positive (cancer rescue) p53 mutants
in silico using 33% fewer experiments than
traditional non-MIP active learning, with only a minor decrease in classifier
accuracy. Applying MIP to in vivo experimentation yielded
immediate Positive results. Ten different p53 mutations found in human cancers
were paired in silico with all possible single amino acid
rescue mutations, from which MIP was used to select a Positive Region predicted
to be enriched for p53 cancer rescue mutants. In vivo assays
showed that the predicted Positive Region: (1) had significantly more
(p<0.01) new strong cancer rescue mutants than control regions (Negative,
and non-MIP active learning); (2) had slightly more new strong cancer rescue
mutants than an Expert region selected for purely biological considerations; and
(3) rescued for the first time the previously unrescuable p53 cancer mutant
P152L. Engineering proteins to acquire or enhance a particular useful function is at the
core of many biomedical problems. This paper presents Most Informative Positive
(MIP) active learning, a novel integrated computational/biological approach
designed to help guide biological discovery of novel and informative positive
mutants. A classifier, together with modeled structure-based features, helps
guide biological experiments and so accelerates protein engineering studies. MIP
reduces the number of expensive biological experiments needed to achieve novel
and informative positive results. We used the MIP method to discover novel p53
cancer rescue mutants. p53 is a tumor suppressor protein, and destructive p53
mutations have been implicated in half of all human cancers. Second-site cancer
rescue mutations restore p53 activity and eventually may facilitate rational
design of better cancer drugs. This paper shows that, even in the first round of
in vivo experiments, MIP significantly increased the discovery rate of novel and
informative positive mutants.
Collapse
|
74
|
Reetz MT, Kahakeaw D, Lohmer R. Addressing the Numbers Problem in Directed Evolution. Chembiochem 2008; 9:1797-804. [DOI: 10.1002/cbic.200800298] [Citation(s) in RCA: 334] [Impact Index Per Article: 20.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
75
|
Urban P, Truan G, Pompon D. High-throughput enzymology and combinatorial mutagenesis for mining cytochrome P450 functions. Expert Opin Drug Metab Toxicol 2008; 4:733-47. [DOI: 10.1517/17425255.4.6.733] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
76
|
Reetz MT, Wu S. Greatly reduced amino acid alphabets in directed evolution: making the right choice for saturation mutagenesis at homologous enzyme positions. Chem Commun (Camb) 2008:5499-501. [DOI: 10.1039/b813388c] [Citation(s) in RCA: 125] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|