151
|
Ott F, Rabe KS, Niemeyer CM, Gygli G. Toward Reproducible Enzyme Modeling with Isothermal Titration Calorimetry. ACS Catal 2021. [DOI: 10.1021/acscatal.1c02076] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Felix Ott
- Institute for Biological Interfaces (IBG 1), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| | - Kersten S. Rabe
- Institute for Biological Interfaces (IBG 1), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| | - Christof M. Niemeyer
- Institute for Biological Interfaces (IBG 1), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| | - Gudrun Gygli
- Institute for Biological Interfaces (IBG 1), Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| |
Collapse
|
152
|
Molina M, Cioci G, Moulis C, Séverac E, Remaud-Siméon M. Bacterial α-Glucan and Branching Sucrases from GH70 Family: Discovery, Structure-Function Relationship Studies and Engineering. Microorganisms 2021; 9:microorganisms9081607. [PMID: 34442685 PMCID: PMC8398850 DOI: 10.3390/microorganisms9081607] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 07/22/2021] [Accepted: 07/25/2021] [Indexed: 01/12/2023] Open
Abstract
Glucansucrases and branching sucrases are classified in the family 70 of glycoside hydrolases. They are produced by lactic acid bacteria occupying very diverse ecological niches (soil, buccal cavity, sourdough, intestine, dairy products, etc.). Usually secreted by their producer organisms, they are involved in the synthesis of α-glucans from sucrose substrate. They contribute to cell protection while promoting adhesion and colonization of different biotopes. Dextran, an α-1,6 linked linear α-glucan, was the first microbial polysaccharide commercialized for medical applications. Advances in the discovery and characterization of these enzymes have remarkably enriched the available diversity with new catalysts. Research into their molecular mechanisms has highlighted important features governing their peculiarities thus opening up many opportunities for engineering these catalysts to provide new routes for the transformation of sucrose into value-added molecules. This article reviews these different aspects with the ambition to show how they constitute the basis for promising future developments.
Collapse
|
153
|
Yi D, Bayer T, Badenhorst CPS, Wu S, Doerr M, Höhne M, Bornscheuer UT. Recent trends in biocatalysis. Chem Soc Rev 2021; 50:8003-8049. [PMID: 34142684 PMCID: PMC8288269 DOI: 10.1039/d0cs01575j] [Citation(s) in RCA: 115] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Indexed: 12/13/2022]
Abstract
Biocatalysis has undergone revolutionary progress in the past century. Benefited by the integration of multidisciplinary technologies, natural enzymatic reactions are constantly being explored. Protein engineering gives birth to robust biocatalysts that are widely used in industrial production. These research achievements have gradually constructed a network containing natural enzymatic synthesis pathways and artificially designed enzymatic cascades. Nowadays, the development of artificial intelligence, automation, and ultra-high-throughput technology provides infinite possibilities for the discovery of novel enzymes, enzymatic mechanisms and enzymatic cascades, and gradually complements the lack of remaining key steps in the pathway design of enzymatic total synthesis. Therefore, the research of biocatalysis is gradually moving towards the era of novel technology integration, intelligent manufacturing and enzymatic total synthesis.
Collapse
Affiliation(s)
- Dong Yi
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Thomas Bayer
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Christoffel P. S. Badenhorst
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Shuke Wu
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Mark Doerr
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Matthias Höhne
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| | - Uwe T. Bornscheuer
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University GreifswaldFelix-Hausdorff-Str. 4D-17487 GreifswaldGermany
| |
Collapse
|
154
|
Alejaldre L, Pelletier JN, Quaglia D. Methods for enzyme library creation: Which one will you choose?: A guide for novices and experts to introduce genetic diversity. Bioessays 2021; 43:e2100052. [PMID: 34263468 DOI: 10.1002/bies.202100052] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 05/28/2021] [Accepted: 06/02/2021] [Indexed: 12/15/2022]
Abstract
Enzyme engineering allows to explore sequence diversity in search for new properties. The scientific literature is populated with methods to create enzyme libraries for engineering purposes, however, choosing a suitable method for the creation of mutant libraries can be daunting, in particular for the novices. Here, we address both novices and experts: how can one enter the arena of enzyme library design and what guidelines can advanced users apply to select strategies best suited to their purpose? Section I is dedicated to the novices and presents an overview of established and standard methods for library creation, as well as available commercial solutions. The expert will discover an up-to-date tool to freshen up their repertoire (Section I) and learn of the newest methods that are likely to become a mainstay (Section II). We focus primarily on in vitro methods, presenting the advantages of each method. Our ultimate aim is to offer a selection of methods/strategies that we believe to be most useful to the enzyme engineer, whether a first-timer or a seasoned user.
Collapse
Affiliation(s)
- Lorea Alejaldre
- Département de biochimie and Center for Green Chemistry and Catalysis (CGCC), Université de Montréal, Montréal, Quebec, Canada.,PROTEO, The Québec Network for Research on Protein Function, Engineering and Applications, Québec, Quebec, Canada
| | - Joelle N Pelletier
- Département de biochimie and Center for Green Chemistry and Catalysis (CGCC), Université de Montréal, Montréal, Quebec, Canada.,PROTEO, The Québec Network for Research on Protein Function, Engineering and Applications, Québec, Quebec, Canada.,Département de chimie, Université de Montréal, Montréal, Quebec, Canada
| | - Daniela Quaglia
- Département de chimie, Université de Montréal, Montréal, Quebec, Canada.,School of Chemistry, University of Nottingham, Nottingham, UK
| |
Collapse
|
155
|
Desagiacomo CCV, Alnoch RC, Pinheiro VE, Cereia M, Machado CB, Damasio A, Augusto MJ, Pedersoli W, Silva RN, Polizeli MDLTDM. Structural model and functional properties of an exo-polygalacturonase from Neosartorya glabra. Int J Biol Macromol 2021; 186:909-918. [PMID: 34274400 DOI: 10.1016/j.ijbiomac.2021.07.065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Revised: 04/28/2021] [Accepted: 07/10/2021] [Indexed: 11/28/2022]
Abstract
A purified exo-polygalacturonase of Neosartorya glabra (EplNg) was successfully characterized. EplNg native presented 68.2 kDa, with 32% carbohydrate content. The deglycosylated form showed 46.3 kDa and isoelectric point of 5.4. The identity of EplNg was confirmed as an exo-polygalacturonase class I (EC 3.2.1.67) using mass spectrometry and Western-Blotting. Capillary electrophoresis indicated that only galacturonic acid was released by the action of EplNg on sodium polypectate, confirming an exoenzyme character. The structural model confers that EplNg has a core formed by twisted parallel β-sheets structure. Among twelve putative cysteines, ten were predicted to form disulfide bridges. The catalytic triad predicted is composed of Asp223, Asp245, and Asp246 aligned along with a distance in 4-5 Å, suggesting that EplNg probably does not perform the standard inverting catalytic mechanism described for the GH28 family. EplNg was active from 30 to 90 °C, with maximum activity at 65 °C, pH 5.0. The Km and Vmax determined using sodium polypectate were 6.9 mg·mL-1 and Vmax 690 μmol·min-1.mg-1, respectively. EplNg was active and stable over a wide range of pH values and temperatures, confirming the interesting properties EplNg and provide a basis for the development of the enzyme in different biotechnological processes.
Collapse
Affiliation(s)
- Carla Cristina Villela Desagiacomo
- Departamento de Bioquímica e Imunologia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14049-900, Brazil
| | - Robson Carlos Alnoch
- Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14040-901, Brazil
| | - Vanessa Elisa Pinheiro
- Departamento de Bioquímica e Imunologia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14049-900, Brazil
| | - Mariana Cereia
- Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14040-901, Brazil
| | - Carla Botelho Machado
- Centro Nacional de Pesquisa em Energia e Materiais, Laboratório Nacional de Ciência e Tecnologia do Bioetanol, Campinas 13083-970, Brazil
| | - André Damasio
- Instituto de Biologia, Universidade de Campinas - UNICAMP, Campinas, São Paulo 13083-862, Brazil
| | - Marlei Josiele Augusto
- Departamento de Patologia e Medicina Legal da Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14040-901, Brazil
| | - Wellington Pedersoli
- Departamento de Bioquímica e Imunologia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14049-900, Brazil
| | - Roberto Nascimento Silva
- Departamento de Bioquímica e Imunologia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14049-900, Brazil
| | - Maria de Lourdes Teixeira de Moraes Polizeli
- Departamento de Bioquímica e Imunologia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14049-900, Brazil; Departamento de Biologia, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, São Paulo 14040-901, Brazil.
| |
Collapse
|
156
|
Siedhoff NE, Illig AM, Schwaneberg U, Davari MD. PyPEF-An Integrated Framework for Data-Driven Protein Engineering. J Chem Inf Model 2021; 61:3463-3476. [PMID: 34260225 DOI: 10.1021/acs.jcim.1c00099] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Data-driven strategies are gaining increased attention in protein engineering due to recent advances in access to large experimental databanks of proteins, next-generation sequencing (NGS), high-throughput screening (HTS) methods, and the development of artificial intelligence algorithms. However, the reliable prediction of beneficial amino acid substitutions, their combination, and the effect on functional properties remain the most significant challenges in protein engineering, which is applied to develop proteins and enzymes for biocatalysis, biomedicine, and life sciences. Here, we present a general-purpose framework (PyPEF: pythonic protein engineering framework) for performing data-driven protein engineering using machine learning methods combined with techniques from signal processing and statistical physics. PyPEF guides the identification and selection of beneficial proteins of a defined sequence space by systematically or randomly exploring the fitness of variants and by sampling random evolution pathways. The performance of PyPEF was evaluated concerning its predictive accuracy and throughput on four public protein and enzyme data sets using common regression models. It was proved that the program could efficiently predict the fitness of protein sequences for different target properties (predictive models with coefficient of determination values ranging from 0.58 to 0.92). By combining machine learning and protein evolution, PyPEF enabled the screening of proteins with various functions, reaching a screening capacity of more than 500,000 protein sequence variants in the timeframe of only a few minutes on a personal computer. PyPEF displayed significant accuracies on four public data sets (different proteins and properties) and underlined the potential of integrating data-driven technologies for covering different philosophies by either predicting the fitness of the variants to the highest accuracy accounting for epistatic effects or capturing the general trend of introduced mutations on the fitness in directed protein evolution campaigns. In essence, PyPEF can provide a powerful solution to current sequence exploration and combinatorial problems faced in protein engineering through exhaustive in silico screening of the sequence space.
Collapse
Affiliation(s)
- Niklas E Siedhoff
- Institute of Biotechnology, RWTH Aachen University, Worringer Weg 3, 52074 Aachen, Germany
| | | | - Ulrich Schwaneberg
- Institute of Biotechnology, RWTH Aachen University, Worringer Weg 3, 52074 Aachen, Germany.,DWI-Leibniz Institute for Interactive Materials, Forckenbeckstraße 50, 52074 Aachen, Germany
| | - Mehdi D Davari
- Institute of Biotechnology, RWTH Aachen University, Worringer Weg 3, 52074 Aachen, Germany
| |
Collapse
|
157
|
Wu L, Qin L, Nie Y, Xu Y, Zhao YL. Computer-aided understanding and engineering of enzymatic selectivity. Biotechnol Adv 2021; 54:107793. [PMID: 34217814 DOI: 10.1016/j.biotechadv.2021.107793] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 04/26/2021] [Accepted: 06/28/2021] [Indexed: 12/26/2022]
Abstract
Enzymes offering chemo-, regio-, and stereoselectivity enable the asymmetric synthesis of high-value chiral molecules. Unfortunately, the drawback that naturally occurring enzymes are often inefficient or have undesired selectivity toward non-native substrates hinders the broadening of biocatalytic applications. To match the demands of specific selectivity in asymmetric synthesis, biochemists have implemented various computer-aided strategies in understanding and engineering enzymatic selectivity, diversifying the available repository of artificial enzymes. Here, given that the entire asymmetric catalytic cycle, involving precise interactions within the active pocket and substrate transport in the enzyme channel, could affect the enzymatic efficiency and selectivity, we presented a comprehensive overview of the computer-aided workflow for enzymatic selectivity. This review includes a mechanistic understanding of enzymatic selectivity based on quantum mechanical calculations, rational design of enzymatic selectivity guided by enzyme-substrate interactions, and enzymatic selectivity regulation via enzyme channel engineering. Finally, we discussed the computational paradigm for designing enzyme selectivity in silico to facilitate the advancement of asymmetric biosynthesis.
Collapse
Affiliation(s)
- Lunjie Wu
- School of Biotechnology and Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Lei Qin
- School of Biotechnology and Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Yao Nie
- School of Biotechnology and Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Suqian Industrial Technology Research Institute of Jiangnan University, Suqian 223814, China.
| | - Yan Xu
- School of Biotechnology and Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China.
| | - Yi-Lei Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, MOE-LSB & MOE-LSC, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
158
|
Roda S, Robles-Martín A, Xiang R, Kazemi M, Guallar V. Structural-Based Modeling in Protein Engineering. A Must Do. J Phys Chem B 2021; 125:6491-6500. [PMID: 34106727 DOI: 10.1021/acs.jpcb.1c02545] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Biotechnological solutions will be a key aspect in our immediate future society, where optimized enzymatic processes through enzyme engineering might be an important solution for waste transformation, clean energy production, biodegradable materials, and green chemistry, for example. Here we advocate the importance of structural-based bioinformatics and molecular modeling tools in such developments. We summarize our recent experiences indicating a great prediction/success ratio, and we suggest that an early in silico phase should be performed in enzyme engineering studies. Moreover, we demonstrate the potential of a new technique combining Rosetta and PELE, which could provide a faster and more automated procedure, an essential aspect for a broader use.
Collapse
Affiliation(s)
- Sergi Roda
- Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
| | | | - Ruite Xiang
- Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
| | - Masoud Kazemi
- Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain
| | - Victor Guallar
- Barcelona Supercomputing Center (BSC), Barcelona 08034, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona 08010, Spain
| |
Collapse
|
159
|
Schenkmayerova A, Pinto GP, Toul M, Marek M, Hernychova L, Planas-Iglesias J, Daniel Liskova V, Pluskal D, Vasina M, Emond S, Dörr M, Chaloupkova R, Bednar D, Prokop Z, Hollfelder F, Bornscheuer UT, Damborsky J. Engineering the protein dynamics of an ancestral luciferase. Nat Commun 2021; 12:3616. [PMID: 34127663 PMCID: PMC8203615 DOI: 10.1038/s41467-021-23450-z] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 04/27/2021] [Indexed: 01/06/2023] Open
Abstract
Protein dynamics are often invoked in explanations of enzyme catalysis, but their design has proven elusive. Here we track the role of dynamics in evolution, starting from the evolvable and thermostable ancestral protein AncHLD-RLuc which catalyses both dehalogenase and luciferase reactions. Insertion-deletion (InDel) backbone mutagenesis of AncHLD-RLuc challenged the scaffold dynamics. Screening for both activities reveals InDel mutations localized in three distinct regions that lead to altered protein dynamics (based on crystallographic B-factors, hydrogen exchange, and molecular dynamics simulations). An anisotropic network model highlights the importance of the conformational flexibility of a loop-helix fragment of Renilla luciferases for ligand binding. Transplantation of this dynamic fragment leads to lower product inhibition and highly stable glow-type bioluminescence. The success of our approach suggests that a strategy comprising (i) constructing a stable and evolvable template, (ii) mapping functional regions by backbone mutagenesis, and (iii) transplantation of dynamic features, can lead to functionally innovative proteins.
Collapse
Affiliation(s)
- Andrea Schenkmayerova
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Gaspar P Pinto
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Martin Toul
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Martin Marek
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Lenka Hernychova
- Research Centre for Applied Molecular Oncology, Masaryk Memorial Cancer Institute, Brno, Czech Republic
| | - Joan Planas-Iglesias
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Veronika Daniel Liskova
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Daniel Pluskal
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Michal Vasina
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Stephane Emond
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Mark Dörr
- Department of Biotechnology and Enzyme Catalysis, Institute of Biochemistry, University of Greifswald, Greifswald, Germany
| | - Radka Chaloupkova
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - David Bednar
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Zbynek Prokop
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | | | - Uwe T Bornscheuer
- Department of Biotechnology and Enzyme Catalysis, Institute of Biochemistry, University of Greifswald, Greifswald, Germany.
| | - Jiri Damborsky
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic.
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.
| |
Collapse
|
160
|
Osadchy M, Kolodny R. How Deep Learning Tools Can Help Protein Engineers Find Good Sequences. J Phys Chem B 2021; 125:6440-6450. [PMID: 34105961 DOI: 10.1021/acs.jpcb.1c02449] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The deep learning revolution introduced a new and efficacious way to address computational challenges in a wide range of fields, relying on large data sets and powerful computational resources. In protein engineering, we consider the challenge of computationally predicting properties of a protein and designing sequences with these properties. Indeed, accurate and fast deep network oracles for different properties of proteins have been developed. These learn to predict a property from an amino acid sequence by training on large sets of proteins that have this property. In particular, deep networks can learn from the set of all known protein sequences to identify ones that are protein-like. A fundamental challenge when engineering sequences that are both protein-like and satisfy a desired property is that these are rare instances within the vast space of all possible ones. When searching for these very rare instances, one would like to use good sampling procedures. Sampling approaches that are decoupled from the prediction of the property or in which the predictor uses only post-sampling to identify good instances are less efficient. The alternative is to use sampling methods that are geared to generate sequences satisfying and/or optimizing the predictor's desired properties. Deep learning has a class of architectures, denoted as generative models, which offer the capability of sampling from the learned distribution of a predicted property. Here, we review the use of deep learning tools to find good sequences for protein engineering, including developing oracles/predictors of a property of the proteins and methods that sample from a distribution of protein-like sequences to optimize the desired property.
Collapse
Affiliation(s)
- Margarita Osadchy
- Department of Computer Science, Jacobs Building, University of Haifa, 199 Aba Houshi Road, Mount Carmel, Haifa, Israel 3498838
| | - Rachel Kolodny
- Department of Computer Science, Jacobs Building, University of Haifa, 199 Aba Houshi Road, Mount Carmel, Haifa, Israel 3498838
| |
Collapse
|
161
|
Enhancement of protein thermostability by three consecutive mutations using loop-walking method and machine learning. Sci Rep 2021; 11:11883. [PMID: 34088952 PMCID: PMC8178419 DOI: 10.1038/s41598-021-91339-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Accepted: 05/25/2021] [Indexed: 01/22/2023] Open
Abstract
We developed a method to improve protein thermostability, “loop-walking method”. Three consecutive positions in 12 loops of Burkholderia cepacia lipase were subjected to random mutagenesis to make 12 libraries. Screening allowed us to identify L7 as a hot-spot loop having an impact on thermostability, and the P233G/L234E/V235M mutant was found from 214 variants in the L7 library. Although a more excellent mutant might be discovered by screening all the 8000 P233X/L234X/V235X mutants, it was difficult to assay all of them. We therefore employed machine learning. Using thermostability data of the 214 mutants, a computational discrimination model was constructed to predict thermostability potentials. Among 7786 combinations ranked in silico, 20 promising candidates were selected and assayed. The P233D/L234P/V235S mutant retained 66% activity after heat treatment at 60 °C for 30 min, which was higher than those of the wild-type enzyme (5%) and the P233G/L234E/V235M mutant (35%).
Collapse
|
162
|
Wu Z, Johnston KE, Arnold FH, Yang KK. Protein sequence design with deep generative models. Curr Opin Chem Biol 2021; 65:18-27. [PMID: 34051682 DOI: 10.1016/j.cbpa.2021.04.004] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 04/02/2021] [Accepted: 04/07/2021] [Indexed: 12/20/2022]
Abstract
Protein engineering seeks to identify protein sequences with optimized properties. When guided by machine learning, protein sequence generation methods can draw on prior knowledge and experimental efforts to improve this process. In this review, we highlight recent applications of machine learning to generate protein sequences, focusing on the emerging field of deep generative methods.
Collapse
Affiliation(s)
- Zachary Wu
- Division of Chemistry and Chemical Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, 91125, CA, USA
| | - Kadina E Johnston
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, 91125, CA, USA
| | - Frances H Arnold
- Division of Chemistry and Chemical Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, 91125, CA, USA; Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California Blvd, Pasadena, 91125, CA, USA
| | - Kevin K Yang
- Microsoft Research New England, 1 Memorial Drive, Cambridge, 02142, MA, USA.
| |
Collapse
|
163
|
Sequeiros-Borja CE, Surpeta B, Brezovsky J. Recent advances in user-friendly computational tools to engineer protein function. Brief Bioinform 2021; 22:bbaa150. [PMID: 32743637 PMCID: PMC8138880 DOI: 10.1093/bib/bbaa150] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 06/03/2020] [Accepted: 06/16/2020] [Indexed: 12/14/2022] Open
Abstract
Progress in technology and algorithms throughout the past decade has transformed the field of protein design and engineering. Computational approaches have become well-engrained in the processes of tailoring proteins for various biotechnological applications. Many tools and methods are developed and upgraded each year to satisfy the increasing demands and challenges of protein engineering. To help protein engineers and bioinformaticians navigate this emerging wave of dedicated software, we have critically evaluated recent additions to the toolbox regarding their application for semi-rational and rational protein engineering. These newly developed tools identify and prioritize hotspots and analyze the effects of mutations for a variety of properties, comprising ligand binding, protein-protein and protein-nucleic acid interactions, and electrostatic potential. We also discuss notable progress to target elusive protein dynamics and associated properties like ligand-transport processes and allosteric communication. Finally, we discuss several challenges these tools face and provide our perspectives on the further development of readily applicable methods to guide protein engineering efforts.
Collapse
Affiliation(s)
- Carlos Eduardo Sequeiros-Borja
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University and the International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Bartłomiej Surpeta
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University and the International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland
| | - Jan Brezovsky
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University and the International Institute of Molecular and Cell Biology in Warsaw
| |
Collapse
|
164
|
Revolutionizing enzyme engineering through artificial intelligence and machine learning. Emerg Top Life Sci 2021; 5:113-125. [PMID: 33835131 DOI: 10.1042/etls20200257] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 03/17/2021] [Accepted: 03/22/2021] [Indexed: 12/20/2022]
Abstract
The combinatorial space of an enzyme sequence has astronomical possibilities and exploring it with contemporary experimental techniques is arduous and often ineffective. Multi-target objectives such as concomitantly achieving improved selectivity, solubility and activity of an enzyme have narrow plausibility under approaches of restricted mutagenesis and combinatorial search. Traditional enzyme engineering approaches have a limited scope for complex optimization due to the requirement of a priori knowledge or experimental burden of screening huge protein libraries. The recent surge in high-throughput experimental methods including Next Generation Sequencing and automated screening has flooded the field of molecular biology with big-data, which requires us to re-think our concurrent approaches towards enzyme engineering. Artificial Intelligence (AI) and Machine Learning (ML) have great potential to revolutionize smart enzyme engineering without the explicit need for a complete understanding of the underlying molecular system. Here, we portray the role and position of AI techniques in the field of enzyme engineering along with their scope and limitations. In addition, we explain how the traditional approaches of directed evolution and rational design can be extended through AI tools. Recent successful examples of AI-assisted enzyme engineering projects and their deviation from traditional approaches are highlighted. A comprehensive picture of current challenges and future avenues for AI in enzyme engineering are also discussed.
Collapse
|
165
|
Robotics for enzyme technology: innovations and technological perspectives. Appl Microbiol Biotechnol 2021; 105:4089-4097. [PMID: 33970318 DOI: 10.1007/s00253-021-11302-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Revised: 04/09/2021] [Accepted: 04/17/2021] [Indexed: 10/21/2022]
Abstract
The use of robotics in the life science sector has created a considerable and significant impact on a wide range of research areas, including enzyme technology due to their immense applications in enzyme and microbial engineering as an indispensable tool in high-throughput screening applications. Scientists are experiencing the advanced applications of various biological robots (nanobots), fabricated based on bottom-up or top-down approaches for making nanotechnology scaffolds. Nanobots and enzyme-powered nanomotors are particularly attractive because they are self-propelled vehicles, which consume biocompatible fuels. These smart nanostructures are widely used as drug delivery systems for the efficient treatment of various diseases. This review gives insights into the escalating necessity of robotics and nanobots and their ever-widening applications in enzyme technology, including biofuel production and biomedical applications. It also offers brief insights into high-throughput robotic platforms that are currently being used in enzyme screening applications for monitoring and control of microbial growth conditions. KEY POINTS: • Robotics and their applications in biotechnology are highlighted. • Robotics for high-throughput enzyme screening and microbial engineering are described. • Nanobots and enzyme-powered nanomotors as controllable drug delivery systems are reviewed.
Collapse
|
166
|
Gaur NK, Goyal VD, Kulkarni K, Makde RD. Machine learning classifiers aid virtual screening for efficient design of mini-protein therapeutics. Bioorg Med Chem Lett 2021; 38:127852. [PMID: 33609660 DOI: 10.1016/j.bmcl.2021.127852] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Revised: 02/01/2021] [Accepted: 02/05/2021] [Indexed: 11/15/2022]
Abstract
De novo design of mini-proteins (4-12 kDa) has recently been shown to produce new candidates for protein therapeutics. They are temperature stable molecules that bind to the drug target with high affinity for inhibiting its interactions. The development of mini-protein binders requires laboratory screening of tens of thousands of molecules for effective target binding. In this study we trained machine learning classifiers which can distinguish, with 90% accuracy and 80% precision, mini-protein binders from non-binding molecules designed for a particular target; this significantly reduces the number of mini protein candidates for experimental screening. Further, on the basis of our results we propose a multi-stage protocol where a small dataset (few hundred experimentally verified target-specific mini-proteins) can be used to train classifiers for improving the efficiency of mini-protein design for any specific target.
Collapse
Affiliation(s)
- Neeraj K Gaur
- Beamline Development and Application Section, Bhabha Atomic Research Centre, Mumbai 400085, India; Division of Biochemical Sciences, CSIR-National Chemical Laboratory, Pune 411008, India.
| | - Venuka Durani Goyal
- Beamline Development and Application Section, Bhabha Atomic Research Centre, Mumbai 400085, India
| | - Kiran Kulkarni
- Division of Biochemical Sciences, CSIR-National Chemical Laboratory, Pune 411008, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Ravindra D Makde
- Beamline Development and Application Section, Bhabha Atomic Research Centre, Mumbai 400085, India
| |
Collapse
|
167
|
Caldararu O, Blundell TL, Kepp KP. Three Simple Properties Explain Protein Stability Change upon Mutation. J Chem Inf Model 2021; 61:1981-1988. [PMID: 33848149 DOI: 10.1021/acs.jcim.1c00201] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Accurate prediction of protein stability upon mutation enables rational engineering of new proteins and insights into protein evolution and monogenetic diseases caused by single-point amino acid substitutions. Many tools have been developed to this aim, ranging from energy-based models to machine-learning methods that use large amounts of experimental data. However, as the methods become more complex, the interpretation of the chemistry underlying the protein stability effects becomes obscure. It is thus of interest to identify the simplest prediction model that retains complete amino acid specific interpretation; for a given number of input descriptors, we expect such a model to be almost universal. In this study, we identify such a limiting model, SimBa, a simple multilinear regression model trained on a substitution-type-balanced experimental data set. The model accounts only for the solvent accessibility of the site, volume difference, and polarity difference caused by mutation. Our results show that this very simple and directly applicable model performs comparably to other much more complex, widely used protein stability prediction methods. This suggests that a hard limit of ∼1 kcal/mol numerical accuracy and an R ∼ 0.5 trend accuracy exists and that new features, such as account of unfolded states, water colocalization, and amino acid correlations, are required to improve accuracy to, e.g., 1/2 kcal/mol.
Collapse
Affiliation(s)
- Octav Caldararu
- DTU Chemistry, Technical University of Denmark, Building 206, 2800 Kgs. Lyngby, Denmark
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, United Kingdom
| | - Kasper P Kepp
- DTU Chemistry, Technical University of Denmark, Building 206, 2800 Kgs. Lyngby, Denmark
| |
Collapse
|
168
|
Ferguson AL, Ranganathan R. 100th Anniversary of Macromolecular Science Viewpoint: Data-Driven Protein Design. ACS Macro Lett 2021; 10:327-340. [PMID: 35549066 DOI: 10.1021/acsmacrolett.0c00885] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The design of synthetic proteins with the desired function is a long-standing goal in biomolecular science, with broad applications in biochemical engineering, agriculture, medicine, and public health. Rational de novo design and experimental directed evolution have achieved remarkable successes but are challenged by the requirement to find functional "needles" in the vast "haystack" of protein sequence space. Data-driven models for fitness landscapes provide a predictive map between protein sequence and function and can prospectively identify functional candidates for experimental testing to greatly improve the efficiency of this search. This Viewpoint reviews the applications of machine learning and, in particular, deep learning as part of data-driven protein engineering platforms. We highlight recent successes, review promising computational methodologies, and provide an outlook on future challenges and opportunities. The article is written for a broad audience comprising both polymer and protein scientists and computer and data scientists interested in an up-to-date review of recent innovations and opportunities in this rapidly evolving field.
Collapse
Affiliation(s)
- Andrew L. Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| | - Rama Ranganathan
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
- Center for Physics of Evolving Systems, University of Chicago, Chicago, Illinois 60637, United States
- Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
169
|
Zhu B, Wang D, Wei N. Enzyme Discovery and Engineering for Sustainable Plastic Recycling. Trends Biotechnol 2021; 40:22-37. [PMID: 33676748 DOI: 10.1016/j.tibtech.2021.02.008] [Citation(s) in RCA: 82] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 02/09/2021] [Accepted: 02/10/2021] [Indexed: 10/22/2022]
Abstract
The drastically increasing amount of plastic waste is causing an environmental crisis that requires innovative technologies for recycling post-consumer plastics to achieve waste valorization while meeting environmental quality goals. Biocatalytic depolymerization mediated by enzymes has emerged as an efficient and sustainable alternative for plastic treatment and recycling. A variety of plastic-degrading enzymes have been discovered from microbial sources. Meanwhile, protein engineering has been exploited to modify and optimize plastic-degrading enzymes. This review highlights the recent trends and up-to-date advances in mining novel plastic-degrading enzymes through state-of-the-art omics-based techniques and improving the enzyme catalytic efficiency and stability via various protein engineering strategies. Future research prospects and challenges are also discussed.
Collapse
Affiliation(s)
- Baotong Zhu
- Department of Civil and Environmental Engineering and Earth Sciences, University of Notre Dame, 156 Fitzpatrick Hall, Notre Dame, Indiana 46556, USA
| | - Dong Wang
- Department of Computer Science and Engineering, University of Notre Dame, 156 Fitzpatrick Hall, Notre Dame, Indiana 46556, USA
| | - Na Wei
- Department of Civil and Environmental Engineering and Earth Sciences, University of Notre Dame, 156 Fitzpatrick Hall, Notre Dame, Indiana 46556, USA.
| |
Collapse
|
170
|
Li G, Qin Y, Fontaine NT, Ng Fuk Chong M, Maria‐Solano MA, Feixas F, Cadet XF, Pandjaitan R, Garcia‐Borràs M, Cadet F, Reetz MT. Machine Learning Enables Selection of Epistatic Enzyme Mutants for Stability Against Unfolding and Detrimental Aggregation. Chembiochem 2021; 22:904-914. [PMID: 33094545 PMCID: PMC7984044 DOI: 10.1002/cbic.202000612] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/22/2020] [Indexed: 12/15/2022]
Abstract
Machine learning (ML) has pervaded most areas of protein engineering, including stability and stereoselectivity. Using limonene epoxide hydrolase as the model enzyme and innov'SAR as the ML platform, comprising a digital signal process, we achieved high protein robustness that can resist unfolding with concomitant detrimental aggregation. Fourier transform (FT) allows us to take into account the order of the protein sequence and the nonlinear interactions between positions, and thus to grasp epistatic phenomena. The innov'SAR approach is interpolative, extrapolative and makes outside-the-box, predictions not found in other state-of-the-art ML or deep learning approaches. Equally significant is the finding that our approach to ML in the present context, flanked by advanced molecular dynamics simulations, uncovers the connection between epistatic mutational interactions and protein robustness.
Collapse
Affiliation(s)
- Guangyue Li
- State Key Laboratory for Biology of Plant Diseases and Insect Pests Key Laboratory of Control of Biological Hazard Factors (Plant Origin) for Agri-product Quality and Safety Ministry of Agriculture, Institute of Plant ProtectionChinese Academy of Agricultural SciencesBeijing100081P. R. China
| | - Youcai Qin
- State Key Laboratory for Biology of Plant Diseases and Insect Pests Key Laboratory of Control of Biological Hazard Factors (Plant Origin) for Agri-product Quality and Safety Ministry of Agriculture, Institute of Plant ProtectionChinese Academy of Agricultural SciencesBeijing100081P. R. China
| | - Nicolas T. Fontaine
- PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
| | - Matthieu Ng Fuk Chong
- PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
| | - Miguel A. Maria‐Solano
- Institut de Química Computacional i Catàlisi and Departament de QuímicaUniversitat de Girona Campus Montilivi17003Girona, CataloniaSpain) .
| | - Ferran Feixas
- Institut de Química Computacional i Catàlisi and Departament de QuímicaUniversitat de Girona Campus Montilivi17003Girona, CataloniaSpain) .
| | - Xavier F. Cadet
- PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
| | - Rudy Pandjaitan
- PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
| | - Marc Garcia‐Borràs
- Institut de Química Computacional i Catàlisi and Departament de QuímicaUniversitat de Girona Campus Montilivi17003Girona, CataloniaSpain) .
| | - Frederic Cadet
- PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
| | - Manfred T. Reetz
- Department of ChemistryPhilipps-Universität35032MarburgGermany) .
- Max-Planck-Institut fuer Kohlenforschung45470MülheimGermany
- Tianjin Institute of Industrial BiotechnologyChinese Academy of Sciences32 West 7th Avenue, Tianjin Airport Economic Area300308TianjinP. R. China
| |
Collapse
|
171
|
Wang Z, Sundara Sekar B, Li Z. Recent advances in artificial enzyme cascades for the production of value-added chemicals. BIORESOURCE TECHNOLOGY 2021; 323:124551. [PMID: 33360113 DOI: 10.1016/j.biortech.2020.124551] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Revised: 12/10/2020] [Accepted: 12/11/2020] [Indexed: 06/12/2023]
Abstract
Enzyme cascades are efficient tools to perform multi-step synthesis in one-pot in a green and sustainable manner, enabling non-natural synthesis of valuable chemicals from easily available substrates by artificially combining two or more enzymes. Bioproduction of many high-value chemicals such as chiral and highly functionalised molecules have been achieved by developing new enzyme cascades. This review summarizes recent advances on engineering and application of enzyme cascades to produce high-value chemicals (alcohols, aldehydes, ketones, amines, carboxylic acids, etc) from simple starting materials. While 2-step enzyme cascades are developed for versatile enantioselective synthesis, multi-step enzyme cascades are engineered to functionalise basic chemicals, such as styrenes, cyclic alkanes, and aromatic compounds. New cascade reactions have also been developed for producing valuable chemicals from bio-based substrates, such as ʟ-phenylalanine, and renewable feedstocks such as glucose and glycerol. The challenges in current process and future outlooks in the development of enzyme cascades are also addressed.
Collapse
Affiliation(s)
- Zilong Wang
- Department of Chemical and Biomolecular Engineering, National University of Singapore, Singapore 117585, Singapore
| | - Balaji Sundara Sekar
- Department of Chemical and Biomolecular Engineering, National University of Singapore, Singapore 117585, Singapore
| | - Zhi Li
- Department of Chemical and Biomolecular Engineering, National University of Singapore, Singapore 117585, Singapore.
| |
Collapse
|
172
|
Wittmann BJ, Johnston KE, Wu Z, Arnold FH. Advances in machine learning for directed evolution. Curr Opin Struct Biol 2021; 69:11-18. [PMID: 33647531 DOI: 10.1016/j.sbi.2021.01.008] [Citation(s) in RCA: 54] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 01/09/2021] [Accepted: 01/26/2021] [Indexed: 01/11/2023]
Abstract
Machine learning (ML) can expedite directed evolution by allowing researchers to move expensive experimental screens in silico. Gathering sequence-function data for training ML models, however, can still be costly. In contrast, raw protein sequence data is widely available. Recent advances in ML approaches use protein sequences to augment limited sequence-function data for directed evolution. We highlight contributions in a growing effort to use sequences to reduce or eliminate the amount of sequence-function data needed for effective in silico screening. We also highlight approaches that use ML models trained on sequences to generate new functional sequence diversity, focusing on strategies that use these generative models to efficiently explore vast regions of protein space.
Collapse
Affiliation(s)
- Bruce J Wittmann
- Division of Biology and Biological Engineering, California Institute of Technology, MC 210-41, 1200 E. California Boulevard, Pasadena, CA 91125, USA
| | - Kadina E Johnston
- Division of Biology and Biological Engineering, California Institute of Technology, MC 210-41, 1200 E. California Boulevard, Pasadena, CA 91125, USA
| | - Zachary Wu
- Division of Chemistry and Chemical Engineering, California Institute of Technology, MC 210-41, 1200 E. California Boulevard, Pasadena, CA 91125, USA; Present address: Google DeepMind, 6 Pancras Square, Kings Cross, London, N1C 4AG, UK
| | - Frances H Arnold
- Division of Biology and Biological Engineering, California Institute of Technology, MC 210-41, 1200 E. California Boulevard, Pasadena, CA 91125, USA; Division of Chemistry and Chemical Engineering, California Institute of Technology, MC 210-41, 1200 E. California Boulevard, Pasadena, CA 91125, USA.
| |
Collapse
|
173
|
Victorino da Silva Amatto I, Gonsales da Rosa-Garzon N, Antônio de Oliveira Simões F, Santiago F, Pereira da Silva Leite N, Raspante Martins J, Cabral H. Enzyme engineering and its industrial applications. Biotechnol Appl Biochem 2021; 69:389-409. [PMID: 33555054 DOI: 10.1002/bab.2117] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 01/18/2021] [Indexed: 01/03/2023]
Abstract
Recently, there has been an increase in the demand for enzymes with modified activity, specificity, and stability. Enzyme engineering is an important tool to meet the demand for enzymes adjusted to different industrial processes. Knowledge of the structure and function of enzymes guides the choice of the best strategy for engineering enzymes. Each enzyme engineering strategy, such as rational design, directed evolution, and semi-rational design, has specific applications, as well as limitations, which must be considered when choosing a suitable strategy. Engineered enzymes can be optimized for different industrial applications by choosing the appropriate strategy. This review features engineered enzymes that have been applied in food, animal feed, pharmaceuticals, medical applications, bioremediation, biofuels, and detergents.
Collapse
Affiliation(s)
- Isabela Victorino da Silva Amatto
- Department of Pharmaceutical Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil.,Biosciences and Biotechnology Program, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil
| | - Nathalia Gonsales da Rosa-Garzon
- Department of Pharmaceutical Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil
| | - Flávio Antônio de Oliveira Simões
- Department of Pharmaceutical Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil.,Pharmaceutical Sciences Program, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil
| | - Fernanda Santiago
- Department of Pharmaceutical Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil.,Biosciences and Biotechnology Program, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil
| | - Nathália Pereira da Silva Leite
- Pharmaceutical Sciences Program, Faculty of Philosophy, Sciences and Letters at Ribeirão Preto, XUniversity of São Paulo, Ribeirão Preto, SP, Brazil
| | - Júlia Raspante Martins
- Department of Pharmaceutical Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil.,Biosciences and Biotechnology Program, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil
| | - Hamilton Cabral
- Department of Pharmaceutical Sciences, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil.,Biosciences and Biotechnology Program, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil.,Pharmaceutical Sciences Program, School of Pharmaceutical Sciences of Ribeirão Preto, University of São Paulo, Ribeirão Preto, SP, Brazil
| |
Collapse
|
174
|
|
175
|
Ionic liquids for regulating biocatalytic process: Achievements and perspectives. Biotechnol Adv 2021; 51:107702. [PMID: 33515671 DOI: 10.1016/j.biotechadv.2021.107702] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 12/26/2020] [Accepted: 01/15/2021] [Indexed: 12/26/2022]
Abstract
Biocatalysis has found enormous applications in sorts of fields as an alternative to chemical catalysis. In the pursue of green and sustainable chemistry, ionic liquids (ILs) have been considered as promising reaction media for biocatalysis, owing to their unique characteristics, such as nonvolatility, inflammability and tunable properties as regards polarity and water miscibility behavior, compared to organic solvents. In recent years, great developments have been achieved in respects to biocatalysis in ILs, especially for preparing various chemicals. This review tends to give illustrative examples with a focus on representative chemicals production by biocatalyst in ILs and elucidate the possible mechanism in such systems. It also discusses how to regulate the catalytic efficiency from several aspects and finally provides an outlook on the opportunities to broaden biocatalysis in ILs.
Collapse
|
176
|
Winkler C, Schrittwieser JH, Kroutil W. Power of Biocatalysis for Organic Synthesis. ACS CENTRAL SCIENCE 2021; 7:55-71. [PMID: 33532569 PMCID: PMC7844857 DOI: 10.1021/acscentsci.0c01496] [Citation(s) in RCA: 124] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Indexed: 05/05/2023]
Abstract
Biocatalysis, using defined enzymes for organic transformations, has become a common tool in organic synthesis, which is also frequently applied in industry. The generally high activity and outstanding stereo-, regio-, and chemoselectivity observed in many biotransformations are the result of a precise control of the reaction in the active site of the biocatalyst. This control is achieved by exact positioning of the reagents relative to each other in a fine-tuned 3D environment, by specific activating interactions between reagents and the protein, and by subtle movements of the catalyst. Enzyme engineering enables one to adapt the catalyst to the desired reaction and process. A well-filled biocatalytic toolbox is ready to be used for various reactions. Providing nonnatural reagents and conditions and evolving biocatalysts enables one to play with the myriad of options for creating novel transformations and thereby opening new, short pathways to desired target molecules. Combining several biocatalysts in one pot to perform several reactions concurrently increases the efficiency of biocatalysis even further.
Collapse
Affiliation(s)
- Christoph
K. Winkler
- Institute
of Chemistry, University of Graz, NAWI Graz, Heinrichstraße
28, 8010 Graz, Austria
| | - Joerg H. Schrittwieser
- Institute
of Chemistry, University of Graz, NAWI Graz, Heinrichstraße
28, 8010 Graz, Austria
| | - Wolfgang Kroutil
- Institute
of Chemistry, University of Graz, NAWI Graz, Heinrichstraße
28, 8010 Graz, Austria
- Field
of Excellence BioHealth − University of Graz, 8010 Graz, Austria
- BioTechMed
Graz, 8010 Graz, Austria
| |
Collapse
|
177
|
Planas-Iglesias J, Marques SM, Pinto GP, Musil M, Stourac J, Damborsky J, Bednar D. Computational design of enzymes for biotechnological applications. Biotechnol Adv 2021; 47:107696. [PMID: 33513434 DOI: 10.1016/j.biotechadv.2021.107696] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/12/2021] [Accepted: 01/13/2021] [Indexed: 12/14/2022]
Abstract
Enzymes are the natural catalysts that execute biochemical reactions upholding life. Their natural effectiveness has been fine-tuned as a result of millions of years of natural evolution. Such catalytic effectiveness has prompted the use of biocatalysts from multiple sources on different applications, including the industrial production of goods (food and beverages, detergents, textile, and pharmaceutics), environmental protection, and biomedical applications. Natural enzymes often need to be improved by protein engineering to optimize their function in non-native environments. Recent technological advances have greatly facilitated this process by providing the experimental approaches of directed evolution or by enabling computer-assisted applications. Directed evolution mimics the natural selection process in a highly accelerated fashion at the expense of arduous laboratory work and economic resources. Theoretical methods provide predictions and represent an attractive complement to such experiments by waiving their inherent costs. Computational techniques can be used to engineer enzymatic reactivity, substrate specificity and ligand binding, access pathways and ligand transport, and global properties like protein stability, solubility, and flexibility. Theoretical approaches can also identify hotspots on the protein sequence for mutagenesis and predict suitable alternatives for selected positions with expected outcomes. This review covers the latest advances in computational methods for enzyme engineering and presents many successful case studies.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Sérgio M Marques
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Gaspar P Pinto
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic; IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 61266 Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic.
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5/A13, 625 00 Brno, Czech Republic.
| |
Collapse
|
178
|
Slagman S, Fessner WD. Biocatalytic routes to anti-viral agents and their synthetic intermediates. Chem Soc Rev 2021; 50:1968-2009. [DOI: 10.1039/d0cs00763c] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
An assessment of biocatalytic strategies for the synthesis of anti-viral agents, offering guidelines for the development of sustainable production methods for a future COVID-19 remedy.
Collapse
Affiliation(s)
- Sjoerd Slagman
- Institut für Organische Chemie und Biochemie
- Technische Universität Darmstadt
- Germany
| | - Wolf-Dieter Fessner
- Institut für Organische Chemie und Biochemie
- Technische Universität Darmstadt
- Germany
| |
Collapse
|
179
|
Benítez-Mateos AI, Contente ML, Roura Padrosa D, Paradisi F. Flow biocatalysis 101: design, development and applications. REACT CHEM ENG 2021. [DOI: 10.1039/d0re00483a] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Flow biocatalysis: where to start? This tutorial review aims to guide and inspire new-comers to the field to boost the potential of flow biocatalysis.
Collapse
Affiliation(s)
| | | | | | - Francesca Paradisi
- Department of Chemistry and Biochemistry
- University of Bern
- Bern
- Switzerland
| |
Collapse
|
180
|
Heckmann CM, Paradisi F. Looking Back: A Short History of the Discovery of Enzymes and How They Became Powerful Chemical Tools. ChemCatChem 2020; 12:6082-6102. [PMID: 33381242 PMCID: PMC7756376 DOI: 10.1002/cctc.202001107] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Revised: 09/02/2020] [Indexed: 12/20/2022]
Abstract
Enzymatic approaches to challenges in chemical synthesis are increasingly popular and very attractive to industry given their green nature and high efficiency compared to traditional methods. In this historical review we highlight the developments across several fields that were necessary to create the modern field of biocatalysis, with enzyme engineering and directed evolution at its core. We exemplify the modular, incremental, and highly unpredictable nature of scientific discovery, driven by curiosity, and showcase the resulting examples of cutting-edge enzymatic applications in industry.
Collapse
Affiliation(s)
- Christian M Heckmann
- School of Chemistry University of Nottingham University Park Nottingham NG7 2RD UK
| | - Francesca Paradisi
- School of Chemistry University of Nottingham University Park Nottingham NG7 2RD UK
- Department of Chemistry and Biochemistry University of Bern Freiestrasse 3 3012 Bern Switzerland
| |
Collapse
|
181
|
Voigt CA. Synthetic biology 2020-2030: six commercially-available products that are changing our world. Nat Commun 2020; 11:6379. [PMID: 33311504 PMCID: PMC7733420 DOI: 10.1038/s41467-020-20122-2] [Citation(s) in RCA: 90] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 11/13/2020] [Indexed: 01/05/2023] Open
Abstract
Synthetic biology will transform how we grow food, what we eat, and where we source materials and medicines. Here I have selected six products that are now on the market, highlighting the underlying technologies and projecting forward to the future that can be expected over the next ten years.
Collapse
Affiliation(s)
- Christopher A Voigt
- Synthetic Biology Center, Department of Biological Engineering, Massachusetts Institute of Technology, Boston, USA.
| |
Collapse
|
182
|
Zurek PJ, Knyphausen P, Neufeld K, Pushpanath A, Hollfelder F. UMI-linked consensus sequencing enables phylogenetic analysis of directed evolution. Nat Commun 2020; 11:6023. [PMID: 33243970 PMCID: PMC7691348 DOI: 10.1038/s41467-020-19687-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 10/12/2020] [Indexed: 11/09/2022] Open
Abstract
The success of protein evolution campaigns is strongly dependent on the sequence context in which mutations are introduced, stemming from pervasive non-additive interactions between a protein's amino acids ('intra-gene epistasis'). Our limited understanding of such epistasis hinders the correct prediction of the functional contributions and adaptive potential of mutations. Here we present a straightforward unique molecular identifier (UMI)-linked consensus sequencing workflow (UMIC-seq) that simplifies mapping of evolutionary trajectories based on full-length sequences. Attaching UMIs to gene variants allows accurate consensus generation for closely related genes with nanopore sequencing. We exemplify the utility of this approach by reconstructing the artificial phylogeny emerging in three rounds of directed evolution of an amine dehydrogenase biocatalyst via ultrahigh throughput droplet screening. Uniquely, we are able to identify lineages and their founding variant, as well as non-additive interactions between mutations within a full gene showing sign epistasis. Access to deep and accurate long reads will facilitate prediction of key beneficial mutations and adaptive potential based on in silico analysis of large sequence datasets.
Collapse
Affiliation(s)
- Paul Jannis Zurek
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
- Johnson Matthey Plc, Cambridge, CB4 0WE, UK
| | - Philipp Knyphausen
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
| | - Katharina Neufeld
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
- Johnson Matthey Plc, Cambridge, CB4 0WE, UK
| | | | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK.
| |
Collapse
|
183
|
Song H, Bremer BJ, Hinds EC, Raskutti G, Romero PA. Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning. Cell Syst 2020; 12:92-101.e8. [PMID: 33212013 DOI: 10.1016/j.cels.2020.10.007] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2019] [Revised: 08/13/2020] [Accepted: 10/22/2020] [Indexed: 10/22/2022]
Abstract
Machine learning can infer how protein sequence maps to function without requiring a detailed understanding of the underlying physical or biological mechanisms. It is challenging to apply existing supervised learning frameworks to large-scale experimental data generated by deep mutational scanning (DMS) and related methods. DMS data often contain high-dimensional and correlated sequence variables, experimental sampling error and bias, and the presence of missing data. Notably, most DMS data do not contain examples of negative sequences, making it challenging to directly estimate how sequence affects function. Here, we develop a positive-unlabeled (PU) learning framework to infer sequence-function relationships from large-scale DMS data. Our PU learning method displays excellent predictive performance across ten large-scale sequence-function datasets, representing proteins of different folds, functions, and library types. The estimated parameters pinpoint key residues that dictate protein structure and function. Finally, we apply our statistical sequence-function model to design highly stabilized enzymes.
Collapse
Affiliation(s)
- Hyebin Song
- Department of Statistics, The Pennsylvania State University, State College, PA 16802, USA; Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Bennett J Bremer
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Emily C Hinds
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Garvesh Raskutti
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Philip A Romero
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
184
|
Foroozandeh Shahraki M, Ariaeenejad S, Fallah Atanaki F, Zolfaghari B, Koshiba T, Kavousi K, Salekdeh GH. MCIC: Automated Identification of Cellulases From Metagenomic Data and Characterization Based on Temperature and pH Dependence. Front Microbiol 2020; 11:567863. [PMID: 33193158 PMCID: PMC7645119 DOI: 10.3389/fmicb.2020.567863] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Accepted: 09/30/2020] [Indexed: 01/03/2023] Open
Abstract
As the availability of high-throughput metagenomic data is increasing, agile and accurate tools are required to analyze and exploit this valuable and plentiful resource. Cellulose-degrading enzymes have various applications, and finding appropriate cellulases for different purposes is becoming increasingly challenging. An in silico screening method for high-throughput data can be of great assistance when combined with the characterization of thermal and pH dependence. By this means, various metagenomic sources with high cellulolytic potentials can be explored. Using a sequence similarity-based annotation and an ensemble of supervised learning algorithms, this study aims to identify and characterize cellulolytic enzymes from a given high-throughput metagenomic data based on optimum temperature and pH. The prediction performance of MCIC (metagenome cellulase identification and characterization) was evaluated through multiple iterations of sixfold cross-validation tests. This tool was also implemented for a comparative analysis of four metagenomic sources to estimate their cellulolytic profile and capabilities. For experimental validation of MCIC’s screening and prediction abilities, two identified enzymes from cattle rumen were subjected to cloning, expression, and characterization. To the best of our knowledge, this is the first time that a sequence-similarity based method is used alongside an ensemble machine learning model to identify and characterize cellulase enzymes from extensive metagenomic data. This study highlights the strength of machine learning techniques to predict enzymatic properties solely based on their sequence. MCIC is freely available as a python package and standalone toolkit for Windows and Linux-based operating systems with several functions to facilitate the screening and thermal and pH dependence prediction of cellulases.
Collapse
Affiliation(s)
- Mehdi Foroozandeh Shahraki
- Laboratory of Complex Biological Systems and Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Shohreh Ariaeenejad
- Department of Systems and Synthetic Biology, Agricultural Biotechnology Research Institute of Iran, Agricultural Research Education and Extension Organization, Karaj, Iran
| | - Fereshteh Fallah Atanaki
- Laboratory of Complex Biological Systems and Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Behrouz Zolfaghari
- Computer Science and Engineering Department, Indian Institute of Technology Guwahati, Guwahati, India
| | - Takeshi Koshiba
- Department of Mathematics, Faculty of Education and Integrated Arts and Sciences, Waseda University, Tokyo, Japan
| | - Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Ghasem Hosseini Salekdeh
- Department of Systems and Synthetic Biology, Agricultural Biotechnology Research Institute of Iran, Agricultural Research Education and Extension Organization, Karaj, Iran.,Department of Molecular Sciences, Macquarie University, Sydney, NSW, Australia
| |
Collapse
|
185
|
Carbonell P, Le Feuvre R, Takano E, Scrutton NS. In silico design and automated learning to boost next-generation smart biomanufacturing. Synth Biol (Oxf) 2020; 5:ysaa020. [PMID: 33344778 PMCID: PMC7737007 DOI: 10.1093/synbio/ysaa020] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 09/08/2020] [Accepted: 09/28/2020] [Indexed: 02/07/2023] Open
Abstract
The increasing demand for bio-based compounds produced from waste or sustainable sources is driving biofoundries to deliver a new generation of prototyping biomanufacturing platforms. Integration and automation of the design, build, test and learn (DBTL) steps in centers like SYNBIOCHEM in Manchester and across the globe (Global Biofoundries Alliance) are helping to reduce the delivery time from initial strain screening and prototyping towards industrial production. Notably, a portfolio of producer strains for a suite of material monomers was recently developed, some approaching industrial titers, in a tour de force by the Manchester Centre that was achieved in less than 90 days. New in silico design tools are providing significant contributions to the front end of the DBTL pipelines. At the same time, the far-reaching initiatives of modern biofoundries are generating a large amount of high-dimensional data and knowledge that can be integrated through automated learning to expedite the DBTL cycle. In this Perspective, the new design tools and the role of the learning component as an enabling technology for the next generation of automated biofoundries are discussed. Future biofoundries will operate under completely automated DBTL cycles driven by in silico optimal experimental planning, full biomanufacturing devices connectivity, virtualization platforms and cloud-based design. The automated generation of robotic build worklists and the integration of machine-learning algorithms will collectively allow high levels of adaptability and rapid design changes toward fully automated smart biomanufacturing.
Collapse
Affiliation(s)
- Pablo Carbonell
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM) and Future Biomanufacturing Research Hub, Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK.,Instituto Universitario de Automática e Informática Industrial, Universitat Politècnica de València, 46022 Valencia, Spain
| | - Rosalind Le Feuvre
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM) and Future Biomanufacturing Research Hub, Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK
| | - Eriko Takano
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM) and Future Biomanufacturing Research Hub, Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK
| | - Nigel S Scrutton
- Manchester Synthetic Biology Research Centre for Fine and Speciality Chemicals (SYNBIOCHEM) and Future Biomanufacturing Research Hub, Manchester Institute of Biotechnology, The University of Manchester, Manchester M1 7DN, UK
| |
Collapse
|
186
|
Markova K, Chmelova K, Marques SM, Carpentier P, Bednar D, Damborsky J, Marek M. Decoding the intricate network of molecular interactions of a hyperstable engineered biocatalyst. Chem Sci 2020; 11:11162-11178. [PMID: 34094357 PMCID: PMC8162949 DOI: 10.1039/d0sc03367g] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 09/10/2020] [Indexed: 12/01/2022] Open
Abstract
Computational design of protein catalysts with enhanced stabilities for use in research and enzyme technologies is a challenging task. Using force-field calculations and phylogenetic analysis, we previously designed the haloalkane dehalogenase DhaA115 which contains 11 mutations that confer upon it outstanding thermostability (T m = 73.5 °C; ΔT m > 23 °C). An understanding of the structural basis of this hyperstabilization is required in order to develop computer algorithms and predictive tools. Here, we report X-ray structures of DhaA115 at 1.55 Å and 1.6 Å resolutions and their molecular dynamics trajectories, which unravel the intricate network of interactions that reinforce the αβα-sandwich architecture. Unexpectedly, mutations toward bulky aromatic amino acids at the protein surface triggered long-distance (∼27 Å) backbone changes due to cooperative effects. These cooperative interactions produced an unprecedented double-lock system that: (i) induced backbone changes, (ii) closed the molecular gates to the active site, (iii) reduced the volumes of the main and slot access tunnels, and (iv) occluded the active site. Despite these spatial restrictions, experimental tracing of the access tunnels using krypton derivative crystals demonstrates that transport of ligands is still effective. Our findings highlight key thermostabilization effects and provide a structural basis for designing new thermostable protein catalysts.
Collapse
Affiliation(s)
- Klara Markova
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University Kamenice 5 625 00 Brno Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno Pekarska 53 656 91 Brno Czech Republic
| | - Klaudia Chmelova
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University Kamenice 5 625 00 Brno Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno Pekarska 53 656 91 Brno Czech Republic
| | - Sérgio M Marques
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University Kamenice 5 625 00 Brno Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno Pekarska 53 656 91 Brno Czech Republic
| | - Philippe Carpentier
- Université Grenoble Alpes, CNRS, CEA, Interdisciplinary Research Institute of Grenoble (IRIG), Laboratoire Chimie et Biologie des Métaux (LCBM) 17 Avenue des Martyrs 38054 Grenoble France
- European Synchrotron Radiation Facility (ESRF) 71 Avenue des Martyrs 38043 Grenoble France
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University Kamenice 5 625 00 Brno Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno Pekarska 53 656 91 Brno Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University Kamenice 5 625 00 Brno Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno Pekarska 53 656 91 Brno Czech Republic
| | - Martin Marek
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University Kamenice 5 625 00 Brno Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno Pekarska 53 656 91 Brno Czech Republic
| |
Collapse
|
187
|
Zhou K, Ng W, Cortés-Peña Y, Wang X. Increasing metabolic pathway flux by using machine learning models. Curr Opin Biotechnol 2020; 66:179-185. [PMID: 32896771 DOI: 10.1016/j.copbio.2020.08.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 08/03/2020] [Accepted: 08/11/2020] [Indexed: 01/19/2023]
Abstract
Machine learning is transforming many industries through self-improving models that are fueled by big data and high computing power. The field of metabolic engineering, which uses cellular biochemical network to manufacture useful small molecules, has also witnessed the first wave of machine learning applications in the past five years, covering reaction route design, enzyme selection, pathway engineering and process optimization. This review focuses on pathway engineering, and uses a few recent studies to illustrate (1) how machine learning models can be useful in overcoming an evident rate-limiting step, and (2) how the models may be used to exhaustively search - or guide optimization algorithms to search - a large design space when the cellular regulation of the reaction network is more convoluted.
Collapse
Affiliation(s)
- Kang Zhou
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 117585, Singapore.
| | - Wenfa Ng
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 117585, Singapore
| | - Yoel Cortés-Peña
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 117585, Singapore; Department of Civil and Environmental Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA; DOE Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Xiaonan Wang
- Department of Chemical and Biomolecular Engineering, National University of Singapore, 117585, Singapore
| |
Collapse
|
188
|
Mazurenko S. Predicting protein stability and solubility changes upon mutations: data perspective. ChemCatChem 2020. [DOI: 10.1002/cctc.202000933] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Stanislav Mazurenko
- Loschmidt Laboratories Department of Experimental Biology and RECETOX Faculty of Science Masaryk University Zerotinovo nam. 617/9 601 77 Brno Czech Republic
| |
Collapse
|
189
|
Osuna S. The challenge of predicting distal active site mutations in computational enzyme design. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1502] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Affiliation(s)
- Sílvia Osuna
- CompBioLab group, Institut de Química Computacional i Catàlisi (IQCC) and Departament de Química Universitat de Girona Girona Spain
- ICREA Barcelona Spain
| |
Collapse
|
190
|
Neun S, Zurek PJ, Kaminski TS, Hollfelder F. Ultrahigh throughput screening for enzyme function in droplets. Methods Enzymol 2020; 643:317-343. [PMID: 32896286 DOI: 10.1016/bs.mie.2020.06.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Water-in-oil droplets, made and handled in microfluidic devices, provide a new experimental format, in which ultrahigh throughput experiments can be conducted faster and with minimal reagent consumption. An increasing number of studies have emerged that applied this approach to directed evolution and metagenomic screening of enzyme catalysts. Here, we review the considerations necessary to implement robust workflows, based on choices of device design, detection modes, emulsion formulations and substrates, and scope out which enzyme classes have become amenable to droplet screening.
Collapse
Affiliation(s)
- Stefanie Neun
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Paul J Zurek
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Tomasz S Kaminski
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom.
| |
Collapse
|
191
|
Crean RM, Gardner JM, Kamerlin SCL. Harnessing Conformational Plasticity to Generate Designer Enzymes. J Am Chem Soc 2020; 142:11324-11342. [PMID: 32496764 PMCID: PMC7467679 DOI: 10.1021/jacs.0c04924] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Indexed: 02/08/2023]
Abstract
Recent years have witnessed an explosion of interest in understanding the role of conformational dynamics both in the evolution of new enzymatic activities from existing enzymes and in facilitating the emergence of enzymatic activity de novo on scaffolds that were previously non-catalytic. There are also an increasing number of examples in the literature of targeted engineering of conformational dynamics being successfully used to alter enzyme selectivity and activity. Despite the obvious importance of conformational dynamics to both enzyme function and evolvability, many (although not all) computational design approaches still focus either on pure sequence-based approaches or on using structures with limited flexibility to guide the design. However, there exist a wide variety of computational approaches that can be (re)purposed to introduce conformational dynamics as a key consideration in the design process. Coupled with laboratory evolution and more conventional existing sequence- and structure-based approaches, these techniques provide powerful tools for greatly expanding the protein engineering toolkit. This Perspective provides an overview of evolutionary studies that have dissected the role of conformational dynamics in facilitating the emergence of novel enzymes, as well as advances in computational approaches that allow one to target conformational dynamics as part of enzyme design. Harnessing conformational dynamics in engineering studies is a powerful paradigm with which to engineer the next generation of designer biocatalysts.
Collapse
Affiliation(s)
- Rory M. Crean
- Department of Chemistry -
BMC, Uppsala University, Box 576, 751 23 Uppsala, Sweden
| | - Jasmine M. Gardner
- Department of Chemistry -
BMC, Uppsala University, Box 576, 751 23 Uppsala, Sweden
| | - Shina C. L. Kamerlin
- Department of Chemistry -
BMC, Uppsala University, Box 576, 751 23 Uppsala, Sweden
| |
Collapse
|
192
|
Risso VA, Romero-Rivera A, Gutierrez-Rus LI, Ortega-Muñoz M, Santoyo-Gonzalez F, Gavira JA, Sanchez-Ruiz JM, Kamerlin SCL. Enhancing a de novo enzyme activity by computationally-focused ultra-low-throughput screening. Chem Sci 2020; 11:6134-6148. [PMID: 32832059 PMCID: PMC7407621 DOI: 10.1039/d0sc01935f] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Accepted: 05/18/2020] [Indexed: 01/02/2023] Open
Abstract
Directed evolution has revolutionized protein engineering. Still, enzyme optimization by random library screening remains sluggish, in large part due to futile probing of mutations that are catalytically neutral and/or impair stability and folding. FuncLib is a novel approach which uses phylogenetic analysis and Rosetta design to rank enzyme variants with multiple mutations, on the basis of predicted stability. Here, we use it to target the active site region of a minimalist-designed, de novo Kemp eliminase. The similarity between the Michaelis complex and transition state for the enzymatic reaction makes this system particularly challenging to optimize. Yet, experimental screening of a small number of active-site variants at the top of the predicted stability ranking leads to catalytic efficiencies and turnover numbers (∼2 × 104 M-1 s-1 and ∼102 s-1) for this anthropogenic reaction that compare favorably to those of modern natural enzymes. This result illustrates the promise of FuncLib as a powerful tool with which to speed up directed evolution, even on scaffolds that were not originally evolved for those functions, by guiding screening to regions of the sequence space that encode stable and catalytically diverse enzymes. Empirical valence bond calculations reproduce the experimental activation energies for the optimized eliminases to within ∼2 kcal mol-1 and indicate that the enhanced activity is linked to better geometric preorganization of the active site. This raises the possibility of further enhancing the stability-guidance of FuncLib by computational predictions of catalytic activity, as a generalized approach for computational enzyme design.
Collapse
Affiliation(s)
- Valeria A Risso
- Departamento de Química Física, Facultad de Ciencias , Unidad de Excelencia de Química aplicada a Biomedicina y Medioambiente (UEQ) , Universidad de Granada , 18071 Granada , Spain .
| | - Adrian Romero-Rivera
- Science for Life Laboratory , Department of Chemistry-BMC , Uppsala University , BMC Box 576 , S-751 23 Uppsala , Sweden .
| | - Luis I Gutierrez-Rus
- Departamento de Química Física, Facultad de Ciencias , Unidad de Excelencia de Química aplicada a Biomedicina y Medioambiente (UEQ) , Universidad de Granada , 18071 Granada , Spain .
| | - Mariano Ortega-Muñoz
- Departamento de Química Orgánica , Facultad de Ciencias , Unidad de Excelencia de Química aplicada a Biomedicina y Medioambiente (UEQ) , Universidad de Granada , 18071 Granada , Spain
| | - Francisco Santoyo-Gonzalez
- Departamento de Química Orgánica , Facultad de Ciencias , Unidad de Excelencia de Química aplicada a Biomedicina y Medioambiente (UEQ) , Universidad de Granada , 18071 Granada , Spain
| | - Jose A Gavira
- Laboratorio de Estudios Cristalográficos , Instituto Andaluz de Ciencias de la Tierra , CSIC, Unidad de Excelencia de Química aplicada a Biomedicina y Medioambiente (UEQ) , University of Granada , Avenida de las Palmeras 4 , 18100 Armilla , Granada , Spain
| | - Jose M Sanchez-Ruiz
- Departamento de Química Física, Facultad de Ciencias , Unidad de Excelencia de Química aplicada a Biomedicina y Medioambiente (UEQ) , Universidad de Granada , 18071 Granada , Spain .
| | - Shina C L Kamerlin
- Science for Life Laboratory , Department of Chemistry-BMC , Uppsala University , BMC Box 576 , S-751 23 Uppsala , Sweden .
| |
Collapse
|
193
|
Weinstein J, Khersonsky O, Fleishman SJ. Practically useful protein-design methods combining phylogenetic and atomistic calculations. Curr Opin Struct Biol 2020; 63:58-64. [PMID: 32505941 DOI: 10.1016/j.sbi.2020.04.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2020] [Accepted: 04/06/2020] [Indexed: 12/11/2022]
Abstract
Our ability to design new or improved biomolecular activities depends on understanding the sequence-function relationships in proteins. The large size and fold complexity of most proteins, however, obscure these relationships, and protein-optimization methods continue to rely on laborious experimental iterations. Recently, a deeper understanding of the roles of stability-threshold effects and biomolecular epistasis in proteins has led to the development of hybrid methods that combine phylogenetic analysis with atomistic design calculations. These methods enable reliable and even single-step optimization of protein stability, expressibility, and activity in proteins that were considered outside the scope of computational design. Furthermore, ancestral-sequence reconstruction produces insights on missing links in the evolution of enzymes and binders that may be used in protein design. Through the combination of phylogenetic and atomistic calculations, the long-standing goal of general computational methods that can be universally applied to study and optimize proteins finally seems within reach.
Collapse
Affiliation(s)
- Jonathan Weinstein
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Olga Khersonsky
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel.
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel.
| |
Collapse
|
194
|
Affiliation(s)
- Bernhard Hauer
- Institute of Biochemistry and Technical Biochemistry, Department of Technical Biochemistry, Universitaet Stuttgart, Allmandring 31, 70569 Stuttgart, Germany
| |
Collapse
|
195
|
Li RJ, Zhang Z, Acevedo-Rocha CG, Zhao J, Li A. Biosynthesis of organic molecules via artificial cascade reactions based on cytochrome P450 monooxygenases. GREEN SYNTHESIS AND CATALYSIS 2020. [DOI: 10.1016/j.gresc.2020.05.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
|
196
|
One Pot Use of Combilipases for Full Modification of Oils and Fats: Multifunctional and Heterogeneous Substrates. Catalysts 2020. [DOI: 10.3390/catal10060605] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Lipases are among the most utilized enzymes in biocatalysis. In many instances, the main reason for their use is their high specificity or selectivity. However, when full modification of a multifunctional and heterogeneous substrate is pursued, enzyme selectivity and specificity become a problem. This is the case of hydrolysis of oils and fats to produce free fatty acids or their alcoholysis to produce biodiesel, which can be considered cascade reactions. In these cases, to the original heterogeneity of the substrate, the presence of intermediate products, such as diglycerides or monoglycerides, can be an additional drawback. Using these heterogeneous substrates, enzyme specificity can promote that some substrates (initial substrates or intermediate products) may not be recognized as such (in the worst case scenario they may be acting as inhibitors) by the enzyme, causing yields and reaction rates to drop. To solve this situation, a mixture of lipases with different specificity, selectivity and differently affected by the reaction conditions can offer much better results than the use of a single lipase exhibiting a very high initial activity or even the best global reaction course. This mixture of lipases from different sources has been called “combilipases” and is becoming increasingly popular. They include the use of liquid lipase formulations or immobilized lipases. In some instances, the lipases have been coimmobilized. Some discussion is offered regarding the problems that this coimmobilization may give rise to, and some strategies to solve some of these problems are proposed. The use of combilipases in the future may be extended to other processes and enzymes.
Collapse
|
197
|
Qu G, Li A, Acevedo‐Rocha CG, Sun Z, Reetz MT. Die zentrale Rolle der Methodenentwicklung in der gerichteten Evolution selektiver Enzyme. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201901491] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Affiliation(s)
- Ge Qu
- Tianjin Institute of Industrial Biotechnology Chinese Academy of Sciences 32 West 7th Avenue, Tianjin Airport Economic Area Tianjin 300308 China
| | - Aitao Li
- State Key Laboratory of Biocatalysis and Enzyme Engineering Hubei Collaborative Innovation Center for Green Transformation of Bio-resources Hubei Key Laboratory of Industrial Biotechnology College of Life Sciences Hubei University 368 Youyi Road Wuchang Wuhan 430062 China
| | | | - Zhoutong Sun
- Tianjin Institute of Industrial Biotechnology Chinese Academy of Sciences 32 West 7th Avenue, Tianjin Airport Economic Area Tianjin 300308 China
| | - Manfred T. Reetz
- Tianjin Institute of Industrial Biotechnology Chinese Academy of Sciences 32 West 7th Avenue, Tianjin Airport Economic Area Tianjin 300308 China
- Max-Planck-Institut für Kohlenforschung Kaiser-Wilhelm-Platz 1 45470 Mülheim Deutschland
- Department of Chemistry, Hans-Meerwein-Straße 4 Philipps-Universität 35032 Marburg Deutschland
| |
Collapse
|
198
|
Qu G, Li A, Acevedo‐Rocha CG, Sun Z, Reetz MT. The Crucial Role of Methodology Development in Directed Evolution of Selective Enzymes. Angew Chem Int Ed Engl 2020; 59:13204-13231. [PMID: 31267627 DOI: 10.1002/anie.201901491] [Citation(s) in RCA: 233] [Impact Index Per Article: 58.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2019] [Indexed: 12/14/2022]
Affiliation(s)
- Ge Qu
- Tianjin Institute of Industrial Biotechnology Chinese Academy of Sciences 32 West 7th Avenue, Tianjin Airport Economic Area Tianjin 300308 China
| | - Aitao Li
- State Key Laboratory of Biocatalysis and Enzyme Engineering Hubei Collaborative Innovation Center for Green Transformation of Bio-resources Hubei Key Laboratory of Industrial Biotechnology College of Life Sciences Hubei University 368 Youyi Road Wuchang Wuhan 430062 China
| | | | - Zhoutong Sun
- Tianjin Institute of Industrial Biotechnology Chinese Academy of Sciences 32 West 7th Avenue, Tianjin Airport Economic Area Tianjin 300308 China
| | - Manfred T. Reetz
- Tianjin Institute of Industrial Biotechnology Chinese Academy of Sciences 32 West 7th Avenue, Tianjin Airport Economic Area Tianjin 300308 China
- Max-Planck-Institut für Kohlenforschung Kaiser-Wilhelm-Platz 1 45470 Mülheim Germany
- Department of Chemistry, Hans-Meerwein-Strasse 4 Philipps-University 35032 Marburg Germany
| |
Collapse
|
199
|
Siedhoff NE, Schwaneberg U, Davari MD. Machine learning-assisted enzyme engineering. Methods Enzymol 2020; 643:281-315. [DOI: 10.1016/bs.mie.2020.05.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|