Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Liao J, Warmuth MK, Govindarajan S, Ness JE, Wang RP, Gustafsson C, Minshull J. Engineering proteinase K using machine learning and synthetic genes. BMC Biotechnol 2007;7:16. [PMID: 17386103 PMCID: PMC1847811 DOI: 10.1186/1472-6750-7-16] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2006] [Accepted: 03/26/2007] [Indexed: 11/10/2022] Open

For:	Liao J, Warmuth MK, Govindarajan S, Ness JE, Wang RP, Gustafsson C, Minshull J. Engineering proteinase K using machine learning and synthetic genes. BMC Biotechnol 2007;7:16. [PMID: 17386103 PMCID: PMC1847811 DOI: 10.1186/1472-6750-7-16] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2006] [Accepted: 03/26/2007] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Duan R, Wang S, Li Z, Zhang W, Wu J, Jiang Y, Lin Q, Yuan P, Yue X, Yao Y, Xiao X, Xiao Y, Wang Z. Computer-assisted semi-rational design enhanced the enzymatic activity and protein stability of Proteinase K in calcium-free conditions. Biochem Biophys Res Commun 2024;721:150109. [PMID: 38762932 DOI: 10.1016/j.bbrc.2024.150109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2024] [Accepted: 05/12/2024] [Indexed: 05/21/2024]

Shanker VR, Bruun TUJ, Hie BL, Kim PS. Unsupervised evolution of protein and antibody complexes with a structure-informed language model. Science 2024;385:46-53. [PMID: 38963838 DOI: 10.1126/science.adk8946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 05/29/2024] [Indexed: 07/06/2024]

Zhu X, Zhao YF, Wen HJ, Lu Y, You S, Herman RA, Wang J. Silkworm pupae protein co-degradation by magnetic nanoparticles immobilized proteinase K and Mucor circinelloides aspartic protease for further utilization of sericulture by-products. ENVIRONMENTAL RESEARCH 2024;249:118385. [PMID: 38331140 DOI: 10.1016/j.envres.2024.118385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 01/18/2024] [Accepted: 01/30/2024] [Indexed: 02/10/2024]

Affiliation(s)

Xuan Zhu Jiangsu Key Laboratory of Sericultural Biology and Biotechnology, School of Biotechnology, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
Yi-Fan Zhao Jiangsu Key Laboratory of Sericultural Biology and Biotechnology, School of Biotechnology, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
Hong-Jian Wen Jiangsu Key Laboratory of Sericultural Biology and Biotechnology, School of Biotechnology, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
Yu Lu Jiangsu Key Laboratory of Sericultural Biology and Biotechnology, School of Biotechnology, Jiangsu University of Science and Technology, Zhenjiang, 212100, China
Shuai You Jiangsu Key Laboratory of Sericultural Biology and Biotechnology, School of Biotechnology, Jiangsu University of Science and Technology, Zhenjiang, 212100, China; Key Laboratory of Silkworm and Mulberry Genetic Improvement, Ministry of Agriculture and Rural Affairs, The Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, 212100, China
Richard Ansah Herman Jiangsu Key Laboratory of Sericultural Biology and Biotechnology, School of Biotechnology, Jiangsu University of Science and Technology, Zhenjiang, 212100, China; Key Laboratory of Silkworm and Mulberry Genetic Improvement, Ministry of Agriculture and Rural Affairs, The Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, 212100, China
Jun Wang Jiangsu Key Laboratory of Sericultural Biology and Biotechnology, School of Biotechnology, Jiangsu University of Science and Technology, Zhenjiang, 212100, China; Key Laboratory of Silkworm and Mulberry Genetic Improvement, Ministry of Agriculture and Rural Affairs, The Sericultural Research Institute, Chinese Academy of Agricultural Sciences, Zhenjiang, 212100, China.

Collapse

Gelman S, Johnson B, Freschlin C, D'Costa S, Gitter A, Romero PA. Biophysics-based protein language models for protein engineering. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.15.585128. [PMID: 38559182 PMCID: PMC10980077 DOI: 10.1101/2024.03.15.585128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]

Wang X, Li A, Li X, Cui H. Empowering Protein Engineering through Recombination of Beneficial Substitutions. Chemistry 2024;30:e202303889. [PMID: 38288640 DOI: 10.1002/chem.202303889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Indexed: 02/24/2024]

Nam K, Shao Y, Major DT, Wolf-Watz M. Perspectives on Computational Enzyme Modeling: From Mechanisms to Design and Drug Development. ACS OMEGA 2024;9:7393-7412. [PMID: 38405524 PMCID: PMC10883025 DOI: 10.1021/acsomega.3c09084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/15/2024] [Accepted: 01/19/2024] [Indexed: 02/27/2024]

Ismail A, Govindarajan S, Mannervik B. Human GST P1-1 Redesigned for Enhanced Catalytic Activity with the Anticancer Prodrug Telcyta and Improved Thermostability. Cancers (Basel) 2024;16:762. [PMID: 38398153 PMCID: PMC10887215 DOI: 10.3390/cancers16040762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/09/2024] [Accepted: 02/10/2024] [Indexed: 02/25/2024] Open

Shanker VR, Bruun TU, Hie BL, Kim PS. Inverse folding of protein complexes with a structure-informed language model enables unsupervised antibody evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.19.572475. [PMID: 38187780 PMCID: PMC10769282 DOI: 10.1101/2023.12.19.572475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]

Wang J, Chen C, Yao G, Ding J, Wang L, Jiang H. Intelligent Protein Design and Molecular Characterization Techniques: A Comprehensive Review. Molecules 2023;28:7865. [PMID: 38067593 PMCID: PMC10707872 DOI: 10.3390/molecules28237865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 11/13/2023] [Accepted: 11/23/2023] [Indexed: 12/18/2023] Open

Markus B, C GC, Andreas K, Arkadij K, Stefan L, Gustav O, Elina S, Radka S. Accelerating Biocatalysis Discovery with Machine Learning: A Paradigm Shift in Enzyme Engineering, Discovery, and Design. ACS Catal 2023;13:14454-14469. [PMID: 37942268 PMCID: PMC10629211 DOI: 10.1021/acscatal.3c03417] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/29/2023] [Accepted: 10/03/2023] [Indexed: 11/10/2023]

Sun Y, Huang X, Osawa Y, Chen YE, Zhang H. The Versatile Biocatalyst of Cytochrome P450 CYP102A1: Structure, Function, and Engineering. Molecules 2023;28:5353. [PMID: 37513226 PMCID: PMC10383305 DOI: 10.3390/molecules28145353] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 07/07/2023] [Accepted: 07/10/2023] [Indexed: 07/30/2023] Open

Ramírez-Palacios C, Marrink SJ. Super High-Throughput Screening of Enzyme Variants by Spectral Graph Convolutional Neural Networks. J Chem Theory Comput 2023. [PMID: 36961994 PMCID: PMC10373491 DOI: 10.1021/acs.jctc.2c01227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2023]

Xu C, Battig A, Schartel B, Siegel R, Senker J, von der Forst I, Unverzagt C, Agarwal S, Möglich A, Greiner A. Investigation of the Thermal Stability of Proteinase K for the Melt Processing of Poly(l-lactide). Biomacromolecules 2022;23:4841-4850. [PMID: 36327974 PMCID: PMC9667878 DOI: 10.1021/acs.biomac.2c01008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/14/2022] [Indexed: 11/06/2022]

Abstract

The enzymatic degradation of aliphatic polyesters offers unique opportunities for various use cases in materials science. Although evidently desirable, the implementation of enzymes in technical applications of polyesters is generally challenging due to the thermal lability of enzymes. To prospectively overcome this intrinsic limitation, we here explored the thermal stability of proteinase K at conditions applicable for polymer melt processing, given that this hydrolytic enzyme is well established for its ability to degrade poly(l-lactide) (PLLA). Using assorted spectroscopic methods and enzymatic assays, we investigated the effects of high temperatures on the structure and specific activity of proteinase K. Whereas in solution, irreversible unfolding occurred at temperatures above 75-80 °C, in the dry, bulk state, proteinase K withstood prolonged incubation at elevated temperatures. Unexpectedly little activity loss occurred during incubation at up to 130 °C, and intermediate levels of catalytic activity were preserved at up to 150 °C. The resistance of bulk proteinase K to thermal treatment was slightly enhanced by absorption into polyacrylamide (PAM) particles. Under these conditions, after 5 min at a temperature of 200 °C, which is required for the melt processing of PLLA, proteinase K was not completely denatured but retained around 2% enzymatic activity. Our findings reveal that the thermal processing of proteinase K in the dry state is principally feasible, but equally, they also identify needs and prospects for improvement. The experimental pipeline we establish for proteinase K analysis stands to benefit efforts directed to this end. More broadly, our work sheds light on enzymatically degradable polymers and the thermal processing of enzymes, which are of increasing economical and societal relevance.

Collapse

Villalobos-Alva J, Ochoa-Toledo L, Villalobos-Alva MJ, Aliseda A, Pérez-Escamirosa F, Altamirano-Bustamante NF, Ochoa-Fernández F, Zamora-Solís R, Villalobos-Alva S, Revilla-Monsalve C, Kemper-Valverde N, Altamirano-Bustamante MM. Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field. Front Bioeng Biotechnol 2022;10:788300. [PMID: 35875501 PMCID: PMC9301016 DOI: 10.3389/fbioe.2022.788300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 05/25/2022] [Indexed: 11/23/2022] Open

Abstract

Proteins are some of the most fascinating and challenging molecules in the universe, and they pose a big challenge for artificial intelligence. The implementation of machine learning/AI in protein science gives rise to a world of knowledge adventures in the workhorse of the cell and proteome homeostasis, which are essential for making life possible. This opens up epistemic horizons thanks to a coupling of human tacit–explicit knowledge with machine learning power, the benefits of which are already tangible, such as important advances in protein structure prediction. Moreover, the driving force behind the protein processes of self-organization, adjustment, and fitness requires a space corresponding to gigabytes of life data in its order of magnitude. There are many tasks such as novel protein design, protein folding pathways, and synthetic metabolic routes, as well as protein-aggregation mechanisms, pathogenesis of protein misfolding and disease, and proteostasis networks that are currently unexplored or unrevealed. In this systematic review and biochemical meta-analysis, we aim to contribute to bridging the gap between what we call binomial artificial intelligence (AI) and protein science (PS), a growing research enterprise with exciting and promising biotechnological and biomedical applications. We undertake our task by exploring “the state of the art” in AI and machine learning (ML) applications to protein science in the scientific literature to address some critical research questions in this domain, including What kind of tasks are already explored by ML approaches to protein sciences? What are the most common ML algorithms and databases used? What is the situational diagnostic of the AI–PS inter-field? What do ML processing steps have in common? We also formulate novel questions such as Is it possible to discover what the rules of protein evolution are with the binomial AI–PS? How do protein folding pathways evolve? What are the rules that dictate the folds? What are the minimal nuclear protein structures? How do protein aggregates form and why do they exhibit different toxicities? What are the structural properties of amyloid proteins? How can we design an effective proteostasis network to deal with misfolded proteins? We are a cross-functional group of scientists from several academic disciplines, and we have conducted the systematic review using a variant of the PICO and PRISMA approaches. The search was carried out in four databases (PubMed, Bireme, OVID, and EBSCO Web of Science), resulting in 144 research articles. After three rounds of quality screening, 93 articles were finally selected for further analysis. A summary of our findings is as follows: regarding AI applications, there are mainly four types: 1) genomics, 2) protein structure and function, 3) protein design and evolution, and 4) drug design. In terms of the ML algorithms and databases used, supervised learning was the most common approach (85%). As for the databases used for the ML models, PDB and UniprotKB/Swissprot were the most common ones (21 and 8%, respectively). Moreover, we identified that approximately 63% of the articles organized their results into three steps, which we labeled pre-process, process, and post-process. A few studies combined data from several databases or created their own databases after the pre-process. Our main finding is that, as of today, there are no research road maps serving as guides to address gaps in our knowledge of the AI–PS binomial. All research efforts to collect, integrate multidimensional data features, and then analyze and validate them are, so far, uncoordinated and scattered throughout the scientific literature without a clear epistemic goal or connection between the studies. Therefore, our main contribution to the scientific literature is to offer a road map to help solve problems in drug design, protein structures, design, and function prediction while also presenting the “state of the art” on research in the AI–PS binomial until February 2021. Thus, we pave the way toward future advances in the synthetic redesign of novel proteins and protein networks and artificial metabolic pathways, learning lessons from nature for the welfare of humankind. Many of the novel proteins and metabolic pathways are currently non-existent in nature, nor are they used in the chemical industry or biomedical field.

Collapse

Affiliation(s)

Jalil Villalobos-Alva Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Luis Ochoa-Toledo Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Mario Javier Villalobos-Alva Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Atocha Aliseda Instituto de Investigaciones Filosóficas, Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Fernando Pérez-Escamirosa Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Nelly F. Altamirano-Bustamante Instituto Nacional de Pediatría, Mexico City, Mexico
Francine Ochoa-Fernández Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Ricardo Zamora-Solís Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Sebastián Villalobos-Alva Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Cristina Revilla-Monsalve Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico
Nicolás Kemper-Valverde Instituto de Ciencias Aplicadas y Tecnología (ICAT), Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
Myriam M. Altamirano-Bustamante Unidad de Investigación en Enfermedades Metabólicas, Centro Médico Nacional Siglo XXI, Instituto Mexicano del Seguro Social, Mexico City, Mexico *Correspondence: Myriam M. Altamirano-Bustamante,

Collapse

Talluri S. Algorithms for protein design. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022;130:1-38. [PMID: 35534105 DOI: 10.1016/bs.apcsb.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Tatta ER, Imchen M, Moopantakath J, Kumavath R. Bioprospecting of microbial enzymes: current trends in industry and healthcare. Appl Microbiol Biotechnol 2022;106:1813-1835. [PMID: 35254498 DOI: 10.1007/s00253-022-11859-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 02/15/2022] [Accepted: 02/26/2022] [Indexed: 12/13/2022]

Kenny SE, Antaw F, Locke WJ, Howard CB, Korbie D, Trau M. Next-Generation Molecular Discovery: From Bottom-Up In Vivo and In Vitro Approaches to In Silico Top-Down Approaches for Therapeutics Neogenesis. Life (Basel) 2022;12:life12030363. [PMID: 35330114 PMCID: PMC8950575 DOI: 10.3390/life12030363] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 02/23/2022] [Indexed: 12/02/2022] Open

Computational enzyme redesign: large jumps in function. TRENDS IN CHEMISTRY 2022. [DOI: 10.1016/j.trechm.2022.03.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Vanella R, Kovacevic G, Doffini V, Fernández de Santaella J, Nash MA. High-throughput screening, next generation sequencing and machine learning: advanced methods in enzyme engineering. Chem Commun (Camb) 2022;58:2455-2467. [PMID: 35107442 PMCID: PMC8851469 DOI: 10.1039/d1cc04635g] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Yu Y, Wang R, Teo RD. Machine Learning Approaches for Metalloproteins. MOLECULES (BASEL, SWITZERLAND) 2022;27:molecules27041277. [PMID: 35209064 PMCID: PMC8878495 DOI: 10.3390/molecules27041277] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 02/10/2022] [Accepted: 02/11/2022] [Indexed: 01/10/2023]

Cadet XF, Gelly JC, van Noord A, Cadet F, Acevedo-Rocha CG. Learning Strategies in Protein Directed Evolution. Methods Mol Biol 2022;2461:225-275. [PMID: 35727454 DOI: 10.1007/978-1-0716-2152-3_15] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Abstract

Synthetic biology is a fast-evolving research field that combines biology and engineering principles to develop new biological systems for medical, pharmacological, and industrial applications. Synthetic biologists use iterative "design, build, test, and learn" cycles to efficiently engineer genetic systems that are reliable, reproducible, and predictable. Protein engineering by directed evolution can benefit from such a systematic engineering approach for various reasons. Learning can be carried out before starting, throughout or after finalizing a directed evolution project. Computational tools, bioinformatics, and scanning mutagenesis methods can be excellent starting points, while molecular dynamics simulations and other strategies can guide engineering efforts. Similarly, studying protein intermediates along evolutionary pathways offers fascinating insights into the molecular mechanisms shaped by evolution. The learning step of the cycle is not only crucial for proteins or enzymes that are not suitable for high-throughput screening or selection systems, but it is also valuable for any platform that can generate a large amount of data that can be aided by machine learning algorithms. The main challenge in protein engineering is to predict the effect of a single mutation on one functional parameter-to say nothing of several mutations on multiple parameters. This is largely due to nonadditive mutational interactions, known as epistatic effects-beneficial mutations present in a genetic background may not be beneficial in another genetic background. In this work, we provide an overview of experimental and computational strategies that can guide the user to learn protein function at different stages in a directed evolution project. We also discuss how epistatic effects can influence the success of directed evolution projects. Since machine learning is gaining momentum in protein engineering and the field is becoming more interdisciplinary thanks to collaboration between mathematicians, computational scientists, engineers, molecular biologists, and chemists, we provide a general workflow that familiarizes nonexperts with the basic concepts, dataset requirements, learning approaches, model capabilities and performance metrics of this intriguing area. Finally, we also provide some practical recommendations on how machine learning can harness epistatic effects for engineering proteins in an "outside-the-box" way.

Collapse

Saito Y, Oikawa M, Sato T, Nakazawa H, Ito T, Kameda T, Tsuda K, Umetsu M. Machine-Learning-Guided Library Design Cycle for Directed Evolution of Enzymes: The Effects of Training Data Composition on Sequence Space Exploration. ACS Catal 2021. [DOI: 10.1021/acscatal.1c03753] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Affiliation(s)

Yutaka Saito Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Japan AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan
Misaki Oikawa Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
Takumi Sato Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
Hikaru Nakazawa Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
Tomoyuki Ito Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
Tomoshi Kameda Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Japan Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan
Koji Tsuda Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, 1-2-1 Sengen, Tsukuba, Ibaraki 305-0047, Japan
Mitsuo Umetsu Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan

Collapse

Bertelsen AB, Hackney CM, Bayer CN, Kjelgaard LD, Rennig M, Christensen B, Sørensen ES, Safavi‐Hemami H, Wulff T, Ellgaard L, Nørholm MHH. DisCoTune: versatile auxiliary plasmids for the production of disulphide-containing proteins and peptides in the E. coli T7 system. Microb Biotechnol 2021;14:2566-2580. [PMID: 34405535 PMCID: PMC8601162 DOI: 10.1111/1751-7915.13895] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 06/15/2021] [Accepted: 07/04/2021] [Indexed: 11/28/2022] Open

Machine learning-guided acyl-ACP reductase engineering for improved in vivo fatty alcohol production. Nat Commun 2021;12:5825. [PMID: 34611172 PMCID: PMC8492656 DOI: 10.1038/s41467-021-25831-w] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 09/01/2021] [Indexed: 02/04/2023] Open

Galanie S, Entwistle D, Lalonde J. Engineering biosynthetic enzymes for industrial natural product synthesis. Nat Prod Rep 2021;37:1122-1143. [PMID: 32364202 DOI: 10.1039/c9np00071b] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Dutta K, Shityakov S, Khalifa I. New Trends in Bioremediation Technologies Toward Environment-Friendly Society: A Mini-Review. Front Bioeng Biotechnol 2021;9:666858. [PMID: 34409018 PMCID: PMC8365754 DOI: 10.3389/fbioe.2021.666858] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 05/26/2021] [Indexed: 01/29/2023] Open

Yi D, Bayer T, Badenhorst CPS, Wu S, Doerr M, Höhne M, Bornscheuer UT. Recent trends in biocatalysis. Chem Soc Rev 2021;50:8003-8049. [PMID: 34142684 PMCID: PMC8288269 DOI: 10.1039/d0cs01575j] [Citation(s) in RCA: 115] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Indexed: 12/13/2022]

Siedhoff NE, Illig AM, Schwaneberg U, Davari MD. PyPEF-An Integrated Framework for Data-Driven Protein Engineering. J Chem Inf Model 2021;61:3463-3476. [PMID: 34260225 DOI: 10.1021/acs.jcim.1c00099] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract

Data-driven strategies are gaining increased attention in protein engineering due to recent advances in access to large experimental databanks of proteins, next-generation sequencing (NGS), high-throughput screening (HTS) methods, and the development of artificial intelligence algorithms. However, the reliable prediction of beneficial amino acid substitutions, their combination, and the effect on functional properties remain the most significant challenges in protein engineering, which is applied to develop proteins and enzymes for biocatalysis, biomedicine, and life sciences. Here, we present a general-purpose framework (PyPEF: pythonic protein engineering framework) for performing data-driven protein engineering using machine learning methods combined with techniques from signal processing and statistical physics. PyPEF guides the identification and selection of beneficial proteins of a defined sequence space by systematically or randomly exploring the fitness of variants and by sampling random evolution pathways. The performance of PyPEF was evaluated concerning its predictive accuracy and throughput on four public protein and enzyme data sets using common regression models. It was proved that the program could efficiently predict the fitness of protein sequences for different target properties (predictive models with coefficient of determination values ranging from 0.58 to 0.92). By combining machine learning and protein evolution, PyPEF enabled the screening of proteins with various functions, reaching a screening capacity of more than 500,000 protein sequence variants in the timeframe of only a few minutes on a personal computer. PyPEF displayed significant accuracies on four public data sets (different proteins and properties) and underlined the potential of integrating data-driven technologies for covering different philosophies by either predicting the fitness of the variants to the highest accuracy accounting for epistatic effects or capturing the general trend of introduced mutations on the fitness in directed protein evolution campaigns. In essence, PyPEF can provide a powerful solution to current sequence exploration and combinatorial problems faced in protein engineering through exhaustive in silico screening of the sequence space.

Collapse

Wu Z, Johnston KE, Arnold FH, Yang KK. Protein sequence design with deep generative models. Curr Opin Chem Biol 2021;65:18-27. [PMID: 34051682 DOI: 10.1016/j.cbpa.2021.04.004] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 04/02/2021] [Accepted: 04/07/2021] [Indexed: 12/20/2022]

Sunny JS, Nisha K, Natarajan A, Saleena LM. IND-enzymes: a repository for hydrolytic enzymes derived from thermophilic and psychrophilic bacterial species with potential industrial usage. Extremophiles 2021;25:319-325. [PMID: 33961119 DOI: 10.1007/s00792-021-01231-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 04/22/2021] [Indexed: 10/21/2022]

Ferguson AL, Ranganathan R. 100th Anniversary of Macromolecular Science Viewpoint: Data-Driven Protein Design. ACS Macro Lett 2021;10:327-340. [PMID: 35549066 DOI: 10.1021/acsmacrolett.0c00885] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Li G, Qin Y, Fontaine NT, Ng Fuk Chong M, Maria‐Solano MA, Feixas F, Cadet XF, Pandjaitan R, Garcia‐Borràs M, Cadet F, Reetz MT. Machine Learning Enables Selection of Epistatic Enzyme Mutants for Stability Against Unfolding and Detrimental Aggregation. Chembiochem 2021;22:904-914. [PMID: 33094545 PMCID: PMC7984044 DOI: 10.1002/cbic.202000612] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/22/2020] [Indexed: 12/15/2022]

Affiliation(s)

Guangyue Li State Key Laboratory for Biology of Plant Diseases and Insect Pests Key Laboratory of Control of Biological Hazard Factors (Plant Origin) for Agri-product Quality and Safety Ministry of Agriculture, Institute of Plant ProtectionChinese Academy of Agricultural SciencesBeijing100081P. R. China
Youcai Qin State Key Laboratory for Biology of Plant Diseases and Insect Pests Key Laboratory of Control of Biological Hazard Factors (Plant Origin) for Agri-product Quality and Safety Ministry of Agriculture, Institute of Plant ProtectionChinese Academy of Agricultural SciencesBeijing100081P. R. China
Nicolas T. Fontaine PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
Matthieu Ng Fuk Chong PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
Miguel A. Maria‐Solano Institut de Química Computacional i Catàlisi and Departament de QuímicaUniversitat de Girona Campus Montilivi17003Girona, CataloniaSpain) .
Ferran Feixas Institut de Química Computacional i Catàlisi and Departament de QuímicaUniversitat de Girona Campus Montilivi17003Girona, CataloniaSpain) .
Xavier F. Cadet PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
Rudy Pandjaitan PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
Marc Garcia‐Borràs Institut de Química Computacional i Catàlisi and Departament de QuímicaUniversitat de Girona Campus Montilivi17003Girona, CataloniaSpain) .
Frederic Cadet PEACCELArtificial Intelligence Department6 Square Albin Cachot, Box 4275013ParisFrance) .
Manfred T. Reetz Department of ChemistryPhilipps-Universität35032MarburgGermany) . Max-Planck-Institut fuer Kohlenforschung45470MülheimGermany Tianjin Institute of Industrial BiotechnologyChinese Academy of Sciences32 West 7th Avenue, Tianjin Airport Economic Area300308TianjinP. R. China

Collapse

Zhao Y, Li D, Bai X, Luo M, Feng Y, Zhao Y, Ma F, Yang GY. Improved thermostability of proteinase K and recognizing the synergistic effect of Rosetta and FoldX approaches. Protein Eng Des Sel 2021;34:6404066. [PMID: 34671809 DOI: 10.1093/protein/gzab024] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 08/24/2021] [Accepted: 08/25/2021] [Indexed: 11/14/2022] Open

Unger EK, Keller JP, Altermatt M, Liang R, Matsui A, Dong C, Hon OJ, Yao Z, Sun J, Banala S, Flanigan ME, Jaffe DA, Hartanto S, Carlen J, Mizuno GO, Borden PM, Shivange AV, Cameron LP, Sinning S, Underhill SM, Olson DE, Amara SG, Temple Lang D, Rudnick G, Marvin JS, Lavis LD, Lester HA, Alvarez VA, Fisher AJ, Prescher JA, Kash TL, Yarov-Yarovoy V, Gradinaru V, Looger LL, Tian L. Directed Evolution of a Selective and Sensitive Serotonin Sensor via Machine Learning. Cell 2020;183:1986-2002.e26. [PMID: 33333022 PMCID: PMC8025677 DOI: 10.1016/j.cell.2020.11.040] [Citation(s) in RCA: 92] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Revised: 06/22/2020] [Accepted: 11/20/2020] [Indexed: 12/28/2022]

Affiliation(s)

Elizabeth K Unger Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Jacob P Keller Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20174, USA
Michael Altermatt Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
Ruqiang Liang Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Aya Matsui Laboratory on Neurobiology of Compulsive Behaviors, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD 20892, USA
Chunyang Dong Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Olivia J Hon Bowles Center for Alcohol Studies, Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, NC 27599, USA
Zi Yao Department of Chemistry, University of California, Irvine, Irvine, CA 92697, USA
Junqing Sun Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Samba Banala Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20174, USA
Meghan E Flanigan Bowles Center for Alcohol Studies, Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, NC 27599, USA
David A Jaffe Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Samantha Hartanto Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Jane Carlen Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Grace O Mizuno Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Phillip M Borden Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20174, USA
Amol V Shivange Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
Lindsay P Cameron Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Steffen Sinning Department of Pharmacology, Yale University School of Medicine, New Haven, CT 06520, USA
Suzanne M Underhill Laboratory of Molecular and Cellular Neurobiology, National Institute on Mental Health, NIH, Bethesda, MD 20892, USA
David E Olson Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Susan G Amara Laboratory of Molecular and Cellular Neurobiology, National Institute on Mental Health, NIH, Bethesda, MD 20892, USA
Duncan Temple Lang Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Gary Rudnick Department of Pharmacology, Yale University School of Medicine, New Haven, CT 06520, USA
Jonathan S Marvin Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20174, USA
Luke D Lavis Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20174, USA
Henry A Lester Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
Veronica A Alvarez Laboratory on Neurobiology of Compulsive Behaviors, National Institute on Alcohol Abuse and Alcoholism, NIH, Bethesda, MD 20892, USA
Andrew J Fisher Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Jennifer A Prescher Department of Chemistry, University of California, Irvine, Irvine, CA 92697, USA
Thomas L Kash Bowles Center for Alcohol Studies, Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, NC 27599, USA
Vladimir Yarov-Yarovoy Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA
Viviana Gradinaru Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
Loren L Looger Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA 20174, USA.
Lin Tian Departments of Biochemistry and Molecular Medicine, Chemistry, Statistics, Molecular and Cellular Biology, and Physiology and Membrane Biology, the Center for Neuroscience, and Graduate Programs in Molecular, Cellular, and Integrative Physiology, Biochemistry, Molecular, Cellular and Developmental Biology and Neuroscience, University of California, Davis, Davis, CA 95616, USA.

Collapse

Song H, Bremer BJ, Hinds EC, Raskutti G, Romero PA. Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning. Cell Syst 2020;12:92-101.e8. [PMID: 33212013 DOI: 10.1016/j.cels.2020.10.007] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2019] [Revised: 08/13/2020] [Accepted: 10/22/2020] [Indexed: 10/22/2022]

Troiano D, Orsat V, Dumont MJ. Status of Biocatalysis in the Production of 2,5-Furandicarboxylic Acid. ACS Catal 2020. [DOI: 10.1021/acscatal.0c02378] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]

Siedhoff NE, Schwaneberg U, Davari MD. Machine learning-assisted enzyme engineering. Methods Enzymol 2020;643:281-315. [DOI: 10.1016/bs.mie.2020.05.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Chowdhury R, Maranas CD. From directed evolution to computational enzyme engineering—A review. AIChE J 2019. [DOI: 10.1002/aic.16847] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Improving the catalytic performance of Proteinase K from Parengyodontium album for use in feather degradation. Int J Biol Macromol 2019;154:1586-1595. [PMID: 31706815 DOI: 10.1016/j.ijbiomac.2019.11.043] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2019] [Revised: 11/06/2019] [Accepted: 11/06/2019] [Indexed: 01/14/2023]

Yang KK, Wu Z, Arnold FH. Machine-learning-guided directed evolution for protein engineering. Nat Methods 2019;16:687-694. [PMID: 31308553 DOI: 10.1038/s41592-019-0496-6] [Citation(s) in RCA: 431] [Impact Index Per Article: 86.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 06/17/2019] [Indexed: 02/06/2023]

Kent R, Dixon N. Systematic Evaluation of Genetic and Environmental Factors Affecting Performance of Translational Riboswitches. ACS Synth Biol 2019;8:884-901. [PMID: 30897329 PMCID: PMC6492952 DOI: 10.1021/acssynbio.9b00017] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Li G, Dong Y, Reetz MT. Can Machine Learning Revolutionize Directed Evolution of Selective Enzymes? Adv Synth Catal 2019. [DOI: 10.1002/adsc.201900149] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Moore JC, Rodriguez-Granillo A, Crespo A, Govindarajan S, Welch M, Hiraga K, Lexa K, Marshall N, Truppo MD. "Site and Mutation"-Specific Predictions Enable Minimal Directed Evolution Libraries. ACS Synth Biol 2018;7:1730-1741. [PMID: 29782150 DOI: 10.1021/acssynbio.7b00359] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Brown SR, Staff M, Lee R, Love J, Parker DA, Aves SJ, Howard TP. Design of Experiments Methodology to Build a Multifactorial Statistical Model Describing the Metabolic Interactions of Alcohol Dehydrogenase Isozymes in the Ethanol Biosynthetic Pathway of the Yeast Saccharomyces cerevisiae. ACS Synth Biol 2018;7:1676-1684. [PMID: 29976056 DOI: 10.1021/acssynbio.8b00112] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Rigoldi F, Donini S, Redaelli A, Parisini E, Gautieri A. Review: Engineering of thermostable enzymes for industrial applications. APL Bioeng 2018;2:011501. [PMID: 31069285 PMCID: PMC6481699 DOI: 10.1063/1.4997367] [Citation(s) in RCA: 155] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 11/14/2017] [Indexed: 01/19/2023] Open

Getting Momentum: From Biocatalysis to Advanced Synthetic Biology. Trends Biochem Sci 2018;43:180-198. [DOI: 10.1016/j.tibs.2018.01.003] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Revised: 01/08/2018] [Accepted: 01/10/2018] [Indexed: 11/20/2022]

Lutz S, Iamurri SM. Protein Engineering: Past, Present, and Future. Methods Mol Biol 2018;1685:1-12. [PMID: 29086300 DOI: 10.1007/978-1-4939-7366-8_1] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Learning epistatic interactions from sequence-activity data to predict enantioselectivity. J Comput Aided Mol Des 2017;31:1085-1096. [DOI: 10.1007/s10822-017-0090-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 12/04/2017] [Indexed: 10/18/2022]

Musdal Y, Govindarajan S, Mannervik B. Exploring sequence-function space of a poplar glutathione transferase using designed information-rich gene variants. Protein Eng Des Sel 2017;30:543-549. [PMID: 28967959 PMCID: PMC5914380 DOI: 10.1093/protein/gzx045] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2017] [Accepted: 08/15/2017] [Indexed: 01/19/2023] Open

Carlin DA, Caster RW, Wang X, Betzenderfer SA, Chen CX, Duong VM, Ryklansky CV, Alpekin A, Beaumont N, Kapoor H, Kim N, Mohabbot H, Pang B, Teel R, Whithaus L, Tagkopoulos I, Siegel JB. Kinetic Characterization of 100 Glycoside Hydrolase Mutants Enables the Discovery of Structural Features Correlated with Kinetic Constants. PLoS One 2016;11:e0147596. [PMID: 26815142 PMCID: PMC4729467 DOI: 10.1371/journal.pone.0147596] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Accepted: 01/06/2016] [Indexed: 11/18/2022] Open

Affiliation(s)

Dylan Alexander Carlin Biophysics Graduate Group, University of California Davis, California, United States of America
Ryan W. Caster Genome Center, University of California Davis, Davis, California, United States of America
Xiaokang Wang Department of Biomedical Engineering, University of California Davis, Davis, California, United States of America
Stephanie A. Betzenderfer Genome Center, University of California Davis, Davis, California, United States of America
Claire X. Chen Genome Center, University of California Davis, Davis, California, United States of America
Veasna M. Duong Genome Center, University of California Davis, Davis, California, United States of America
Carolina V. Ryklansky Genome Center, University of California Davis, Davis, California, United States of America
Alp Alpekin Genome Center, University of California Davis, Davis, California, United States of America
Nathan Beaumont Genome Center, University of California Davis, Davis, California, United States of America
Harshul Kapoor Genome Center, University of California Davis, Davis, California, United States of America
Nicole Kim Genome Center, University of California Davis, Davis, California, United States of America
Hosna Mohabbot Genome Center, University of California Davis, Davis, California, United States of America
Boyu Pang Genome Center, University of California Davis, Davis, California, United States of America
Rachel Teel Genome Center, University of California Davis, Davis, California, United States of America
Lillian Whithaus Genome Center, University of California Davis, Davis, California, United States of America
Ilias Tagkopoulos Genome Center, University of California Davis, Davis, California, United States of America Department of Computer Science, University of California Davis, Davis, California, United States of America
Justin B. Siegel Genome Center, University of California Davis, Davis, California, United States of America Department of Chemistry, University of California Davis, Davis, California, United States of America Department of Biochemistry & Molecular Medicine, University of California Davis, Davis, California, United States of America * E-mail:

Collapse