1
|
Zhou J, Huang M. Navigating the landscape of enzyme design: from molecular simulations to machine learning. Chem Soc Rev 2024. [PMID: 38990263 DOI: 10.1039/d4cs00196f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Global environmental issues and sustainable development call for new technologies for fine chemical synthesis and waste valorization. Biocatalysis has attracted great attention as the alternative to the traditional organic synthesis. However, it is challenging to navigate the vast sequence space to identify those proteins with admirable biocatalytic functions. The recent development of deep-learning based structure prediction methods such as AlphaFold2 reinforced by different computational simulations or multiscale calculations has largely expanded the 3D structure databases and enabled structure-based design. While structure-based approaches shed light on site-specific enzyme engineering, they are not suitable for large-scale screening of potential biocatalysts. Effective utilization of big data using machine learning techniques opens up a new era for accelerated predictions. Here, we review the approaches and applications of structure-based and machine-learning guided enzyme design. We also provide our view on the challenges and perspectives on effectively employing enzyme design approaches integrating traditional molecular simulations and machine learning, and the importance of database construction and algorithm development in attaining predictive ML models to explore the sequence fitness landscape for the design of admirable biocatalysts.
Collapse
Affiliation(s)
- Jiahui Zhou
- School of Chemistry and Chemical Engineering, Queen's University, David Keir Building, Stranmillis Road, Belfast BT9 5AG, Northern Ireland, UK.
| | - Meilan Huang
- School of Chemistry and Chemical Engineering, Queen's University, David Keir Building, Stranmillis Road, Belfast BT9 5AG, Northern Ireland, UK.
| |
Collapse
|
2
|
Xu R, Bao Y, Jiao F, Li M, Zhang X, Zhang F, Guo J. Unraveling the atomic mechanisms underlying glyphosate insensitivity in EPSPS: implications of distal mutations. J Biomol Struct Dyn 2024:1-12. [PMID: 38400730 DOI: 10.1080/07391102.2024.2318472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 02/08/2024] [Indexed: 02/26/2024]
Abstract
5-enolpyruvyl shikimate-3-phosphate synthase (EPSPS), as an indispensable enzyme in the shikimate pathway, is the specific target of grasser killer glyphosate (GPJ). GPJ is a competitive inhibitor of phosphoenolpyruvate (PEP), which is the natural substrate of EPSPS. A novel Ls-EPSPS gene variant discovered from Liliaceae, named ELs-EPSPS, includes five distal mutations, E112V, D142N, T351S, D425G, and R496G, endowing high GPJ insensitivity. However, the implicit molecular mechanism of the enhanced tolerance/insensitivity of GPJ in ELs-EPSPS is not fully understood. Herein, we try to interpret the hidden molecular mechanism using computational methods. Computational results reveal the enhanced flexibility of apo EPSPS upon mutations. The enhanced affinity of the initial binding substrate shikimate-3-phosphate (S3P), and the higher probability of second ligands PEP/GPJ entering the pocket are observed in the ELs-EPSPS-S3P system. Docking and MD results further confirmed the decreased GPJ-induced EPSPS inhibition upon mutations. And, the alterations of K98 and R179 side-chain orientations upon mutations are detrimental to GPJ binding at the active site. Additionally, the oscillation of side chain K98, in charge of PEP location, improves the proximity effect for substrates in the dual-substrate systems upon mutations. Our results clarify that the enhanced GPJ tolerance of EPSPS is achieved from decreased competitive inhibition of GPJ at the atomic perspective, and this finding further contributes to the cultivation of EPSPS genes with higher GPJ tolerance/insensitivity and a mighty renovation for developing glyphosate-resistant crops.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Ran Xu
- Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao, China
- College of Life Sciences, Nanjing Agricultural University, Nanjing, China
| | - Yiqiong Bao
- College of Life Sciences, Nanjing Agricultural University, Nanjing, China
| | - Fangfang Jiao
- Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao, China
| | - Mengrong Li
- College of Life Sciences, Nanjing Agricultural University, Nanjing, China
| | - Xiaoxiao Zhang
- College of Life Sciences, Nanjing Agricultural University, Nanjing, China
| | - Feng Zhang
- College of Plant Protection, Nanjing Agricultural University, Nanjing, China
| | - Jingjing Guo
- Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao, China
- Engineering Research Centre of Applied Technology on Machine Translation and Artificial Intelligence, Macao Polytechnic University, Macao, China
| |
Collapse
|
3
|
Prešern U, Goličnik M. Enzyme Databases in the Era of Omics and Artificial Intelligence. Int J Mol Sci 2023; 24:16918. [PMID: 38069254 PMCID: PMC10707154 DOI: 10.3390/ijms242316918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 11/24/2023] [Accepted: 11/26/2023] [Indexed: 12/18/2023] Open
Abstract
Enzyme research is important for the development of various scientific fields such as medicine and biotechnology. Enzyme databases facilitate this research by providing a wide range of information relevant to research planning and data analysis. Over the years, various databases that cover different aspects of enzyme biology (e.g., kinetic parameters, enzyme occurrence, and reaction mechanisms) have been developed. Most of the databases are curated manually, which improves reliability of the information; however, such curation cannot keep pace with the exponential growth in published data. Lack of data standardization is another obstacle for data extraction and analysis. Improving machine readability of databases is especially important in the light of recent advances in deep learning algorithms that require big training datasets. This review provides information regarding the current state of enzyme databases, especially in relation to the ever-increasing amount of generated research data and recent advancements in artificial intelligence algorithms. Furthermore, it describes several enzyme databases, providing the reader with necessary information for their use.
Collapse
Affiliation(s)
| | - Marko Goličnik
- Institute of Biochemistry and Molecular Genetics, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, 1000 Ljubljana, Slovenia;
| |
Collapse
|
4
|
Kouba P, Kohout P, Haddadi F, Bushuiev A, Samusevich R, Sedlar J, Damborsky J, Pluskal T, Sivic J, Mazurenko S. Machine Learning-Guided Protein Engineering. ACS Catal 2023; 13:13863-13895. [PMID: 37942269 PMCID: PMC10629210 DOI: 10.1021/acscatal.3c02743] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/20/2023] [Indexed: 11/10/2023]
Abstract
Recent progress in engineering highly promising biocatalysts has increasingly involved machine learning methods. These methods leverage existing experimental and simulation data to aid in the discovery and annotation of promising enzymes, as well as in suggesting beneficial mutations for improving known targets. The field of machine learning for protein engineering is gathering steam, driven by recent success stories and notable progress in other areas. It already encompasses ambitious tasks such as understanding and predicting protein structure and function, catalytic efficiency, enantioselectivity, protein dynamics, stability, solubility, aggregation, and more. Nonetheless, the field is still evolving, with many challenges to overcome and questions to address. In this Perspective, we provide an overview of ongoing trends in this domain, highlight recent case studies, and examine the current limitations of machine learning-based methods. We emphasize the crucial importance of thorough experimental validation of emerging models before their use for rational protein design. We present our opinions on the fundamental problems and outline the potential directions for future research.
Collapse
Affiliation(s)
- Petr Kouba
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Faculty of
Electrical Engineering, Czech Technical
University in Prague, Technicka 2, 166 27 Prague 6, Czech Republic
| | - Pavel Kohout
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Faraneh Haddadi
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Anton Bushuiev
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Raman Samusevich
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Jiri Sedlar
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Jiri Damborsky
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Tomas Pluskal
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Josef Sivic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|
5
|
Wen Y, Xu J, Pan D, Wang C. Removal of substrate inhibition of Acinetobacter baumannii xanthine oxidase by point mutation at Gln-201 enables efficient reduction of purine content in fish sauce. Food Chem X 2023; 17:100593. [PMID: 36845495 PMCID: PMC9944496 DOI: 10.1016/j.fochx.2023.100593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 02/02/2023] [Accepted: 02/02/2023] [Indexed: 02/09/2023] Open
Abstract
Xanthine oxidase is an oxidase that has a molybdopterin structure with substrate inhibition. Here, we show that a single point mutation (Q201) in the Acinetobacter baumannii xanthine oxidase (AbXOD) obtained mutant Q201E (k cat =799.44 s-1, no inhibition) with high enzyme activity and decrease of substrate inhibition in 5 mmol/L high substrate model, and which cause two loops structure change at active center, characterized by complete loss of substrate inhibition without reduction of enzymatic activity. Molecular docking results showed that the change of flexible loop increased the affinity between substrate and enzyme, and the formation of a π-π bond and two hydrogen bonds made the substrate more stable in the active center. Ultimately, Q201E can still maintain better enzyme activity under high purine content (an approximately 7-fold improvement over the wild-type), indicating a broader application prospect in the manufacture of low-purine food.
Collapse
Affiliation(s)
| | | | | | - Chenghua Wang
- Corresponding author at: College of Light Industry and Food Engineering, 100 Daxue East Road, Nanning 530004, People’s Republic of China.
| |
Collapse
|
6
|
Wei GW, Soares TA, Wahab H, Zhu F. Computational Chemistry in Asia. J Chem Inf Model 2022; 62:5035-5037. [DOI: 10.1021/acs.jcim.2c01050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
7
|
Velecký J, Hamsikova M, Stourac J, Musil M, Damborsk J, Bednar D, Mazurenko S. SoluProtMutDB: a manually curated database of protein solubility changes upon mutations. Comput Struct Biotechnol J 2022; 20:6339-6347. [DOI: 10.1016/j.csbj.2022.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 11/04/2022] [Accepted: 11/04/2022] [Indexed: 11/11/2022] Open
|
8
|
Vasina M, Velecký J, Planas-Iglesias J, Marques SM, Skarupova J, Damborsky J, Bednar D, Mazurenko S, Prokop Z. Tools for computational design and high-throughput screening of therapeutic enzymes. Adv Drug Deliv Rev 2022; 183:114143. [PMID: 35167900 DOI: 10.1016/j.addr.2022.114143] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Revised: 02/04/2022] [Accepted: 02/09/2022] [Indexed: 12/16/2022]
Abstract
Therapeutic enzymes are valuable biopharmaceuticals in various biomedical applications. They have been successfully applied for fibrinolysis, cancer treatment, enzyme replacement therapies, and the treatment of rare diseases. Still, there is a permanent demand to find new or better therapeutic enzymes, which would be sufficiently soluble, stable, and active to meet specific medical needs. Here, we highlight the benefits of coupling computational approaches with high-throughput experimental technologies, which significantly accelerate the identification and engineering of catalytic therapeutic agents. New enzymes can be identified in genomic and metagenomic databases, which grow thanks to next-generation sequencing technologies exponentially. Computational design and machine learning methods are being developed to improve catalytically potent enzymes and predict their properties to guide the selection of target enzymes. High-throughput experimental pipelines, increasingly relying on microfluidics, ensure functional screening and biochemical characterization of target enzymes to reach efficient therapeutic enzymes.
Collapse
Affiliation(s)
- Michal Vasina
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Jan Velecký
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic
| | - Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Sergio M Marques
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Jana Skarupova
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic; Enantis, INBIT, Kamenice 34, Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| |
Collapse
|
9
|
Association Analysis between Genetic Variants of elovl5a and elovl5b and Poly-Unsaturated Fatty Acids in Common Carp (Cyprinus carpio). BIOLOGY 2022; 11:biology11030466. [PMID: 35336839 PMCID: PMC8945013 DOI: 10.3390/biology11030466] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 03/10/2022] [Accepted: 03/14/2022] [Indexed: 01/14/2023]
Abstract
Simple Summary PUFAs have an essential impact on human health, but their availability constitutes a critical bottleneck in food production. Although fish is the traditional source of PUFAs, it is limited by the stagnation of fisheries. Many studies aim to increase the PUFA products of fish. Genetic markers are efficient in aquaculture breeding. Fatty acid desaturase 2 (fads2) and elongase 5 (elovl5) are the rate-limiting enzymes in the synthesis of PUFAs. The allo-tetraploid common carp is able to biosynthesize endogenous PUFAs. However, selective breeding common carp with high PUFA contents was hindered due to a lack of effective molecular markers. For future breeding common carp capable of producing endogenous PUFAs more effectively, we previously identified the polymorphisms in the coding regions of two duplicated fads2, fads2a and fads2b. However, the polymorphisms in the duplicated elovl5, elovl5a and elovl5b, were not detected. This study screened the genetic variants in the coding regions of elovl5a and elovl5b. Moreover, the joint effects of multiple coding SNPs in fads2b and elovl5b, two major genes regulating the PUFA biosynthesis, were evidenced with the increased explained percentages of the PUFA contents. These polymorphisms in these two genes were used to evaluate the breeding values of PUFAs. These SNPs would be potential markers for future selection to improve the PUFA contents in common carp. Abstract The allo-tetraploid common carp, one widely cultured food fish, is able to produce poly-unsaturated fatty acids (PUFAs). The genetic markers on the PUFA contents for breeding was limited. The polymorphisms in elovl5a and elovl5b, the rate-limiting enzymes in the PUFA biosynthesis, have not been investigated yet. Herein, we identified one coding SNP (cSNP) in elovl5a associated with the content of one PUFA and two cSNPs in elovl5b with the contents of eight PUFAs. The heterozygous genotypes in these three loci were associated with higher contents than the homozygotes. Together with previously identified two associated cSNPs in fads2b, we found the joint effect of these four cSNPs in fads2b and elovl5b on the PUFA contents with the increased explained percentages of PUFA contents. The genotype combinations of more heterozygotes were associated with higher PUFA contents than the other combinations. Using ten genomic selection programs with all cSNPs in fads2b and elovl5b, we obtained the high and positive correlations between the phenotypes and the estimated breeding values of eight PUFAs. These results suggested that elovl5b might be the major gene corresponding to common carp PUFA contents compared with elovl5a. The cSNP combinations in fads2b and elovl5b and the optimal genomic selection program will be used in the future selection breeding to improve the PUFA contents of common carp.
Collapse
|
10
|
Zha J, Li M, Kong R, Lu S, Zhang J. Explaining and Predicting Allostery with Allosteric Database and Modern Analytical Techniques. J Mol Biol 2022; 434:167481. [DOI: 10.1016/j.jmb.2022.167481] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 01/25/2022] [Accepted: 01/31/2022] [Indexed: 12/17/2022]
|