1
|
Zhou L, Tao C, Shen X, Sun X, Wang J, Yuan Q. Unlocking the potential of enzyme engineering via rational computational design strategies. Biotechnol Adv 2024; 73:108376. [PMID: 38740355 DOI: 10.1016/j.biotechadv.2024.108376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 04/27/2024] [Accepted: 05/08/2024] [Indexed: 05/16/2024]
Abstract
Enzymes play a pivotal role in various industries by enabling efficient, eco-friendly, and sustainable chemical processes. However, the low turnover rates and poor substrate selectivity of enzymes limit their large-scale applications. Rational computational enzyme design, facilitated by computational algorithms, offers a more targeted and less labor-intensive approach. There has been notable advancement in employing rational computational protein engineering strategies to overcome these issues, it has not been comprehensively reviewed so far. This article reviews recent developments in rational computational enzyme design, categorizing them into three types: structure-based, sequence-based, and data-driven machine learning computational design. Case studies are presented to demonstrate successful enhancements in catalytic activity, stability, and substrate selectivity. Lastly, the article provides a thorough analysis of these approaches, highlights existing challenges and potential solutions, and offers insights into future development directions.
Collapse
Affiliation(s)
- Lei Zhou
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China
| | - Chunmeng Tao
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China
| | - Xiaolin Shen
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China
| | - Xinxiao Sun
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China
| | - Jia Wang
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China.
| | - Qipeng Yuan
- State Key Laboratory of Chemical Resource Engineering, Beijing University of Chemical Technology, Beijing 100029, China.
| |
Collapse
|
2
|
Ouyang X, Liu G, Guo L, Wu G, Xu P, Zhao YL, Tang H. A multifunctional flavoprotein monooxygenase HspB for hydroxylation and C-C cleavage of 6-hydroxy-3-succinoyl-pyridine. Appl Environ Microbiol 2024; 90:e0225523. [PMID: 38415602 PMCID: PMC10952382 DOI: 10.1128/aem.02255-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 01/26/2024] [Indexed: 02/29/2024] Open
Abstract
Flavoprotein monooxygenases catalyze reactions, including hydroxylation and epoxidation, involved in the catabolism, detoxification, and biosynthesis of natural substrates and industrial contaminants. Among them, the 6-hydroxy-3-succinoyl-pyridine (HSP) monooxygenase (HspB) from Pseudomonas putida S16 facilitates the hydroxylation and C-C bond cleavage of the pyridine ring in nicotine. However, the mechanism for biodegradation remains elusive. Here, we refined the crystal structure of HspB and elucidated the detailed mechanism behind the oxidative hydroxylation and C-C cleavage processes. Leveraging structural information about domains for binding the cofactor flavin adenine dinucleotide (FAD) and HSP substrate, we used molecular dynamics simulations and quantum/molecular mechanics calculations to demonstrate that the transfer of an oxygen atom from the reactive FAD peroxide species (C4a-hydroperoxyflavin) to the C3 atom in the HSP substrate constitutes a rate-limiting step, with a calculated reaction barrier of about 20 kcal/mol. Subsequently, the hydrogen atom was rebounded to the FAD cofactor, forming C4a-hydroxyflavin. The residue Cys218 then catalyzed the subsequent hydrolytic process of C-C cleavage. Our findings contribute to a deeper understanding of the versatile functions of flavoproteins in the natural transformation of pyridine and HspB in nicotine degradation.IMPORTANCEPseudomonas putida S16 plays a pivotal role in degrading nicotine, a toxic pyridine derivative that poses significant environmental challenges. This study highlights a key enzyme, HspB (6-hydroxy-3-succinoyl-pyridine monooxygenase), in breaking down nicotine through the pyrrolidine pathway. Utilizing dioxygen and a flavin adenine dinucleotide cofactor, HspB hydroxylates and cleaves the substrate's side chain. Structural analysis of the refined HspB crystal structure, combined with state-of-the-art computations, reveals its distinctive mechanism. The crucial function of Cys218 was never discovered in its homologous enzymes. Our findings not only deepen our understanding of bacterial nicotine degradation but also open avenues for applications in both environmental cleanup and pharmaceutical development.
Collapse
Affiliation(s)
- Xingyu Ouyang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Gongquan Liu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Lihua Guo
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Geng Wu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Ping Xu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Yi-Lei Zhao
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Hongzhi Tang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
3
|
Edholm F, Nandy A, Reinhardt CR, Kastner DW, Kulik HJ. Protein3D: Enabling analysis and extraction of metal-containing sites from the Protein Data Bank with molSimplify. J Comput Chem 2024; 45:352-361. [PMID: 37873926 DOI: 10.1002/jcc.27242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 09/27/2023] [Accepted: 10/03/2023] [Indexed: 10/25/2023]
Abstract
Metalloenzymes catalyze a wide range of chemical transformations, with the active site residues playing a key role in modulating chemical reactivity and selectivity. Unlike smaller synthetic catalysts, a metalloenzyme active site is embedded in a larger protein, which makes interrogation of electronic properties and geometric features with quantum mechanical calculations challenging. Here we implement the ability to fetch crystallographic structures from the Protein Data Bank and analyze the metal binding sites in the program molSimplify. We show the usefulness of the newly created protein3D class to extract the local environment around non-heme iron enzymes containing a two histidine motif and prepare 372 structures for quantum mechanical calculations. Our implementation of protein3D serves to expand the range of systems molSimplify can be used to analyze and will enable high-throughput study of metal-containing active sites in proteins.
Collapse
Affiliation(s)
- Freya Edholm
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Aditya Nandy
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Clorice R Reinhardt
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - David W Kastner
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| |
Collapse
|
4
|
Jiang Y, Ding N, Shao Q, Stull SL, Cheng Z, Yang ZJ. Substrate Positioning Dynamics Involves a Non-Electrostatic Component to Mediate Catalysis. J Phys Chem Lett 2023; 14:11480-11489. [PMID: 38085952 PMCID: PMC11211065 DOI: 10.1021/acs.jpclett.3c02444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
Substrate positioning dynamics (SPD) orients the substrate in the active site, thereby influencing catalytic efficiency. However, it remains unknown whether SPD effects originate primarily from electrostatic perturbation inside the enzyme or can independently mediate catalysis with a significant non-electrostatic component. In this work, we investigated how the non-electrostatic component of SPD affects transition state (TS) stabilization. Using high-throughput enzyme modeling, we selected Kemp eliminase variants with similar electrostatics inside the enzyme but significantly different SPD. The kinetic parameters of these mutants were experimentally characterized. We observed a valley-shaped, two-segment linear correlation between the TS stabilization free energy (converted from kinetic parameters) and substrate positioning index (a metric to quantify SPD). The energy varies by approximately 2 kcal/mol. Favorable SPD was observed for the distal mutant R154W, increasing the proportion of reactive conformations and leading to the lowest activation free energy. These results indicate the substantial contribution of the non-electrostatic component of SPD to enzyme catalytic efficiency.
Collapse
Affiliation(s)
- Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Ning Ding
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Qianzhen Shao
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Sebastian L. Stull
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Zihao Cheng
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Zhongyue J. Yang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Data Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|
5
|
Yang ZJ, Shao Q, Jiang Y, Jurich C, Ran X, Juarez RJ, Yan B, Stull SL, Gollu A, Ding N. Mutexa: A Computational Ecosystem for Intelligent Protein Engineering. J Chem Theory Comput 2023; 19:7459-7477. [PMID: 37828731 PMCID: PMC10653112 DOI: 10.1021/acs.jctc.3c00602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Indexed: 10/14/2023]
Abstract
Protein engineering holds immense promise in shaping the future of biomedicine and biotechnology. This Review focuses on our ongoing development of Mutexa, a computational ecosystem designed to enable "intelligent protein engineering". In this vision, researchers will seamlessly acquire sequences of protein variants with desired functions as biocatalysts, therapeutic peptides, and diagnostic proteins through a finely-tuned computational machine, akin to Amazon Alexa's role as a versatile virtual assistant. The technical foundation of Mutexa has been established through the development of a database that combines and relates enzyme structures and their respective functions (e.g., IntEnzyDB), workflow software packages that enable high-throughput protein modeling (e.g., EnzyHTP and LassoHTP), and scoring functions that map the sequence-structure-function relationship of proteins (e.g., EnzyKR and DeepLasso). We will showcase the applications of these tools in benchmarking the convergence conditions of enzyme functional descriptors across mutants, investigating protein electrostatics and cavity distributions in SAM-dependent methyltransferases, and understanding the role of nonelectrostatic dynamic effects in enzyme catalysis. Finally, we will conclude by addressing the future steps and fundamental challenges in our endeavor to develop new Mutexa applications that assist the identification of beneficial mutants in protein engineering.
Collapse
Affiliation(s)
- Zhongyue J. Yang
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt
Institute of Chemical Biology, Vanderbilt
University, Nashville, Tennessee 37235, United States
- Department
of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235, United States
- Data
Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Qianzhen Shao
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Yaoyukun Jiang
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Christopher Jurich
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt
Institute of Chemical Biology, Vanderbilt
University, Nashville, Tennessee 37235, United States
| | - Xinchun Ran
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Reecan J. Juarez
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37235, United States
| | - Bailu Yan
- Department
of Biostatistics, Vanderbilt University, Nashville, Tennessee 37205, United States
| | - Sebastian L. Stull
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Anvita Gollu
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Ning Ding
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|
6
|
Coimbra JTS, Fernandes PA, Ramos MJ. Deciphering the Catalytic Mechanism of Virginiamycin B Lyase with Multiscale Methods and Molecular Dynamics Simulations. J Chem Inf Model 2023; 63:6354-6365. [PMID: 37791530 DOI: 10.1021/acs.jcim.3c00962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Due to the emergence of antibiotic resistance, the need to explore novel antibiotics and/or novel strategies to counter antibiotic resistance is of utmost importance. In this work, we explored the molecular and mechanistic details of the degradation of a streptogramin B antibiotic by virginiamycin B (Vgb) lyase of Staphylococcus aureus using classical molecular dynamics simulations and multiscale quantum mechanics/molecular mechanics methods. Our results were in line with available experimental kinetic information. Although we were able to identify a stepwise mechanism, in the wild-type enzyme, the intermediate is short-lived, showing a small barrier to decay to the product state. The impact of point mutations on the reaction was also assessed, showing not only the importance of active site residues to the reaction catalyzed by Vgb lyase but also of near positive and negative residues surrounding the active site. Using molecular dynamics simulations, we also predicted the most likely protonation state of the 3-hydroxypicolinic moiety of the antibiotic and the impact of mutants on antibiotic binding. All this information will expand our understanding of linearization reactions of cyclic antibiotics, which are crucial for the development of novel strategies that aim to tackle antibiotic resistance.
Collapse
Affiliation(s)
- João T S Coimbra
- LAQV, REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| | - Pedro A Fernandes
- LAQV, REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| | - Maria J Ramos
- LAQV, REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências Universidade do Porto, Rua do Campo Alegre, s/n, 4169-007 Porto, Portugal
| |
Collapse
|
7
|
Shao Q, Jiang Y, Yang ZJ. EnzyHTP Computational Directed Evolution with Adaptive Resource Allocation. J Chem Inf Model 2023; 63:5650-5659. [PMID: 37611241 PMCID: PMC11211066 DOI: 10.1021/acs.jcim.3c00618] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2023]
Abstract
Directed evolution facilitates enzyme engineering via iterative rounds of mutagenesis. Despite the wide applications of high-throughput screening, building "smart libraries" to effectively identify beneficial variants remains a major challenge in the community. Here, we developed a new computational directed evolution protocol based on EnzyHTP, a software that we have previously reported to automate enzyme modeling. To enhance the throughput efficiency, we implemented an adaptive resource allocation strategy that dynamically allocates different types of computing resources (e.g., GPU/CPU) based on the specific need of an enzyme modeling subtask in the workflow. We implemented the strategy as a Python library and tested the library using fluoroacetate dehalogenase as a model enzyme. The results show that compared to fixed resource allocation where both CPU and GPU are on-call for use during the entire workflow, applying adaptive resource allocation can save 87% CPU hours and 14% GPU hours. Furthermore, we constructed a computational directed evolution protocol under the framework of adaptive resource allocation. The workflow was tested against two rounds of mutational screening in the directed evolution experiments of Kemp eliminase (KE07) with a total of 184 mutants. Using folding stability and electrostatic stabilization energy as computational readout, we identified all four experimentally observed target variants. Enabled by the workflow, the entire computation task (i.e., 18.4 μs MD and 18,400 QM single-point calculations) completes in 3 days of wall-clock time using ∼30 GPUs and ∼1000 CPUs.
Collapse
Affiliation(s)
- Qianzhen Shao
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Zhongyue J. Yang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Data Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|
8
|
Jurich C, Yang ZJ. High-throughput computational investigation of protein electrostatics and cavity for SAM-dependent methyltransferases. Protein Sci 2023; 32:e4690. [PMID: 37278582 PMCID: PMC10273352 DOI: 10.1002/pro.4690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/25/2023] [Accepted: 05/29/2023] [Indexed: 06/07/2023]
Abstract
S-adenosyl methionine (SAM)-dependent methyl transferases (MTases) are a ubiquitous class of enzymes catalyzing dozens of essential life processes. Despite targeting a large space of substrates with diverse intrinsic reactivity, SAM MTases have similar catalytic efficiency. While understanding of MTase mechanism has grown tremendously through the integration of structural characterization, kinetic assays, and multiscale simulations, it remains elusive how these enzymes have evolved to fit the diverse chemical needs of their respective substrates. In this work, we performed a high-throughput molecular modeling analysis of 91 SAM MTases to better understand how their properties (i.e., electric field [EF] strength and active site volumes) help achieve similar catalytic efficiency toward substrates of different reactivity. We found that EF strengths have largely adjusted to make the target atom a better methyl acceptor. For MTases that target RNA/DNA and histone proteins, our results suggest that EF strength accommodates formal hybridization state and variation in cavity volume trends with diversity of substrate classes. Metal ions in SAM MTases contribute negatively to EF strength for methyl donation and enzyme scaffolds tend to offset these contributions.
Collapse
Affiliation(s)
| | - Zhongyue J. Yang
- Department of ChemistryVanderbilt UniversityNashvilleTennesseeUSA
- Center for Structural BiologyVanderbilt UniversityNashvilleTennesseeUSA
- Vanderbilt Institute of Chemical Biology, Vanderbilt UniversityNashvilleTennesseeUSA
- Data Science InstituteVanderbilt UniversityNashvilleTennesseeUSA
- Department of Chemical and Biomolecular EngineeringVanderbilt UniversityNashvilleTennesseeUSA
| |
Collapse
|
9
|
Crawford B, Timalsina U, Quach CD, Craven NC, Gilmer JB, McCabe C, Cummings PT, Potoff JJ. MoSDeF-GOMC: Python Software for the Creation of Scientific Workflows for the Monte Carlo Simulation Engine GOMC. J Chem Inf Model 2023; 63:1218-1228. [PMID: 36791286 DOI: 10.1021/acs.jcim.2c01498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
MoSDeF-GOMC is a python interface for the Monte Carlo software GOMC to the Molecular Simulation Design Framework (MoSDeF) ecosystem. MoSDeF-GOMC automates the process of generating initial coordinates, assigning force field parameters, and writing coordinate (PDB), connectivity (PSF), force field parameter, and simulation control files. The software lowers entry barriers for novice users while allowing advanced users to create complex workflows that encapsulate simulation setup, execution, and data analysis in a single script. All relevant simulation parameters are encoded within the workflow, ensuring reproducible simulations. MoSDeF-GOMC's capabilities are illustrated through a number of examples, including prediction of the adsorption isotherm for CO2 in IRMOF-1, free energies of hydration for neon and radon over a broad temperature range, and the vapor-liquid coexistence curve of a four-component surrogate for the jet fuel S-8. The MoSDeF-GOMC software is available on GitHub at https://github.com/GOMC-WSU/MoSDeF-GOMC.
Collapse
Affiliation(s)
- Brad Crawford
- Department of Chemical Engineering, Wayne State University, Detroit, Michigan 48202-4050, United States
| | - Umesh Timalsina
- Institute for Software Integrated Systems (ISIS), Vanderbilt University, Nashville, Tennessee 37212, United States
| | - Co D Quach
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235-1604, United States.,Multiscale Modeling and Simulation (MuMS) Center, Vanderbilt University, Nashville, Tennessee 37212, United States
| | - Nicholas C Craven
- Multiscale Modeling and Simulation (MuMS) Center, Vanderbilt University, Nashville, Tennessee 37212, United States.,Interdisciplinary Material Science Program, Vanderbilt University, Nashville, Tennessee 37235-0106, United States
| | - Justin B Gilmer
- Multiscale Modeling and Simulation (MuMS) Center, Vanderbilt University, Nashville, Tennessee 37212, United States.,Interdisciplinary Material Science Program, Vanderbilt University, Nashville, Tennessee 37235-0106, United States
| | - Clare McCabe
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235-1604, United States.,Multiscale Modeling and Simulation (MuMS) Center, Vanderbilt University, Nashville, Tennessee 37212, United States
| | - Peter T Cummings
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235-1604, United States.,Multiscale Modeling and Simulation (MuMS) Center, Vanderbilt University, Nashville, Tennessee 37212, United States
| | - Jeffrey J Potoff
- Department of Chemical Engineering, Wayne State University, Detroit, Michigan 48202-4050, United States
| |
Collapse
|
10
|
Juarez RJ, Jiang Y, Tremblay M, Shao Q, Link AJ, Yang ZJ. LassoHTP: A High-Throughput Computational Tool for Lasso Peptide Structure Construction and Modeling. J Chem Inf Model 2023; 63:522-530. [PMID: 36594886 PMCID: PMC10117200 DOI: 10.1021/acs.jcim.2c00945] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Lasso peptides are a subclass of ribosomally synthesized and post-translationally modified peptides with a slipknot conformation. With superior thermal stability, protease resistance, and antimicrobial activity, lasso peptides are promising candidates for bioengineering and pharmaceutical applications. To enable high-throughput computational prediction and design of lasso peptides, we developed a software, LassoHTP, for automatic lasso peptide structure construction and modeling. LassoHTP consists of three modules, including the scaffold constructor, mutant generator, and molecular dynamics (MD) simulator. With a user-provided sequence and conformational annotation, LassoHTP can either generate the structure and conformational ensemble as is or conduct random mutagenesis. We used LassoHTP to construct eight known lasso peptide structures de novo and to simulate their conformational ensembles for 100 ns MD simulations. For benchmarking, we calculated the root mean square deviation (RMSD) of these ensembles with reference to their experimental crystal or NMR PDB structures; we also compared these RMSD values against those of the MD ensembles that are initiated from the PDB structures. Dihedral principal component analysis was also conducted. The results show that the LassoHTP-initiated ensembles are similar to those of the PDB-initiated ensembles. LassoHTP offers a computational platform to develop strategies for lasso peptide prediction and design.
Collapse
Affiliation(s)
- Reecan J. Juarez
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Matthew Tremblay
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Qianzhen Shao
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - A. James Link
- Department of Chemical and Biological Engineering, Chemistry and Molecular Biology, Princeton University, 207 Hoyt Laboratory, Princeton, New Jersey 08544, United States
| | - Zhongyue J. Yang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Data Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|
11
|
Jiang Y, Ran X, Yang ZJ. Data-driven enzyme engineering to identify function-enhancing enzymes. Protein Eng Des Sel 2023; 36:gzac009. [PMID: 36214500 PMCID: PMC10365845 DOI: 10.1093/protein/gzac009] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 08/08/2022] [Accepted: 09/28/2022] [Indexed: 01/22/2023] Open
Abstract
Identifying function-enhancing enzyme variants is a 'holy grail' challenge in protein science because it will allow researchers to expand the biocatalytic toolbox for late-stage functionalization of drug-like molecules, environmental degradation of plastics and other pollutants, and medical treatment of food allergies. Data-driven strategies, including statistical modeling, machine learning, and deep learning, have largely advanced the understanding of the sequence-structure-function relationships for enzymes. They have also enhanced the capability of predicting and designing new enzymes and enzyme variants for catalyzing the transformation of new-to-nature reactions. Here, we reviewed the recent progresses of data-driven models that were applied in identifying efficiency-enhancing mutants for catalytic reactions. We also discussed existing challenges and obstacles faced by the community. Although the review is by no means comprehensive, we hope that the discussion can inform the readers about the state-of-the-art in data-driven enzyme engineering, inspiring more joint experimental-computational efforts to develop and apply data-driven modeling to innovate biocatalysts for synthetic and pharmaceutical applications.
Collapse
Affiliation(s)
- Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
| | - Xinchun Ran
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
| | - Zhongyue J Yang
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, USA
- Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, TN 37235, USA
- Data Science Institute, Vanderbilt University, Nashville, TN 37235, USA
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN 37235, USA
| |
Collapse
|
12
|
Jiang Y, Stull SL, Shao Q, Yang ZJ. Convergence in determining enzyme functional descriptors across Kemp eliminase variants. ELECTRONIC STRUCTURE (BRISTOL, ENGLAND) 2022; 4:044007. [PMID: 37425623 PMCID: PMC10327861 DOI: 10.1088/2516-1075/acad51] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Molecular simulations have been extensively employed to accelerate biocatalytic discoveries. Enzyme functional descriptors derived from molecular simulations have been leveraged to guide the search for beneficial enzyme mutants. However, the ideal active-site region size for computing the descriptors over multiple enzyme variants remains untested. Here, we conducted convergence tests for dynamics-derived and electrostatic descriptors on 18 Kemp eliminase variants across six active-site regions with various boundary distances to the substrate. The tested descriptors include the root-mean-square deviation of the active-site region, the solvent accessible surface area ratio between the substrate and active site, and the projection of the electric field (EF) on the breaking C-H bond. All descriptors were evaluated using molecular mechanics methods. To understand the effects of electronic structure, the EF was also evaluated using quantum mechanics/molecular mechanics methods. The descriptor values were computed for 18 Kemp eliminase variants. Spearman correlation matrices were used to determine the region size condition under which further expansion of the region boundary does not substantially change the ranking of descriptor values. We observed that protein dynamics-derived descriptors, including RMSDactive_site and SASAratio, converge at a distance cutoff of 5 Å from the substrate. The electrostatic descriptor, EFC-H, converges at 6 Å using molecular mechanics methods with truncated enzyme models and 4 Å using quantum mechanics/molecular mechanics methods with whole enzyme model. This study serves as a future reference to determine descriptors for predictive modeling of enzyme engineering.
Collapse
Affiliation(s)
- Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States of America
| | - Sebastian L Stull
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States of America
| | - Qianzhen Shao
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States of America
| | - Zhongyue J Yang
- Department of Chemistry, Vanderbilt University, Nashville, TN 37235, United States of America
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37235, United States of America
- Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, TN 37235, United States of America
- Data Science Institute, Vanderbilt University, Nashville, TN 37235, United States of America
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN 37235, United States of America
| |
Collapse
|
13
|
Yan B, Ran X, Gollu A, Cheng Z, Zhou X, Chen Y, Yang ZJ. IntEnzyDB: an Integrated Structure-Kinetics Enzymology Database. J Chem Inf Model 2022; 62:5841-5848. [PMID: 36286319 DOI: 10.1021/acs.jcim.2c01139] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Data-driven modeling has emerged as a new paradigm for biocatalyst design and discovery. Biocatalytic databases that integrate enzyme structure and function data are in urgent need. Here we describe IntEnzyDB as an integrated structure-kinetics database for facile statistical modeling and machine learning. IntEnzyDB employs a relational database architecture with a flattened data structure, which allows rapid data operation. This architecture also makes it easy for IntEnzyDB to incorporate more types of enzyme function data. IntEnzyDB contains enzyme kinetics and structure data from six enzyme commission classes. Using 1050 enzyme structure-kinetics pairs, we investigated the efficiency-perturbing propensities of mutations that are close or distal to the active site. The statistical results show that efficiency-enhancing mutations are globally encoded and that deleterious mutations are much more likely to occur in close mutations than in distal mutations. Finally, we describe a web interface that allows public users to access enzymology data stored in IntEnzyDB. IntEnzyDB will provide a computational facility for data-driven modeling in biocatalysis and molecular evolution.
Collapse
Affiliation(s)
- Bailu Yan
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States.,Department of Biostatistics, Vanderbilt University, Nashville, Tennessee 37205, United States
| | - Xinchun Ran
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Anvita Gollu
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Zihao Cheng
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Xiang Zhou
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Yiwen Chen
- Data Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Zhongyue J Yang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States.,Center for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States.,Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, Tennessee 37235, United States.,Data Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States.,Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37205, United States
| |
Collapse
|
14
|
Feng S, Li Y, Zhang R, Zhang Q, Wang W. Origin of metabolites diversity and selectivity of P450 catalyzed benzo[a]pyrene metabolic activation. JOURNAL OF HAZARDOUS MATERIALS 2022; 435:129008. [PMID: 35490637 DOI: 10.1016/j.jhazmat.2022.129008] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/19/2022] [Accepted: 04/22/2022] [Indexed: 06/14/2023]
Abstract
Polycyclic Aromatic Hydrocarbon (PAHs) presents one of the most abundant class of environmental pollutants. Recent study shows a lab-synthesized PAHs derivative, helicenium, can selectively kill cancer cells rather than normal cells, calling for the in-depth understanding of the metabolic process. However, the origin of metabolites diversity and selectivity of P450 catalyzed PAHs metabolic activation is still unclear to a great extent. Here we systematically investigated P450 enzymes catalyzed activation mechanism of a representative PAHs, benzo[a]pyrene (BaP), and found the corresponding activation process mainly involves two elementary steps: electrophilic addition and epoxidation. Electrophilic addition step is evidenced to be rate determining step. Two representative binding modes of BaP with P450 were found, which enables the electrophilic addition of Heme (FeO) to almost all the carbons of BaP. This electrophilic addition was proposed to be accelerated by the P450 enzyme environment when compared with the gas phase and water solvent. To dig deeper on the origin of metabolites diversity, we built several linear regression models to explore the structural-energy relationships. The selectivity was eventually attributed to the integrated effects of structural (e.g. O-C distance and O-C-Fe angle) and electrostatic parameters (e.g. charge of C and O) from both BaP and P450.
Collapse
Affiliation(s)
- Shanshan Feng
- Environment Research Institute, Shandong University, Qingdao 266237, PR China
| | - Yanwei Li
- Environment Research Institute, Shandong University, Qingdao 266237, PR China.
| | - Ruiming Zhang
- Environment Research Institute, Shandong University, Qingdao 266237, PR China
| | - Qingzhu Zhang
- Environment Research Institute, Shandong University, Qingdao 266237, PR China
| | - Wenxing Wang
- Environment Research Institute, Shandong University, Qingdao 266237, PR China
| |
Collapse
|
15
|
Zhou J, Zhang X, Li Y, Feng S, Zhang Q, Wang W. Endocrine-disrupting metabolic activation of 2-nitrofluorene catalyzed by human cytochrome P450 1A1: A QM/MM approach. ENVIRONMENT INTERNATIONAL 2022; 166:107355. [PMID: 35751956 DOI: 10.1016/j.envint.2022.107355] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 05/25/2022] [Accepted: 06/14/2022] [Indexed: 06/15/2023]
Abstract
Nitropolycyclic aromatic hydrocarbons (NPAHs) present one of the most important airborne pollutants. Recent studies have shown that one of the most abundant NPAHs, 2-Nitrofluorene (NF), was supposed to be converted to endocrine-disrupting metabolites by cytochrome P450 1A1 (CYP1A1) in human cells. However, the mechanism is still largely unexplored. Here the metabolic activation and transformation mechanism of NF catalyzed by CYP1A1 were systematically studied with the aid of Molecular Dynamics, Density Functional Theory and Quantum Mechanics/Molecular Mechanics techniques. We evidence that CYP1A1 can activate NF through two elementary processes: (i) electrophilic addition (12.4 kcal·mol-1) or hydrogen abstraction (38.2 kcal·mol-1) and (ii) epoxidation (5.9 and 8.7 kcal·mol-1) or NIH shift (12.5 and 14.9 kcal·mol-1) or proton shuttle (12.1 kcal·mol-1). Electrophilic addition was found to be the rate-determining step while epoxidation rather than NIH shift or proton shuttle is the more feasible pathway after electrophilic addition. Metabolites 6,7-epoxide-2-nitrofluorene and 7,8-epoxide-2-nitrofluorene were identified as the major epoxidation products. Epoxides are unstable and easy to react with hydrated hydrogen ions and hydroxyls to produce endocrine disrupter 7-hydroxy-2-nitrofluorene. Toxic analysis shows that some of the metabolites are more toxic to model aquatic organisms (e.g. Green algea) than NF. Binding affinity analysis to human sex hormone binding globulin reveals that NF metabolites all have endocrine-disrupting potential. This study provides a comprehensive understanding on the biotransformation process of NF and may aid future studies on various NPAHs activation catalyzed by human P450 enzyme.
Collapse
Affiliation(s)
- Junhua Zhou
- Environment Research Institute, Shandong University, Qingdao 266237, PR China; School of Environmental Science and Engineering, Shandong University, Qingdao 266237, PR China
| | - Xin Zhang
- Environment Research Institute, Shandong University, Qingdao 266237, PR China; School of Environmental Science and Engineering, Shandong University, Qingdao 266237, PR China
| | - Yanwei Li
- Environment Research Institute, Shandong University, Qingdao 266237, PR China.
| | - Shanshan Feng
- Environment Research Institute, Shandong University, Qingdao 266237, PR China
| | - Qingzhu Zhang
- Environment Research Institute, Shandong University, Qingdao 266237, PR China
| | - Wenxing Wang
- Environment Research Institute, Shandong University, Qingdao 266237, PR China
| |
Collapse
|
16
|
Jiang Y, Yan B, Chen Y, Juarez RJ, Yang ZJ. Molecular Dynamics-Derived Descriptor Informs the Impact of Mutation on the Catalytic Turnover Number in Lactonase Across Substrates. J Phys Chem B 2022; 126:2486-2495. [PMID: 35324218 DOI: 10.1021/acs.jpcb.2c00142] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Molecular dynamics simulations have been extensively employed to reveal the roles of protein dynamics in mediating enzyme catalysis. However, simulation-derived predictive descriptors that inform the impacts of mutations on catalytic turnover numbers remain largely unexplored. In this work, we report the identification of molecular modeling-derived descriptors to predict mutation effect on the turnover number of lactonase SsoPox with both native and non-native substrates. The study consists of 10 enzyme-substrate complexes resulting from a combination of five enzyme variants with two substrates. For each complex, we derived 15 descriptors from molecular dynamics simulations and applied principal component analysis to rank the predictive capability of the descriptors. A top-ranked descriptor was identified, which is the solvent-accessible surface area (SASA) ratio of the substrate to the active site pocket. A uniform volcano-shaped plot was observed in the distribution of experimental activation free energy against the SASA ratio. To achieve efficient lactonase hydrolysis, a non-native substrate-bound enzyme variant needs to involve a similar range of the SASA ratio to the native substrate-bound wild-type enzyme. The descriptor reflects how well the enzyme active site pocket accommodates a substrate for reaction, which has the potential of guiding optimization of enzyme reaction turnover for non-native chemical transformations.
Collapse
Affiliation(s)
- Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Bailu Yan
- Department of Biostatistics, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Yu Chen
- School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, Jiangsu 210023, China
| | - Reecan J Juarez
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Zhongyue J Yang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States.,Center for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States.,Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, Tennessee 37235, United States.,Data Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|