1
|
Planas-Iglesias J, Borko S, Swiatkowski J, Elias M, Havlasek M, Salamon O, Grakova E, Kunka A, Martinovic T, Damborsky J, Martinovic J, Bednar D. AggreProt: a web server for predicting and engineering aggregation prone regions in proteins. Nucleic Acids Res 2024; 52:W159-W169. [PMID: 38801076 PMCID: PMC11223854 DOI: 10.1093/nar/gkae420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 04/23/2024] [Accepted: 05/13/2024] [Indexed: 05/29/2024] Open
Abstract
Recombinant proteins play pivotal roles in numerous applications including industrial biocatalysts or therapeutics. Despite the recent progress in computational protein structure prediction, protein solubility and reduced aggregation propensity remain challenging attributes to design. Identification of aggregation-prone regions is essential for understanding misfolding diseases or designing efficient protein-based technologies, and as such has a great socio-economic impact. Here, we introduce AggreProt, a user-friendly webserver that automatically exploits an ensemble of deep neural networks to predict aggregation-prone regions (APRs) in protein sequences. Trained on experimentally evaluated hexapeptides, AggreProt compares to or outperforms state-of-the-art algorithms on two independent benchmark datasets. The server provides per-residue aggregation profiles along with information on solvent accessibility and transmembrane propensity within an intuitive interface with interactive sequence and structure viewers for comprehensive analysis. We demonstrate AggreProt efficacy in predicting differential aggregation behaviours in proteins on several use cases, which emphasize its potential for guiding protein engineering strategies towards decreased aggregation propensity and improved solubility. The webserver is freely available and accessible at https://loschmidt.chemi.muni.cz/aggreprot/.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Simeon Borko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Jan Swiatkowski
- IT4Innovations, VSB – Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Matej Elias
- IT4Innovations, VSB – Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Martin Havlasek
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Ondrej Salamon
- IT4Innovations, VSB – Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Ekaterina Grakova
- IT4Innovations, VSB – Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Antonín Kunka
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Tomas Martinovic
- IT4Innovations, VSB – Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Jan Martinovic
- IT4Innovations, VSB – Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| |
Collapse
|
2
|
Narayanan KK, Weigle AT, Xu L, Mi X, Zhang C, Chen LQ, Procko E, Shukla D. Deep mutational scanning reveals sequence to function constraints for SWEET family transporters. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.28.601307. [PMID: 39005363 PMCID: PMC11244857 DOI: 10.1101/2024.06.28.601307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Protein science is entering a transformative phase enabled by deep mutational scans that provide an unbiased view of the residue level interactions that mediate function. However, it has yet to be extensively used to characterize the mutational and evolutionary landscapes of plant proteins. Here, we apply the method to explore sequence-function relationships within the sugar transporter AtSWEET13. DMS results describe how mutational interrogation throughout different regions of the protein affects AtSWEET13 abundance and transport function. Our results identify novel transport-enhancing mutations that are validated using the FRET sensor assays. Extending DMS results to phylogenetic analyses reveal the role of transmembrane helix 4 (TM4) which makes the SWEET family transporters distinct from prokaryotic SemiSWEETs. We show that transmembrane helix 4 is intolerant to motif swapping with other clade-specific SWEET TM4 compositions, despite accommodating single point-mutations towards aromatic and charged polar amino acids. We further show that the transfer learning approaches based on physics and ML based In silico variant prediction tools have limited utility for engineering plant proteins as they were unable to reproduce our experimental results. We conclude that DMS can produce datasets which, when combined with the right predictive computational frameworks, can direct plant engineering efforts through derivative phenotype selection and evolutionary insights.
Collapse
Affiliation(s)
- Krishna K. Narayanan
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Austin T. Weigle
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Lingyun Xu
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Xuenan Mi
- Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Chen Zhang
- Department of Plant Biology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Li-Qing Chen
- Department of Plant Biology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Erik Procko
- Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
- Cyrus Biotechnology, Inc., Seattle, Washington 98121, United States
| | - Diwakar Shukla
- Department of Chemical & Biomolecular Engineering; Department of Plant Biology; Department of Bioengineering; Department of Chemistry, Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
3
|
Notin P, Kollasch AW, Ritter D, van Niekerk L, Paul S, Spinner H, Rollins N, Shaw A, Weitzman R, Frazer J, Dias M, Franceschi D, Orenbuch R, Gal Y, Marks DS. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570727. [PMID: 38106144 PMCID: PMC10723403 DOI: 10.1101/2023.12.07.570727] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Predicting the effects of mutations in proteins is critical to many applications, from understanding genetic disease to designing novel proteins that can address our most pressing challenges in climate, agriculture and healthcare. Despite a surge in machine learning-based protein models to tackle these questions, an assessment of their respective benefits is challenging due to the use of distinct, often contrived, experimental datasets, and the variable performance of models across different protein families. Addressing these challenges requires scale. To that end we introduce ProteinGym, a large-scale and holistic set of benchmarks specifically designed for protein fitness prediction and design. It encompasses both a broad collection of over 250 standardized deep mutational scanning assays, spanning millions of mutated sequences, as well as curated clinical datasets providing high-quality expert annotations about mutation effects. We devise a robust evaluation framework that combines metrics for both fitness prediction and design, factors in known limitations of the underlying experimental methods, and covers both zero-shot and supervised settings. We report the performance of a diverse set of over 70 high-performing models from various subfields (eg., alignment-based, inverse folding) into a unified benchmark suite. We open source the corresponding codebase, datasets, MSAs, structures, model predictions and develop a user-friendly website that facilitates data access and analysis.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Ada Shaw
- Applied Mathematics, Harvard University
| | | | | | - Mafalda Dias
- Centre for Genomic Regulation, Universitat Pompeu Fabra
| | | | | | - Yarin Gal
- Computer Science, University of Oxford
| | | |
Collapse
|
4
|
Christensen S, Wernersson C, André I. Facile Method for High-throughput Identification of Stabilizing Mutations. J Mol Biol 2023; 435:168209. [PMID: 37479080 DOI: 10.1016/j.jmb.2023.168209] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 07/13/2023] [Accepted: 07/13/2023] [Indexed: 07/23/2023]
Abstract
Characterizing the effects of mutations on stability is critical for understanding the function and evolution of proteins and improving their biophysical properties. High throughput folding and abundance assays have been successfully used to characterize missense mutations associated with reduced stability. However, screening for increased thermodynamic stability is more challenging since such mutations are rarer and their impact on assay readout is more subtle. Here, a multiplex assay for high throughput screening of protein folding was developed by combining deep mutational scanning, fluorescence-activated cell sorting, and deep sequencing. By analyzing a library of 2000 variants of Adenylate kinase we demonstrate that the readout of the method correlates with stability and that mutants with up to 13 °C increase in thermal melting temperature could be identified with low false positive rate. The discovery of many stabilizing mutations also enabled the analysis of general substitution patterns associated with increased stability in Adenylate kinase. This high throughput method to identify stabilizing mutations can be combined with functional screens to identify mutations that improve both stability and activity.
Collapse
Affiliation(s)
- Signe Christensen
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Camille Wernersson
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Ingemar André
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden.
| |
Collapse
|
5
|
Smith MD, Case MA, Makowski EK, Tessier PM. Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data. Bioinformatics 2023; 39:btad446. [PMID: 37478351 PMCID: PMC10477941 DOI: 10.1093/bioinformatics/btad446] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 06/21/2023] [Accepted: 07/20/2023] [Indexed: 07/23/2023] Open
Abstract
MOTIVATION Deep sequencing of antibody and related protein libraries after phage or yeast-surface display sorting is widely used to identify variants with increased affinity, specificity, and/or improvements in key biophysical properties. Conventional approaches for identifying optimal variants typically use the frequencies of observation in enriched libraries or the corresponding enrichment ratios. However, these approaches disregard the vast majority of deep sequencing data and often fail to identify the best variants in the libraries. RESULTS Here, we present a method, Position-Specific Enrichment Ratio Matrix (PSERM) scoring, that uses entire deep sequencing datasets from pre- and post-selections to score each observed protein variant. The PSERM scores are the sum of the site-specific enrichment ratios observed at each mutated position. We find that PSERM scores are much more reproducible and correlate more strongly with experimentally measured properties than frequencies or enrichment ratios, including for multiple antibody properties (affinity and non-specific binding) for a clinical-stage antibody (emibetuzumab). We expect that this method will be broadly applicable to diverse protein engineering campaigns. AVAILABILITY AND IMPLEMENTATION All deep sequencing datasets and code to perform the analyses presented within are available via https://github.com/Tessier-Lab-UMich/PSERM_paper.
Collapse
Affiliation(s)
- Matthew D Smith
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI 48109-2200, United States
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI 48109-2200, United States
| | - Marshall A Case
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI 48109-2200, United States
| | - Emily K Makowski
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI 48109-2200, United States
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI 48109-2200, United States
| | - Peter M Tessier
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI 48109-2200, United States
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI 48109-2200, United States
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI 48109-2200, United States
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI 48109-2200, United States
- Protein Folding Disease Initiative, University of Michigan, Ann Arbor, MI 48109-2200, United States
- Michigan Alzheimer’s Disease Center, University of Michigan, Ann Arbor, MI 48109-2200, United States
| |
Collapse
|
6
|
Smith MD, Case MA, Makowski EK, Tessier PM. Position-Specific Enrichment Ratio Matrix scores predict antibody variant properties from deep sequencing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.10.548448. [PMID: 37503142 PMCID: PMC10369870 DOI: 10.1101/2023.07.10.548448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Motivation Deep sequencing of antibody and related protein libraries after phage or yeast-surface display sorting is widely used to identify variants with increased affinity, specificity and/or improvements in key biophysical properties. Conventional approaches for identifying optimal variants typically use the frequencies of observation in enriched libraries or the corresponding enrichment ratios. However, these approaches disregard the vast majority of deep sequencing data and often fail to identify the best variants in the libraries. Results Here, we present a method, Position-Specific Enrichment Ratio Matrix (PSERM) scoring, that uses entire deep sequencing datasets from pre- and post-selections to score each observed protein variant. The PSERM scores are the sum of the site-specific enrichment ratios observed at each mutated position. We find that PSERM scores are much more reproducible and correlate more strongly with experimentally measured properties than frequencies or enrichment ratios, including for multiple antibody properties (affinity and non-specific binding) for a clinical-stage antibody (emibetuzumab). We expect that this method will be broadly applicable to diverse protein engineering campaigns. Availability All deep sequencing datasets and code to do the analyses presented within are available via GitHub. Contact Peter Tessier, ptessier@umich.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
7
|
Evaluation of strategies to narrow the product chain-length distribution of microbially synthesized free fatty acids. Metab Eng 2023; 77:21-31. [PMID: 36863604 DOI: 10.1016/j.ymben.2023.02.012] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 01/29/2023] [Accepted: 02/28/2023] [Indexed: 03/04/2023]
Abstract
The dominant strategy for tailoring the chain-length distribution of free fatty acids (FFA) synthesized by heterologous hosts is expression of a selective acyl-acyl carrier protein (ACP) thioesterase. However, few of these enzymes can generate a precise (greater than 90% of a desired chain-length) product distribution when expressed in a microbial or plant host. The presence of alternative chain-lengths can complicate purification in situations where blends of fatty acids are not desired. We report the assessment of several strategies for improving the dodecanoyl-ACP thioesterase from the California bay laurel to exhibit more selective production of medium-chain free fatty acids to near exclusivity. We demonstrated that matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-ToF MS) was an effective library screening technique for identification of thioesterase variants with favorable shifts in chain-length specificity. This strategy proved to be a more effective screening technique than several rational approaches discussed herein. With this data, we isolated four thioesterase variants which exhibited a more selective FFA distribution over wildtype when expressed in the fatty acid accumulating E. coli strain, RL08. We then combined mutations from the MALDI isolates to generate BTE-MMD19, a thioesterase variant capable of producing free fatty acids consisting of 90% of C12 products. Of the four mutations which conferred a specificity shift, we noted that three affected the shape of the binding pocket, while one occurred on the positively charged acyl carrier protein landing pad. Finally, we fused the maltose binding protein (MBP) from E. coli to the N - terminus of BTE-MMD19 to improve enzyme solubility and achieve a titer of 1.9 g per L of twelve-carbon fatty acids in a shake flask.
Collapse
|
8
|
Fu Y, Bedő J, Papenfuss AT, Rubin AF. Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants. Gigascience 2022; 12:giad073. [PMID: 37721410 PMCID: PMC10506130 DOI: 10.1093/gigascience/giad073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 07/02/2023] [Accepted: 08/23/2023] [Indexed: 09/19/2023] Open
Abstract
BACKGROUND Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors. RESULTS In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins. CONCLUSIONS We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results.
Collapse
Affiliation(s)
- Yunfan Fu
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Justin Bedő
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Anthony T Papenfuss
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
- Peter MacCallum Cancer Centre, Melbourne, Victoria 3000, Australia
| | - Alan F Rubin
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| |
Collapse
|
9
|
Beltrán J, Steiner PJ, Bedewitz M, Wei S, Peterson FC, Li Z, Hughes BE, Hartley Z, Robertson NR, Medina-Cucurella AV, Baumer ZT, Leonard AC, Park SY, Volkman BF, Nusinow DA, Zhong W, Wheeldon I, Cutler SR, Whitehead TA. Rapid biosensor development using plant hormone receptors as reprogrammable scaffolds. Nat Biotechnol 2022; 40:1855-1861. [PMID: 35726092 PMCID: PMC9750858 DOI: 10.1038/s41587-022-01364-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 05/17/2022] [Indexed: 01/14/2023]
Abstract
A general method to generate biosensors for user-defined molecules could provide detection tools for a wide range of biological applications. Here, we describe an approach for the rapid engineering of biosensors using PYR1 (Pyrabactin Resistance 1), a plant abscisic acid (ABA) receptor with a malleable ligand-binding pocket and a requirement for ligand-induced heterodimerization, which facilitates the construction of sense-response functions. We applied this platform to evolve 21 sensors with nanomolar to micromolar sensitivities for a range of small molecules, including structurally diverse natural and synthetic cannabinoids and several organophosphates. X-ray crystallography analysis revealed the mechanistic basis for new ligand recognition by an evolved cannabinoid receptor. We demonstrate that PYR1-derived receptors are readily ported to various ligand-responsive outputs, including enzyme-linked immunosorbent assay (ELISA)-like assays, luminescence by protein-fragment complementation and transcriptional circuits, all with picomolar to nanomolar sensitivity. PYR1 provides a scaffold for rapidly evolving new biosensors for diverse sense-response applications.
Collapse
Affiliation(s)
- Jesús Beltrán
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA
| | - Paul J Steiner
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, USA
| | - Matthew Bedewitz
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, USA
| | - Shuang Wei
- Department of Biochemistry, University of California, Riverside, Riverside, CA, USA
| | - Francis C Peterson
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Zongbo Li
- Department of Chemistry, University of California, Riverside, Riverside, CA, USA
| | - Brigid E Hughes
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Zachary Hartley
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA
| | - Nicholas R Robertson
- Department of Bioengineering, University of California, Riverside, Riverside, USA
| | | | - Zachary T Baumer
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, USA
| | - Alison C Leonard
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, USA
| | - Sang-Youl Park
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
| | - Brian F Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | | | - Wenwan Zhong
- Department of Chemistry, University of California, Riverside, Riverside, CA, USA
| | - Ian Wheeldon
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA.
- Department of Chemical and Environmental Engineering, University of California, Riverside, Riverside, CA, USA.
| | - Sean R Cutler
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA.
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA.
- Center for Plant Cell Biology, University of California, Riverside, Riverside, CA, USA.
| | - Timothy A Whitehead
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, USA.
| |
Collapse
|
10
|
Velecký J, Hamsikova M, Stourac J, Musil M, Damborsk J, Bednar D, Mazurenko S. SoluProtMutDB: a manually curated database of protein solubility changes upon mutations. Comput Struct Biotechnol J 2022; 20:6339-6347. [DOI: 10.1016/j.csbj.2022.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 11/04/2022] [Accepted: 11/04/2022] [Indexed: 11/11/2022] Open
|
11
|
Vasina M, Velecký J, Planas-Iglesias J, Marques SM, Skarupova J, Damborsky J, Bednar D, Mazurenko S, Prokop Z. Tools for computational design and high-throughput screening of therapeutic enzymes. Adv Drug Deliv Rev 2022; 183:114143. [PMID: 35167900 DOI: 10.1016/j.addr.2022.114143] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Revised: 02/04/2022] [Accepted: 02/09/2022] [Indexed: 12/16/2022]
Abstract
Therapeutic enzymes are valuable biopharmaceuticals in various biomedical applications. They have been successfully applied for fibrinolysis, cancer treatment, enzyme replacement therapies, and the treatment of rare diseases. Still, there is a permanent demand to find new or better therapeutic enzymes, which would be sufficiently soluble, stable, and active to meet specific medical needs. Here, we highlight the benefits of coupling computational approaches with high-throughput experimental technologies, which significantly accelerate the identification and engineering of catalytic therapeutic agents. New enzymes can be identified in genomic and metagenomic databases, which grow thanks to next-generation sequencing technologies exponentially. Computational design and machine learning methods are being developed to improve catalytically potent enzymes and predict their properties to guide the selection of target enzymes. High-throughput experimental pipelines, increasingly relying on microfluidics, ensure functional screening and biochemical characterization of target enzymes to reach efficient therapeutic enzymes.
Collapse
Affiliation(s)
- Michal Vasina
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Jan Velecký
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic
| | - Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Sergio M Marques
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Jana Skarupova
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic; Enantis, INBIT, Kamenice 34, Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| |
Collapse
|
12
|
Leonard AC, Weinstein JJ, Steiner PJ, Erbse AH, Fleishman SJ, Whitehead TA. Stabilization of the SARS-CoV-2 receptor binding domain by protein core redesign and deep mutational scanning. Protein Eng Des Sel 2022; 35:6553331. [PMID: 35325236 PMCID: PMC9077414 DOI: 10.1093/protein/gzac002] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 01/21/2022] [Accepted: 02/16/2022] [Indexed: 11/12/2022] Open
Abstract
Stabilizing antigenic proteins as vaccine immunogens or diagnostic reagents is a stringent case of protein engineering and design as the exterior surface must maintain recognition by receptor(s) and antigen-specific antibodies at multiple distinct epitopes. This is a challenge, as stability enhancing mutations must be focused on the protein core, whereas successful computational stabilization algorithms typically select mutations at solvent-facing positions. In this study, we report the stabilization of SARS-CoV-2 Wuhan Hu-1 Spike receptor binding domain using a combination of deep mutational scanning and computational design, including the FuncLib algorithm. Our most successful design encodes I358F, Y365W, T430I, and I513L receptor binding domain mutations, maintains recognition by the receptor ACE2 and a panel of different anti-receptor binding domain monoclonal antibodies, is between 1 and 2°C more thermally stable than the original receptor binding domain using a thermal shift assay, and is less proteolytically sensitive to chymotrypsin and thermolysin than the original receptor binding domain. Our approach could be applied to the computational stabilization of a wide range of proteins without requiring detailed knowledge of active sites or binding epitopes. We envision that this strategy may be particularly powerful for cases when there are multiple or unknown binding sites.
Collapse
Affiliation(s)
- Alison C Leonard
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80303, USA
| | - Jonathan J Weinstein
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Paul J Steiner
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80303, USA
| | - Annette H Erbse
- Department of Biochemistry, University of Colorado, Boulder, CO 80303, USA
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | | |
Collapse
|
13
|
Leonard AC, Weinstein JJ, Steiner PJ, Erbse AH, Fleishman SJ, Whitehead TA. Stabilization of the SARS-CoV-2 Receptor Binding Domain by Protein Core Redesign and Deep Mutational Scanning.. [PMID: 34845448 PMCID: PMC8629191 DOI: 10.1101/2021.11.22.469552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Stabilizing antigenic proteins as vaccine immunogens or diagnostic reagents is a stringent case of protein engineering and design as the exterior surface must maintain recognition by receptor(s) and antigen—specific antibodies at multiple distinct epitopes. This is a challenge, as stability-enhancing mutations must be focused on the protein core, whereas successful computational stabilization algorithms typically select mutations at solvent-facing positions. In this study we report the stabilization of SARS-CoV-2 Wuhan Hu-1 Spike receptor binding domain (S RBD) using a combination of deep mutational scanning and computational design, including the FuncLib algorithm. Our most successful design encodes I358F, Y365W, T430I, and I513L RBD mutations, maintains recognition by the receptor ACE2 and a panel of different anti-RBD monoclonal antibodies, is between 1–2°C more thermally stable than the original RBD using a thermal shift assay, and is less proteolytically sensitive to chymotrypsin and thermolysin than the original RBD. Our approach could be applied to the computational stabilization of a wide range of proteins without requiring detailed knowledge of active sites or binding epitopes, particularly powerful for cases when there are multiple or unknown binding sites.
Collapse
|
14
|
Kuiper BP, Prins RC, Billerbeck S. Oligo Pools as an Affordable Source of Synthetic DNA for Cost-Effective Library Construction in Protein- and Metabolic Pathway Engineering. Chembiochem 2021; 23:e202100507. [PMID: 34817110 PMCID: PMC9300125 DOI: 10.1002/cbic.202100507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 11/23/2021] [Indexed: 11/11/2022]
Abstract
The construction of custom libraries is critical for rational protein engineering and directed evolution. Array‐synthesized oligo pools of thousands of user‐defined sequences (up to ∼350 bases in length) have emerged as a low‐cost commercially available source of DNA. These pools cost ≤10 % (depending on error rate and length) of other commercial sources of custom DNA, and this significant cost difference can determine whether an enzyme engineering project can be realized on a given research budget. However, while being cheap, oligo pools do suffer from a low concentration of individual oligos and relatively high error rates. Several powerful techniques that specifically make use of oligo pools have been developed and proven valuable or even essential for next‐generation protein and pathway engineering strategies, such as sequence‐function mapping, enzyme minimization, or de‐novo design. Here we consolidate the knowledge on these techniques and their applications to facilitate the use of oligo pools within the protein engineering community.
Collapse
Affiliation(s)
- Bastiaan P Kuiper
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands
| | - Rianne C Prins
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands
| | - Sonja Billerbeck
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands
| |
Collapse
|
15
|
Galanie S, Entwistle D, Lalonde J. Engineering biosynthetic enzymes for industrial natural product synthesis. Nat Prod Rep 2021; 37:1122-1143. [PMID: 32364202 DOI: 10.1039/c9np00071b] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Covering: 2000 to 2020 Natural products and their derivatives are commercially important medicines, agrochemicals, flavors, fragrances, and food ingredients. Industrial strategies to produce these structurally complex molecules encompass varied combinations of chemical synthesis, biocatalysis, and extraction from natural sources. Interest in engineering natural product biosynthesis began with the advent of genetic tools for pathway discovery. Genes and strains can now readily be synthesized, mutated, recombined, and sequenced. Enzyme engineering has succeeded commercially due to the development of genetic methods, analytical technologies, and machine learning algorithms. Today, engineered biosynthetic enzymes from organisms spanning the tree of life are used industrially to produce diverse molecules. These biocatalytic processes include single enzymatic steps, multienzyme cascades, and engineered native and heterologous microbial strains. This review will describe how biosynthetic enzymes have been engineered to enable commercial and near-commercial syntheses of natural products and their analogs.
Collapse
Affiliation(s)
- Stephanie Galanie
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee, USA.
| | - David Entwistle
- Process Chemistry, Codexis, Inc., Redwood City, California, USA
| | - James Lalonde
- Microbial Digital Genome Engineering, Inscripta, Inc., Pleasanton, California, USA
| |
Collapse
|
16
|
Evolution-aided engineering of plant specialized metabolism. ABIOTECH 2021; 2:240-263. [PMID: 36303885 PMCID: PMC9590541 DOI: 10.1007/s42994-021-00052-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 06/04/2021] [Indexed: 02/07/2023]
Abstract
The evolution of new traits in living organisms occurs via the processes of mutation, recombination, genetic drift, and selection. These processes that have resulted in the immense biological diversity on our planet are also being employed in metabolic engineering to optimize enzymes and pathways, create new-to-nature reactions, and synthesize complex natural products in heterologous systems. In this review, we discuss two evolution-aided strategies for metabolic engineering-directed evolution, which improves upon existing genetic templates using the evolutionary process, and combinatorial pathway reconstruction, which brings together genes evolved in different organisms into a single heterologous host. We discuss the general principles of these strategies, describe the technologies involved and the molecular traits they influence, provide examples of their use, and discuss the roadblocks that need to be addressed for their wider adoption. A better understanding of these strategies can provide an impetus to research on gene function discovery and biochemical evolution, which is foundational for improved metabolic engineering. These evolution-aided approaches thus have a substantial potential for improving our understanding of plant metabolism in general, for enhancing the production of plant metabolites, and in sustainable agriculture.
Collapse
|
17
|
Baumer ZT, Whitehead TA. The inner workings of an enzyme. Science 2021; 373:391-392. [PMID: 34437105 DOI: 10.1126/science.abj8346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Affiliation(s)
- Zachary T Baumer
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80305, USA
| | - Timothy A Whitehead
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80305, USA.
| |
Collapse
|
18
|
Song H, Bremer BJ, Hinds EC, Raskutti G, Romero PA. Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning. Cell Syst 2020; 12:92-101.e8. [PMID: 33212013 DOI: 10.1016/j.cels.2020.10.007] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2019] [Revised: 08/13/2020] [Accepted: 10/22/2020] [Indexed: 10/22/2022]
Abstract
Machine learning can infer how protein sequence maps to function without requiring a detailed understanding of the underlying physical or biological mechanisms. It is challenging to apply existing supervised learning frameworks to large-scale experimental data generated by deep mutational scanning (DMS) and related methods. DMS data often contain high-dimensional and correlated sequence variables, experimental sampling error and bias, and the presence of missing data. Notably, most DMS data do not contain examples of negative sequences, making it challenging to directly estimate how sequence affects function. Here, we develop a positive-unlabeled (PU) learning framework to infer sequence-function relationships from large-scale DMS data. Our PU learning method displays excellent predictive performance across ten large-scale sequence-function datasets, representing proteins of different folds, functions, and library types. The estimated parameters pinpoint key residues that dictate protein structure and function. Finally, we apply our statistical sequence-function model to design highly stabilized enzymes.
Collapse
Affiliation(s)
- Hyebin Song
- Department of Statistics, The Pennsylvania State University, State College, PA 16802, USA; Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Bennett J Bremer
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Emily C Hinds
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Garvesh Raskutti
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Philip A Romero
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
19
|
Stimple SD, Smith MD, Tessier PM. Directed evolution methods for overcoming trade-offs between protein activity and stability. AIChE J 2020; 66. [PMID: 32719568 DOI: 10.1002/aic.16814] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Engineered proteins are being widely developed and employed in applications ranging from enzyme catalysts to therapeutic antibodies. Directed evolution, an iterative experimental process composed of mutagenesis and library screening, is a powerful technique for enhancing existing protein activities and generating entirely new ones not observed in nature. However, the process of accumulating mutations for enhanced protein activity requires chemical and structural changes that are often destabilizing, and low protein stability is a significant barrier to achieving large enhancements in activity during multiple rounds of directed evolution. Here we highlight advances in understanding the origins of protein activity/stability trade-offs for two important classes of proteins (enzymes and antibodies) as well as innovative experimental and computational methods for overcoming such trade-offs. These advances hold great potential for improving the generation of highly active and stable proteins that are needed to address key challenges related to human health, energy and the environment.
Collapse
Affiliation(s)
- Samuel D. Stimple
- Department of Pharmaceutical Sciences Biointerfaces Institute, University of Michigan Ann Arbor Michigan
- Department of Chemical Engineering Biointerfaces Institute, University of Michigan Ann Arbor Michigan
| | - Matthew D. Smith
- Department of Chemical Engineering Biointerfaces Institute, University of Michigan Ann Arbor Michigan
| | - Peter M. Tessier
- Department of Pharmaceutical Sciences Biointerfaces Institute, University of Michigan Ann Arbor Michigan
- Department of Chemical Engineering Biointerfaces Institute, University of Michigan Ann Arbor Michigan
- Department of Biomedical Engineering Biointerfaces Institute, University of Michigan Ann Arbor Michigan
| |
Collapse
|
20
|
Siedhoff NE, Schwaneberg U, Davari MD. Machine learning-assisted enzyme engineering. Methods Enzymol 2020; 643:281-315. [DOI: 10.1016/bs.mie.2020.05.005] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
21
|
Klesmith JR, Wu L, Lobb RR, Rennert PD, Hackel BJ. Fine Epitope Mapping of the CD19 Extracellular Domain Promotes Design. Biochemistry 2019; 58:4869-4881. [PMID: 31702909 DOI: 10.1021/acs.biochem.9b00808] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The B-cell surface protein CD19 is present throughout the cell life cycle and is uniformly expressed in leukemias, making it a target for chimeric antigen receptor engineered immune cell therapy. Identifying the sequence dependence of the binding of CD19 to antibodies empowers fundamental study and more tailored development of CD19-targeted therapeutics. To identify the antibody-binding epitopes on CD19, we screened a comprehensive single-site saturation mutation library of the human CD19 extracellular domain to identify mutations detrimental to binding FMC63-the dominant CD19 antibody used in chimeric antigen receptor development-as well as 4G7-2E3 and 3B10, which have been used in various types of CD19 research and development. All three antibodies had partially overlapping, yet distinct, epitopes near the published epitope of antibody B43. The FMC63 conformational epitope spans spatially adjacent, but genetically distant, loops in exons 3 and 4. The 3B10 epitope is a linear peptide sequence that binds CD19 with 440 pM affinity. Along with their primary goal of epitope mapping, the mutational tolerance data also empowered additional CD19 variant design and analysis. A designed CD19 variant with all N-linked glycosylation sites removed successfully bound antibody in the yeast display context, which provides a lead for aglycosylated applications. Screening for thermally stable variants identified mutations to guide further CD19 stabilization for fusion protein applications and revealed evolutionary affinity-stability trade-offs. These fundamental insights into CD19 sequence-function relationships enhance our understanding of antibody-mediated CD19-targeted therapeutics.
Collapse
Affiliation(s)
- Justin R Klesmith
- Department of Chemical Engineering and Materials Science , University of Minnesota-Twin Cities , 421 Washington Avenue SE , Minneapolis , Minnesota 55455 , United States
| | - Lan Wu
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Roy R Lobb
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Paul D Rennert
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Benjamin J Hackel
- Department of Chemical Engineering and Materials Science , University of Minnesota-Twin Cities , 421 Washington Avenue SE , Minneapolis , Minnesota 55455 , United States
| |
Collapse
|
22
|
Engineering a microbial biosynthesis platform for de novo production of tropane alkaloids. Nat Commun 2019; 10:3634. [PMID: 31406117 PMCID: PMC6690885 DOI: 10.1038/s41467-019-11588-w] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Accepted: 07/18/2019] [Indexed: 02/07/2023] Open
Abstract
Tropane alkaloids (TAs) are a class of phytochemicals produced by plants of the nightshade family used for treating diverse neurological disorders. Here, we demonstrate de novo production of tropine, a key intermediate in the biosynthetic pathway of medicinal TAs such as scopolamine, from simple carbon and nitrogen sources in yeast (Saccharomyces cerevisiae). Our engineered strain incorporates 15 additional genes, including 11 derived from diverse plants and bacteria, and 7 disruptions to yeast regulatory or biosynthetic proteins to produce tropine at titers of 6 mg/L. We also demonstrate the utility of our engineered yeast platform for the discovery of TA derivatives by combining biosynthetic modules from distant plant lineages to achieve de novo production of cinnamoyltropine, a non-canonical TA. Our engineered strain constitutes a starting point for future optimization efforts towards realizing industrial fermentation of medicinal TAs and a platform for the synthesis of TA derivatives with enhanced bioactivities. Tropane alkaloids (TAs) are a group of phytochemicals that are used to treat neurological disorders. Here, the authors engineer baker’s yeast to produce tropine, a key intermediate in the biosynthetic pathway of TAs, and cinnamoyltropine, a non-canonical TA, from simple carbon and nitrogen sources.
Collapse
|
23
|
Ribeiro LF, Amarelle V, Alves LDF, Viana de Siqueira GM, Lovate GL, Borelli TC, Guazzaroni ME. Genetically Engineered Proteins to Improve Biomass Conversion: New Advances and Challenges for Tailoring Biocatalysts. Molecules 2019; 24:molecules24162879. [PMID: 31398877 PMCID: PMC6719137 DOI: 10.3390/molecules24162879] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Revised: 07/30/2019] [Accepted: 08/06/2019] [Indexed: 01/02/2023] Open
Abstract
Protein engineering emerged as a powerful approach to generate more robust and efficient biocatalysts for bio-based economy applications, an alternative to ecologically toxic chemistries that rely on petroleum. On the quest for environmentally friendly technologies, sustainable and low-cost resources such as lignocellulosic plant-derived biomass are being used for the production of biofuels and fine chemicals. Since most of the enzymes used in the biorefinery industry act in suboptimal conditions, modification of their catalytic properties through protein rational design and in vitro evolution techniques allows the improvement of enzymatic parameters such as specificity, activity, efficiency, secretability, and stability, leading to better yields in the production lines. This review focuses on the current application of protein engineering techniques for improving the catalytic performance of enzymes used to break down lignocellulosic polymers. We discuss the use of both classical and modern methods reported in the literature in the last five years that allowed the boosting of biocatalysts for biomass degradation.
Collapse
Affiliation(s)
- Lucas Ferreira Ribeiro
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, University of São Paulo, Ribeirão Preto 14040-901, Brazil.
| | - Vanesa Amarelle
- Department of Microbial Biochemistry and Genomics, Biological Research Institute Clemente Estable, Montevideo, PC 11600, Uruguay
| | - Luana de Fátima Alves
- Department of Biochemistry and Immunology, Faculdade de Medicina de Ribeirão Preto, University of São Paulo, Ribeirão Preto 14049-900, Brazil
| | | | - Gabriel Lencioni Lovate
- Department of Biochemistry and Immunology, Faculdade de Medicina de Ribeirão Preto, University of São Paulo, Ribeirão Preto 14049-900, Brazil
| | - Tiago Cabral Borelli
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, University of São Paulo, Ribeirão Preto 14040-901, Brazil
| | - María-Eugenia Guazzaroni
- Department of Biology, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, University of São Paulo, Ribeirão Preto 14040-901, Brazil.
| |
Collapse
|