1
|
Spannenkrebs JB, Eiermann A, Zoll T, Hackenschmidt S, Kabisch J. Closing the loop: establishing an autonomous test-learn cycle to optimize induction of bacterial systems using a robotic platform. Front Bioeng Biotechnol 2025; 12:1528224. [PMID: 39911814 PMCID: PMC11795046 DOI: 10.3389/fbioe.2024.1528224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2024] [Accepted: 12/30/2024] [Indexed: 02/07/2025] Open
Abstract
One goal of synthetic biology is to provide well-characterised biological parts that behave predictably in genetic assemblies. To achieve this, each part must be characterised in a time-resolved manner under relevant conditions. Robotic platforms can be used to automate this task and provide sufficiently large and reproducible data sets including provenance. Although robotics can significantly speed up the data collection process, the collation and analysis of the resulting data, needed to reprogram and refine workflows for future iterations, is often a manual process. As a result, even in times of rapidly advancing artificial intelligence, the common design-build-test-learn (DBTL) cycle is still not circular without human intervention. To move towards fully automated DBTL cycles, we developed a software framework to enable a robotic platform to autonomously adjust test parameters. This interdisciplinary work between computer science and biology thus transforms a static robotic platform into a dynamic one. The software framework includes software components such as an importer that retrieves measurement data from the platform's devices and writes it to a database. This is followed by an optimizer that selects the next measurement points based on a balance between exploration and exploitation. The platform is shown to be able to automatically and autonomously optimize the inducer concentration for a Bacillus subtilis system and the combination of inducer and feed release for a Escherichia coli system. As a target product the readily measurable green fluorescent reporter protein (GFP) is produced over multiple, consecutive iterations of testing. An evaluation of chosen (learning) algorithms for single and dual factor optimization was performed. In this article, we share the lessons learned from the development, implementation and execution of this automated design-build-test-learn cycles on a robotic platform.
Collapse
Affiliation(s)
| | | | - Thomas Zoll
- Computer-Aided Synthetic Biology, TU Darmstadt, Darmstadt, Germany
| | | | - Johannes Kabisch
- Institute for Biotechnology and Food Science, NTNU, Trondheim, Norway
- Proteineer GmbH, Neu-Isenburg, Germany
- Computer-Aided Synthetic Biology, TU Darmstadt, Darmstadt, Germany
| |
Collapse
|
2
|
Mey F, De Waele G, Demeester W, Guidi C, Duchi D, Diederen T, Kochuyt H, Van Huffel K, Zamboni N, Waegeman W, De Mey M. Machine learning reveals novel compound for the improved production of chitooligosaccharides in Escherichia coli. N Biotechnol 2025; 86:39-46. [PMID: 39827984 DOI: 10.1016/j.nbt.2025.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2024] [Revised: 01/10/2025] [Accepted: 01/16/2025] [Indexed: 01/22/2025]
Abstract
In order to improve predictability of outcome and reduce costly rounds of trial-and-error, machine learning models have been of increasing importance in the field of synthetic biology. Besides applications in predicting genome annotation, process parameters and transcription initiation frequency, such models have also been of help for pathway optimization. The latter is a common strategy in metabolic engineering and improves the production of a desirable compound by optimizing enzyme expression levels of the production pathway. However, engineering steps might not lead to sufficient improvement, and bottlenecks may remain hidden among the hundreds of metabolic reactions occurring in a living cell, especially if the production pathway is highly interconnected with other parts of the cell's metabolism. Here, we use the synthesis of chitooligosaccharides (COS) to show that the production from such complex pathways can be improved by using machine learning models and feature importance analysis to find new compounds with an impact on COS production. We screened Escherichia coli libraries of engineered transcription regulators with an expected broad range of metabolic diversity and trained several machine learning models to predict COS production titers. Subsequent feature analysis led to the finding of iron, whose addition we could show improved COS production in vivo up to two-fold. Additionally, the analysis revealed important clues for future engineering steps.
Collapse
Affiliation(s)
- Friederike Mey
- Center for Synthetic Biology, Department of Biotechnology, University of Ghent, Belgium
| | - Gaetan De Waele
- KERMIT, Department of Data Analysis and Mathematical Modelling, University of Ghent, Belgium
| | - Wouter Demeester
- Center for Synthetic Biology, Department of Biotechnology, University of Ghent, Belgium
| | - Chiara Guidi
- Center for Synthetic Biology, Department of Biotechnology, University of Ghent, Belgium
| | - Dries Duchi
- Center for Synthetic Biology, Department of Biotechnology, University of Ghent, Belgium
| | - Tomek Diederen
- Institute for Molecular Systems Biology, Department of Biology, ETH Zurich, Switzerland
| | - Hanne Kochuyt
- Center for Synthetic Biology, Department of Biotechnology, University of Ghent, Belgium
| | - Kirsten Van Huffel
- Center for Synthetic Biology, Department of Biotechnology, University of Ghent, Belgium
| | - Nicola Zamboni
- Institute for Molecular Systems Biology, Department of Biology, ETH Zurich, Switzerland
| | - Willem Waegeman
- KERMIT, Department of Data Analysis and Mathematical Modelling, University of Ghent, Belgium
| | - Marjan De Mey
- Center for Synthetic Biology, Department of Biotechnology, University of Ghent, Belgium.
| |
Collapse
|
3
|
Moreno-Paz S, Schmitz J, Suarez-Diez M. In silico analysis of design of experiment methods for metabolic pathway optimization. Comput Struct Biotechnol J 2024; 23:1959-1967. [PMID: 38736694 PMCID: PMC11087228 DOI: 10.1016/j.csbj.2024.04.062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 04/25/2024] [Accepted: 04/25/2024] [Indexed: 05/14/2024] Open
Abstract
Microbial cell factories allow the production of chemicals presenting an alternative to traditional fossil fuel-dependent production. However, finding the optimal expression of production pathway genes is crucial for the development of efficient production strains. Unlike sequential experimentation, combinatorial optimization captures the relationships between pathway genes and production, albeit at the cost of conducting multiple experiments. Fractional factorial designs followed by linear modeling and statistical analysis reduce the experimental workload while maximizing the information gained during experimentation. Although tools to perform and analyze these designs are available, guidelines for selecting appropriate factorial designs for pathway optimization are missing. In this study, we leverage a kinetic model of a seven-genes pathway to simulate the performance of a full factorial strain library. We compare this approach to resolution V, IV, III, and Plackett Burman (PB) designs. Additionally, we evaluate the performance of these designs as training sets for a random forest algorithm aimed at identifying best-producing strains. Evaluating the robustness of these designs to noise and missing data, traits inherent to biological datasets, we find that while resolution V designs capture most information present in full factorial data, they necessitate the construction of a large number of strains. On the other hand, resolution III and PB designs fall short in identifying optimal strains and miss relevant information. Besides, given the small number of experiments required for the optimization of a pathway with seven genes, linear models outperform random forest. Consequently, we propose the use of resolution IV designs followed by linear modeling in Design-Build-Test-Learn (DBTL) cycles targeting the screening of multiple factors. These designs enable the identification of optimal strains and provide valuable guidance for subsequent optimization cycles.
Collapse
Affiliation(s)
- Sara Moreno-Paz
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, 6708WE Wageningen, the Netherlands
| | - Joep Schmitz
- Department of Science and Research - dsm-firmenich, Science & Research, 2600 MA Delft, the Netherlands
| | - Maria Suarez-Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, 6708WE Wageningen, the Netherlands
| |
Collapse
|
4
|
Bultelle M, Casas A, Kitney R. Engineering biology and automation-Replicability as a design principle. ENGINEERING BIOLOGY 2024; 8:53-68. [PMID: 39734660 PMCID: PMC11681252 DOI: 10.1049/enb2.12035] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 06/24/2024] [Accepted: 07/07/2024] [Indexed: 12/31/2024] Open
Abstract
Applications in engineering biology increasingly share the need to run operations on very large numbers of biological samples. This is a direct consequence of the application of good engineering practices, the limited predictive power of current computational models and the desire to investigate very large design spaces in order to solve the hard, important problems the discipline promises to solve. Automation has been proposed as a key component for running large numbers of operations on biological samples. This is because it is strongly associated with higher throughput, and with higher replicability (thanks to the reduction of human input). The authors focus on replicability and make the point that, far from being an additional burden for automation efforts, replicability should be considered central to the design of the automated pipelines processing biological samples at scale-as trialled in biofoundries. There cannot be successful automation without effective error control. Design principles for an IT infrastructure that supports replicability are presented. Finally, the authors conclude with some perspectives regarding the evolution of automation in engineering biology. In particular, they speculate that the integration of hardware and software will show rapid progress, and offer users a degree of control and abstraction of the robotic infrastructure on a level significantly greater than experienced today.
Collapse
Affiliation(s)
| | - Alexis Casas
- Department of BioengineeringImperial College LondonLondonUK
| | - Richard Kitney
- Department of BioengineeringImperial College LondonLondonUK
| |
Collapse
|
5
|
Omar MN, Minggu MM, Nor Muhammad NA, Abdul PM, Zhang Y, Ramzi AB. Towards consolidated bioprocessing of biomass and plastic substrates for semi-synthetic production of bio-poly(ethylene furanoate) (PEF) polymer using omics-guided construction of artificial microbial consortia. Enzyme Microb Technol 2024; 177:110429. [PMID: 38537325 DOI: 10.1016/j.enzmictec.2024.110429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/20/2024] [Accepted: 03/14/2024] [Indexed: 04/29/2024]
Abstract
Poly(ethylene furanoate) (PEF) plastic is a 100% renewable polyester that is currently being pursued for commercialization as the next-generation bio-based plastic. This is in line with growing demand for circular bioeconomy and new plastics economy that is aimed at minimizing plastic waste mismanagement and lowering carbon footprint of plastics. However, the current catalytic route for the synthesis of PEF is impeded with technical challenges including high cost of pretreatment and catalyst refurbishment. On the other hand, the semi-biosynthetic route of PEF plastic production is of increased biotechnological interest. In particular, the PEF monomers (Furan dicarboxylic acid and ethylene glycol) can be synthesized via microbial-based biorefinery and purified for subsequent catalyst-mediated polycondensation into PEF. Several bioengineering and bioprocessing issues such as efficient substrate utilization and pathway optimization need to be addressed prior to establishing industrial-scale production of the monomers. This review highlights current advances in semi-biosynthetic production of PEF monomers using consolidated waste biorefinery strategies, with an emphasis on the employment of omics-driven systems biology approaches in enzyme discovery and pathway construction. The roles of microbial protein transporters will be discussed, especially in terms of improving substrate uptake and utilization from lignocellulosic biomass, as well as from depolymerized plastic waste as potential bio-feedstock. The employment of artificial bioengineered microbial consortia will also be highlighted to provide streamlined systems and synthetic biology strategies for bio-based PEF monomer production using both plant biomass and plastic-derived substrates, which are important for circular and new plastics economy advances.
Collapse
Affiliation(s)
- Mohd Norfikri Omar
- Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), UKM, Bangi, Selangor 43600, Malaysia
| | - Matthlessa Matthew Minggu
- Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), UKM, Bangi, Selangor 43600, Malaysia
| | - Nor Azlan Nor Muhammad
- Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), UKM, Bangi, Selangor 43600, Malaysia
| | - Peer Mohamed Abdul
- Department of Chemical and Process Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Bangi, Selangor 43600, Malaysia; Centre for Sustainable Process Technology (CESPRO), Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, Bangi, Selangor 43600, Malaysia
| | - Ying Zhang
- BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Life Sciences, University of Nottingham, University Park, Nottingham NG7 2RD, UK
| | - Ahmad Bazli Ramzi
- Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), UKM, Bangi, Selangor 43600, Malaysia.
| |
Collapse
|
6
|
Li S, Ye Z, Moreb EA, Menacho-Melgar R, Golovsky M, Lynch MD. 2-Stage microfermentations. Metab Eng Commun 2024; 18:e00233. [PMID: 38665924 PMCID: PMC11043886 DOI: 10.1016/j.mec.2024.e00233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 03/27/2024] [Accepted: 04/03/2024] [Indexed: 04/28/2024] Open
Abstract
Cell based factories can be engineered to produce a wide variety of products. Advances in DNA synthesis and genome editing have greatly simplified the design and construction of these factories. It has never been easier to generate hundreds or even thousands of cell factory strain variants for evaluation. These advances have amplified the need for standardized, higher throughput means of evaluating these designs. Toward this goal, we have previously reported the development of engineered E. coli strains and associated 2-stage production processes to simplify and standardize strain engineering, evaluation and scale up. This approach relies on decoupling growth (stage 1), from production, which occurs in stationary phase (stage 2). Phosphate depletion is used as the trigger to stop growth as well as induce heterologous expression. Here, we describe in detail the development of protocols for the evaluation of engineered E. coli strains in 2-stage microfermentations. These protocols are readily adaptable to the evaluation of strains producing a wide variety of protein as well as small molecule products. Additionally, by detailing the approach to protocol development, these methods are also adaptable to additional cellular hosts, as well as other 2-stage processes with various additional triggers.
Collapse
Affiliation(s)
- Shuai Li
- Department of Chemistry, Duke University, Durham, NC, USA
| | - Zhixia Ye
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | - Eirik A. Moreb
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| | | | | | - Michael D. Lynch
- Department of Biomedical Engineering, Duke University, Durham, NC, USA
| |
Collapse
|
7
|
Shao M, Xu F, Ke X, Huang M, Chu J. Enhancing erythromycin production in Saccharopolyspora erythraea through rational engineering and fermentation refinement: A Design-Build-Test-Learn approach. Biotechnol J 2024; 19:e2400039. [PMID: 38797723 DOI: 10.1002/biot.202400039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Revised: 04/09/2024] [Accepted: 04/10/2024] [Indexed: 05/29/2024]
Abstract
Industrial production of bioactive compounds from actinobacteria, such as erythromycin and its derivatives, faces challenges in achieving optimal yields. To this end, the Design-Build-Test-Learn (DBTL) framework, a systematic metabolic engineering approach, was employed to enhance erythromycin production in Saccharopolyspora erythraea (S. erythraea) E3 strain. A genetically modified strain, S. erythraea E3-CymRP21-dcas9-sucC (S. erythraea CS), was developed by suppressing the sucC gene using an inducible promoter and dcas9 protein. The strain exhibited improved erythromycin synthesis, attributed to enhanced precursor synthesis and increased NADPH availability. Transcriptomic and metabolomic analyses revealed altered central carbon metabolism, amino acid metabolism, energy metabolism, and co-factor/vitamin metabolism in CS. Augmented amino acid metabolism led to nitrogen depletion, potentially causing cellular autolysis during later fermentation stages. By refining the fermentation process through ammonium sulfate supplementation, erythromycin yield reached 1125.66 mg L-1, a 43.5% increase. The results demonstrate the power of the DBTL methodology in optimizing erythromycin production, shedding light on its potential for revolutionizing antibiotic manufacturing in response to the global challenge of antibiotic resistance.
Collapse
Affiliation(s)
- Minghao Shao
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| | - Feng Xu
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| | - Xiang Ke
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| | - Mingzhi Huang
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| | - Ju Chu
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, People's Republic of China
| |
Collapse
|
8
|
Manoli MT, Gargantilla-Becerra Á, Del Cerro Sánchez C, Rivero-Buceta V, Prieto MA, Nogales J. A model-driven approach to upcycling recalcitrant feedstocks in Pseudomonas putida by decoupling PHA production from nutrient limitation. Cell Rep 2024; 43:113979. [PMID: 38517887 DOI: 10.1016/j.celrep.2024.113979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 01/29/2024] [Accepted: 03/06/2024] [Indexed: 03/24/2024] Open
Abstract
Bacterial polyhydroxyalkanoates (PHAs) have emerged as promising eco-friendly alternatives to petroleum-based plastics since they are synthesized from renewable resources and offer exceptional properties. However, their production is limited to the stationary growth phase under nutrient-limited conditions, requiring customized strategies and costly two-phase bioprocesses. In this study, we tackle these challenges by employing a model-driven approach to reroute carbon flux and remove regulatory constraints using synthetic biology. We construct a collection of Pseudomonas putida-overproducing strains at the expense of plastics and lignin-related compounds using growth-coupling approaches. PHA production was successfully achieved during growth phase, resulting in the production of up to 46% PHA/cell dry weight while maintaining a balanced carbon-to-nitrogen ratio. Our strains are additionally validated under an upcycling scenario using enzymatically hydrolyzed polyethylene terephthalate as a feedstock. These findings have the potential to revolutionize PHA production and address the global plastic crisis by overcoming the complexities of traditional PHA production bioprocesses.
Collapse
Affiliation(s)
- Maria-Tsampika Manoli
- Polymer Biotechnology Group, Department of Microbial and Plant Biotechnology, Margarita Salas Center for Biological Research (CIB-CSIC), 28040 Madrid, Spain; Interdisciplinary Platform for Sustainable Plastics towards a Circular Economy-Spanish National Research Council (SusPlast-CSIC), Madrid, Spain
| | - Álvaro Gargantilla-Becerra
- Interdisciplinary Platform for Sustainable Plastics towards a Circular Economy-Spanish National Research Council (SusPlast-CSIC), Madrid, Spain; 3Systems Biotechnology Group, Department of Systems Biology, Centro Nacional de Biotecnología, CSIC, Madrid 28049, Spain
| | - Carlos Del Cerro Sánchez
- Polymer Biotechnology Group, Department of Microbial and Plant Biotechnology, Margarita Salas Center for Biological Research (CIB-CSIC), 28040 Madrid, Spain; Interdisciplinary Platform for Sustainable Plastics towards a Circular Economy-Spanish National Research Council (SusPlast-CSIC), Madrid, Spain
| | - Virginia Rivero-Buceta
- Polymer Biotechnology Group, Department of Microbial and Plant Biotechnology, Margarita Salas Center for Biological Research (CIB-CSIC), 28040 Madrid, Spain; Interdisciplinary Platform for Sustainable Plastics towards a Circular Economy-Spanish National Research Council (SusPlast-CSIC), Madrid, Spain
| | - M Auxiliadora Prieto
- Polymer Biotechnology Group, Department of Microbial and Plant Biotechnology, Margarita Salas Center for Biological Research (CIB-CSIC), 28040 Madrid, Spain; Interdisciplinary Platform for Sustainable Plastics towards a Circular Economy-Spanish National Research Council (SusPlast-CSIC), Madrid, Spain.
| | - Juan Nogales
- Interdisciplinary Platform for Sustainable Plastics towards a Circular Economy-Spanish National Research Council (SusPlast-CSIC), Madrid, Spain; 3Systems Biotechnology Group, Department of Systems Biology, Centro Nacional de Biotecnología, CSIC, Madrid 28049, Spain; CNB DNA Biofoundry (CNBio), CSIC, Madrid, Spain.
| |
Collapse
|
9
|
Moreno-Paz S, van der Hoek R, Eliana E, Zwartjens P, Gosiewska S, Martins dos Santos VAP, Schmitz J, Suarez-Diez M. Machine Learning-Guided Optimization of p-Coumaric Acid Production in Yeast. ACS Synth Biol 2024; 13:1312-1322. [PMID: 38545878 PMCID: PMC11036487 DOI: 10.1021/acssynbio.4c00035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 03/07/2024] [Accepted: 03/14/2024] [Indexed: 04/20/2024]
Abstract
Industrial biotechnology uses Design-Build-Test-Learn (DBTL) cycles to accelerate the development of microbial cell factories, required for the transition to a biobased economy. To use them effectively, appropriate connections between the phases of the cycle are crucial. Using p-coumaric acid (pCA) production in Saccharomyces cerevisiae as a case study, we propose the use of one-pot library generation, random screening, targeted sequencing, and machine learning (ML) as links during DBTL cycles. We showed that the robustness and flexibility of the ML models strongly enable pathway optimization and propose feature importance and Shapley additive explanation values as a guide to expand the design space of original libraries. This approach allowed a 68% increased production of pCA within two DBTL cycles, leading to a 0.52 g/L titer and a 0.03 g/g yield on glucose.
Collapse
Affiliation(s)
- Sara Moreno-Paz
- Laboratory
of Systems and Synthetic Biology, Wageningen
University & Research, 6708 WE Wageningen, The Netherlands
| | - Rianne van der Hoek
- Department
of Science and Research, dsm-firmenich,
Science & Research, 2600 MA Delft, The
Netherlands
| | - Elif Eliana
- Laboratory
of Systems and Synthetic Biology, Wageningen
University & Research, 6708 WE Wageningen, The Netherlands
| | - Priscilla Zwartjens
- Department
of Science and Research, dsm-firmenich,
Science & Research, 2600 MA Delft, The
Netherlands
| | - Silvia Gosiewska
- Department
of Science and Research, dsm-firmenich,
Science & Research, 2600 MA Delft, The
Netherlands
| | | | - Joep Schmitz
- Department
of Science and Research, dsm-firmenich,
Science & Research, 2600 MA Delft, The
Netherlands
| | - Maria Suarez-Diez
- Laboratory
of Systems and Synthetic Biology, Wageningen
University & Research, 6708 WE Wageningen, The Netherlands
| |
Collapse
|
10
|
Petersen SD, Levassor L, Pedersen CM, Madsen J, Hansen LG, Zhang J, Haidar AK, Frandsen RJN, Keasling JD, Weber T, Sonnenschein N, K. Jensen M. teemi: An open-source literate programming approach for iterative design-build-test-learn cycles in bioengineering. PLoS Comput Biol 2024; 20:e1011929. [PMID: 38457467 PMCID: PMC10954146 DOI: 10.1371/journal.pcbi.1011929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 03/20/2024] [Accepted: 02/17/2024] [Indexed: 03/10/2024] Open
Abstract
Synthetic biology dictates the data-driven engineering of biocatalysis, cellular functions, and organism behavior. Integral to synthetic biology is the aspiration to efficiently find, access, interoperate, and reuse high-quality data on genotype-phenotype relationships of native and engineered biosystems under FAIR principles, and from this facilitate forward-engineering strategies. However, biology is complex at the regulatory level, and noisy at the operational level, thus necessitating systematic and diligent data handling at all levels of the design, build, and test phases in order to maximize learning in the iterative design-build-test-learn engineering cycle. To enable user-friendly simulation, organization, and guidance for the engineering of biosystems, we have developed an open-source python-based computer-aided design and analysis platform operating under a literate programming user-interface hosted on Github. The platform is called teemi and is fully compliant with FAIR principles. In this study we apply teemi for i) designing and simulating bioengineering, ii) integrating and analyzing multivariate datasets, and iii) machine-learning for predictive engineering of metabolic pathway designs for production of a key precursor to medicinal alkaloids in yeast. The teemi platform is publicly available at PyPi and GitHub.
Collapse
Affiliation(s)
- Søren D. Petersen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Lucas Levassor
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Christine M. Pedersen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Jan Madsen
- Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Lea G. Hansen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Jie Zhang
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Ahmad K. Haidar
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Rasmus J. N. Frandsen
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Jay D. Keasling
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
- Joint BioEnergy Institute, Emeryville, California, United States of America
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California, United States of America
- Department of Chemical and Biomolecular Engineering, Department of Bioengineering, University of California, Berkeley, California, United States of America
- Center for Synthetic Biochemistry, Institute for Synthetic Biology, Shenzhen Institutes of Advanced Technologies, Shenzhen, China
| | - Tilmann Weber
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Nikolaus Sonnenschein
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Michael K. Jensen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| |
Collapse
|
11
|
Peng B, Weintraub SJ, Lu Z, Evans S, Shen Q, McDonnell L, Plan M, Collier T, Cheah LC, Ji L, Howard CB, Anderson W, Trau M, Dumsday G, Bredeweg EL, Young EM, Speight R, Vickers CE. Integration of Yeast Episomal/Integrative Plasmid Causes Genotypic and Phenotypic Diversity and Improved Sesquiterpene Production in Metabolically Engineered Saccharomyces cerevisiae. ACS Synth Biol 2024; 13:141-156. [PMID: 38084917 DOI: 10.1021/acssynbio.3c00363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2024]
Abstract
The variability in phenotypic outcomes among biological replicates in engineered microbial factories presents a captivating mystery. Establishing the association between phenotypic variability and genetic drivers is important to solve this intricate puzzle. We applied a previously developed auxin-inducible depletion of hexokinase 2 as a metabolic engineering strategy for improved nerolidol production in Saccharomyces cerevisiae, and biological replicates exhibit a dichotomy in nerolidol production of either 3.5 or 2.5 g L-1 nerolidol. Harnessing Oxford Nanopore's long-read genomic sequencing, we reveal a potential genetic cause─the chromosome integration of a 2μ sequence-based yeast episomal plasmid, encoding the expression cassettes for nerolidol synthetic enzymes. This finding was reinforced through chromosome integration revalidation, engineering nerolidol and valencene production strains, and generating a diverse pool of yeast clones, each uniquely fingerprinted by gene copy numbers, plasmid integrations, other genomic rearrangements, protein expression levels, growth rate, and target product productivities. Τhe best clone in two strains produced 3.5 g L-1 nerolidol and ∼0.96 g L-1 valencene. Comparable genotypic and phenotypic variations were also generated through the integration of a yeast integrative plasmid lacking 2μ sequences. Our work shows that multiple factors, including plasmid integration status, subchromosomal location, gene copy number, sesquiterpene synthase expression level, and genome rearrangement, together play a complicated determinant role on the productivities of sesquiterpene product. Integration of yeast episomal/integrative plasmids may be used as a versatile method for increasing the diversity and optimizing the efficiency of yeast cell factories, thereby uncovering metabolic control mechanisms.
Collapse
Affiliation(s)
- Bingyin Peng
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Centre of Agriculture and the Bioeconomy, School of Biology and Environmental Science, Faculty of Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
| | - Sarah J Weintraub
- Bioinformatics and Computational Biology, Worcester Polytechnic Institute, Worcester, Massachusetts 01609, United States of America
| | - Zeyu Lu
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Centre of Agriculture and the Bioeconomy, School of Biology and Environmental Science, Faculty of Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
| | - Samuel Evans
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Centre of Agriculture and the Bioeconomy, School of Biology and Environmental Science, Faculty of Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
| | - Qianyi Shen
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Centre of Agriculture and the Bioeconomy, School of Biology and Environmental Science, Faculty of Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
- School of Chemistry and Molecular Biosciences (SCMB), The University of Queensland, Brisbane, QLD4072, Australia
| | - Liam McDonnell
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Centre of Agriculture and the Bioeconomy, School of Biology and Environmental Science, Faculty of Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
| | - Manuel Plan
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
- Metabolomics Australia (Queensland Node), Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
| | - Thomas Collier
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- School of Natural Sciences, Macquarie University, Sydney, NSW 2109, Australia
| | - Li Chen Cheah
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
| | - Lei Ji
- Shandong Provincial Key Laboratory of Applied Microbiology, Ecology Institute, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250103, PR China
| | - Christopher B Howard
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
| | - Will Anderson
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
| | - Matt Trau
- Australian Institute for Bioengineering and Nanotechnology (AIBN), The University of Queensland, Brisbane, QLD 4072, Australia
- School of Chemistry and Molecular Biosciences (SCMB), The University of Queensland, Brisbane, QLD4072, Australia
| | | | - Erin L Bredeweg
- Functional and Systems Biology Group, Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Eric M Young
- Department of Chemical Engineering, Worcester Polytechnic Institute, Worcester, Massachusetts 01609, United States
| | - Robert Speight
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Centre of Agriculture and the Bioeconomy, School of Biology and Environmental Science, Faculty of Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
- Advanced Engineering Biology Future Science Platform, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Black Mountain, ACT 2601, Australia
| | - Claudia E Vickers
- ARC Centre of Excellence in Synthetic Biology, Sydney, NSW 2109, Australia
- Centre of Agriculture and the Bioeconomy, School of Biology and Environmental Science, Faculty of Science, Queensland University of Technology, Brisbane, QLD 4000, Australia
| |
Collapse
|
12
|
Boob AG, Chen J, Zhao H. Enabling pathway design by multiplex experimentation and machine learning. Metab Eng 2024; 81:70-87. [PMID: 38040110 DOI: 10.1016/j.ymben.2023.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/01/2023] [Accepted: 11/25/2023] [Indexed: 12/03/2023]
Abstract
The remarkable metabolic diversity observed in nature has provided a foundation for sustainable production of a wide array of valuable molecules. However, transferring the biosynthetic pathway to the desired host often runs into inherent failures that arise from intermediate accumulation and reduced flux resulting from competing pathways within the host cell. Moreover, the conventional trial and error methods utilized in pathway optimization struggle to fully grasp the intricacies of installed pathways, leading to time-consuming and labor-intensive experiments, ultimately resulting in suboptimal yields. Considering these obstacles, there is a pressing need to explore the enzyme expression landscape and identify the optimal pathway configuration for enhanced production of molecules. This review delves into recent advancements in pathway engineering, with a focus on multiplex experimentation and machine learning techniques. These approaches play a pivotal role in overcoming the limitations of traditional methods, enabling exploration of a broader design space and increasing the likelihood of discovering optimal pathway configurations for enhanced production of molecules. We discuss several tools and strategies for pathway design, construction, and optimization for sustainable and cost-effective microbial production of molecules ranging from bulk to fine chemicals. We also highlight major successes in academia and industry through compelling case studies.
Collapse
Affiliation(s)
- Aashutosh Girish Boob
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Junyu Chen
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, United States; DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, United States.
| |
Collapse
|
13
|
Yin Y, Arneson R, Apostle A, Eriyagama AMDN, Chillar K, Burke E, Jahfetson M, Yuan Y, Fang S. Long oligodeoxynucleotides: chemical synthesis, isolation via catching-by-polymerization, verification via sequencing, and gene expression demonstration. Beilstein J Org Chem 2023; 19:1957-1965. [PMID: 38170048 PMCID: PMC10760481 DOI: 10.3762/bjoc.19.146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2023] [Accepted: 12/14/2023] [Indexed: 01/05/2024] Open
Abstract
Long oligodeoxynucleotides (ODNs) are segments of DNAs having over one hundred nucleotides (nt). They are typically assembled using enzymatic methods such as PCR and ligation from shorter 20 to 60 nt ODNs produced by automated de novo chemical synthesis. While these methods have made many projects in areas such as synthetic biology and protein engineering possible, they have various drawbacks. For example, they cannot produce genes and genomes with long repeats and have difficulty to produce sequences containing stable secondary structures. Here, we report a direct de novo chemical synthesis of 400 nt ODNs, and their isolation from the complex reaction mixture using the catching-by-polymerization (CBP) method. To determine the authenticity of the ODNs, 399 and 401 nt ODNs were synthesized and purified with CBP. The two were joined together using Gibson assembly to give the 800 nt green fluorescent protein (GFP) gene construct. The sequence of the construct was verified via Sanger sequencing. To demonstrate the potential use of the long ODN synthesis method, the GFP gene was expressed in E. coli. The long ODN synthesis and isolation method presented here provides a pathway to the production of genes and genomes containing long repeats or stable secondary structures that cannot be produced or are highly challenging to produce using existing technologies.
Collapse
Affiliation(s)
- Yipeng Yin
- Department of Chemistry and Health Research Institute, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA,
| | - Reed Arneson
- College of Forest Resources and Environmental Science, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA
| | - Alexander Apostle
- Department of Chemistry and Health Research Institute, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA,
| | - Adikari M D N Eriyagama
- Department of Chemistry and Health Research Institute, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA,
| | - Komal Chillar
- Department of Chemistry and Health Research Institute, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA,
| | - Emma Burke
- College of Forest Resources and Environmental Science, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA
| | - Martina Jahfetson
- Department of Chemistry and Health Research Institute, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA,
| | - Yinan Yuan
- College of Forest Resources and Environmental Science, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA
| | - Shiyue Fang
- Department of Chemistry and Health Research Institute, Michigan Technological University, 1400 Townsend Drive, Houghton, MI 49931, USA,
| |
Collapse
|
14
|
Cavero Rozas GM, Mandujano JMC, Chombo YAF, Rencoret DVM, Ortiz Mora YM, Pescarmona MEG, Torres AJD. pyBrick-DNA: A Python-Based Environment for Automated Genetic Component Assembly. J Comput Biol 2023; 30:1315-1321. [PMID: 38010519 DOI: 10.1089/cmb.2023.0008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023] Open
Abstract
Genetic component assembly is key in the simulation and implementation of genetic circuits. Automating this process, thus accelerating prototyping, is a necessity. We present pyBrick-DNA, a software written in Python, that assembles components for the construction of genetic circuits. pyBrick-DNA (colab.pyBrick.com) is a user-friendly environment where scientists can select genetic sequences or input custom sequences to build genetic assemblies. All components are modularly fused to generate a ready-to-go single DNA fragment. It includes Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and plant gene-editing components. Hence, pyBrick-DNA can generate a functional CRISPR construct composed of a single-guided RNA integrated with Cas9, promoters, and terminator elements. The outcome is a DNA sequence, along with a graphical representation, composed of user-selected genetic parts, ready to be synthesized and cloned in vivo. Moreover, the sequence can be exported as a GenBank file allowing its use with other synthetic biology tools.
Collapse
Affiliation(s)
- Gladys M Cavero Rozas
- Department of Bioengineering and Chemical Engineering, University of Engineering and Technology (UTEC), Barranco, Lima, Peru
| | - Jose M Cisneros Mandujano
- Department of Bioengineering and Chemical Engineering, University of Engineering and Technology (UTEC), Barranco, Lima, Peru
| | - Yomali A Ferreyra Chombo
- Department of Bioengineering and Chemical Engineering, University of Engineering and Technology (UTEC), Barranco, Lima, Peru
| | - Daniela V Moreno Rencoret
- School of Informatics and Telecommunications, Faculty of Engineering and Sciences, Universidad Diego Portales, Santiago, Chile
| | - Yerko M Ortiz Mora
- School of Informatics and Telecommunications, Faculty of Engineering and Sciences, Universidad Diego Portales, Santiago, Chile
| | - Martín E Gutiérrez Pescarmona
- School of Informatics and Telecommunications, Faculty of Engineering and Sciences, Universidad Diego Portales, Santiago, Chile
| | - Alberto J Donayre Torres
- Department of Bioengineering and Chemical Engineering, University of Engineering and Technology (UTEC), Barranco, Lima, Peru
| |
Collapse
|
15
|
Khamwachirapithak P, Sae-Tang K, Mhuantong W, Tanapongpipat S, Zhao XQ, Liu CG, Wei DQ, Champreda V, Runguphan W. Optimizing Ethanol Production in Saccharomyces cerevisiae at Ambient and Elevated Temperatures through Machine Learning-Guided Combinatorial Promoter Modifications. ACS Synth Biol 2023; 12:2897-2908. [PMID: 37681736 PMCID: PMC10594650 DOI: 10.1021/acssynbio.3c00199] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Indexed: 09/09/2023]
Abstract
Bioethanol has gained popularity in recent decades as an ecofriendly alternative to fossil fuels due to increasing concerns about global climate change. However, economically viable ethanol fermentation remains a challenge. High-temperature fermentation can reduce production costs, but Saccharomyces cerevisiae yeast strains normally ferment poorly under high temperatures. In this study, we present a machine learning (ML) approach to optimize bioethanol production in S. cerevisiae by fine-tuning the promoter activities of three endogenous genes. We created 216 combinatorial strains of S. cerevisiae by replacing native promoters with five promoters of varying strengths to regulate ethanol production. Promoter replacement resulted in a 63% improvement in ethanol production at 30 °C. We created an ML-guided workflow by utilizing XGBoost to train high-performance models based on promoter strengths and cellular metabolite concentrations obtained from ethanol production of 216 combinatorial strains at 30 °C. This strategy was then applied to optimize ethanol production at 40 °C, where we selected 31 strains for experimental fermentation. This reduced experimental load led to a 7.4% increase in ethanol production in the second round of the ML-guided workflow. Our study offers a comprehensive library of promoter strength modifications for key ethanol production enzymes, showcasing how machine learning can guide yeast strain optimization and make bioethanol production more cost-effective and efficient. Furthermore, we demonstrate that metabolic engineering processes can be accelerated and optimized through this approach.
Collapse
Affiliation(s)
- Peerapat Khamwachirapithak
- National
Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency
(NSTDA) 111 Thailand Science Park, Phahonyothin Road, Khlong
Nueng, Khlong Luang, Pathum Thani 12120, Thailand
| | - Kittapong Sae-Tang
- National
Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency
(NSTDA) 111 Thailand Science Park, Phahonyothin Road, Khlong
Nueng, Khlong Luang, Pathum Thani 12120, Thailand
| | - Wuttichai Mhuantong
- National
Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency
(NSTDA) 111 Thailand Science Park, Phahonyothin Road, Khlong
Nueng, Khlong Luang, Pathum Thani 12120, Thailand
| | - Sutipa Tanapongpipat
- National
Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency
(NSTDA) 111 Thailand Science Park, Phahonyothin Road, Khlong
Nueng, Khlong Luang, Pathum Thani 12120, Thailand
| | - Xin-Qing Zhao
- State
Key Laboratory of Microbial Metabolism, Joint International Research
Laboratory of Metabolic & Developmental Sciences, School of Life
Sciences and Biotechnology, Shanghai Jiao
Tong University, Shanghai 200240, People’s
Republic of China
| | - Chen-Guang Liu
- State
Key Laboratory of Microbial Metabolism, Joint International Research
Laboratory of Metabolic & Developmental Sciences, School of Life
Sciences and Biotechnology, Shanghai Jiao
Tong University, Shanghai 200240, People’s
Republic of China
| | - Dong-Qing Wei
- Department
of Bioinformatics and Biological Statistics, School of Life Sciences
and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, People’s Republic of China
| | - Verawat Champreda
- National
Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency
(NSTDA) 111 Thailand Science Park, Phahonyothin Road, Khlong
Nueng, Khlong Luang, Pathum Thani 12120, Thailand
| | - Weerawat Runguphan
- National
Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency
(NSTDA) 111 Thailand Science Park, Phahonyothin Road, Khlong
Nueng, Khlong Luang, Pathum Thani 12120, Thailand
| |
Collapse
|
16
|
Michailidou F. The Scent of Change: Sustainable Fragrances Through Industrial Biotechnology. Chembiochem 2023; 24:e202300309. [PMID: 37668275 DOI: 10.1002/cbic.202300309] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 05/29/2023] [Indexed: 09/06/2023]
Abstract
Current environmental and safety considerations urge innovation to address the need for sustainable high-value chemicals that are embraced by consumers. This review discusses the concept of sustainable fragrances, as high-value, everyday and everywhere chemicals. Current and emerging technologies represent an opportunity to produce fragrances in an environmentally and socially responsible way. Biotechnology, including fermentation, biocatalysis, and genetic engineering, has the potential to reduce the environmental footprint of fragrance production while maintaining quality and consistency. Computational and in silico methods, including machine learning (ML), are also likely to augment the capabilities of sustainable fragrance production. Continued innovation and collaboration will be crucial to the future of sustainable fragrances, with a focus on developing novel sustainable ingredients, as well as ethical sourcing practices.
Collapse
Affiliation(s)
- Freideriki Michailidou
- Department of Health Sciences and Technology, ETH Zurich, Schmelzbergstrasse 9, 8092, Zürich, Switzerland
| |
Collapse
|
17
|
Wei X, Chen Z, Liu A, Yang L, Xu Y, Cao M, He N. Advanced strategies for metabolic engineering of Bacillus to produce extracellular polymeric substances. Biotechnol Adv 2023; 67:108199. [PMID: 37330153 DOI: 10.1016/j.biotechadv.2023.108199] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 05/24/2023] [Accepted: 06/11/2023] [Indexed: 06/19/2023]
Abstract
Extracellular polymeric substances are mainly synthesized via a variety of biosynthetic pathways in bacteria. Bacilli-sourced extracellular polymeric substances, such as exopolysaccharides (EPS) and poly-γ-glutamic acid (γ-PGA), can serve as active ingredients and hydrogels, and have other important industrial applications. However, the functional diversity and widespread applications of these extracellular polymeric substances, are hampered by their low yields and high costs. Biosynthesis of extracellular polymeric substances is very complex in Bacillus, and there is no detailed elucidation of the reactions and regulations among various metabolic pathways. Therefore, a better understanding of the metabolic mechanisms is required to broaden the functions and increase the yield of extracellular polymeric substances. This review systematically summarizes the biosynthesis and metabolic mechanisms of extracellular polymeric substances in Bacillus, providing an in-depth understanding of the relationships between EPS and γ-PGA synthesis. This review provides a better clarification of Bacillus metabolic mechanisms during extracellular polymeric substance secretion and thus benefits their application and commercialization.
Collapse
Affiliation(s)
- Xiaoyu Wei
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China; The Key Lab for Synthetic Biotechnology of Xiamen City, Xiamen University, Xiamen 361005, China
| | - Zhen Chen
- College of Life Science, Xinyang Normal University, Xinyang 464000, China.
| | - Ailing Liu
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China; The Key Lab for Synthetic Biotechnology of Xiamen City, Xiamen University, Xiamen 361005, China
| | - Lijie Yang
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China; The Key Lab for Synthetic Biotechnology of Xiamen City, Xiamen University, Xiamen 361005, China
| | - Yiyuan Xu
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China; The Key Lab for Synthetic Biotechnology of Xiamen City, Xiamen University, Xiamen 361005, China
| | - Mingfeng Cao
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China; The Key Lab for Synthetic Biotechnology of Xiamen City, Xiamen University, Xiamen 361005, China; Innovation Laboratory for Sciences and Technologies of Energy Materials of Fujian Province (IKKEM), Xiamen 361005, China.
| | - Ning He
- Department of Chemical and Biochemical Engineering, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China; The Key Lab for Synthetic Biotechnology of Xiamen City, Xiamen University, Xiamen 361005, China.
| |
Collapse
|
18
|
Huttanus HM, Triola EKH, Velasquez-Guzman JC, Shin SM, Granja-Travez RS, Singh A, Dale T, Jha RK. Targeted mutagenesis and high-throughput screening of diversified gene and promoter libraries for isolating gain-of-function mutations. Front Bioeng Biotechnol 2023; 11:1202388. [PMID: 37545889 PMCID: PMC10400447 DOI: 10.3389/fbioe.2023.1202388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Accepted: 06/25/2023] [Indexed: 08/08/2023] Open
Abstract
Targeted mutagenesis of a promoter or gene is essential for attaining new functions in microbial and protein engineering efforts. In the burgeoning field of synthetic biology, heterologous genes are expressed in new host organisms. Similarly, natural or designed proteins are mutagenized at targeted positions and screened for gain-of-function mutations. Here, we describe methods to attain complete randomization or controlled mutations in promoters or genes. Combinatorial libraries of one hundred thousands to tens of millions of variants can be created using commercially synthesized oligonucleotides, simply by performing two rounds of polymerase chain reactions. With a suitably engineered reporter in a whole cell, these libraries can be screened rapidly by performing fluorescence-activated cell sorting (FACS). Within a few rounds of positive and negative sorting based on the response from the reporter, the library can rapidly converge to a few optimal or extremely rare variants with desired phenotypes. Library construction, transformation and sequence verification takes 6-9 days and requires only basic molecular biology lab experience. Screening the library by FACS takes 3-5 days and requires training for the specific cytometer used. Further steps after sorting, including colony picking, sequencing, verification, and characterization of individual clones may take longer, depending on number of clones and required experiments.
Collapse
Affiliation(s)
- Herbert M. Huttanus
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- Agile BioFoundry, Emeryville, CA, United States
| | - Ellin-Kristina H. Triola
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- Agile BioFoundry, Emeryville, CA, United States
| | - Jeanette C. Velasquez-Guzman
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- Agile BioFoundry, Emeryville, CA, United States
| | - Sang-Min Shin
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- BOTTLE Consortium, Golden, CO, United States
| | - Rommel S. Granja-Travez
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- BOTTLE Consortium, Golden, CO, United States
| | - Anmoldeep Singh
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Taraka Dale
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- Agile BioFoundry, Emeryville, CA, United States
- BOTTLE Consortium, Golden, CO, United States
| | - Ramesh K. Jha
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- Agile BioFoundry, Emeryville, CA, United States
- BOTTLE Consortium, Golden, CO, United States
| |
Collapse
|
19
|
Su H, Lin J. Biosynthesis pathways of expanding carbon chains for producing advanced biofuels. BIOTECHNOLOGY FOR BIOFUELS AND BIOPRODUCTS 2023; 16:109. [PMID: 37400889 DOI: 10.1186/s13068-023-02340-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 05/11/2023] [Indexed: 07/05/2023]
Abstract
Because the thermodynamic property is closer to gasoline, advanced biofuels (C ≥ 6) are appealing for replacing non-renewable fossil fuels using biosynthesis method that has presented a promising approach. Synthesizing advanced biofuels (C ≥ 6), in general, requires the expansion of carbon chains from three carbon atoms to more than six carbon atoms. Despite some specific biosynthesis pathways that have been developed in recent years, adequate summary is still lacking on how to obtain an effective metabolic pathway. Review of biosynthesis pathways for expanding carbon chains will be conducive to selecting, optimizing and discovering novel synthetic route to obtain new advanced biofuels. Herein, we first highlighted challenges on expanding carbon chains, followed by presentation of two biosynthesis strategies and review of three different types of biosynthesis pathways of carbon chain expansion for synthesizing advanced biofuels. Finally, we provided an outlook for the introduction of gene-editing technology in the development of new biosynthesis pathways of carbon chain expansion.
Collapse
Affiliation(s)
- Haifeng Su
- Key Laboratory of Degraded and Unused Land Consolidation Engineering, The Ministry of Natural and Resources, Xian, 710075, Shanxi, China
| | - JiaFu Lin
- Antibiotics Research and Re-Evaluation Key Laboratory of Sichuan Province, Sichuan Industrial Institute of Antibiotics, School of Pharmacy, Chengdu University, Chengdu, 610106, China.
| |
Collapse
|
20
|
Muhamadali H, Winder CL, Dunn WB, Goodacre R. Unlocking the secrets of the microbiome: exploring the dynamic microbial interplay with humans through metabolomics and their manipulation for synthetic biology applications. Biochem J 2023; 480:891-908. [PMID: 37378961 PMCID: PMC10317162 DOI: 10.1042/bcj20210534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 06/12/2023] [Accepted: 06/16/2023] [Indexed: 06/29/2023]
Abstract
Metabolomics is a powerful research discovery tool with the potential to measure hundreds to low thousands of metabolites. In this review, we discuss the application of GC-MS and LC-MS in discovery-based metabolomics research, we define metabolomics workflows and we highlight considerations that need to be addressed in order to generate robust and reproducible data. We stress that metabolomics is now routinely applied across the biological sciences to study microbiomes from relatively simple microbial systems to their complex interactions within consortia in the host and the environment and highlight this in a range of biological species and mammalian systems including humans. However, challenges do still exist that need to be overcome to maximise the potential for metabolomics to help us understanding biological systems. To demonstrate the potential of the approach we discuss the application of metabolomics in two broad research areas: (1) synthetic biology to increase the production of high-value fine chemicals and reduction in secondary by-products and (2) gut microbial interaction with the human host. While burgeoning in importance, the latter is still in its infancy and will benefit from the development of tools to detangle host-gut-microbial interactions and their impact on human health and diseases.
Collapse
Affiliation(s)
- Howbeer Muhamadali
- Centre for Metabolomics Research, Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, U.K
| | - Catherine L. Winder
- Centre for Metabolomics Research, Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, U.K
| | - Warwick B. Dunn
- Centre for Metabolomics Research, Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, U.K
| | - Royston Goodacre
- Centre for Metabolomics Research, Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, U.K
| |
Collapse
|
21
|
Hueso-Gil A, Calles B, de Lorenzo V. In Vivo Sampling of Intracellular Heterogeneity of Pseudomonas putida Enables Multiobjective Optimization of Genetic Devices. ACS Synth Biol 2023; 12:1667-1676. [PMID: 37196337 PMCID: PMC10278179 DOI: 10.1021/acssynbio.3c00009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Indexed: 05/19/2023]
Abstract
The inner physicochemical heterogeneity of bacterial cells generates three-dimensional (3D)-dependent variations of resources for effective expression of given chromosomally located genes. This fact has been exploited for adjusting the most favorable parameters for implanting a complex device for optogenetic control of biofilm formation in the soil bacterium Pseudomonas putida. To this end, a DNA segment encoding a superactive variant of the Caulobacter crescendus diguanylate cyclase PleD expressed under the control of the cyanobacterial light-responsive CcaSR system was placed in a mini-Tn5 transposon vector and randomly inserted through the chromosome of wild-type and biofilm-deficient variants of P. putida lacking the wsp gene cluster. This operation delivered a collection of clones covering a whole range of biofilm-building capacities and dynamic ranges in response to green light. Since the phenotypic output of the device depends on a large number of parameters (multiple promoters, RNA stability, translational efficacy, metabolic precursors, protein folding, etc.), we argue that random chromosomal insertions enable sampling the intracellular milieu for an optimal set of resources that deliver a preset phenotypic specification. Results thus support the notion that the context dependency can be exploited as a tool for multiobjective optimization, rather than a foe to be suppressed in Synthetic Biology constructs.
Collapse
Affiliation(s)
| | - Belén Calles
- Systems Biology Department, Centro Nacional de Biotecnología-CSIC, Campus
de Cantoblanco, Madrid 28049, Spain
| | - Víctor de Lorenzo
- Systems Biology Department, Centro Nacional de Biotecnología-CSIC, Campus
de Cantoblanco, Madrid 28049, Spain
| |
Collapse
|
22
|
Kelwick RJR, Webb AJ, Freemont PS. Opportunities for engineering outer membrane vesicles using synthetic biology approaches. EXTRACELLULAR VESICLES AND CIRCULATING NUCLEIC ACIDS 2023; 4:255-261. [PMID: 39697987 PMCID: PMC11648402 DOI: 10.20517/evcna.2023.21] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 06/05/2023] [Accepted: 06/08/2023] [Indexed: 12/20/2024]
Abstract
Gram-negative bacteria naturally shed lipid vesicles, which contain complex molecular cargoes, from their outer membrane. These outer membrane vesicles (OMVs) have important biological functions relating to microbial stress responses, microbiome regulation, and host-pathogen interactions. OMVs are also attractive vehicles for delivering drugs, vaccines, and other therapeutic agents because of their ability to interact with host cells and their natural immunogenic properties. OMVs are also set to have a positive impact on other biotechnological and medical applications including diagnostics, bioremediation, and metabolic engineering. We envision that the field of synthetic biology offers a compelling opportunity to further expand and accelerate the foundational research and downstream applications of OMVs in a range of applications including the provision of OMV-based healthcare technologies. In our opinion, we discuss how current and potential future synergies between OMV research and synthetic biology approaches might help to further accelerate OMV research and real-world applications for the benefit of animal and human health.
Collapse
Affiliation(s)
- Richard J. R. Kelwick
- Section of Structural and Synthetic Biology, Department of Infectious Disease, Imperial College London, London SW7 2AZ, UK
- Authors contributed equally
| | - Alexander J. Webb
- Section of Structural and Synthetic Biology, Department of Infectious Disease, Imperial College London, London SW7 2AZ, UK
- UK Dementia Research Institute Care Research and Technology Centre, Imperial College London, Hammersmith Campus, London W12 0NN, UK
- Authors contributed equally
| | - Paul S. Freemont
- Section of Structural and Synthetic Biology, Department of Infectious Disease, Imperial College London, London SW7 2AZ, UK
- UK Dementia Research Institute Care Research and Technology Centre, Imperial College London, Hammersmith Campus, London W12 0NN, UK
- The London Biofoundry, Imperial College Translation & Innovation Hub, White City Campus, London W12 0BZ, UK
| |
Collapse
|
23
|
Helleckes LM, Hemmerich J, Wiechert W, von Lieres E, Grünberger A. Machine learning in bioprocess development: from promise to practice. Trends Biotechnol 2023; 41:817-835. [PMID: 36456404 DOI: 10.1016/j.tibtech.2022.10.010] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 10/20/2022] [Accepted: 10/27/2022] [Indexed: 11/30/2022]
Abstract
Fostered by novel analytical techniques, digitalization, and automation, modern bioprocess development provides large amounts of heterogeneous experimental data, containing valuable process information. In this context, data-driven methods like machine learning (ML) approaches have great potential to rationally explore large design spaces while exploiting experimental facilities most efficiently. Herein we demonstrate how ML methods have been applied so far in bioprocess development, especially in strain engineering and selection, bioprocess optimization, scale-up, monitoring, and control of bioprocesses. For each topic, we will highlight successful application cases, current challenges, and point out domains that can potentially benefit from technology transfer and further progress in the field of ML.
Collapse
Affiliation(s)
- Laura M Helleckes
- Institute for Bio- and Geosciences (IBG-1), Forschungszentrum Jülich GmbH, 52428 Jülich, Germany; RWTH Aachen University, Templergraben 55, 52062 Aachen, Germany
| | - Johannes Hemmerich
- Institute for Bio- and Geosciences (IBG-1), Forschungszentrum Jülich GmbH, 52428 Jülich, Germany
| | - Wolfgang Wiechert
- Institute for Bio- and Geosciences (IBG-1), Forschungszentrum Jülich GmbH, 52428 Jülich, Germany; RWTH Aachen University, Templergraben 55, 52062 Aachen, Germany
| | - Eric von Lieres
- Institute for Bio- and Geosciences (IBG-1), Forschungszentrum Jülich GmbH, 52428 Jülich, Germany; RWTH Aachen University, Templergraben 55, 52062 Aachen, Germany
| | - Alexander Grünberger
- Multiscale Bioengineering, Technical Faculty, Bielefeld University, Universitätsstr. 25, 33615 Bielefeld, Germany; Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstr. 25, 33615 Bielefeld, Germany; Institute of Process Engineering in Life Sciences, Section III: Microsystems in Bioprocess Engineering, Karlsruhe Institute of Technology, Fritz-Haber-Weg 2, 76131, Karlsruhe, Germany.
| |
Collapse
|
24
|
Gurdo N, Volke DC, McCloskey D, Nikel PI. Automating the design-build-test-learn cycle towards next-generation bacterial cell factories. N Biotechnol 2023; 74:1-15. [PMID: 36736693 DOI: 10.1016/j.nbt.2023.01.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 01/15/2023] [Accepted: 01/22/2023] [Indexed: 02/04/2023]
Abstract
Automation is playing an increasingly significant role in synthetic biology. Groundbreaking technologies, developed over the past 20 years, have enormously accelerated the construction of efficient microbial cell factories. Integrating state-of-the-art tools (e.g. for genome engineering and analytical techniques) into the design-build-test-learn cycle (DBTLc) will shift the metabolic engineering paradigm from an almost artisanal labor towards a fully automated workflow. Here, we provide a perspective on how a fully automated DBTLc could be harnessed to construct the next-generation bacterial cell factories in a fast, high-throughput fashion. Innovative toolsets and approaches that pushed the boundaries in each segment of the cycle are reviewed to this end. We also present the most recent efforts on automation of the DBTLc, which heralds a fully autonomous pipeline for synthetic biology in the near future.
Collapse
Affiliation(s)
- Nicolás Gurdo
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kongens, Lyngby, Denmark
| | - Daniel C Volke
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kongens, Lyngby, Denmark
| | - Douglas McCloskey
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kongens, Lyngby, Denmark
| | - Pablo Iván Nikel
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kongens, Lyngby, Denmark.
| |
Collapse
|
25
|
Park H, Faulkner M, Toogood HS, Chen GQ, Scrutton N. Online Omics Platform Expedites Industrial Application of Halomonas bluephagenesis TD1.0. Bioinform Biol Insights 2023; 17:11779322231171779. [PMID: 37200674 PMCID: PMC10185862 DOI: 10.1177/11779322231171779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 04/07/2023] [Indexed: 05/20/2023] Open
Abstract
Multi-omic data mining has the potential to revolutionize synthetic biology especially in non-model organisms that have not been extensively studied. However, tangible engineering direction from computational analysis remains elusive due to the interpretability of large datasets and the difficulty in analysis for non-experts. New omics data are generated faster than our ability to use and analyse results effectively, resulting in strain development that proceeds through classic methods of trial-and-error without insight into complex cell dynamics. Here we introduce a user-friendly, interactive website hosting multi-omics data. Importantly, this new platform allows non-experts to explore questions in an industrially important chassis whose cellular dynamics are still largely unknown. The web platform contains a complete KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment analysis derived from principal components analysis, an interactive bio-cluster heatmap analysis of genes, and the Halomonas TD1.0 genome-scale metabolic (GEM) model. As a case study of the effectiveness of this platform, we applied unsupervised machine learning to determine key differences between Halomonas bluephagenesis TD1.0 cultivated under varied conditions. Specifically, cell motility and flagella apparatus are identified to drive energy expenditure usage at different osmolarities, and predictions were verified experimentally using microscopy and fluorescence labelled flagella staining. As more omics projects are completed, this landing page will facilitate exploration and targeted engineering efforts of the robust, industrial chassis H bluephagenesis for researchers without extensive bioinformatics background.
Collapse
Affiliation(s)
- Helen Park
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
- Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua-Peking Center for Life Sciences, Tsinghua University, Beijing, China
| | - Matthew Faulkner
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| | - Helen S Toogood
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| | - Guo-Qiang Chen
- Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua-Peking Center for Life Sciences, Tsinghua University, Beijing, China
| | - Nigel Scrutton
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| |
Collapse
|
26
|
Cheng Y, Bi X, Xu Y, Liu Y, Li J, Du G, Lv X, Liu L. Machine learning for metabolic pathway optimization: A review. Comput Struct Biotechnol J 2023; 21:2381-2393. [PMID: 38213889 PMCID: PMC10781721 DOI: 10.1016/j.csbj.2023.03.045] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 03/24/2023] [Accepted: 03/25/2023] [Indexed: 03/29/2023] Open
Abstract
Optimizing the metabolic pathways of microbial cell factories is essential for establishing viable biotechnological production processes. However, due to the limited understanding of the complex setup of cellular machinery, building efficient microbial cell factories remains tedious and time-consuming. Machine learning (ML), a powerful tool capable of identifying patterns within large datasets, has been used to analyze biological datasets generated using various high-throughput technologies to build data-driven models for complex bioprocesses. In addition, ML can also be integrated with Design-Build-Test-Learn to accelerate development. This review focuses on recent ML applications in genome-scale metabolic model construction, multistep pathway optimization, rate-limiting enzyme engineering, and gene regulatory element designing. In addition, we have discussed some limitations of these methods as well as potential solutions.
Collapse
Affiliation(s)
- Yang Cheng
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Xinyu Bi
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Yameng Xu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Yanfeng Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jianghua Li
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Guocheng Du
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Xueqin Lv
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Long Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
27
|
Yang CT, Kristiani E, Leong YK, Chang JS. Big data and machine learning driven bioprocessing - Recent trends and critical analysis. BIORESOURCE TECHNOLOGY 2023; 372:128625. [PMID: 36642201 DOI: 10.1016/j.biortech.2023.128625] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 01/09/2023] [Accepted: 01/11/2023] [Indexed: 06/17/2023]
Abstract
Given the potential of machine learning algorithms in revolutionizing the bioengineering field, this paper examined and summarized the literature related to artificial intelligence (AI) in the bioprocessing field. Natural language processing (NLP) was employed to explore the direction of the research domain. All the papers from 2013 to 2022 with specific keywords of bioprocessing using AI were extracted from Scopus and grouped into two five-year periods of 2013-to-2017 and 2018-to-2022, where the past and recent research directions were compared. Based on this procedure, selected sample papers from recent five years were subjected to further review and analysis. The result shows that 50% of the publications in the past five-year focused on topics related to hybrid models, ANN, biopharmaceutical manufacturing, and biorefinery. The summarization and analysis of the outcome indicated that implementing AI could improve the design and process engineering strategies in bioprocessing fields.
Collapse
Affiliation(s)
- Chao-Tung Yang
- Department of Computer Science, Tunghai University, Taichung 407224, Taiwan; Research Center for Smart Sustainable Circular Economy, Tunghai University, Taichung 407224, Taiwan
| | - Endah Kristiani
- Department of Computer Science, Tunghai University, Taichung 407224, Taiwan; Department of Informatics, Krida Wacana Christian University, Jakarta 11470, Indonesia
| | - Yoong Kit Leong
- Research Center for Smart Sustainable Circular Economy, Tunghai University, Taichung 407224, Taiwan; Department of Chemical and Materials Engineering, Tunghai University, Taichung 407224, Taiwan
| | - Jo-Shu Chang
- Research Center for Smart Sustainable Circular Economy, Tunghai University, Taichung 407224, Taiwan; Department of Chemical and Materials Engineering, Tunghai University, Taichung 407224, Taiwan; Department of Chemical Engineering, National Cheng Kung University, Tainan 701, Taiwan.
| |
Collapse
|
28
|
Sanka I, Kusuma AB, Martha F, Hendrawan A, Pramanda IT, Wicaksono A, Jati AP, Mazaya M, Dwijayanti A, Izzati N, Maulana MF, Widyaningrum AR. Synthetic biology in Indonesia: Potential and projection in a country with mega biodiversity. BIOTECHNOLOGY NOTES (AMSTERDAM, NETHERLANDS) 2023; 4:41-48. [PMID: 39416916 PMCID: PMC11446346 DOI: 10.1016/j.biotno.2023.02.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 02/14/2023] [Accepted: 02/14/2023] [Indexed: 10/19/2024]
Abstract
Synthetic biology has gained many interest around the globe in the last two decades, not only due to its rapid development but also the potential to provide addressable solutions using standardized design of biological systems. Considering its huge population, biodiversity, and natural resources, Indonesia could play an important role in shaping the future of synthetic biology towards a sustainable bio-circular economy. Here, we provide an overview of synthetic biology development in Indonesia, especially on exploring the potential of our biodiversity. We also discuss some potentials of synthetic biology in solving national issues. Furthermore, we also provide the projection and future landscape of synthetic biology development in Indonesia. In addition, we briefly explain the potential challenges that may arise during the development.
Collapse
Affiliation(s)
- Immanuel Sanka
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
- Department of Chemistry and Biotechnology, Tallinn University of Technology, Akadeemia tee 15, 12618, Tallinn, Estonia
| | - Ali Budhi Kusuma
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
- Indonesian Centre for Extremophile Bioresources and Biotechnology (ICEBB), Faculty of Life Sciences and Technology, Sumbawa University of Technology, Jl. Raya Olat Maras Sumbawa, 84371, Indonesia
| | - Faustina Martha
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
- Science Communication Steering Committee, iGEM Foundation, 45 Prospect St, Cambridge, MA, 02139, United States
| | - Andre Hendrawan
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
| | - Ihsan Tria Pramanda
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
- Department of Bio Technology, Indonesia International Institute for Life Sciences (i3L), Jl. Pulomas Barat Kav. 88, Pulomas, Jakarta, 13210, Indonesia
| | - Adhityo Wicaksono
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
- Division of Biotechnology, Genbinesia Foundation, Jalan Swadaya Barat no. 4, Gresik, 61171, Indonesia
| | - Afif Pranaya Jati
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
- Infection Program, Monash Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, Victoria, Australia
| | - Maulida Mazaya
- Research Center for Computing, Research Organization for Electronics and Informatics, National Research and Innovation Agency (BRIN), Cibinong Science Center, Jl. Raya Jakarta-Bogor KM 46, Cibinong, 16911, West Java, Indonesia
| | - Ari Dwijayanti
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
| | - Nurul Izzati
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
- Indonesian Centre for Extremophile Bioresources and Biotechnology (ICEBB), Faculty of Life Sciences and Technology, Sumbawa University of Technology, Jl. Raya Olat Maras Sumbawa, 84371, Indonesia
| | - Muhammad Farhan Maulana
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
| | - Aulia Reski Widyaningrum
- Synthetic Biology Indonesia (Synbio.id), Jl. Raya Lintas Sumbawa-Bima, Block AA No. 1, Boak Village, Subdistrict Unter Iwes, 84316, Sumbawa, Indonesia
| |
Collapse
|
29
|
Singh B, Kumar A, Saini AK, Saini RV, Thakur R, Mohammed SA, Tuli HS, Gupta VK, Areeshi MY, Faidah H, Jalal NA, Haque S. Strengthening microbial cell factories for efficient production of bioactive molecules. Biotechnol Genet Eng Rev 2023:1-34. [PMID: 36809927 DOI: 10.1080/02648725.2023.2177039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 01/21/2023] [Indexed: 02/24/2023]
Abstract
High demand of bioactive molecules (food additives, antibiotics, plant growth enhancers, cosmetics, pigments and other commercial products) is the prime need for the betterment of human life where the applicability of the synthetic chemical product is on the saturation due to associated toxicity and ornamentations. It has been noticed that the discovery and productivity of such molecules in natural scenarios are limited due to low cellular yields as well as less optimized conventional methods. In this respect, microbial cell factories timely fulfilling the requirement of synthesizing bioactive molecules by improving production yield and screening more promising structural homologues of the native molecule. Where the robustness of the microbial host can be potentially achieved by taking advantage of cell engineering approaches such as tuning functional and adjustable factors, metabolic balancing, adapting cellular transcription machinery, applying high throughput OMICs tools, stability of genotype/phenotype, organelle optimizations, genome editing (CRISPER/Cas mediated system) and also by developing accurate model systems via machine-learning tools. In this article, we provide an overview from traditional to recent trends and the application of newly developed technologies, for strengthening the systemic approaches and providing future directions for enhancing the robustness of microbial cell factories to speed up the production of biomolecules for commercial purposes.
Collapse
Affiliation(s)
- Bharat Singh
- Department of Biotechnology and Central Research Cell, MMEC, Maharishi Markandeshwar (Deemed to be University), Mullana-Ambala, India
| | - Ankit Kumar
- TERI-Deakin Nanobiotechnology Centre, TERI Gram, The Energy and Resources Institute, Gurugram, India
| | - Adesh Kumar Saini
- Department of Biotechnology and Central Research Cell, MMEC, Maharishi Markandeshwar (Deemed to be University), Mullana-Ambala, India
| | - Reena Vohra Saini
- Department of Biotechnology and Central Research Cell, MMEC, Maharishi Markandeshwar (Deemed to be University), Mullana-Ambala, India
| | - Rahul Thakur
- Department of Biotechnology and Central Research Cell, MMEC, Maharishi Markandeshwar (Deemed to be University), Mullana-Ambala, India
| | - Shakeel A Mohammed
- Department of Biotechnology and Central Research Cell, MMEC, Maharishi Markandeshwar (Deemed to be University), Mullana-Ambala, India
| | - Hardeep Singh Tuli
- Department of Biotechnology and Central Research Cell, MMEC, Maharishi Markandeshwar (Deemed to be University), Mullana-Ambala, India
| | - Vijai Kumar Gupta
- Biorefining and Advanced Materials Research Centre, Scotland's Rural College (SRUC), Edinburgh, UK
| | - Mohammed Y Areeshi
- Medical Laboratory Technology Department, College of Applied Medical Sciences, Jazan University, Jazan, Saudi Arabia
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan, Saudi Arabia
| | - Hani Faidah
- Department of Microbiology, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Naif A Jalal
- Department of Microbiology, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Shafiul Haque
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan, Saudi Arabia
- Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Beirut, Lebanon
- Centre of Medical and Bio-Allied Health Sciences Research, Ajman University, Ajman, United Arab Emirates
| |
Collapse
|
30
|
Yu T, Boob AG, Volk MJ, Liu X, Cui H, Zhao H. Machine learning-enabled retrobiosynthesis of molecules. Nat Catal 2023. [DOI: 10.1038/s41929-022-00909-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
|
31
|
García-Franco A, Godoy P, Duque E, Ramos JL. Insights into the susceptibility of Pseudomonas putida to industrially relevant aromatic hydrocarbons that it can synthesize from sugars. Microb Cell Fact 2023; 22:22. [PMID: 36732770 PMCID: PMC9893694 DOI: 10.1186/s12934-023-02028-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 01/21/2023] [Indexed: 02/04/2023] Open
Abstract
Pseudomonas putida DOT-T1E is a highly solvent tolerant strain for which many genetic tools have been developed. The strain represents a promising candidate host for the synthesis of aromatic compounds-opening a path towards a green alternative to petrol-derived chemicals. We have engineered this strain to produce phenylalanine, which can then be used as a raw material for the synthesis of styrene via trans-cinnamic acid. To understand the response of this strain to the bioproducts of interest, we have analyzed the in-depth physiological and genetic response of the strain to these compounds. We found that in response to the exposure to the toxic compounds that the strain can produce, the cell launches a multifactorial response to enhance membrane impermeabilization. This process occurs via the activation of a cis to trans isomerase that converts cis unsaturated fatty acids to their corresponding trans isomers. In addition, the bacterial cells initiate a stress response program that involves the synthesis of a number of chaperones and ROS removing enzymes, such as peroxidases and superoxide dismutases. The strain also responds by enhancing the metabolism of glucose through the specific induction of the glucose phosphorylative pathway, Entner-Doudoroff enzymes, Krebs cycle enzymes and Nuo. In step with these changes, the cells induce two efflux pumps to extrude the toxic chemicals. Through analyzing a wide collection of efflux pump mutants, we found that the most relevant pump is TtgGHI, which is controlled by the TtgV regulator.
Collapse
Affiliation(s)
- Ana García-Franco
- Estación Experimental del Zaidín. Consejo Superior de Investigaciones Científicas, c/Profesor Albareda nº 1, 18008, Granada, Spain
| | - Patricia Godoy
- Estación Experimental del Zaidín. Consejo Superior de Investigaciones Científicas, c/Profesor Albareda nº 1, 18008, Granada, Spain
| | - Estrella Duque
- Estación Experimental del Zaidín. Consejo Superior de Investigaciones Científicas, c/Profesor Albareda nº 1, 18008, Granada, Spain
| | - Juan Luis Ramos
- Estación Experimental del Zaidín. Consejo Superior de Investigaciones Científicas, c/Profesor Albareda nº 1, 18008, Granada, Spain.
| |
Collapse
|
32
|
Kang CK, Shin J, Cha Y, Kim MS, Choi MS, Kim T, Park YK, Choi YJ. Machine learning-guided prediction of potential engineering targets for microbial production of lycopene. BIORESOURCE TECHNOLOGY 2023; 369:128455. [PMID: 36503092 DOI: 10.1016/j.biortech.2022.128455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 12/02/2022] [Accepted: 12/04/2022] [Indexed: 06/17/2023]
Abstract
The process of designing streamlined workflows for developing microbial strains using classical methods from vast amounts of biological big data has reached its limits. With the continuous increase in the amount of biological big data, data-driven machine learning approaches are being used to overcome the limits of classical approaches for strain development. Here, machine learning-guided engineering of Deinococcus radiodurans R1 for high-yield production of lycopene was demonstrated. The multilayer perceptron models were first trained using the mRNA expression levels of the key genes along with lycopene titers and yields obtained from 17 strains. Then, the potential overexpression targets from 2,047 possible combinations were predicted by the multilayer perceptron combined with a genetic algorithm. Through the machine learning-aided fine-tuning of the predicted genes, the final-engineered LY04 strain resulted in an 8-fold increase in the lycopene production, up to 1.25 g/L from glycerol, and a 6-fold increase in the lycopene yield.
Collapse
Affiliation(s)
- Chang Keun Kang
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
| | - Jihoon Shin
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
| | - YoonKyung Cha
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
| | - Min Sun Kim
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
| | - Min Sun Choi
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
| | - TaeHo Kim
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
| | - Young-Kwon Park
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
| | - Yong Jun Choi
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea.
| |
Collapse
|
33
|
Deng K, Wang X, Ing N, Opgenorth P, de Raad M, Kim J, Simmons BA, Adams PD, Singh AK, Lee TS, Northen TR. Rapid quantification of alcohol production in microorganisms based on nanostructure-initiator mass spectrometry (NIMS). Anal Biochem 2023; 662:114997. [PMID: 36435200 DOI: 10.1016/j.ab.2022.114997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 11/17/2022] [Accepted: 11/18/2022] [Indexed: 11/25/2022]
Abstract
We described a mass spectrometry-based assay to rapidly quantify the production of primary alcohols directly from cell cultures. This novel assay used the combination of TEMPO-based oxidation chemistry and oxime ligation, followed by product analysis based on Nanostructure-Initiator Mass Spectrometry. This assay enables quantitative monitor both C5 to C18 alcohols as well as glucose and gluconate in the growth medium to support strain characterization and optimization. We find that this assay yields similar results to gas chromatography for isoprenol production but required much less acquisition time per sample. We applied this assay to gain new insights into P. Putida's utilization of alcohols and find that this strain largely could not grow on heptanol and octanol.
Collapse
Affiliation(s)
- Kai Deng
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Sandia National Laboratories, Livermore, CA, 94551, USA.
| | - Xi Wang
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Nicole Ing
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Sandia National Laboratories, Livermore, CA, 94551, USA
| | - Paul Opgenorth
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Markus de Raad
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Jinho Kim
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Blake A Simmons
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Paul D Adams
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; University of California, Berkeley, CA, 94720, USA
| | - Anup K Singh
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Livermore National Laboratory, Livermore, 94550, USA
| | - Taek Soon Lee
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Trent R Northen
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
| |
Collapse
|
34
|
Zha J, Zhao Z, Xiao Z, Eng T, Mukhopadhyay A, Koffas MA, Tang YJ. Biosystem design of Corynebacterium glutamicum for bioproduction. Curr Opin Biotechnol 2023; 79:102870. [PMID: 36549106 DOI: 10.1016/j.copbio.2022.102870] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 11/13/2022] [Accepted: 11/24/2022] [Indexed: 12/24/2022]
Abstract
Corynebacterium glutamicum, a natural glutamate-producing bacterium adopted for industrial production of amino acids, has been extensively explored recently for high-level biosynthesis of amino acid derivatives, bulk chemicals such as organic acids and short-chain alcohols, aromatics, and natural products, including polyphenols and terpenoids. Here, we review the recent advances with a focus on biosystem design principles, metabolic characterization and modeling, omics analysis, utilization of nonmodel feedstock, emerging CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) tools for Corynebacterium strain engineering, biosensors, and novel strains of C. glutamicum. Future research directions for developing C. glutamicum cell factories are also discussed.
Collapse
Affiliation(s)
- Jian Zha
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an, Shaanxi 710021, China
| | - Zhen Zhao
- School of Food and Biological Engineering, Shaanxi University of Science and Technology, Xi'an, Shaanxi 710021, China
| | - Zhengyang Xiao
- Department of Energy, Environmental and Chemical Engineering, Washington University in Saint Louis, MO 63130, USA
| | - Thomas Eng
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Aindrila Mukhopadhyay
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Mattheos Ag Koffas
- Department of Chemical and Biological Engineering, Rensselaer Polytechnic Institute, Troy, NY 12180, USA; Center for Biotechnology and Interdisciplinary Studies, Rensselaer Polytechnic Institute, Troy, NY 12180, USA
| | - Yinjie J Tang
- Department of Energy, Environmental and Chemical Engineering, Washington University in Saint Louis, MO 63130, USA.
| |
Collapse
|
35
|
Patra P, B R D, Kundu P, Das M, Ghosh A. Recent advances in machine learning applications in metabolic engineering. Biotechnol Adv 2023; 62:108069. [PMID: 36442697 DOI: 10.1016/j.biotechadv.2022.108069] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/18/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]
Abstract
Metabolic engineering encompasses several widely-used strategies, which currently hold a high seat in the field of biotechnology when its potential is manifesting through a plethora of research and commercial products with a strong societal impact. The genomic revolution that occurred almost three decades ago has initiated the generation of large omics-datasets which has helped in gaining a better understanding of cellular behavior. The itinerary of metabolic engineering that has occurred based on these large datasets has allowed researchers to gain detailed insights and a reasonable understanding of the intricacies of biosystems. However, the existing trail-and-error approaches for metabolic engineering are laborious and time-intensive when it comes to the production of target compounds with high yields through genetic manipulations in host organisms. Machine learning (ML) coupled with the available metabolic engineering test instances and omics data brings a comprehensive and multidisciplinary approach that enables scientists to evaluate various parameters for effective strain design. This vast amount of biological data should be standardized through knowledge engineering to train different ML models for providing accurate predictions in gene circuits designing, modification of proteins, optimization of bioprocess parameters for scaling up, and screening of hyper-producing robust cell factories. This review briefs on the premise of ML, followed by mentioning various ML methods and algorithms alongside the numerous omics datasets available to train ML models for predicting metabolic outcomes with high-accuracy. The combinative interplay between the ML algorithms and biological datasets through knowledge engineering have guided the recent advancements in applications such as CRISPR/Cas systems, gene circuits, protein engineering, metabolic pathway reconstruction, and bioprocess engineering. Finally, this review addresses the probable challenges of applying ML in metabolic engineering which will guide the researchers toward novel techniques to overcome the limitations.
Collapse
Affiliation(s)
- Pradipta Patra
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Disha B R
- B.M.S College of Engineering, Basavanagudi, Bengaluru, Karnataka 560019, India
| | - Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Manali Das
- School of Bioscience, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
36
|
Volk MJ, Tran VG, Tan SI, Mishra S, Fatma Z, Boob A, Li H, Xue P, Martin TA, Zhao H. Metabolic Engineering: Methodologies and Applications. Chem Rev 2022; 123:5521-5570. [PMID: 36584306 DOI: 10.1021/acs.chemrev.2c00403] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Metabolic engineering aims to improve the production of economically valuable molecules through the genetic manipulation of microbial metabolism. While the discipline is a little over 30 years old, advancements in metabolic engineering have given way to industrial-level molecule production benefitting multiple industries such as chemical, agriculture, food, pharmaceutical, and energy industries. This review describes the design, build, test, and learn steps necessary for leading a successful metabolic engineering campaign. Moreover, we highlight major applications of metabolic engineering, including synthesizing chemicals and fuels, broadening substrate utilization, and improving host robustness with a focus on specific case studies. Finally, we conclude with a discussion on perspectives and future challenges related to metabolic engineering.
Collapse
Affiliation(s)
- Michael J Volk
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Vinh G Tran
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Shih-I Tan
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemical Engineering, National Cheng Kung University, Tainan 70101, Taiwan
| | - Shekhar Mishra
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Zia Fatma
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Aashutosh Boob
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Hongxiang Li
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Pu Xue
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Teresa A Martin
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,DOE Center for Advanced Bioenergy and Bioproducts Innovation, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
37
|
Iman MN, Herawati E, Fukusaki E, Putri SP. Metabolomics-driven strain improvement: A mini review. Front Mol Biosci 2022; 9:1057709. [PMID: 36438656 PMCID: PMC9681786 DOI: 10.3389/fmolb.2022.1057709] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 10/28/2022] [Indexed: 07/22/2023] Open
Abstract
In recent years, mass spectrometry-based metabolomics has been established as a powerful and versatile technique for studying cellular metabolism by comprehensive analysis of metabolites in the cell. Although there are many scientific reports on the use of metabolomics for the elucidation of mechanism and physiological changes occurring in the cell, there are surprisingly very few reports on its use for the identification of rate-limiting steps in a synthetic biological system that can lead to the actual improvement of the host organism. In this mini review, we discuss different strategies for improving strain performance using metabolomics data and compare the application of metabolomics-driven strain improvement techniques in different host microorganisms. Finally, we highlight several success stories on the use of metabolomics-driven strain improvement strategies, which led to significant bioproductivity improvements.
Collapse
Affiliation(s)
- Marvin Nathanael Iman
- Department of Biotechnology, Graduate School of Engineering, Osaka University, Osaka, Japan
| | - Elisa Herawati
- Department of Biology, Faculty of Mathematics and Natural Sciences, Universitas Sebelas Maret, Surakarta, Indonesia
| | - Eiichiro Fukusaki
- Department of Biotechnology, Graduate School of Engineering, Osaka University, Osaka, Japan
| | - Sastia Prama Putri
- Department of Biotechnology, Graduate School of Engineering, Osaka University, Osaka, Japan
| |
Collapse
|
38
|
DISA tool: Discriminative and informative subspace assessment with categorical and numerical outcomes. PLoS One 2022; 17:e0276253. [PMID: 36260602 PMCID: PMC9581374 DOI: 10.1371/journal.pone.0276253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 10/03/2022] [Indexed: 11/19/2022] Open
Abstract
Pattern discovery and subspace clustering play a central role in the biological domain, supporting for instance putative regulatory module discovery from omics data for both descriptive and predictive ends. In the presence of target variables (e.g. phenotypes), regulatory patterns should further satisfy delineate discriminative power properties, well-established in the presence of categorical outcomes, yet largely disregarded for numerical outcomes, such as risk profiles and quantitative phenotypes. DISA (Discriminative and Informative Subspace Assessment), a Python software package, is proposed to evaluate patterns in the presence of numerical outcomes using well-established measures together with a novel principle able to statistically assess the correlation gain of the subspace against the overall space. Results confirm the possibility to soundly extend discriminative criteria towards numerical outcomes without the drawbacks well-associated with discretization procedures. Results from four case studies confirm the validity and relevance of the proposed methods, further unveiling critical directions for research on biotechnology and biomedicine. Availability: DISA is freely available at https://github.com/JupitersMight/DISA under the MIT license.
Collapse
|
39
|
Yeo HC, Selvarajoo K. Machine learning alternative to systems biology should not solely depend on data. Brief Bioinform 2022; 23:6731718. [PMID: 36184188 PMCID: PMC9677488 DOI: 10.1093/bib/bbac436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 08/24/2022] [Accepted: 09/09/2022] [Indexed: 12/14/2022] Open
Abstract
In recent years, artificial intelligence (AI)/machine learning has emerged as a plausible alternative to systems biology for the elucidation of biological phenomena and in attaining specified design objective in synthetic biology. Although considered highly disruptive with numerous notable successes so far, we seek to bring attention to both the fundamental and practical pitfalls of their usage, especially in illuminating emergent behaviors from chaotic or stochastic systems in biology. Without deliberating on their suitability and the required data qualities and pre-processing approaches beforehand, the research and development community could experience similar 'AI winters' that had plagued other fields. Instead, we anticipate the integration or combination of the two approaches, where appropriate, moving forward.
Collapse
Affiliation(s)
- Hock Chuan Yeo
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore
| | | |
Collapse
|
40
|
Boiko DA, Kozlov KS, Burykina JV, Ilyushenkova VV, Ananikov VP. Fully Automated Unconstrained Analysis of High-Resolution Mass Spectrometry Data with Machine Learning. J Am Chem Soc 2022; 144:14590-14606. [PMID: 35939718 DOI: 10.1021/jacs.2c03631] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Mass spectrometry (MS) is a convenient, highly sensitive, and reliable method for the analysis of complex mixtures, which is vital for materials science, life sciences fields such as metabolomics and proteomics, and mechanistic research in chemistry. Although it is one of the most powerful methods for individual compound detection, complete signal assignment in complex mixtures is still a great challenge. The unconstrained formula-generating algorithm, covering the entire spectra and revealing components, is a "dream tool" for researchers. We present the framework for efficient MS data interpretation, describing a novel approach for detailed analysis based on deisotoping performed by gradient-boosted decision trees and a neural network that generates molecular formulas from the fine isotopic structure, approaching the long-standing inverse spectral problem. The methods were successfully tested on three examples: fragment ion analysis in protein sequencing for proteomics, analysis of the natural samples for life sciences, and study of the cross-coupling catalytic system for chemistry.
Collapse
Affiliation(s)
- Daniil A Boiko
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow 119991, Russia
| | - Konstantin S Kozlov
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow 119991, Russia
| | - Julia V Burykina
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow 119991, Russia
| | - Valentina V Ilyushenkova
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow 119991, Russia
| | - Valentine P Ananikov
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow 119991, Russia
| |
Collapse
|
41
|
Shi Z, Liu P, Liao X, Mao Z, Zhang J, Wang Q, Sun J, Ma H, Ma Y. Data-Driven Synthetic Cell Factories Development for Industrial Biomanufacturing. BIODESIGN RESEARCH 2022; 2022:9898461. [PMID: 37850146 PMCID: PMC10521697 DOI: 10.34133/2022/9898461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 05/26/2022] [Indexed: 10/19/2023] Open
Abstract
Revolutionary breakthroughs in artificial intelligence (AI) and machine learning (ML) have had a profound impact on a wide range of scientific disciplines, including the development of artificial cell factories for biomanufacturing. In this paper, we review the latest studies on the application of data-driven methods for the design of new proteins, pathways, and strains. We first briefly introduce the various types of data and databases relevant to industrial biomanufacturing, which are the basis for data-driven research. Different types of algorithms, including traditional ML and more recent deep learning methods, are also presented. We then demonstrate how these data-based approaches can be applied to address various issues in cell factory development using examples from recent studies, including the prediction of protein function, improvement of metabolic models, and estimation of missing kinetic parameters, design of non-natural biosynthesis pathways, and pathway optimization. In the last section, we discuss the current limitations of these data-driven approaches and propose that data-driven methods should be integrated with mechanistic models to complement each other and facilitate the development of synthetic strains for industrial biomanufacturing.
Collapse
Affiliation(s)
- Zhenkun Shi
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Pi Liu
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Xiaoping Liao
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Zhitao Mao
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Jianqi Zhang
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Qinhong Wang
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Jibin Sun
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Hongwu Ma
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| | - Yanhe Ma
- Key Laboratory of Systems Microbial Technology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308China
| |
Collapse
|
42
|
Zhang M, Holowko MB, Hayman Zumpe H, Ong CS. Machine Learning Guided Batched Design of a Bacterial Ribosome Binding Site. ACS Synth Biol 2022; 11:2314-2326. [PMID: 35704784 PMCID: PMC9295160 DOI: 10.1021/acssynbio.2c00015] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Optimization of gene expression levels is an essential part of the organism design process. Fine control of this process can be achieved by engineering transcription and translation control elements, including the ribosome binding site (RBS). Unfortunately, the design of specific genetic parts remains challenging because of the lack of reliable design methods. To address this problem, we have created a machine learning guided Design-Build-Test-Learn (DBTL) cycle for the experimental design of bacterial RBSs to demonstrate how small genetic parts can be reliably designed using relatively small, high-quality data sets. We used Gaussian Process Regression for the Learn phase of the cycle and the Upper Confidence Bound multiarmed bandit algorithm for the Design of genetic variants to be tested in vivo. We have integrated these machine learning algorithms with laboratory automation and high-throughput processes for reliable data generation. Notably, by Testing a total of 450 RBS variants in four DBTL cycles, we have experimentally validated RBSs with high translation initiation rates equaling or exceeding our benchmark RBS by up to 34%. Overall, our results show that machine learning is a powerful tool for designing RBSs, and they pave the way toward more complicated genetic devices.
Collapse
Affiliation(s)
- Mengyan Zhang
- Machine Learning and Artificial Intelligence Future Science Platform, CSIRO, Canberra, ACT 2601, Australia.,Department of Computer Science, Australian National University, Canberra, ACT 2601, Australia.,Data61, CSIRO, Canberra, ACT 2601, Australia
| | - Maciej Bartosz Holowko
- Synthetic Biology Future Science Platform, CSIRO, Canberra, ACT 2601, Australia.,Land and Water, CSIRO, Canberra, ACT 2601, Australia
| | - Huw Hayman Zumpe
- Synthetic Biology Future Science Platform, CSIRO, Canberra, ACT 2601, Australia.,Land and Water, CSIRO, Canberra, ACT 2601, Australia
| | - Cheng Soon Ong
- Machine Learning and Artificial Intelligence Future Science Platform, CSIRO, Canberra, ACT 2601, Australia.,Department of Computer Science, Australian National University, Canberra, ACT 2601, Australia.,Data61, CSIRO, Canberra, ACT 2601, Australia
| |
Collapse
|
43
|
Liang B, Xu D, Yu N, Xu Y, Ma X, Liu Q, Asif MS, Yan R, Liu M. Physics-Guided Neural-Network-Based Inverse Design of a Photonic -Plasmonic Nanodevice for Superfocusing. ACS APPLIED MATERIALS & INTERFACES 2022; 14:27397-27404. [PMID: 35649169 DOI: 10.1021/acsami.2c05083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Controlling the nanoscale light-matter interaction using superfocusing hybrid photonic-plasmonic devices has attracted significant research interest in tackling existing challenges, including converting efficiencies, working bandwidths, and manufacturing complexities. With the growth in demand for efficient photonic-plasmonic input-output interfaces to improve plasmonic device performances, sophisticated designs with multiple optimization parameters are required, which comes with an unaffordable computation cost. Machine learning methods can significantly reduce the cost of computations compared to numerical simulations, but the input-output dimension mismatch remains a challenging problem. Here, we introduce a physics-guided two-stage machine learning network that uses the improved coupled-mode theory for optical waveguides to guide the learning module and improve the accuracy of predictive engines to 98.5%. A near-unity coupling efficiency with symmetry-breaking selectivity is predicted by the inverse design. By fabricating photonic-plasmonic couplers using the predicted profiles, we demonstrate that the excitation efficiency of 83% on the radially polarized surface plasmon mode can be achieved, which paves the way for super-resolution optical imaging.
Collapse
Affiliation(s)
- Boqun Liang
- Materials Science and Engineering program, University of California─Riverside, Riverside, California 92521, United States
| | - Da Xu
- Department of Electrical and Computer Engineering, University of California─Riverside, Riverside, California 92521, United States
| | - Ning Yu
- Department of Chemical and Environmental Engineering, University of California─Riverside, Riverside, California 92521, United States
| | - Yaodong Xu
- Materials Science and Engineering program, University of California─Riverside, Riverside, California 92521, United States
| | - Xuezhi Ma
- Department of Electrical and Computer Engineering, University of California─Riverside, Riverside, California 92521, United States
| | - Qiushi Liu
- Department of Electrical and Computer Engineering, University of California─Riverside, Riverside, California 92521, United States
| | - M Salman Asif
- Department of Electrical and Computer Engineering, University of California─Riverside, Riverside, California 92521, United States
- Department of Computer Science and Engineering, University of California─Riverside, Riverside, California 92521, United States
| | - Ruoxue Yan
- Materials Science and Engineering program, University of California─Riverside, Riverside, California 92521, United States
- Department of Chemical and Environmental Engineering, University of California─Riverside, Riverside, California 92521, United States
| | - Ming Liu
- Materials Science and Engineering program, University of California─Riverside, Riverside, California 92521, United States
- Department of Electrical and Computer Engineering, University of California─Riverside, Riverside, California 92521, United States
| |
Collapse
|
44
|
Liao X, Ma H, Tang YJ. Artificial intelligence: a solution to involution of design–build–test–learn cycle. Curr Opin Biotechnol 2022; 75:102712. [DOI: 10.1016/j.copbio.2022.102712] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 02/05/2022] [Accepted: 03/01/2022] [Indexed: 01/08/2023]
|
45
|
León-Buitimea A, Balderas-Cisneros FDJ, Garza-Cárdenas CR, Garza-Cervantes JA, Morones-Ramírez JR. Synthetic Biology Tools for Engineering Microbial Cells to Fight Superbugs. Front Bioeng Biotechnol 2022; 10:869206. [PMID: 35600895 PMCID: PMC9114757 DOI: 10.3389/fbioe.2022.869206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 04/18/2022] [Indexed: 11/23/2022] Open
Abstract
With the increase in clinical cases of bacterial infections with multiple antibiotic resistance, the world has entered a health crisis. Overuse, inappropriate prescribing, and lack of innovation of antibiotics have contributed to the surge of microorganisms that can overcome traditional antimicrobial treatments. In 2017, the World Health Organization published a list of pathogenic bacteria, including Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Escherichia coli (ESKAPE). These bacteria can adapt to multiple antibiotics and transfer their resistance to other organisms; therefore, studies to find new therapeutic strategies are needed. One of these strategies is synthetic biology geared toward developing new antimicrobial therapies. Synthetic biology is founded on a solid and well-established theoretical framework that provides tools for conceptualizing, designing, and constructing synthetic biological systems. Recent developments in synthetic biology provide tools for engineering synthetic control systems in microbial cells. Applying protein engineering, DNA synthesis, and in silico design allows building metabolic pathways and biological circuits to control cellular behavior. Thus, synthetic biology advances have permitted the construction of communication systems between microorganisms where exogenous molecules can control specific population behaviors, induce intracellular signaling, and establish co-dependent networks of microorganisms.
Collapse
Affiliation(s)
- Angel León-Buitimea
- Facultad de Ciencias Químicas, Universidad Autónoma de Nuevo León (UANL), San Nicolás de los Garza, Mexico
- Centro de Investigación en Biotecnología y Nanotecnología, Facultad de Ciencias Químicas, Parque de Investigación e Innovación Tecnológica, Universidad Autónoma de Nuevo León, Apodaca, Mexico
| | - Francisco de Jesús Balderas-Cisneros
- Facultad de Ciencias Químicas, Universidad Autónoma de Nuevo León (UANL), San Nicolás de los Garza, Mexico
- Centro de Investigación en Biotecnología y Nanotecnología, Facultad de Ciencias Químicas, Parque de Investigación e Innovación Tecnológica, Universidad Autónoma de Nuevo León, Apodaca, Mexico
| | - César Rodolfo Garza-Cárdenas
- Facultad de Ciencias Químicas, Universidad Autónoma de Nuevo León (UANL), San Nicolás de los Garza, Mexico
- Centro de Investigación en Biotecnología y Nanotecnología, Facultad de Ciencias Químicas, Parque de Investigación e Innovación Tecnológica, Universidad Autónoma de Nuevo León, Apodaca, Mexico
| | - Javier Alberto Garza-Cervantes
- Facultad de Ciencias Químicas, Universidad Autónoma de Nuevo León (UANL), San Nicolás de los Garza, Mexico
- Centro de Investigación en Biotecnología y Nanotecnología, Facultad de Ciencias Químicas, Parque de Investigación e Innovación Tecnológica, Universidad Autónoma de Nuevo León, Apodaca, Mexico
| | - José Rubén Morones-Ramírez
- Facultad de Ciencias Químicas, Universidad Autónoma de Nuevo León (UANL), San Nicolás de los Garza, Mexico
- Centro de Investigación en Biotecnología y Nanotecnología, Facultad de Ciencias Químicas, Parque de Investigación e Innovación Tecnológica, Universidad Autónoma de Nuevo León, Apodaca, Mexico
- *Correspondence: José Rubén Morones-Ramírez,
| |
Collapse
|
46
|
Bonturi N, Pinheiro MJ, Monteiro de Oliveira P, Rusadze E, Eichinger T, Liudžiūtė G, De Biaggi JS, Brauer A, Remm M, Miranda EA, Ledesma-Amaro R, Lahtvee PJ. Development of a dedicated Golden Gate Assembly Platform (RtGGA) for Rhodotorula toruloides. Metab Eng Commun 2022; 15:e00200. [PMID: 35662893 PMCID: PMC9157227 DOI: 10.1016/j.mec.2022.e00200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 05/13/2022] [Accepted: 05/15/2022] [Indexed: 10/29/2022] Open
|
47
|
Gurdo N, Volke DC, Nikel PI. Merging automation and fundamental discovery into the design–build–test–learn cycle of nontraditional microbes. Trends Biotechnol 2022; 40:1148-1159. [DOI: 10.1016/j.tibtech.2022.03.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 03/12/2022] [Accepted: 03/16/2022] [Indexed: 12/29/2022]
|
48
|
Casas A, Bultelle M, Motraghi C, Kitney R. Removing the Bottleneck: Introducing cMatch - A Lightweight Tool for Construct-Matching in Synthetic Biology. Front Bioeng Biotechnol 2022; 9:785131. [PMID: 35083201 PMCID: PMC8784771 DOI: 10.3389/fbioe.2021.785131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 12/14/2021] [Indexed: 11/30/2022] Open
Abstract
We present a software tool, called cMatch, to reconstruct and identify synthetic genetic constructs from their sequences, or a set of sub-sequences—based on two practical pieces of information: their modular structure, and libraries of components. Although developed for combinatorial pathway engineering problems and addressing their quality control (QC) bottleneck, cMatch is not restricted to these applications. QC takes place post assembly, transformation and growth. It has a simple goal, to verify that the genetic material contained in a cell matches what was intended to be built - and when it is not the case, to locate the discrepancies and estimate their severity. In terms of reproducibility/reliability, the QC step is crucial. Failure at this step requires repetition of the construction and/or sequencing steps. When performed manually or semi-manually QC is an extremely time-consuming, error prone process, which scales very poorly with the number of constructs and their complexity. To make QC frictionless and more reliable, cMatch performs an operation we have called “construct-matching” and automates it. Construct-matching is more thorough than simple sequence-matching, as it matches at the functional level-and quantifies the matching at the individual component level and across the whole construct. Two algorithms (called CM_1 and CM_2) are presented. They differ according to the nature of their inputs. CM_1 is the core algorithm for construct-matching and is to be used when input sequences are long enough to cover constructs in their entirety (e.g., obtained with methods such as next generation sequencing). CM_2 is an extension designed to deal with shorter data (e.g., obtained with Sanger sequencing), and that need recombining. Both algorithms are shown to yield accurate construct-matching in a few minutes (even on hardware with limited processing power), together with a set of metrics that can be used to improve the robustness of the decision-making process. To ensure reliability and reproducibility, cMatch builds on the highly validated pairwise-matching Smith-Waterman algorithm. All the tests presented have been conducted on synthetic data for challenging, yet realistic constructs - and on real data gathered during studies on a metabolic engineering example (lycopene production).
Collapse
Affiliation(s)
- Alexis Casas
- Department of Bioengineering, Imperial College London, London, United Kingdom
| | - Matthieu Bultelle
- Department of Bioengineering, Imperial College London, London, United Kingdom
| | - Charles Motraghi
- Department of Bioengineering, Imperial College London, London, United Kingdom
| | - Richard Kitney
- Department of Bioengineering, Imperial College London, London, United Kingdom
| |
Collapse
|
49
|
Tellechea-Luzardo J, Otero-Muras I, Goñi-Moreno A, Carbonell P. Fast biofoundries: coping with the challenges of biomanufacturing. Trends Biotechnol 2022; 40:831-842. [PMID: 35012773 DOI: 10.1016/j.tibtech.2021.12.006] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 12/13/2021] [Accepted: 12/13/2021] [Indexed: 11/16/2022]
Abstract
Biofoundries are highly automated facilities that enable the rapid and efficient design, build, test, and learn cycle of biomanufacturing and engineering biology, which is applicable to both research and industrial production. However, developing a biofoundry platform can be expensive and time consuming. A biofoundry should grow organically, starting from a basic platform but with a vision for automation, equipment interoperability, and efficiency. By thinking about strategies early in the process through process planning, simulation, and optimization, bottlenecks can be identified and resolved. Here, we provide a survey of technological solutions in biofoundries and their advantages and limitations. We explore possible pathways towards the creation of a functional, early-phase biofoundry, and strategies towards long-term sustainability.
Collapse
Affiliation(s)
- Jonathan Tellechea-Luzardo
- Institute of Industrial Control Systems and Computing (AI2), Universitat Politécnica de València (UPV), 46022 València, Spain
| | - Irene Otero-Muras
- Institute for Integrative Systems Biology I2SysBio, Universitat de València-CSIC, Catedrático Agustín Escardino Benlloch 9, Paterna, 46980 València, Spain
| | - Angel Goñi-Moreno
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid, Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria, Pozuelo de Alarcón, 28223 Madrid, Spain
| | - Pablo Carbonell
- Institute of Industrial Control Systems and Computing (AI2), Universitat Politécnica de València (UPV), 46022 València, Spain.
| |
Collapse
|
50
|
Cadet XF, Gelly JC, van Noord A, Cadet F, Acevedo-Rocha CG. Learning Strategies in Protein Directed Evolution. Methods Mol Biol 2022; 2461:225-275. [PMID: 35727454 DOI: 10.1007/978-1-0716-2152-3_15] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Synthetic biology is a fast-evolving research field that combines biology and engineering principles to develop new biological systems for medical, pharmacological, and industrial applications. Synthetic biologists use iterative "design, build, test, and learn" cycles to efficiently engineer genetic systems that are reliable, reproducible, and predictable. Protein engineering by directed evolution can benefit from such a systematic engineering approach for various reasons. Learning can be carried out before starting, throughout or after finalizing a directed evolution project. Computational tools, bioinformatics, and scanning mutagenesis methods can be excellent starting points, while molecular dynamics simulations and other strategies can guide engineering efforts. Similarly, studying protein intermediates along evolutionary pathways offers fascinating insights into the molecular mechanisms shaped by evolution. The learning step of the cycle is not only crucial for proteins or enzymes that are not suitable for high-throughput screening or selection systems, but it is also valuable for any platform that can generate a large amount of data that can be aided by machine learning algorithms. The main challenge in protein engineering is to predict the effect of a single mutation on one functional parameter-to say nothing of several mutations on multiple parameters. This is largely due to nonadditive mutational interactions, known as epistatic effects-beneficial mutations present in a genetic background may not be beneficial in another genetic background. In this work, we provide an overview of experimental and computational strategies that can guide the user to learn protein function at different stages in a directed evolution project. We also discuss how epistatic effects can influence the success of directed evolution projects. Since machine learning is gaining momentum in protein engineering and the field is becoming more interdisciplinary thanks to collaboration between mathematicians, computational scientists, engineers, molecular biologists, and chemists, we provide a general workflow that familiarizes nonexperts with the basic concepts, dataset requirements, learning approaches, model capabilities and performance metrics of this intriguing area. Finally, we also provide some practical recommendations on how machine learning can harness epistatic effects for engineering proteins in an "outside-the-box" way.
Collapse
Affiliation(s)
- Xavier F Cadet
- PEACCEL, Artificial Intelligence Department, Paris, France
| | - Jean Christophe Gelly
- Laboratoire d'Excellence GR-Ex, Paris, France
- BIGR, DSIMB, UMR_S1134, INSERM, University of Paris & University of Reunion, Paris, France
| | | | - Frédéric Cadet
- Laboratoire d'Excellence GR-Ex, Paris, France
- BIGR, DSIMB, UMR_S1134, INSERM, University of Paris & University of Reunion, Paris, France
| | | |
Collapse
|