1
|
Ding N, Yuan Z, Ma Z, Wu Y, Yin L. AI-Assisted Rational Design and Activity Prediction of Biological Elements for Optimizing Transcription-Factor-Based Biosensors. Molecules 2024; 29:3512. [PMID: 39124917 PMCID: PMC11313831 DOI: 10.3390/molecules29153512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 07/22/2024] [Accepted: 07/24/2024] [Indexed: 08/12/2024] Open
Abstract
The rational design, activity prediction, and adaptive application of biological elements (bio-elements) are crucial research fields in synthetic biology. Currently, a major challenge in the field is efficiently designing desired bio-elements and accurately predicting their activity using vast datasets. The advancement of artificial intelligence (AI) technology has enabled machine learning and deep learning algorithms to excel in uncovering patterns in bio-element data and predicting their performance. This review explores the application of AI algorithms in the rational design of bio-elements, activity prediction, and the regulation of transcription-factor-based biosensor response performance using AI-designed elements. We discuss the advantages, adaptability, and biological challenges addressed by the AI algorithms in various applications, highlighting their powerful potential in analyzing biological data. Furthermore, we propose innovative solutions to the challenges faced by AI algorithms in the field and suggest future research directions. By consolidating current research and demonstrating the practical applications and future potential of AI in synthetic biology, this review provides valuable insights for advancing both academic research and practical applications in biotechnology.
Collapse
Affiliation(s)
- Nana Ding
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou 311300, China;
- Zhejiang Provincial Key Laboratory of Resources Protection and Innovation of Traditional Chinese Medicine, Zhejiang A&F University, Hangzhou 311300, China
| | - Zenan Yuan
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou 311300, China;
- Zhejiang Provincial Key Laboratory of Resources Protection and Innovation of Traditional Chinese Medicine, Zhejiang A&F University, Hangzhou 311300, China
| | - Zheng Ma
- Zhejiang Provincial Key Laboratory of Biometrology and Inspection & Quarantine, College of Life Sciences, China Jiliang University, Hangzhou 310018, China;
| | - Yefei Wu
- Zhejiang Qianjiang Biochemical Co., Ltd., Haining 314400, China;
| | - Lianghong Yin
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou 311300, China;
- Zhejiang Provincial Key Laboratory of Resources Protection and Innovation of Traditional Chinese Medicine, Zhejiang A&F University, Hangzhou 311300, China
| |
Collapse
|
2
|
Orsi E, Schada von Borzyskowski L, Noack S, Nikel PI, Lindner SN. Automated in vivo enzyme engineering accelerates biocatalyst optimization. Nat Commun 2024; 15:3447. [PMID: 38658554 PMCID: PMC11043082 DOI: 10.1038/s41467-024-46574-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 03/04/2024] [Indexed: 04/26/2024] Open
Abstract
Achieving cost-competitive bio-based processes requires development of stable and selective biocatalysts. Their realization through in vitro enzyme characterization and engineering is mostly low throughput and labor-intensive. Therefore, strategies for increasing throughput while diminishing manual labor are gaining momentum, such as in vivo screening and evolution campaigns. Computational tools like machine learning further support enzyme engineering efforts by widening the explorable design space. Here, we propose an integrated solution to enzyme engineering challenges whereby ML-guided, automated workflows (including library generation, implementation of hypermutation systems, adapted laboratory evolution, and in vivo growth-coupled selection) could be realized to accelerate pipelines towards superior biocatalysts.
Collapse
Affiliation(s)
- Enrico Orsi
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Kongens Lyngby, Denmark
| | | | - Stephan Noack
- Institute of Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Pablo I Nikel
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800, Kongens Lyngby, Denmark
| | - Steffen N Lindner
- Max Planck Institute of Molecular Plant Physiology, 14476, Potsdam-Golm, Germany.
- Department of Biochemistry, Charité Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität, 10117, Berlin, Germany.
| |
Collapse
|
3
|
Blau T, Chades I, Ong CS. Machine Learning for Biological Design. Methods Mol Biol 2024; 2760:319-344. [PMID: 38468097 DOI: 10.1007/978-1-0716-3658-9_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
We briefly present machine learning approaches for designing better biological experiments. These approaches build on machine learning predictors and provide additional tools to guide scientific discovery. There are two different kinds of objectives when designing better experiments: to improve the predictive model or to improve the experimental outcome. We survey five different approaches for adaptive experimental design that iteratively search the space of possible experiments while adapting to measured data. The approaches are Bayesian optimization, bandits, reinforcement learning, optimal experimental design, and active learning. These machine learning approaches have shown promise in various areas of biology, and we provide broad guidelines to the practitioner and links to further resources.
Collapse
Affiliation(s)
- Tom Blau
- CSIRO, Data61, Eveleigh, NSW, Australia
| | | | | |
Collapse
|
4
|
Han T, Nazarbekov A, Zou X, Lee SY. Recent advances in systems metabolic engineering. Curr Opin Biotechnol 2023; 84:103004. [PMID: 37778304 DOI: 10.1016/j.copbio.2023.103004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 09/03/2023] [Accepted: 09/05/2023] [Indexed: 10/03/2023]
Abstract
Systems metabolic engineering, which integrates metabolic engineering with systems biology, synthetic biology, and evolutionary engineering, has revolutionized the sustainable production of fuels and materials through the creation of efficient microbial cell factories. Recent advancements in systems metabolic engineering targeting different biological components of the host cell have enabled the creation of highly productive microbial cell factories. This article provides a review of the recent tools and strategies used for enzyme-, genetic module-, pathway-, flux-, genome-, and cell-level engineering, supported by illustrative examples. Furthermore, we highlight recent trends in systems metabolic engineering, which involve the application of multiple tools discussed in this review. Finally, the paper addresses the challenges and perspectives of transitioning academic-level metabolic engineering studies to commercial-scale production.
Collapse
Affiliation(s)
- Taehee Han
- Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, the Republic of Korea; KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon 34141, the Republic of Korea; BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, 34141 Daejeon, the Republic of Korea
| | - Alisher Nazarbekov
- Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, the Republic of Korea; KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon 34141, the Republic of Korea
| | - Xuan Zou
- Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, the Republic of Korea; KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon 34141, the Republic of Korea; BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, 34141 Daejeon, the Republic of Korea
| | - Sang Yup Lee
- Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, the Republic of Korea; KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon 34141, the Republic of Korea; BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, 34141 Daejeon, the Republic of Korea; Graduate School of Engineering Biology, KAIST, Daejeon 34141, the Republic of Korea.
| |
Collapse
|
5
|
Yang L, Liang X, Zhang N, Lu L. STAR: A Web Server for Assisting Directed Protein Evolution with Machine Learning. ACS OMEGA 2023; 8:44751-44756. [PMID: 38046324 PMCID: PMC10688154 DOI: 10.1021/acsomega.3c04832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 10/10/2023] [Accepted: 10/12/2023] [Indexed: 12/05/2023]
Abstract
Protein engineering has made significant contributions to industries such as agriculture, food, and pharmaceuticals. In recent years, directed evolution combined with artificial intelligence has emerged as a cutting-edge R&D approach. However, the application of machine learning techniques can be challenging for those without relevant experience and coding skills. To address this issue, we have developed a web-based protein sequence recommendation system: STAR (Sequence recommendaTion via ARtificial intelligence). Our system utilizes Bayesian optimization as its backbone and includes a filtering step using a regression model to enhance the success rate of recommended sequences. Additionally, we have incorporated an in silico-directed evolution approach to expand the exploration of the protein space. The Web site can be accessed at https://www.FindProteinStar.com/.
Collapse
Affiliation(s)
- Likun Yang
- Asymchem Life Science (Tianjin) Co.,
Ltd, Tianjin 300457, P. R. China
| | - Xiaoli Liang
- Asymchem Life Science (Tianjin) Co.,
Ltd, Tianjin 300457, P. R. China
| | - Na Zhang
- Asymchem Life Science (Tianjin) Co.,
Ltd, Tianjin 300457, P. R. China
| | - Lu Lu
- Asymchem Life Science (Tianjin) Co.,
Ltd, Tianjin 300457, P. R. China
| |
Collapse
|
6
|
Cheng Y, Bi X, Xu Y, Liu Y, Li J, Du G, Lv X, Liu L. Machine learning for metabolic pathway optimization: A review. Comput Struct Biotechnol J 2023; 21:2381-2393. [PMID: 38213889 PMCID: PMC10781721 DOI: 10.1016/j.csbj.2023.03.045] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 03/24/2023] [Accepted: 03/25/2023] [Indexed: 03/29/2023] Open
Abstract
Optimizing the metabolic pathways of microbial cell factories is essential for establishing viable biotechnological production processes. However, due to the limited understanding of the complex setup of cellular machinery, building efficient microbial cell factories remains tedious and time-consuming. Machine learning (ML), a powerful tool capable of identifying patterns within large datasets, has been used to analyze biological datasets generated using various high-throughput technologies to build data-driven models for complex bioprocesses. In addition, ML can also be integrated with Design-Build-Test-Learn to accelerate development. This review focuses on recent ML applications in genome-scale metabolic model construction, multistep pathway optimization, rate-limiting enzyme engineering, and gene regulatory element designing. In addition, we have discussed some limitations of these methods as well as potential solutions.
Collapse
Affiliation(s)
- Yang Cheng
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Xinyu Bi
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Yameng Xu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Yanfeng Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jianghua Li
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Guocheng Du
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Xueqin Lv
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Long Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
7
|
Folch JP, Lee RM, Shafei B, Walz D, Tsay C, van der Wilk M, Misener R. Combining multi-fidelity modelling and asynchronous batch Bayesian Optimization. Comput Chem Eng 2023. [DOI: 10.1016/j.compchemeng.2023.108194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
|