1
|
Yan D, Zhou M, Adduri A, Zhuang Y, Guler M, Liu S, Shin H, Kovach T, Oh G, Liu X, Deng Y, Wang X, Cao L, Sherman DH, Schultz PJ, Kersten RD, Clement JA, Tripathi A, Behsaz B, Mohimani H. Discovering type I cis-AT polyketides through computational mass spectrometry and genome mining with Seq2PKS. Nat Commun 2024; 15:5356. [PMID: 38918378 PMCID: PMC11199612 DOI: 10.1038/s41467-024-49587-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 06/12/2024] [Indexed: 06/27/2024] Open
Abstract
Type 1 polyketides are a major class of natural products used as antiviral, antibiotic, antifungal, antiparasitic, immunosuppressive, and antitumor drugs. Analysis of public microbial genomes leads to the discovery of over sixty thousand type 1 polyketide gene clusters. However, the molecular products of only about a hundred of these clusters are characterized, leaving most metabolites unknown. Characterizing polyketides relies on bioactivity-guided purification, which is expensive and time-consuming. To address this, we present Seq2PKS, a machine learning algorithm that predicts chemical structures derived from Type 1 polyketide synthases. Seq2PKS predicts numerous putative structures for each gene cluster to enhance accuracy. The correct structure is identified using a variable mass spectral database search. Benchmarks show that Seq2PKS outperforms existing methods. Applying Seq2PKS to Actinobacteria datasets, we discover biosynthetic gene clusters for monazomycin, oasomycin A, and 2-aminobenzamide-actiphenol.
Collapse
Affiliation(s)
- Donghui Yan
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Muqing Zhou
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Abhinav Adduri
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Yihao Zhuang
- Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA
| | - Mustafa Guler
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Sitong Liu
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Hyonyoung Shin
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Torin Kovach
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Gloria Oh
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Xiao Liu
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Yuting Deng
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Xiaofeng Wang
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Liu Cao
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - David H Sherman
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA
| | - Pamela J Schultz
- Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA
| | - Roland D Kersten
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | | | - Ashootosh Tripathi
- Natural Products Discovery Core, University of Michigan, Ann Arbor, MI, USA.
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA.
- Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA.
| | - Bahar Behsaz
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
- Chemia Biosciences Inc, Pittsburgh, PA, USA.
| | - Hosein Mohimani
- Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
2
|
Qiu S, Yang A, Zeng H. Flux balance analysis-based metabolic modeling of microbial secondary metabolism: Current status and outlook. PLoS Comput Biol 2023; 19:e1011391. [PMID: 37619239 PMCID: PMC10449171 DOI: 10.1371/journal.pcbi.1011391] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/26/2023] Open
Abstract
In microorganisms, different from primary metabolism for cellular growth, secondary metabolism is for ecological interactions and stress responses and an important source of natural products widely used in various areas such as pharmaceutics and food additives. With advancements of sequencing technologies and bioinformatics tools, a large number of biosynthetic gene clusters of secondary metabolites have been discovered from microbial genomes. However, due to challenges from the difficulty of genome-scale pathway reconstruction and the limitation of conventional flux balance analysis (FBA) on secondary metabolism, the quantitative modeling of secondary metabolism is poorly established, in contrast to that of primary metabolism. This review first discusses current efforts on the reconstruction of secondary metabolic pathways in genome-scale metabolic models (GSMMs), as well as related FBA-based modeling techniques. Additionally, potential extensions of FBA are suggested to improve the prediction accuracy of secondary metabolite production. As this review posits, biosynthetic pathway reconstruction for various secondary metabolites will become automated and a modeling framework capturing secondary metabolism onset will enhance the predictive power. Expectedly, an improved FBA-based modeling workflow will facilitate quantitative study of secondary metabolism and in silico design of engineering strategies for natural product production.
Collapse
Affiliation(s)
- Sizhe Qiu
- School of Food and Health, Beijing Technology and Business University, Bejing, China
- Department of Engineering Science, University of Oxford, Oxford, United Kingdom
| | - Aidong Yang
- Department of Engineering Science, University of Oxford, Oxford, United Kingdom
| | - Hong Zeng
- School of Food and Health, Beijing Technology and Business University, Bejing, China
| |
Collapse
|
3
|
Metabolic pathway assembly using docking domains from type I cis-AT polyketide synthases. Nat Commun 2022; 13:5541. [PMID: 36130947 PMCID: PMC9492657 DOI: 10.1038/s41467-022-33272-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 09/09/2022] [Indexed: 11/21/2022] Open
Abstract
Engineered metabolic pathways in microbial cell factories often have no natural organization and have challenging flux imbalances, leading to low biocatalytic efficiency. Modular polyketide synthases (PKSs) are multienzyme complexes that synthesize polyketide products via an assembly line thiotemplate mechanism. Here, we develop a strategy named mimic PKS enzyme assembly line (mPKSeal) that assembles key cascade enzymes to enhance biocatalytic efficiency and increase target production by recruiting cascade enzymes tagged with docking domains from type I cis-AT PKS. We apply this strategy to the astaxanthin biosynthetic pathway in engineered Escherichia coli for multienzyme assembly to increase astaxanthin production by 2.4-fold. The docking pairs, from the same PKSs or those from different cis-AT PKSs evidently belonging to distinct classes, are effective enzyme assembly tools for increasing astaxanthin production. This study addresses the challenge of cascade catalytic efficiency and highlights the potential for engineering enzyme assembly. Assembly artificial pathway in design connecting media can increase biosynthetic efficiency, but the choice of connecting media is limited. Here, the authors develop a new protein assembly strategy using a pool of docking peptides from polyketide synthase and show its application in astaxanthin biosynthesis in E. coli.
Collapse
|
4
|
Abt K, Castelo-Branco R, Leão PN. Biosynthesis of Chlorinated Lactylates in Sphaerospermopsis sp. LEGE 00249. JOURNAL OF NATURAL PRODUCTS 2021; 84:278-286. [PMID: 33444023 PMCID: PMC7923214 DOI: 10.1021/acs.jnatprod.0c00950] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Indexed: 05/14/2023]
Abstract
Lactylates are an important group of molecules in the food and cosmetic industries. A series of natural halogenated 1-lactylates, chlorosphaerolactylates (1-4), were recently reported from Sphaerospermopsis sp. LEGE 00249. Here, we identify the cly biosynthetic gene cluster, containing all the necessary functionalities for the biosynthesis of the natural lactylates, based on in silico analyses. Using a combination of stable isotope incorporation experiments and bioinformatic analysis, we propose that dodecanoic acid and pyruvate are the key building blocks in the biosynthesis of 1-4. We additionally report minor analogues of these molecules with varying alkyl chains. This work paves the way to accessing industrially relevant lactylates through pathway engineering.
Collapse
Affiliation(s)
- Kathleen Abt
- Interdisciplinary
Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of Porto, Avenida General Norton de Matos, s/n, 4450-208 Matosinhos, Portugal
- Institute
of Biomedical Sciences Abel Salazar (ICBAS), University of Porto, Rua de Jorge Viterbo Ferreira, 228, 4050-313 Porto, Portugal
| | - Raquel Castelo-Branco
- Interdisciplinary
Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of Porto, Avenida General Norton de Matos, s/n, 4450-208 Matosinhos, Portugal
| | - Pedro N. Leão
- Interdisciplinary
Centre of Marine and Environmental Research (CIIMAR/CIMAR), University of Porto, Avenida General Norton de Matos, s/n, 4450-208 Matosinhos, Portugal
| |
Collapse
|
5
|
Medema MH. The year 2020 in natural product bioinformatics: an overview of the latest tools and databases. Nat Prod Rep 2021; 38:301-306. [PMID: 33533785 DOI: 10.1039/d0np00090f] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Covering: 2020 Bioinformatic approaches to document and analyse chemical structures, biosynthetic gene clusters and analytical data play an important role in the study of natural products. Every year, such a large number of new algorithms, tools and databases are released, that it is difficult to keep track of all the latest developments. The aim of this short article is to provide a concise overview of and reference to the major tools, methods and databases that have been released in the past year.
Collapse
Affiliation(s)
- Marnix H Medema
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands.
| |
Collapse
|
6
|
Malico AA, Nichols L, Williams GJ. Synthetic biology enabling access to designer polyketides. Curr Opin Chem Biol 2020; 58:45-53. [PMID: 32758909 DOI: 10.1016/j.cbpa.2020.06.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 05/08/2020] [Accepted: 06/11/2020] [Indexed: 12/18/2022]
Abstract
The full potential of polyketide discovery has yet to be reached owing to a lack of suitable technologies and knowledge required to advance engineering of polyketide biosynthesis. Recent investigations on the discovery, enhancement, and non-natural use of these biosynthetic gene clusters via computational biology, metabolic engineering, structural biology, and enzymology-guided approaches have facilitated improved access to designer polyketides. Here, we discuss recent successes in gene cluster discovery, host strain engineering, precursor-directed biosynthesis, combinatorial biosynthesis, polyketide tailoring, and high-throughput synthetic biology, as well as challenges and outlooks for rapidly generating useful target polyketides.
Collapse
Affiliation(s)
- Alexandra A Malico
- Department of Chemistry, NC State University, Raleigh, NC, 27695, United States
| | - Lindsay Nichols
- Department of Chemistry, NC State University, Raleigh, NC, 27695, United States
| | - Gavin J Williams
- Department of Chemistry, NC State University, Raleigh, NC, 27695, United States; Comparative Medicine Institute, NC State University, Raleigh, NC, 27695, United States.
| |
Collapse
|